To Optimize Robots.txt file is a challenging task. It plays an important role in SEO of your blog. You must have definitely heard about Robots.txt. But are you taking full use of this file? Overlooking this factor can harm your site rankings. If it is wrongly configured, search engines may completely ignore your entire site which can lead to completely disappearing your blog from searches. Today, in this article, I will be explaining how you can edit and optimize Robots.txt file for better SEO. I will be breaking down this article to keep it simple and easy to read.
What Is Robots.txt File?
The robots.txt file is also known as the Robots Exclusions Protocol. It instructs search engine bots on how to crawl a website i.e. what pages to crawl and what pages to ignore. Whenever any search engine bots come to your site, it reads the robots.txt file and follows the instructions. If you have not configured it properly, search engines crawlers and spiders may not index important pages or may index unwanted pages/folders also. So, it is very important to optimize Robots.txt File. This file is placed under root domain directory (E.g. www.yourdomain.com/robots.txt). There can be only one robots.txt on your site and it will be under root directory as mentioned earlier. You can either use cPanel or FTP client to view this file. It is just like any ordinary text file, and you can open it with a plain text editor like Notepad and edit accordingly.
How To create Robots.txt File?
WordPress provides automatic creation of the robots.txt file. So, if you are using WordPress, you should already have a robots.txt file under your root directory. If you are using other CMS or your website doesn’t have any robot.txt file, then just create a simple notepad file and name it as robots.txt and upload it to your site’s root folder using FTP client or cPanel. There are number of robot txt file generators available online.
To check in cPanel, Go to File Manager –> public_html folder. You should have robots.txt file present here.
Understanding Content Of Robots.txt File
Before jumping directly to optimize robots.txt file, let us understand the basics of robots.txt file. There are three commands mainly – User-agent, allow, disallow. We will be configuring these commands to gain better SEO. Let’s see what these commands mean –
Here’s a basic sample of Robots.txt file.
User-agent: *
Disallow: /wp-admin/
Allow: /
I hope you can understand the above sample now. Here, we are instructing all the search engine bots (since we are using User-agent: * // line 1) not to crawl or index /wp-admin/ part of your website (// line 2) and then allowing to crawl and index other parts of your website(// line 3).
Some more examples for your better understanding:
Allow indexing of everything
User-agent: *
Allow: /
Disallow indexing of everything
User-agent: *
Disallow: /
Disallow any particular bot (say Googlebot) from indexing of a particular folder (myfolder) but allowing a page (mypage) in that folder.
User-agent: Googlebot
Disallow: /myfolder/
Allow: /myfolder/mypage
Now, you are familiar with the robots.txt file. Let’s proceed to how you can edit and optimize robots.txt file to gain maximum benefits. Editing robots.txt is one of the things you need to do after Installing WordPress.
Editing the Robots.txt File
You can edit the robots.txt file from cPanel or your FTP client. To edit this through cPanel, follow below steps:
Step 1: Log in to your cPanel account
Step 2: Go to File Manager
Step 3: Go to the public_html folder of your website
Step 4: Locate robots.txt file
Step 5: Right-click on file and select ‘Edit’ as shown below:
Step 6: It will prompt the below message. Take the backup if you want and click on ‘Edit’.
It will open the file in an editable mode. Make necessary changes and click on save changes.
How To Optimize Robots.txt File?
As I mentioned earlier, it is a very challenging task to optimize robots.txt considering all the factors. An un-optimized robots.txt file can harm your SEO and can completely de-index your blog (E.g. if you use the command “Disallow: /” in Robots.txt file, your site will be de-indexed from search engines.). Keep the following things in mind when you start to optimize robots.txt file.
Also Read: What is .htaccess File? How To Secure Your Blog With It
Following is the robots.txt file of my blog.
This is just a sample robot txt file for you. You can easily see what folders I have disallowed specifically. You might be wondering about the/go/ folder. I am using /go/ to denote my cloaked affiliated links. Since I do not want crawlers and bots to index this, I am disallowing. It is always a good practice to include your website’s sitemap in the robots.txt file. It can be placed anywhere in the robots.txt instruction. Mostly it is placed either at the top or bottom of the instruction.
Your robots.txt file can differ from mine since you might have different requirements and other private folders. Few other things that you can do are:
For example, # Allowing xyz bot to crawl xyz folder.
To do so write: Disallow: /readme.html
You should also disallow the WordPress plugin directory for security reasons. Simply write Disallow: /wp-content/plugins/
User-agent: *
Disallow: /*?
User-agent: Googlebot
Disallow: /*.html$
Other tips to optimize robots.txt file
Testing Robot.txt File in Google WebMaster Tool (now Google Search Console)
After you have edited and optimize robots.txt file, your first thing should be to test this file if that is properly configured or not. To do so:
Step 1) log in to your Google Search Console account
Step 2) Navigate to the ‘Crawl’ section from the left sidebar.
Step 3) Click on the ‘robots.txt Tester’
Step 4) It will show the latest robots.txt file on your website. If you have not changed permanently as of now, you can simply paste the content of the optimized robot.txt file and test. Below is the snapshot
Step 5) Select the bot for which you want to test. There are many bots available such as – Googlebot-Video, Googlebot-News, Googlebot-Image, etc.
Step 6) Click on the ‘Test’ button.
If everything is good and bots are allowed to crawl your website, it will prompt ‘ALLOWED’ with green color showing your settings are fine.
Step 7) You can submit the robots.txt file now by clicking on the ‘Submit’ button.
Congratulation! You have successfully optimized robots.txt file now.
Also Read: How To Optimize URL Structure Of Your Blog?
Hope this guide has helped you in understanding various aspects of Robots.txt. Don’t Forget To Share It with your friends and subscribe to our email newsletter for more such updates. If you have any questions on how to optimize robots.txt file, please feel free to ask in the comments section below.
FreshBooks is a very popular cloud-based accounting software. Freshbooks accounting software is used by all…
Blogging is not an easy job. As a blogger, you not only need to write…
Are you looking for A2 Hosting Coupon Codes on this year's most awaited Black Friday…
If you’ve got a stash of unused gift cards lying around, you might be wondering…
There's no denying it. At the point when social media works, it can change your…
The USP DropGenius offers is the power to be able to launch a profitable dropshipping…
View Comments
Nice Post..!
thanks for proving this article is my robots.txt is ok please tell me sir http://tricks11.com/
Useful Post, Can i add rss feed url in robots tag ?