Thursday 29 December 2011

Robots.txt


Robots.txt
 
The concept of robot file is developed more than a decade ago. Robots file is a text file that you can create and put on your website to guide the search engine crawlers, about which pages you would like them to visit and which ones you not like them to visit.

There is no need that search engines must do according to the robots text, but often they tend to obey what the robots file tells them. It not like preventing the search engines from entering your site like the firewalls or password protection, but it is like instruction from the site that “please do not enter this page or something”. 

In case your site contains two version of a page say for example one for the site visitors’ purpose and another one for the printing purpose, there are chances that the search engine crawl twice and black list your site, because of duplicity of contents. But using Robots text, you can command the search engines not to enter certain pages and keep it in your control. 

In case where, you have sensitive data on your site that you don’t want the world to see and utilize it, you can use the robot text to stop the search engines from entering into your site. The robots text can also be used to restrict the search engines from loading the unnecessary items such as images, style sheets and javascript from indexing, so that you can reduce the loading time of the site. 

There should be caution while using the robots text as it should be in main directory since otherwise search engines will not be able to find it. Usually they never search the entire site for file name robot.txt. But they look at the main directory. If they don’t find it there, they tend understand that there is no such things and it will index everything they find in the site.
Do you Like this story..?

Get Free Email Updates Daily!

Follow us!

0 comments:

Post a Comment