I dont know about robots.txt. What is robots.txt?
What only to websites that have a domain only? please explain. I dont understaind about robots.txt
I dont know about robots.txt. What is robots.txt?
What only to websites that have a domain only? please explain. I dont understaind about robots.txt
Robots.txt is an file which must be placed immediately after the root domain.. It is used to prevent the specific pages of the website from being crawled by the Bots.
Robots.txt is a file to which spiders read first. It contain file exclusion property. Mainly we use it to disallow those pages to which we want to hide from search engine crawlers.
robots.txt is simply a plain-text file that a Web publisher should put in the root directory of their website. The text files includes instructions that tell indexing spiders, or "robots," what content and directories on that website they may, or may not, look at.
robot.txt file is plain text work like a web browser. It browse the all web page and index page of website. It crawl your website and then send it to indexer. It include three part spider,crawler and Indexer.
You can block search engine spider through robots.txt file. In this file you can disallow that pages whichever you don’t want to crawl.
For getting more detail you can refer this link The Web Robots Pages
Last edited by Alan Smith; 09-11-2012 at 04:27 AM.
Robots.txt is a file which must be placed immediately after the root domain. It is used to prevent the specific pages of the website from being crawled by the Bots and it also indicate the follow links.
simply saying, it is a text help search engine know your site.
Thanks for help and for thread. Now it's very clear what is robots.txt.. dont knew that before. thanks.
Its a file through which you can stop SE to index & crawl your website certain pages or files. You can read more on it on google central blog.
Basically a file used to allow or deny access to Bots, Spiders or any SE to your site.
Robots.txt is a simple text file that tells the search which page should be crawled or not.
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.
The robot.txt is a regular text file that through its name has special meaning to the majority of honourable robot on the web.The text file include instruction that tell indexing spider or robots, what content and directories on the website they may or may not.
robot.txt was warning crawler should to index url or not