Welcome to the IMTalk - Internet Marketing & SEO Forum.
  • Login:
+ Reply to Thread
Results 1 to 16 of 16
  1. #1
    Garyu73 is offline IM & SEO Quiet One Garyu73 is on a distinguished road
    Join Date
    May 2011
    Posts
    8
    Thanks Given
    0
    Thanked 2 Times in 2 Posts

    What is robots.txt?

    I dont know about robots.txt. What is robots.txt?
    What only to websites that have a domain only? please explain. I dont understaind about robots.txt

  2. #2
    Hogward is offline IM & SEO Mumbler Hogward is on a distinguished road
    Join Date
    Apr 2011
    Posts
    330
    Thanks Given
    11
    Thanked 36 Times in 29 Posts
    Robots.txt is an file which must be placed immediately after the root domain.. It is used to prevent the specific pages of the website from being crawled by the Bots.

  3. #3
    redchillimedia's Avatar
    redchillimedia is offline IM & SEO Weak Jaw redchillimedia is on a distinguished road
    Join Date
    Jul 2011
    Posts
    105
    Thanks Given
    3
    Thanked 15 Times in 14 Posts
    Robots.txt is a file to which spiders read first. It contain file exclusion property. Mainly we use it to disallow those pages to which we want to hide from search engine crawlers.

  4. #4
    Sherly is offline IM & SEO Weak Jaw Sherly is on a distinguished road
    Join Date
    May 2011
    Posts
    233
    Thanks Given
    1
    Thanked 11 Times in 9 Posts
    robots.txt is simply a plain-text file that a Web publisher should put in the root directory of their website. The text files includes instructions that tell indexing spiders, or "robots," what content and directories on that website they may, or may not, look at.

  5. #5
    yamunacbs is offline IM & SEO Quiet One yamunacbs is on a distinguished road
    Join Date
    Aug 2011
    Posts
    3
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    robot.txt file is plain text work like a web browser. It browse the all web page and index page of website. It crawl your website and then send it to indexer. It include three part spider,crawler and Indexer.

  6. #6
    Alan Smith is offline IM & SEO Weak Jaw Alan Smith will become famous soon enough
    Join Date
    Jul 2011
    Posts
    184
    Thanks Given
    0
    Thanked 63 Times in 53 Posts
    You can block search engine spider through robots.txt file. In this file you can disallow that pages whichever you don’t want to crawl.
    For getting more detail you can refer this link The Web Robots Pages
    Last edited by Alan Smith; 09-11-2012 at 04:27 AM.

  7. #7
    webhost is offline IM & SEO Quiet One webhost is on a distinguished road
    Join Date
    Sep 2011
    Posts
    11
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    Robots.txt is a file which must be placed immediately after the root domain. It is used to prevent the specific pages of the website from being crawled by the Bots and it also indicate the follow links.

  8. #8
    semwinner is offline IM & SEO Whisperer semwinner is on a distinguished road
    Join Date
    Sep 2011
    Posts
    27
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    simply saying, it is a text help search engine know your site.

  9. #9
    royjonesjr is offline IM & SEO Quiet One royjonesjr is on a distinguished road
    Join Date
    Oct 2011
    Posts
    17
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    Thanks for help and for thread. Now it's very clear what is robots.txt.. dont knew that before. thanks.

  10. #10
    paddy is offline IM & SEO Whisperer paddy is on a distinguished road
    Join Date
    Jul 2011
    Posts
    32
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    Its a file through which you can stop SE to index & crawl your website certain pages or files. You can read more on it on google central blog.

  11. #11
    Ammy Tisdale is offline Banned Ammy Tisdale is on a distinguished road
    Join Date
    Oct 2011
    Posts
    41
    Thanks Given
    0
    Thanked 2 Times in 2 Posts
    Quote Originally Posted by redchillimedia View Post
    Robots.txt is a file to which spiders read first. It contain file exclusion property. Mainly we use it to disallow those pages to which we want to hide from search engine crawlers.
    Agree with you. There is also an option that you can allow to any search engine or any to not to crawl.

  12. #12
    idreesfarooq's Avatar
    idreesfarooq is offline IM & SEO Chatty idreesfarooq is a jewel in the rough idreesfarooq is a jewel in the rough idreesfarooq is a jewel in the rough idreesfarooq is a jewel in the rough
    Join Date
    Mar 2011
    Location
    Pakistan
    Posts
    1,315
    Thanks Given
    214
    Thanked 306 Times in 213 Posts
    Basically a file used to allow or deny access to Bots, Spiders or any SE to your site.

  13. #13
    isoftx is offline IM & SEO Quiet One isoftx is on a distinguished road
    Join Date
    Oct 2011
    Posts
    15
    Thanks Given
    0
    Thanked 1 Time in 1 Post
    Robots.txt is a simple text file that tells the search which page should be crawled or not.

  14. #14
    googlesiterank is offline IM & SEO Quiet One googlesiterank is on a distinguished road
    Join Date
    Aug 2011
    Posts
    21
    Thanks Given
    0
    Thanked 1 Time in 1 Post
    Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

  15. #15
    enni is offline IM & SEO Quiet One enni is on a distinguished road
    Join Date
    Aug 2011
    Posts
    10
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    The robot.txt is a regular text file that through its name has special meaning to the majority of honourable robot on the web.The text file include instruction that tell indexing spider or robots, what content and directories on the website they may or may not.

  16. #16
    cirerare is offline IM & SEO Quiet One cirerare is on a distinguished road
    Join Date
    Oct 2011
    Posts
    3
    Thanks Given
    0
    Thanked 0 Times in 0 Posts
    robot.txt was warning crawler should to index url or not


 

Similar Threads

  1. Robots.txt and extensions
    By Hajoless in forum General SEO Talk
    Replies: 1
    Last Post: 08-22-2011, 03:05 PM
  2. Twitter's robots.txt question:
    By Hema in forum Social Networks & Community Websites
    Replies: 0
    Last Post: 12-09-2010, 08:42 AM

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts