Results 1 to 15 of 15
  1. #1
    Registered User
    Join Date
    May 2015
    Location
    Augusta
    Posts
    107

    What is a robots.txt file?

    What is a robots.txt file?

    I don't know therefore let me know.
    Thanks & Regards
    Ryan Smith

    QuickBooks Hosting, Remote Access QuickBooks

  2. #2
    Senior Member
    Join Date
    Jun 2013
    Location
    Forum
    Posts
    5,019
    Robots.txt is a text file that is used to define crawler activity on your website. It list the webpages that need to be disallowed from crawling.
    Cheap VPS | $1 VPS Hosting
    Windows VPS Hosting | Windows with Remote Desktop
    Cheap Dedicated Server | Free IPMI Setup

  3. #3
    Junior Member
    Join Date
    Nov 2015
    Posts
    11
    Robots.txt to tell google which file should index and which file should not index.
    Below tha code of robotx.txt

    User-agent: [the name of the robot the following rule applies to]

    Disallow: [the URL path you want to block]

    Allow: [the URL path in of a subdirectory, within a blocked parent directory, that you want to unblock]

  4. #4
    Registered User
    Join Date
    Aug 2014
    Posts
    658
    Robot.txt:
    It is an HTML tag placed on the source of a web page which redirects search engine spiders which files to crawl on or not.
    Easily create a lending and borrowing script in a few days with Agriya's peer to peer lending and borrowing software - Crowdfunding Lend .

  5. #5
    Senior Member
    Join Date
    Nov 2015
    Location
    United Kingdom
    Posts
    241
    Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol. It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html.

  6. #6

  7. #7
    Registered User
    Join Date
    May 2015
    Location
    Augusta
    Posts
    107
    Thanks all for providing such valuable information about robots.txt file. Indeed, I was very curious about this and now so much interested in it.
    Thanks & Regards
    Ryan Smith

    QuickBooks Hosting, Remote Access QuickBooks

  8. #8
    Registered User
    Join Date
    Aug 2015
    Location
    Dhaka
    Posts
    31
    Robot.txt is a text file that permits the google bot what pages should he crawled and what pages he shouldn't.
    User-agent: *
    Disallow: /
    The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

  9. #9
    Registered User
    Join Date
    Jul 2015
    Posts
    8
    Robots.txt is a text document file which is mainly used to give instructions for search engine robots to crawl/not crawl the particular page/website.

  10. #10
    Registered User
    Join Date
    Sep 2015
    Posts
    270
    The robots.txt file is a very powerful file if you're working on a site's SEO, but one that also has to be used with care.

  11. #11
    Registered User
    Join Date
    Dec 2012
    Location
    India
    Posts
    928
    Robot.txt is an on-page SEO technique and it is basically used to allow for the web robots also known as the web wanderers, crawlers or spiders. It is a program that traverses the website automatically and this helps the popular search engine like Google to index the website and its content.

  12. #12
    Senior Member
    Join Date
    Mar 2015
    Posts
    136
    Robots.txt is a text file which you place on your site root folder to tell Google bot which pages you would like them not to crawl. It shows like below:

    User agent: *
    Allow: /
    Disallow: /admin

    Note: Put Allow which you want google bot crawl. In the other hand, You have to place disallow if you don't want them to crawl.

  13. #13
    Registered User
    Join Date
    Aug 2014
    Posts
    34
    Robots.txt file inform the search engine crawler which page or folder has to crawl and which folder has not to crawl. If you specify data for all bots then, you need to put (*) and the data for specific bot for instance (google bot) then the specific bot commands will be followed, Below I mentioned it, just go through it.

    User-agent: *
    Disallow: /admin

    and the second one for google bot

    User-agent: googlebot
    Disallow: /se/

  14. #14
    Registered User
    Join Date
    Dec 2015
    Posts
    14
    robots.txt is file which is allow to web page index or not index, follow or nofollow.

  15. #15
    Senior Member
    Join Date
    Mar 2011
    Location
    New Delhi
    Posts
    642
    Robots.txt is use instructions about their site to web page index or not index this is called The Robots.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

  Find Web Hosting      
  Shared Web Hosting UNIX & Linux Web Hosting Windows Web Hosting Adult Web Hosting
  ASP ASP.NET Web Hosting Reseller Web Hosting VPS Web Hosting Managed Web Hosting
  Cloud Web Hosting Dedicated Server E-commerce Web Hosting Cheap Web Hosting


Premium Partners:


Visit forums.thewebhostbiz.com: to discuss the web hosting business, buy and sell websites and domain names, and discuss current web hosting tools and software.