Page 1 of 2 12 LastLast
Results 1 to 15 of 16
  1. #1
    Senior Member
    Join Date
    Sep 2017
    Location
    New york
    Posts
    117

    What is robots.txt?

    Hlo Friends,
    What is robots.txt?

  2. #2
    Registered User
    Join Date
    Sep 2017
    Posts
    90
    The robots.txt file is a text file. This is a file that lets you compose a syntax. Read the Search Engine's Spiders for this file. In Spiders are the robots.txt.Syntax is the most popular way to go from the internet to another computer. The robot.txt is the simplest way to help us, we have a robots.txt file.

  3. #3
    Senior Member
    Join Date
    Jul 2017
    Posts
    227
    Robots.txt is a text file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

  4. #4
    Senior Member
    Join Date
    Feb 2017
    Posts
    219
    It's a special kind of text file containing the instruction for Crawlers to crawl webpage(s), domains, files or directory.

  5. #5
    Registered User 24x7servermanag's Avatar
    Join Date
    Jul 2017
    Location
    India
    Posts
    1,020
    Robot.txt is used to crawl the pages website. It tells that which part of the area should not be accessed. We can define the pages which should not be accessed by putting the disallow tag in robot.txt. Those disallow pages are restricted to visit. It also help to index the web content.

    There is limit of 500 Kb for robot.txt file.
    Server Management Company
    India's Leading Managed Service Provider | Skype: techs24x7
    Cpanel Technical Discussions - Lets talk !

  6. #6
    Registered User
    Join Date
    Jul 2015
    Location
    jaipur
    Posts
    251
    It is a special file which tells the crawler which part of the website should index or not

  7. #7
    Registered User
    Join Date
    Sep 2017
    Posts
    1,192
    Robotx.txt file is a standard used as a means of communication between the website and crawlers.

    Robots.txt file instructs or tells the crawlers about which pages should be crawled and which shouldn't be crawled.

    If Robots.txt file doesn't exists then the crawler will asssume by default that the whole website has to be crawled. It mifght happen that some pages which you don't want to be cralwed may also be crawled due to the absence of a Robots.txt file.

  8. #8
    Registered User
    Join Date
    Feb 2017
    Posts
    754
    Robots.txt is a file which is used to instruct the search engines about crawling and indexing of the particular web page.

  9. #9
    Registered User
    Join Date
    May 2016
    Posts
    151
    The robots exclusion standard, also known as the robots exclusion protocol or simply robots.txt, is a standard used by websites to communicate with web crawlers and other web robots. The standard specifies how to inform the web robot about which areas of the website should not be processed or scanned.

  10. #10
    Member
    Join Date
    Sep 2017
    Posts
    55
    We use Robot.txt file to indicate web crawler not to search some content like admin etc.

  11. #11
    Registered User
    Join Date
    Dec 2015
    Location
    USA
    Posts
    1,060
    HI

    A robots.txt file is a file at the root of your site that indicates those parts of your site you don't want accessed by search engine crawlers.
    https://www.welllivingshop.com/bedding/duvet-covers/

  12. #12
    Registered User
    Join Date
    Jul 2016
    Posts
    256
    Robots.txt is the file through that you can give permission to search engine to crawl your website pages or not.

  13. #13

  14. #14
    Registered User
    Join Date
    Sep 2017
    Posts
    259
    Robots.txt is a content document you put on your site to tell look robots which pages you might want them not to visit. Robots.txt is in no way, shape or form obligatory for web search tools however for the most part web crawlers obey what they are requested that not do.

  15. #15
    Senior Member
    Join Date
    Sep 2017
    Posts
    153
    Robots.txt is a record which is utilized to teach the web indexes about slithering and ordering of the specific website page.

Page 1 of 2 12 LastLast

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

  Find Web Hosting      
  Shared Web Hosting UNIX & Linux Web Hosting Windows Web Hosting Adult Web Hosting
  ASP ASP.NET Web Hosting Reseller Web Hosting VPS Web Hosting Managed Web Hosting
  Cloud Web Hosting Dedicated Server E-commerce Web Hosting Cheap Web Hosting


Premium Partners:


Visit forums.thewebhostbiz.com: to discuss the web hosting business, buy and sell websites and domain names, and discuss current web hosting tools and software.