Page 2 of 2 FirstFirst 12
Results 16 to 28 of 28
  1. #16
    Registered User
    Join Date
    Feb 2015
    Posts
    237
    Quote Originally Posted by davidweb09 View Post
    Robots.txt file have set of rules that are used to send instructions to search engine bots while indexing the website.
    Thanks for sharing this post. Because that's information is really very nice and informative.

  2. #17
    Registered User
    Join Date
    Jun 2018
    Posts
    1,416
    Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door – e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter. That is why we say that if you have really sen sitive data, it is too naïve to rely on robots.txt to protect it from being indexed and displayed in search results.

  3. #18
    Senior Member
    Join Date
    Feb 2018
    Location
    USA
    Posts
    138
    Well, its really an impressive information. Thanks for share it.

  4. #19
    Member
    Join Date
    Dec 2018
    Location
    USA
    Posts
    94
    Robots.txt is a text file webmasters build to advise web robots (typically search engine robots) how to crawl pages on the website. The robots.txt file is the piece of the robots exclusion protocol (REP), an association of web standards that regulate how robots crawl the web, access, and index content, and serve that content up to users.Robots.txt is by no means compulsory for search engines but commonly, search engines observe what they are demand not to do.

  5. #20
    Registered User
    Join Date
    Dec 2016
    Location
    Chennai,India.
    Posts
    120
    Robots.txt is a file on a website that instructs search engine crawlers which parts of the site should not be accessed by search engine bot programs. Robots.txt is a plaintext file but uses special commands and syntax for webcrawlers. Though not officially standardized, robots.txt is generally followed by all search engines.

  6. #21

  7. #22
    Member
    Join Date
    Nov 2018
    Posts
    65
    Robots.txt is a content record website admins make to teach web robots (ordinarily web crawler robots) how to slither pages on their site.

  8. #23
    Registered User
    Join Date
    Dec 2018
    Posts
    2
    robot.txt is used to instruct google and other search engine bots which file you have indexed or crawled and vice versa so we use robot.txt to give instruction and stop from unusual crawling

  9. #24
    Registered User
    Join Date
    Dec 2018
    Posts
    2
    Thanks for sharing this information and robot.txt we basically use to give instruction google and other Bot

  10. #25
    Member
    Join Date
    Aug 2018
    Posts
    97
    Robot.txt is utilized to crawl the webpage internet site. It informs. We can specify the pages that must not be obtained by putting the disallow label in robot.txt. Those disallow pages have been limited to see. Additionally, it help index the internet content.

  11. #26
    Registered User
    Join Date
    Dec 2018
    Location
    pune
    Posts
    13
    The robots.txt file is primarily used to specify which parts of your website should be crawled by spiders or crawlers. Googlebot, bingbot are the examples of a web spider. Spider look for this file in host directory.

  12. #27
    Registered User
    Join Date
    Dec 2018
    Posts
    154
    A robot.txt file is a file at the root of your site that indicates those parts of your site you do not want to be accessed by search engine crawlers. The file uses the Robots Exclusion Standard, which is a protocol with a small set of commands that can be used to indicate access to your site by section and by specific kinds of web crawlers (such as mobile crawlers vs desktop crawlers).

  13. #28
    Registered User
    Join Date
    Dec 2017
    Location
    Chennai
    Posts
    114
    Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do. It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that you put a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door – e.g. you cannot prevent thieves from coming in but the good guys will not open to door and enter. That is why we say that if you have really sen sitive data, it is too naïve to rely on robots.txt to protect it from being indexed and displayed in search results.

Page 2 of 2 FirstFirst 12

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

  Find Web Hosting      
  Shared Web Hosting UNIX & Linux Web Hosting Windows Web Hosting Adult Web Hosting
  ASP ASP.NET Web Hosting Reseller Web Hosting VPS Web Hosting Managed Web Hosting
  Cloud Web Hosting Dedicated Server E-commerce Web Hosting Cheap Web Hosting


Premium Partners:


Visit forums.thewebhostbiz.com: to discuss the web hosting business, buy and sell websites and domain names, and discuss current web hosting tools and software.