Results 1 to 8 of 8
  1. #1
    Registered User
    Join Date
    Jul 2018
    Posts
    152

    What Is Robots.Txt?

    What Is Robots.Txt?

  2. #2
    Registered User
    Join Date
    Jun 2018
    Posts
    1,416
    Robots.txt is one way of telling the Search Engine Bots about the web pages on your website which you do not want them to visit.Robots.txt is useful for preventing the indexation of the parts of any online content that website owners do not want to display.

  3. #3
    Registered User
    Join Date
    Jan 2019
    Posts
    529
    Robots.txt is a text file that you add to your site that tells the search engines what pages you don’t want them to visit.

  4. #4
    Senior Member
    Join Date
    Aug 2015
    Location
    Dubai
    Posts
    512
    Quote Originally Posted by pharmasecure View Post
    Robots.txt is one way of telling the Search Engine Bots about the web pages on your website which you do not want them to visit.Robots.txt is useful for preventing the indexation of the parts of any online content that website owners do not want to display.
    100% right.

  5. #5
    Member
    Join Date
    Feb 2019
    Posts
    63
    The robots.txt file, also called the robots exclusion protocol or standard, is a document that tells internet robots (most usually search engines) which pages on your website to crawl. It also tells internet robots which pages not to crawl. The slash after “Disallow” tells the robot to not visit any pages on the website.

  6. #6
    Senior Member
    Join Date
    Nov 2018
    Posts
    1,853
    Robots.txt file is very important. Using a robots.txt file and with a disallow direction, we can restrict bots or search engine crawling program from websites and or from certain folders and files.

  7. #7
    Senior Member
    Join Date
    Jul 2018
    Location
    Chennai
    Posts
    311
    Robots.txt is used primarily to manage crawler traffic to your site, and occasionally to keep a page off Google, depending on the file type. Robots.txt is a text file webmasters create to instruct web robots how to crawl pages on their website. The robots.txt file is part of the the robots exclusion protocol (REP), a group of web standards that regulate how robots crawl the web, access and index content, and serve that content up to users.
    Basic format of the robot.txt file.
    User-agent: [user-agent name]
    Disallow: [URL string not to be crawled]

  8. #8
    Registered User
    Join Date
    Jan 2019
    Posts
    25
    It prevents the webpage to get crawl by the crawler, some webpages like credentials prevent from crawlers by using Robots.txt file

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

  Find Web Hosting      
  Shared Web Hosting UNIX & Linux Web Hosting Windows Web Hosting Adult Web Hosting
  ASP ASP.NET Web Hosting Reseller Web Hosting VPS Web Hosting Managed Web Hosting
  Cloud Web Hosting Dedicated Server E-commerce Web Hosting Cheap Web Hosting


Premium Partners:


Visit forums.thewebhostbiz.com: to discuss the web hosting business, buy and sell websites and domain names, and discuss current web hosting tools and software.