PDA

View Full Version : what is robotxt file?



john12345
04-23-2012, 08:59 AM
what is robotxt file?

jasoncamroon
04-23-2012, 09:16 AM
Robots.txt is a text file (no html) you put on your site to tell search robots which pages you don't want them to visit.By defining a few lines in this text file, you can instruct robots not crawling and indexing of certain files, folders within your site, or at all.

danish00
04-23-2012, 09:18 AM
google crawls your sites with help of robotxt file. without it, your sites cannot crawl the page.

watson123
04-24-2012, 12:23 AM
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

Lucy Jackson
04-24-2012, 03:02 AM
robots.txt file is used to direct robot and tell the robots that which file you have to read and which you donot.

Click SSL
04-24-2012, 03:02 AM
Robots.txt is a text file (no html) you put on your site to tell search robots which pages you don't want them to visit.By defining a few lines in this text file, you can instruct robots not crawling and indexing of certain files, folders within your site, or at all.

You make it so easy to understand about robots.txt

Great Job :)

rosesmark
04-24-2012, 04:23 AM
Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol

sslrenewals
04-24-2012, 05:48 AM
Robort.txt is text file which tells to search engine crawl webpages.

adumpaul
04-24-2012, 05:56 AM
Robots.txt file is a text file that resides on your server, and controls a whole lot of features on your website. It’s a simple text file in which there are a few lines of text, but it’s very powerful that it can even decide whether your website should be shown on Google or not, what part of your website should be shown to the search engines.

Steve Michael
04-27-2012, 02:18 AM
Robotxt file is the file in the website which contains the details of accessable and non-accessable pages of the website crawlers. The search engine crawlers can only crawl into the accessable pages of the website. It enables the privacy of the website.

jasoncamroon
04-27-2012, 05:45 AM
Robot.txt one type of text file and when search engine robot/spider enter of site then this text file pass the information to the search engine robot to leave these page with out crawling.

mnichs23
04-27-2012, 06:18 AM
If you want to hide some pages of your website from search engine then you just need to mention all those pages into this text file and put it as disallow. so, by this way search engine spider will not crawl all those pages which you mention in this file. Also as search engine crawler first go to this file, you can also add your sitemap.xml url here. So, that you can increase the crawling of your website.

alexisrois
04-27-2012, 06:42 AM
Robots.txt file utilization is sometimes ignored. However, it is most essential aspect for any websites being listed effectively & very easy to setup.

Muirenn
04-27-2012, 09:49 AM
Robots.txt File is a regular text(not html)file that you put on your site. When a search engine crawler comes to your site. it will look for a special file on your site That file is called robots.txt File.

ajaykrgc
04-27-2012, 11:37 AM
Hi,
i am agree with your answer

Kevin Farrell
04-27-2012, 12:51 PM
robots.txt is a simple text file which tell search engine what to index in website.

Openxcell Inc
04-30-2012, 02:43 AM
Robots.txt is a text file which instruct crawler about the index and crawling of the pages of website. It is very useful for hiding some important pages of website.

ebriks
04-30-2012, 03:52 AM
what is robotxt file?

Robots.txt is a simple text (not html) file you put on your website root directory to tell search robots which pages you would like them not to visit. By defining a few rules in this text file, you can instruct robots to not crawl certain files, directories within your site, or at all.

john515
05-01-2012, 02:37 AM
Robots.txt file is a set of instructions that tell search engine robots which pages of your site to be crawled and indexed. In most cases, your site is consist of many files or folders i.e. admin folders, cgi-bin, image folder, which are not relevant to the search engines. Robots.txt helps tell spiders what is useful and public for sharing in the search engine indexes and what is not. It should also be noted that not all search spiders will follow your instructions left in the robots.txt file. In addition, a poorly done robots.txt file can stop the search spiders from crawling and indexing your website properly.

johnwilson639
05-01-2012, 03:11 AM
Robot.txt is a text file that tells the google robot ignore these pages from a websites.in a simple manner,if you want to uncrawl any page from search engine,then use a robot.txt file in front of that page.

john12345
05-07-2012, 09:12 AM
Define Seo?

john12345
05-07-2012, 09:15 AM
what is on page optimization?

john12345
05-07-2012, 09:16 AM
Tell me about Penguin Update?

john12345
05-07-2012, 09:17 AM
Define sitemap?

john12345
05-07-2012, 09:18 AM
Tell me differnce between Blackhatseo and whitehatseo?

ChrisDevlin
05-11-2012, 05:03 AM
In robots.txt file we can give command to all search engines bots or spiders, it helps to prevent web crawlers and other web robots from accessing all or part of a website. We can block any search engine bots by giving command in Robots.txt file.

sabrinasai
05-11-2012, 05:39 AM
Robots.txt is a text (not html) file you put on your site to tell search robots which pages you would like them not to visit. Robots.txt is by no means mandatory for search engines but generally search engines obey what they are asked not to do.

Raju
05-11-2012, 08:04 AM
Robot.txt file is the file used to hide the some content or page of our website from the spider.So that Spider not reach your personal data or privacy policy.Suppose you have a website and you don't want disclose privacy of a company then you use the robot.txt file.

ajaykrgc
05-11-2012, 09:12 AM
hi,
It is called spider or crawler and say about web page i.e which page you would like to spide or not

Brian-Alfaro
05-11-2012, 09:29 AM
Helps to protect data over web from Google "not to crawl that".Like JavaScript codes

ilmkidunya1
05-12-2012, 02:23 AM
It is great when search engines frequently visit your site and index your content but often there are cases when indexing parts of your online content is not what you want. For instance, if you have two versions of a page (one for viewing in the browser and one for printing), you'd rather have the printing version excluded from crawling, otherwise you risk being imposed a duplicate content penalty. Also, if you happen to have sensitive data on your site that you do not want the world to see, you will also prefer that search engines do not index these pages (although in this case the only sure way for not indexing sensitive data is to keep it offline on a separate machine). Additionally, if you want to save some bandwidth by excluding images, stylesheets and javascript from indexing, you also need a way to tell spiders to keep away from these items.

eliteinfo
05-12-2012, 06:59 AM
Robotxt is created for informing Google that which file or page it has to craw or which is not...


_______________
Web Design India (http://www.eliteinfoworld.com/services/web-design-service-india/web-design-india)

davidweb09
07-29-2019, 01:31 PM
Robots.txt file help to send instructions to search engines for website indexing. https://www.thekelleyfinancialgroup.com/

vinu
07-31-2019, 04:25 AM
The robots.txt file is primarily used to specify which parts of your website should be crawled by spiders or web crawlers.

AtharavD10
07-31-2019, 04:46 AM
robot.txt is the file used to hide the content or privacy terms to the bot or crawler.

pranav
08-01-2019, 02:04 AM
Robots.txt is a file that tells search engine spiders to not crawl certain pages or sections of a website.

Pooja16
08-03-2019, 05:47 AM
The robots.txt file, also called the robots exclusion protocol or standard, is a text file that tells internet robots (most usually search engines) that pages on your web site to crawl. It additionally tells internet robots that pages to not crawl. The slash once “Disallow” tells the robot to not visit any pages on the site.

http://falconwingsaviation.com/

amarnathsmm
08-03-2019, 09:06 AM
robots.txt disallow doesn't work anymore use meta no index to exclude files from indexing

yuva12
08-03-2019, 09:12 AM
A robots.txt shows which pages or files the Googlebot can or can't request from a website. Webmasters usually use this method to avoid overloading the website with requests.