<ol> <li>Is it better to use meta tags* or the robots.txt file for informing spiders/crawlers to include or exclude a page?</li> <li>Are there any issues in using both the meta tags and the robots.txt?</li> </ol> *Eg: <code><#META name="robots" content="index, follow"></code>

Both are supported by all crawlers which respect webmasters wishes. Not all do, but against them neither technique is sufficient. You can use robots.txt rules for general things, like disallow whole sections of your site. If you say <code>Disallow: /family</code> then all links starting with <code>/family</code> are not indexed by a crawler. Meta tag can be used to disallow a single page. Pages disallowed by meta tags do not affect sub pages in the page hierarchy. If you have meta disallow tag on <code>/work</code>, it does not prevent a crawler from accessing <code>/work/my-publications</code> if there is a link to it on an allowed page.

Meta tag vs robots.txt

2 Answers

There is one significant difference. According to Google they will still index a page behind a robots.txt DENY, if the page is linked to via another site.

However, they will not if they see a metatag:

While Google won't crawl or index the content blocked by robots.txt, we might still find and index a disallowed URL from other places on the web. As a result, the URL address and, potentially, other publicly available information such as anchor text in links to the site can still appear in Google search results. You can stop your URL from appearing in Google Search results completely by using other URL blocking methods, such as password-protecting the files on your server or using the noindex meta tag or response header.

111

answered Sep 20 '22 18:09

user2696762

Both are supported by all crawlers which respect webmasters wishes. Not all do, but against them neither technique is sufficient.

You can use robots.txt rules for general things, like disallow whole sections of your site. If you say Disallow: /family then all links starting with /family are not indexed by a crawler.

Meta tag can be used to disallow a single page. Pages disallowed by meta tags do not affect sub pages in the page hierarchy. If you have meta disallow tag on /work, it does not prevent a crawler from accessing /work/my-publications if there is a link to it on an allowed page.

answered Sep 19 '22 18:09

jmz

Related questions
                            
                                MVC: How to route /sitemap.xml to an ActionResult?
                            
                                Google Not Showing React-Helmet Title And Description
                            
                                How does the Android Market search engine work? [closed]
                            
                                Correct microdata markup for breadcrumbs
                            
                                How do you handle HTML Metadata in Progressive Web Apps (PWA)
                            
                                Using node.js to serve content from a Backbone.js app to search crawlers for SEO
                            
                                Does inline CSS and JavaScript really affect site SEO?
                            
                                HTTP status code for overloaded server
                            
                                Apache Redirect 301 fails when using GET parameters, such as ?blah=
                            
                                Internationalization and Search Engine Optimization
                            
                                Is text-indent: -9999px a bad technique for replacing text with images, and what are the alternatives?
                            
                                How can I create custom SEO-friendly URLs in OpenCart?
                            
                                Remove index.php?route=common/home from OpenCart
                            
                                Doing links like Twitter, Hash-Bang #! URL's [duplicate]
                            
                                Should I use <meta name="author" content="Your Name" /> or <link rel="author" href="http://mysite.com/about/" />?
                            
                                AngularJS SEO for static webpages (S3 CDN)
                            
                                Is including <meta name="fragment" content="!"> harmful for pages with hashbang?
                            
                                html5: titles in sectioning elements - document outline and SEO implications
                            
                                What is the purpose of meta tag MSSmartTagsPreventParsing?
                            
                                Validation error: "The itemprop attribute was specified, but the element is not a property of any item"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Meta tag vs robots.txt

Tags:

meta-tags

seo

robots.txt

keruilin

People also ask

2 Answers

user2696762

jmz

Recent Activity

Donate For Us