Leave a reply to meta name=robots content=noindex, follow Show
Meta robots tag controls spiders from crawling and indexing your webpage or post. In this post, we will see all types of tags in detail. We can use more than one meta tags in a webpage but its not a good practice because it can cause conflicts. Hence it should occur only once in a webpage. Default Meta robots tagIf you don’t specify any meta tag in a webpage then by default spider will crawl and index your webpage. you can say that it considers below meta tag by default. <meta name="robots" content="index, follow"" /> <meta name="robots" content="none"> Meta robots noindexIt prevents page from being indexed. It means if you specify below meta tag for a page then that page would not come up in search results. For example: you wouldn’t want your author pages to come up in search results so to prevent it you can use below Meta robots tag in author pages. <meta name="robots" content="noindex" /> Meta robots nofollowThis meta tag prevents spiders to follow any link on the page. If you are a newbie then it may be confusing to you. Let me explain you in detail. Whenever spider( Google bot) finds a link (internal or external) on a page, it lands on that webpage for crawling and indexing, to enforce spider for not following any link on the page, we can use this tag. To understand it better you can refer my post Dofollow vs nofollow. <meta name="robots" content="nofollow" /> Meta robots noarchiveYou must have seen in search results that whenever you hover on any result it will show a snapshot of that page at the right side of the Google search (see the below sreenshot!!). That we call a cached copy of the webpage. To prevent a page from being cached we can specify below tag. <meta name="robots" content="noarchive" /> Meta robots noodpPrevents Search engine (Google, Yahoo and MSN) to display its own description taken from its directory, instead of you meta description. Using below tag would ensure that Search engine would display your meta description below search results and it would be useful to improve your CTR. <meta name="robots" content="noodp " /> Meta robots noydirIt is same as above NOODP tag, the only difference is that it would be applicable for Yahoo only. <meta name="robots" content="noydir" /> Best Practice to use noodp and noydir: You can use the below tag for all of your webpages- <meta name="robots" content="noodp, noydir" /> OR (above tag and below two tags- both are same) <meta name="robots" content="noodp " /> <meta name="robots" content="noydir" /> What if page has more than one Meta robots tagWe should use it only once per page and its the best practice, but sometimes because of combination of SEO friendly themes and SEO plugins, we may have multiple instances of such tags. So for such cases lets discuss how spiders (or crawlers) would interpret it. CASE 1: No conflict values in robots meta tag. <meta name="robots" content="noindex" /> <meta name="robots" content="nofollow" /> Above two tags would be taken same as: <meta name="robots" content="noindex, nofollow" /> CASE 2: Conflicts value present in robots meta tag. First instance: <meta name="robots" content="noindex" /> Second instance: <meta name="robots" content="index" /> In this case both values are conflicting as one tell to noindex and other is enforcing spider to index the page. For such cases crawlers considers first instance of conflicts. So search engine bot would take it as <meta name="robots" content="noindex" /> because noindex encountres before than index one. How to block specific search engine botBy specifying meta name as “robots”,you are actually blocking all search engine robots. However if you want to block specific search engine bot then you can specify bot name instead of “robots” BOT Name: Google: GOOGLEBOT Yahoo: SLURP MSN: MSNBOT ASK: TEOMA So, if you want to make your page nofollow for Google bot only then you can use below tag. <meta name="Googlebot" content="noindex" /> Note: In meta name I have given “Googlebot” instead of “robots” to make it applicable for Google search engine bot only. ReferenceMeta robots tag: Google Webmaster blog <Image Credit>
Robots meta directives (sometimes called "meta tags") are pieces of code that provide crawlers instructions for how to crawl or index web page content. Whereas robots.txt file directives give bots suggestions for how to crawl a website's pages, robots meta directives provide more firm instructions on how to crawl and index a page's content. There are two types of robots meta directives: those that are part of the HTML page (like the meta robots tag) and those that the web server sends as HTTP headers (such as x-robots-tag). The same parameters (i.e., the crawling or indexing instructions a meta tag provides, such as "noindex" and "nofollow" in the example above) can be used with both meta robots and the x-robots-tag; what differs is how those parameters are communicated to crawlers. Meta directives give crawlers instructions about how to crawl and index information they find on a specific webpage. If these directives are discovered by bots, their parameters serve as strong suggestions for crawler indexation behavior. But as with robots.txt files, crawlers don’t have to follow your meta directives, so it’s a safe bet that some malicious web robots will ignore your directives. Below are the parameters that search engine crawlers understand and follow when they’re used in robots meta directives. The parameters are not case-sensitive, but do note that it is possible some search engines may only follow a subset of these parameters or may treat some directives slightly differently. Indexation-controlling parameters:
Types of robots meta directivesThere are two main types of robots meta directives: the meta robots tag and the x-robots-tag. Any parameter that can be used in a meta robots tag can also be specified in an x-robots-tag. We’ll talk about both the meta robots and x-robots tag directives below. Meta robots tagThe meta robots tag, commonly known as "meta robots" or colloquially as a "robots tag," is part of a web page’s HTML code and appears as code elements within a web page’s <head> section: Code sample:<meta name="robots" content="[PARAMETER]">While the general <meta name="robots" content="[PARAMETER]"> tag is standard, you can also provide directives to specific crawlers by replacing the "robots" with the name of a specific user-agent. For example, to target a directive specifically to Googlebot, you’d use the following code: <meta name="googlebot" content="[DIRECTIVE]">Want to use more than one directive on a page? As long as they’re targeted to the same "robot" (user-agent), multiple directives can be included in one meta directive – just separate them by commas. Here’s an example: <meta name="robots" content="noimageindex, nofollow, nosnippet">This tag would tell robots not to index any of the images on a page, follow any of the links, or show a snippet of the page when it appears on a SERP. If you’re using different meta robots tag directives for different search user-agents, you’ll need to use separate tags for each bot. X-robots-tagWhile the meta robots tag allows you to control indexing behavior at the page level, the x-robots-tag can be included as part of the HTTP header to control indexing of a page as a whole, as well as very specific elements of a page. While you can use the x-robots-tag to execute all of the same indexation directives as meta robots, the x-robots-tag directive offers significantly more flexibility and functionality that the meta robots tag does not. Specifically, the x-robots permits the use of regular expressions, executing crawl directives on non-HTML files, and applying parameters at a global level. To use the x-robots-tag, you’ll need to have access to either your website’s header .php, .htaccess, or server access file. From there, add your specific server configuration’s x-robots-tag markup, including any parameters. This article provides some great examples of what x-robots-tag markup looks like if you’re using any of these three configurations. Here are a few use cases for why you might employ the x-robots-tag:
SEO best practices with robots meta directives
Improve your technical site health with Moz ProKeep tabs on critical crawler issues (and tons of other details) with the help of Moz Pro's Site Crawl. Take a 30-day free trial on us and see what you can achieve: Start my free trial Keep learningPut your skills to workMoz Pro lets you run crawls, research and report on keyword ranking, and track your site's SEO performance, including its accessibility, over time. Try it >> |