I have a github page from my repository username.github.io However I do not want Google to crawl my website and absolutely do not want it to show up on search results. Will just using robots.txt in github pages work? I know there are tutorials for stop indexing Github repository but what about the actual Github page?

I don't know if it is still relevant, but google says you can stop spiders with a <code>meta</code> tag: <pre class="prettyprint"><code><meta name="robots" content="noindex"> </code></pre> I'm not sure however if that works for all spiders or google only.

Stopping index of Github pages

2 Answers

I don't know if it is still relevant, but google says you can stop spiders with a meta tag:

<meta name="robots" content="noindex">

I'm not sure however if that works for all spiders or google only.

147

answered Oct 03 '22 09:10

Gumbo

Short answer:

You can use a robots.txt to stop indexing of your users GitHub Pages by adding it in your User Page. This robots.txt will be the active robots.txt for all your projects pages as the project pages are reachable as subdirectories (username.github.io/project) in your subdomain (username.github.io).

Longer answer:

You get your own subdomain for GitHub pages (username.github.io). According to this question on MOZ and googles reference each subdomain has/needs its own robots.txt.

This means that the valid/active robots.txt for project projectname by user username lives at username.github.io/robots.txt. You can put a robots.txtfile there by creating a GitHub Pages page for your user.

This is done by creating a new project/repository named username.github.io where username is your username. You can now create a robots.txt file in the master branch of this project/repository and it should be visible at username.github.io/robots.txt. More information about project, user and organization pages can be found here.

I have tested this with Google, confirming ownership of myusername.github.io by placing a html file in my project/repository https://github.com/myusername/myusername.github.io/tree/master, creating a robot.txt file there and then verifying that my robots.txt works by using Googles Search Console webmaster tools (googlebot-fetch). Google does indeed list it as blocked and Google Search Console webmaster tools (robots-testing-tool) confirms it.

To block robots for one projects GitHub Page:

User-agent: * Disallow: /projectname/

To block robots for all GitHub Pages for your user (User Page and all Project Pages):

User-agent: * Disallow: /

Other options

Look into the HTML meta tag
Look into custom domain (redirects) for GitHub Pages

answered Oct 03 '22 10:10

olavimmanuel

Related questions
                            
                                docker toolbox mount file on windows
                            
                                Elixir map check if not empty and key exists
                            
                                How to initialize values for nested struct array in golang
                            
                                Nginx rule to add x-robots-tag header
                            
                                How to write a Mongoose model in ES6 / ES2015
                            
                                What is the "?:" operator used for in Groovy?
                            
                                Set number of bins for histogram directly in ggplot
                            
                                Laravel 5 - skip migrations
                            
                                How to use ActionCable as API
                            
                                Run npm install only when needed and/or partially
                            
                                Android EditText with TextInputLayout crashing when reaching limit of counterMaxLength after update of Support Library 23.2.0
                            
                                Mix.env/0 equivalent in production env?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Stopping index of Github pages

Tags:

user2961712

People also ask