<p>I'm currently working on an application built with Express (Node.js) and I want to know what is the smartest way to handle different robots.txt for different environments (development, production).</p> <p>This is what I have right now but I'm not convinced by the solution, I think it is dirty:</p> <pre class="prettyprint"><code>app.get '/robots.txt', (req, res) -> res.set 'Content-Type', 'text/plain' if app.settings.env == 'production' res.send 'User-agent: *\nDisallow: /signin\nDisallow: /signup\nDisallow: /signout\nSitemap: /sitemap.xml' else res.send 'User-agent: *\nDisallow: /' </code></pre> <p>(NB: it is CoffeeScript)</p> <p>There should be a better way. How would you do it?</p> <p>Thank you.</p>

<p>Use a middleware function. This way the robots.txt will be handled before any session, cookieParser, etc:</p> <pre class="prettyprint"><code>app.use('/robots.txt', function (req, res, next) { res.type('text/plain') res.send("User-agent: *\nDisallow: /"); }); </code></pre> <p>With express 4 <code>app.get</code> now gets handled in the order it appears so you can just use that:</p> <pre class="prettyprint"><code>app.get('/robots.txt', function (req, res) { res.type('text/plain'); res.send("User-agent: *\nDisallow: /"); }); </code></pre>

<h3>1. Create <code>robots.txt</code> with following content :</h3> <pre class="prettyprint"><code>User-agent: * Disallow: # your rules here </code></pre> <h3>2. Add it to <code>public/</code> directory.</h3> <h3>3. If not already present in your code, add:</h3> <pre class="prettyprint lang-js prettyprint-override"><code>app.use(express.static('public')) </code></pre> <p>Your <code>robots.txt</code> will be available to any crawler at <code>http://yoursite.com/robots.txt</code></p>

What is the smartest way to handle robots.txt in Express?

Tags:

node.js

express

robots.txt

I'm currently working on an application built with Express (Node.js) and I want to know what is the smartest way to handle different robots.txt for different environments (development, production).

This is what I have right now but I'm not convinced by the solution, I think it is dirty:

app.get '/robots.txt', (req, res) ->   res.set 'Content-Type', 'text/plain'   if app.settings.env == 'production'     res.send 'User-agent: *\nDisallow: /signin\nDisallow: /signup\nDisallow: /signout\nSitemap: /sitemap.xml'   else     res.send 'User-agent: *\nDisallow: /'

(NB: it is CoffeeScript)

There should be a better way. How would you do it?

Thank you.

208

asked Feb 27 '13 18:02

Vinch

2 Answers

Use a middleware function. This way the robots.txt will be handled before any session, cookieParser, etc:

app.use('/robots.txt', function (req, res, next) {     res.type('text/plain')     res.send("User-agent: *\nDisallow: /"); });

With express 4 app.get now gets handled in the order it appears so you can just use that:

app.get('/robots.txt', function (req, res) {     res.type('text/plain');     res.send("User-agent: *\nDisallow: /"); });

answered Oct 13 '22 09:10

SystemParadox

1. Create `robots.txt` with following content :

User-agent: * Disallow: # your rules here

2. Add it to `public/` directory.

3. If not already present in your code, add:

app.use(express.static('public'))

Your robots.txt will be available to any crawler at http://yoursite.com/robots.txt

answered Oct 13 '22 09:10

atul

Related questions
                            
                                Separating file server and socket.io logic in node.js
                            
                                Node.Js + Socket.IO vs SignalR vs C# WebSocket Server
                            
                                npm install gives error "can't find a package.json file"
                            
                                How do you import a javascript package from a cdn/script tag in React?
                            
                                Node Express sending image files as API response
                            
                                Node.js: How to read a stream into a buffer?
                            
                                Not compatible with your operating system or architecture: [email protected]
                            
                                While loop with promises
                            
                                Meteor: How to list the installed packages
                            
                                Node - how to run app.js?
                            
                                Install Yarn Ubuntu 16.04 (Linux Mint 18.1)
                            
                                Homebrew npm install: could not symlink
                            
                                How to fill an input field using Puppeteer?
                            
                                How do I get the server timestamp in Cloud Functions for Firebase?
                            
                                Node.js request CERT_HAS_EXPIRED
                            
                                Running app inside Docker as non-root user
                            
                                Best practices to invalidate JWT while changing passwords and logout in node.js? [closed]
                            
                                Correct async function export in node.js
                            
                                Are nested promises normal in node.js?
                            
                                Spawn and kill a process in node.js

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the smartest way to handle robots.txt in Express?

Tags:

node.js

express

robots.txt

Vinch

People also ask

2 Answers

SystemParadox

1. Create `robots.txt` with following content :

2. Add it to `public/` directory.

3. If not already present in your code, add:

atul

Recent Activity

Donate For Us

What is the smartest way to handle robots.txt in Express?

Tags:

node.js

express

robots.txt

Vinch

People also ask

2 Answers

SystemParadox

1. Create robots.txt with following content :

2. Add it to public/ directory.

3. If not already present in your code, add:

atul

Related questions

Recent Activity

Donate For Us

1. Create `robots.txt` with following content :

2. Add it to `public/` directory.