404 vs 403 when directory index is missing

Tags:

This is mostly a philosophical question about the best way to interpret the HTTP spec. Should a directory with no directory index (e.g. index.html) return 404 or 403? (403 is the default in Apache.)

For example, suppose the following URLs exist and are accessible:

http://example.com/files/file_1/
http://example.com/files/file_2/

But there's nothing at:

http://example.com/files/

(Assume we're using 301s to force trailing slashes for all URLs.)

I think several things should be taken into account:

By default, Apache returns 403 in this scenario. That's significant to me. They've thought about this stuff, and they made the decision to use 403.
According to W3C, 403 means "The server understood the request, but is refusing to fulfill it." I take that to mean you should return 403 if the URL is meaningful but nonetheless forbidden.
403 might result in information disclosure if the client correctly guesses that the URL maps to a real directory on disk.
http://example.com/files/ isn't a resource, and the fact that it internally maps to a directory shouldn't be relevant to the status code.
If you interpret the URL scheme as defining a directory structure from the client's perspective, the internal implementation is still irrelevant, but perhaps the outward appearance should indeed have some bearing on the status codes. Maybe, even if you created the same URL structure without using directories internally, you should still use 403s, because it's about the client's perception of a directory structure.

In the balance, what do you think is the best approach? Should we just say "a resource is a resource, and if it doesn't exist, it's a 404?" Or should we say, "if it has slashes, it looks like a directory to the client, and therefore it's a 403 if there's no index?"

If you're in the 403 camp, do you think you should go out of your way to return 403s even if the internal implementation doesn't use directories? Suppose, for example, that you have a dynamic web app with this URL: http://example.com/users/joe, which maps to some code that generates the profile page for Joe. Assuming you don't write something that lists all users, should http://example.com/users/ return 403? (Many if not all web frameworks return 404 in this case.)

759

asked Feb 22 '11 07:02

rlkw1024

2 Answers

The first step to answering this is to refer to RFC 2616: HTTP/1.1. Specifically the sections talking about 403 Forbidden and 404 Not Found.

10.4.4 403 Forbidden

The server understood the request, but is refusing to fulfill it. Authorization will not help and the request SHOULD NOT be repeated. If the request method was not HEAD and the server wishes to make public why the request has not been fulfilled, it SHOULD describe the reason for the refusal in the entity. If the server does not wish to make this information available to the client, the status code 404 (Not Found) can be used instead.

10.4.5 404 Not Found

The server has not found anything matching the Request-URI. No indication is given of whether the condition is temporary or permanent. The 410 (Gone) status code SHOULD be used if the server knows, through some internally configurable mechanism, that an old resource is permanently unavailable and has no forwarding address. This status code is commonly used when the server does not wish to reveal exactly why the request has been refused, or when no other response is applicable.

My interpretation of this is that 404 is the more general error code that just says "there's nothing there". 403 says "there's nothing there, don't try again!".

One reason why Apache might return 403 on directories without explicit index files is that auto-indexing (i.e. listing all files in it) is disabled (a.k.a "forbidden"). In that case saying "listing all files in this directory is forbidden" makes more sense than saying "there is no directory".

196

answered Sep 22 '22 10:09

Joachim Sauer

Another argument why 404 is preferable: google webmaster tools.

Indeed, for a 404, Google Webmaster Tool displays the referer (allowing you to clean up the bad link to the directory), whereas for a 403, it doesn't display it.

answered Sep 22 '22 10:09

Alain Knaff

Related questions
                            
                                Forward Headers not working in Spring Boot + Kubernetes
                            
                                Parse a String containing multipart/form-data request body in Java
                            
                                WSGI/Django: pass username back to Apache for access log
                            
                                How do I run Apache (httpd) and Tomcat together?
                            
                                URL rewriting for different protocols in .htaccess
                            
                                Is there any harm in running PHP and Ruby on the same server?
                            
                                gunicorn via mod_proxy is redirecting outside of the project's scope, despite ProxyPassReverse
                            
                                Handling log and configuration files when load balancing apache
                            
                                Making a two way SSL authentication between apache httpd reverse proxy and Tomcats
                            
                                Always send Content-Length in Apache?
                            
                                Warning: ob_start() [ref.outcontrol]: output handler 'ob_gzhandler' conflicts with zlib output compression'
                            
                                Apache or nginx ? I like to understand the basic working flow of Nginx , its advantage and disadvantage
                            
                                OSX 10.10 Yosemite apache upgrade to 2.4, server side includes no longer works
                            
                                Ubuntu , Apache2 , Django ) Fatal Python error: Py_Initialize: Unable to get the locale encoding ImportError: No module named 'encodings'
                            
                                How to Run my Asp.net Core application on Apache server on Windows?
                            
                                Custom 404 shows "File not found" for php files instead of redirecting
                            
                                How to call R from within a web server (like Apache)?
                            
                                HttpServletRequest.getRemoteAddr() returning wrong address [duplicate]
                            
                                How do I add paths to the Apache PATH variable?
                            
                                PHP emitting 500 on errors - where is this documented?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

404 vs 403 when directory index is missing

Tags:

http

http-status-code-403

http-status-codes

apache

http-status-code-404

rlkw1024

People also ask

2 Answers

Joachim Sauer

Alain Knaff

Recent Activity

Donate For Us