I am trying to set Apache environment variables (for use in PHP) with the [E=VAR:VAL]
flag on RewriteRule rules in an .htaccess file.
I have already discovered the variables are accessed in PHP as server variables $_SERVER
rather than $_ENV
(which makes a certain amount of sense). However, my problem is for some rules the [E=VAR:VAL]
flag works as expected and I end up with a variable $_SERVER['VAR']
but for other rules I end with a variable $_SERVER['REDIRECT_VAR']
or $_SERVER['REDIRECT_REDIRECT_VAR']
, etc
A. What causes an environment variable set in Apache using the [E=VAR:VAL]
flag to get renamed by having "REDIRECT_" prepended to the variable name?
B. What can I do to make sure I end up with an Environment Variable with an unchanged name so I can access it in PHP as $_SERVER['VAR']
without having to resort to checking for variations of the variable name having one of more instances of "REDIRECT_" prepended to it?
Partial solution found. Adding the following to the start of the rewrite rules recreates the original ENV:VAR on each redirect (as well as leaving the REDIRECT_VAR versions there) if they're needed:
RewriteCond %{ENV:REDIRECT_VAR} !^$ RewriteRule .* - [E=VAR:%{ENV:REDIRECT_VAR}]
There are two kinds of environment variables that affect the Apache HTTP Server. First, there are the environment variables controlled by the underlying operating system. These are set before the server starts. They can be used in expansions in configuration files, and can optionally be passed to CGI scripts and SSI using the PassEnv directive.
Apache ordinarily labels this text according to the character set which it uses, which is ISO-8859-1. However, if the redirection is to a page that uses a different character set, some broken browser versions will try to use the character set from the redirection text rather than the actual page.
Second, the Apache HTTP Server provides a mechanism for storing information in named variables that are also called environment variables. This information can be used to control various operations such as logging or access control.
First, there are the environment variables controlled by the underlying operating system. These are set before the server starts. They can be used in expansions in configuration files, and can optionally be passed to CGI scripts and SSI using the PassEnv directive.
This behavior is unfortunate and doesn't even appear to be documented.
Here's what appears to happen in .htaccess
per-directory (per-dir) context:
Assume that Apache processes an .htaccess
file that includes rewrite directives.
Apache populates its environment variable map with all of the standard CGI / Apache variables
Rewriting begins
Environment variables are set in RewriteRule
directives
When Apache stops processing the RewriteRule
directives (because of an L
flag or the end of the ruleset) and the URL has been changed by a RewriteRule
, Apache restarts request processing.
If you're not familiar with this part, see the L
flag documentation:
thus the ruleset may be run again from the start. Most commonly this will happen if one of the rules causes a redirect - either internal or external - causing the request process to start over.
From what I can observe, I believe that when #4 happens, #1 is repeated, then the environment variables that were set in RewriteRule
directives are prepended with REDIRECT_
and added to the environment vars map (not necessarily in that order, but the end result consisting of that combination).
This step is where the chosen variable names are wiped out, and in a moment I will explain why that is so important and inconvenient.
When I originally ran into this issue, I was doing something like the following in .htaccess
(simplified):
RewriteCond %{HTTP_HOST} (.+)\.projects\. RewriteRule (.*) subdomains/%1/docroot/$1 RewriteRule (.+/docroot)/ - [L,E=EFFECTIVE_DOCUMENT_ROOT:$1]
If I were to set the environment variable in the first RewriteRule
, Apache would restart the rewriting process and prepend the variable with REDIRECT_
(steps #4 & 5 above), thus I'd lose access to it via the name I assigned.
In this case, the first RewriteRule
changes the URL, so after both RewriteRule
s are processed, Apache restarts the procedure and processes the .htaccess
again. The second time, the first RewriteRule
is skipped because of the RewriteCond
directive, but the second RewriteRule
matches, sets the environment variable (again), and, importantly, doesn't change the URL. So the request / rewriting process does not start over, and the variable name I chose sticks. In this case I actually have both REDIRECT_EFFECTIVE_DOCUMENT_ROOT
and EFFECTIVE_DOCUMENT_ROOT
. If I were to use an L
flag on the first RewriteRule
, I'd only have EFFECTIVE_DOCUMENT_ROOT
.
@trowel's partial solution works similarly: the rewrite directives are processed again, the renamed variable is assigned to the original name again, and if the URL does not change, the process is over and the assigned variable name sticks.
Both of those techniques suffer from a major flaw: when the rewrite rules in the .htaccess
file where you set environment variables rewrite the URL to a more deeply nested directory that has an .htaccess
file that does any rewriting, your assigned variable name is wiped out again.
Say you have a directory layout like this:
docroot/ .htaccess A.php B.php sub/ .htaccess A.php B.php
And a docroot/.htaccess
like this:
RewriteRule ^A\.php sub/B.php [L] RewriteRule .* - [E=MAJOR:flaw]
So you request /A.php
, and it's rewritten to sub/B.php
. You still have your MAJOR
variable.
However, if you have any rewrite directives in docroot/sub/.htaccess
(even just RewriteEngine Off
or RewriteEngine On
), your MAJOR
variable disappears. That's because once the URL is rewritten to sub/B.php
, docroot/sub/.htaccess
is processed, and if it contains any rewrite directives, rewrite directives in docroot/.htaccess
are not processed again. If you had a REDIRECT_MAJOR
after docroot/.htaccess
was processed (e.g. if you omit the L
flag from the first RewriteRule
), you'll still have it, but those directives won't run again to set your chosen variable name.
So, say you want to:
set environment variables in RewriteRule
directives at a particular level of the directory tree (like docroot/.htaccess
)
have them available in scripts at deeper levels
have them available with the assigned names
be able to have rewrite directives in more deeply nested .htaccess
files
A possible solution is to use RewriteOptions inherit
directives in the more deeply nested .htaccess
files. That allows you to re-run the rewrite directives in less deeply nested files and use the techniques outlined above to set the variables with the chosen names. However, note that this increases complexity because you have to be more careful crafting the rewrite directives in the less deeply nested files so that they don't cause problems when run again from the more deeply nested directories. I believe Apache strips the per-dir prefix for the more deeply nested directory and runs the rewrite directives in the less deeply nested files on that value.
As far as I can see, support for using a construct like %{ENV:REDIRECT_VAR}
in the value component of a RewriteRule
E
flag (e.g. [E=VAR:%{ENV:REDIRECT_VAR}]
) does not appear to be documented:
VAL may contain backreferences ($N or %N) which will be expanded.
It does appear to work, but if you want to avoid relying on something undocumented (please correct me if I'm wrong about that), it can easily be done this way instead:
RewriteCond %{ENV:REDIRECT_VAR} (.+) RewriteRule .* - [E=VAR:%1]
I don't recommend relying on this, because it doesn't seem to be consistent with the documented behavior (see below), but this (in docroot/.htaccess
, with Apache 2.2.20) works for me:
SetEnvIf REDIRECT_VAR (.+) VAR=$1
Only those environment variables defined by earlier SetEnvIf[NoCase] directives are available for testing in this manner.
I don't know what the rationale for prefixing these names with REDIRECT_
is -- not surprising, since it doesn't appear to be mentioned in the Apache documentation sections for mod_rewrite directives, RewriteRule
flags, or environment variables.
At the moment it seems like a big nuisance to me, in the absence of an explanation for why it's better than leaving the assigned names alone. The lack of documentation only contributes to my skepticism about it.
Being able to assign environment variables in rewrite rules is useful, or at least, it would be. But the usefulness is greatly diminished by this name-changing behavior. The complexity of this post illustrates how nuts this behavior and the hoops that have to be jumped through to try to overcome it are.
I haven't tested this at all and I know it doesn't address points A or B, but there is some description of this issue in the comments in PHP documentation and some possible solutions for accessing these variables using $_SERVER['VAR']
:
http://www.php.net/manual/en/reserved.variables.php#79811
EDIT - some more responses to the question offered:
A: The environment variables are renamed by Apache if they are involved in a redirect. For example, if you have the following rule:
RewriteRule ^index.php - [E=VAR1:'hello',E=VAR2:'world']
Then you may access VAR1 and VAR2 using $_SERVER['VAR1']
and $_SERVER['VAR2']
. However, if you redirect the page like so:
RewriteRule ^index.php index2.php [E=VAR1:'hello',E=VAR2:'world']
Then you must use $_SERVER['REDIRECT_VAR1']
, etc.
B: The best way to overcome this issue is to process the variables that you're interested in using PHP. Create a function that runs through the $_SERVER
array and finds the items that you need. You might even use a function like this:
function myGetEnv($key) { $prefix = "REDIRECT_"; if(array_key_exists($key, $_SERVER)) return $_SERVER[$key]; foreach($_SERVER as $k=>$v) { if(substr($k, 0, strlen($prefix)) == $prefix) { if(substr($k, -(strlen($key))) == $key) return $v; } } return null; }
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With