The project that I'm working on is using MySQL on RDS (mysql2 gem specifically). When I use a hash of conditions including a range in a <code>where</code> statement I'm getting a bit of an odd addition to my query. <pre class="prettyprint"><code>User.where(id: [1..5]) </code></pre> and <pre class="prettyprint"><code>User.where(id: [1...5]) </code></pre> Result in the following queries respectively: <pre class="prettyprint"><code>SELECT `users`.* FROM `users` WHERE ((`users`.`id` BETWEEN 1 AND 5 OR 1=0)) SELECT `users`.* FROM `users` WHERE ((`users`.`id` >= 1 AND `users`.`id` < 5 OR 1=0)) </code></pre> The queries work perfectly fine since <code>OR FALSE</code> is effectively a no-op. I'm just wondering why Rails or ARel is adding this snippet into the query. <h3>EDIT</h3> It looks like the line that could explain this is line 26 in <code>ActiveRecord::PredicateBuilder</code>. Still no idea how the hash could be <code>empty?</code> at that point but maybe someone else does. <h3>EDIT 2</h3> This is intersting. I was looking into Filip's comment to see why he made it since it seems just like a clarification but he is correct that <code>1..5 != [1..5]</code>. The former is an inclusive range from 1 to 5 where as the latter is an array whose first element is the former. I tried putting these into an ARel <code>where</code> call to see the SQL produced and the <code>OR 1=0</code> is not there! <pre class="prettyprint"><code>User.where(id: 1..5) #=> SELECT "users".* FROM "users" WHERE ("users"."id" BETWEEN 1 AND 5) User.where(id: 1...5) #=> SELECT "users".* FROM "users" WHERE ("users"."id" >= 1 AND "users"."id" < 5) </code></pre> While I still do not know why ARel is adding the <code>OR 1=0</code> which will always be false and seemingly unnecessary. It may be due to how <code>Array</code>s and <code>Range</code>s are handled differently.

This is strictly speaking a guess, since I did something similar in a project of my own (although I used <code>AND 1</code>). For whatever reason, when generating a query, it is easier to always have a <code>WHERE</code> clause containing a no-op than it is to conditionally generate the <code>WHERE</code> clause at all. That is, if you don't include any <code>where</code> sections it will end up generating something still valid. On the other hand, I'm not sure why it's taking this form: when I did it I use <code>1 [<AND (generated code)>...]</code> it allowed arbitrary chaining, but I don't see how what you're seeing would allow it. None the less, I still think it likely to be a result of an algorithmic code generation scheme.

Why is Rails is adding `OR 1=0` to queries using the where clause hash syntax with a range?

Tags:

mysql

ruby-on-rails

between

The project that I'm working on is using MySQL on RDS (mysql2 gem specifically).

When I use a hash of conditions including a range in a where statement I'm getting a bit of an odd addition to my query.

User.where(id: [1..5])

and

User.where(id: [1...5])

Result in the following queries respectively:

SELECT `users`.* FROM `users` WHERE ((`users`.`id` BETWEEN 1 AND 5 OR 1=0))
SELECT `users`.* FROM `users` WHERE ((`users`.`id` >= 1 AND `users`.`id` < 5 OR 1=0))

The queries work perfectly fine since OR FALSE is effectively a no-op. I'm just wondering why Rails or ARel is adding this snippet into the query.

EDIT

It looks like the line that could explain this is line 26 in ActiveRecord::PredicateBuilder. Still no idea how the hash could be empty? at that point but maybe someone else does.

EDIT 2

This is intersting. I was looking into Filip's comment to see why he made it since it seems just like a clarification but he is correct that 1..5 != [1..5]. The former is an inclusive range from 1 to 5 where as the latter is an array whose first element is the former. I tried putting these into an ARel where call to see the SQL produced and the OR 1=0 is not there!

User.where(id: 1..5) #=> SELECT "users".* FROM "users"  WHERE ("users"."id" BETWEEN 1 AND 5)
User.where(id: 1...5) #=> SELECT "users".* FROM "users"  WHERE ("users"."id" >= 1 AND "users"."id" < 5)

While I still do not know why ARel is adding the OR 1=0 which will always be false and seemingly unnecessary. It may be due to how Arrays and Ranges are handled differently.

754

asked Feb 19 '14 16:02

Aaron

2 Answers

Building on the fact, which you've discovered, that [1..5] is not the correct way to specify the range... I have discovered why [1..5] behaves as it does. To get there, I first found that an empty array in a hash condition produces the 1=0 SQL condition:

User.where(id: []).to_sql
# => "SELECT \"users\".* FROM \"users\"  WHERE 1=0"

And, if you check the ActiveRecord::PredicateBuilder::ArrayHandler code, you'll see that array values are always partitioned into ranges and other values.

ranges, values = values.partition { |v| v.is_a?(Range) }

This explains why you don't see the 1=0 when using non-range values. That is, the only way to get 1=0 from an array without including a range is to supply an empty array, which yields the 1=0 condition, as shown above. And when all the array has in it is a range you're going to get the range conditions (ranges) and, separately, an empty array condition (values) executed. My guess is that there isn't a good reason for this... it just simply is easier to let this be than to avoid it (since the result set is equivalent either way). If the partition code was a bit smarter then it wouldn't have to tack on the additional, empty values array and could skip the 1=0 condition.

As for where the 1=0 comes from in the first place... I think that comes from the database adapter, but I couldn't find exactly where. However, I would call it an attempt to fail to find a record. In other words, WHERE 1=0 isn't ever going to return any users, which makes sense over alternative SQL like WHERE id=null which will find any users whose id is null (realizing that this isn't really correct SQL syntax). And this is what I'd expect when attempting to find all Users whose id is in the empty set (i.e. we're not asking for nil ids or null ids or whatever). So, in my mind, leaving the bit about exactly where 1=0 comes from as a black box is OK. At least we now can reason about why the range inside of the array is causing it to show up!

UPDATE

I've also found that, even when using ARel directly, you can still get 1=0:

User.arel_table[:id].in([]).to_sql
# => "1=0"

104

answered Sep 17 '22 13:09

pdobb

This is strictly speaking a guess, since I did something similar in a project of my own (although I used AND 1).

For whatever reason, when generating a query, it is easier to always have a WHERE clause containing a no-op than it is to conditionally generate the WHERE clause at all. That is, if you don't include any where sections it will end up generating something still valid.

On the other hand, I'm not sure why it's taking this form: when I did it I use 1 [<AND (generated code)>...] it allowed arbitrary chaining, but I don't see how what you're seeing would allow it. None the less, I still think it likely to be a result of an algorithmic code generation scheme.