take vs first performance in Ruby on Rails

Tags:

This is a question regarding ActiveRecord query methods:

first Find the first record (or first N records if a parameter is supplied). If no order is defined it will order by primary key.
take Gives a record (or N records if a parameter is supplied) without any implied order. The order will depend on the database implementation. If an order is supplied it will be respected.

usecase: retrieve record from database based on unique attribute, example.

User.where(email: '[email protected]')

here, first generates

SELECT "users".* FROM "users" WHERE "users"."email" = '[email protected]' ORDER BY "users"."id"` ASC LIMIT 1

take generates

SELECT "users".* FROM "users" WHERE "users"."email" = '[email protected]' LIMIT 1

so as seen above first adds additional ordering clause. I am wondering if there a performance difference between take vs first.

Is take faster than first or vice-versa?

976

asked Aug 28 '13 19:08

CuriousMind

1 Answers

In general "take" will be faster, because the database does not have to identify all of the rows that meet the criteria and then sort them and find the lowest-sorting row. "take" allows the database to stop as soon as it has found a single row.

The degree to which it is faster is going to vary according to:

How much time is saved in not having to look for more than one row. The worst case here is where a full scan of a large table is required, but one matching row is found very early in the scan. "take" would allow the scan to be stopped.
How many rows would need to be sorted to find the one with the lowest id. The worst case here is where every row in the table matches the criteria and needs to be included in the sort.

There are some other factors to consider -- for example for a "first" query the optimiser might be able to access the table via a scan of the primary key index and check each row to see if it matches the condition. If there is a very high likelihood of that then both a complete scan of the data and a sort can be avoided if the query optimiser is sophisticated enough.

In many cases, where there are very few matching records and index-based access to find them, you'll find that the difference is trivial (where there is a unique index on "email" in your example). However, I would still use "take" in preference to first even then.

Edit: I'll just add, though it's a little off-topic, that in your example you might as well use:

User.find_by(email: '[email protected]')

The generated query should be exactly the same as for take, but the semantics are a bit more clear I think.

172

answered Oct 13 '22 10:10

David Aldridge

Related questions
                            
                                Filtering by window function result in Postgresql
                            
                                Postgres "missing FROM-clause entry" error on query with WITH clause
                            
                                Entity-Attribute-Value Table Design
                            
                                Find longest matching ngrams in MySQL
                            
                                How can I create a unique index in Oracle but ignore nulls?
                            
                                Why is selecting specified columns, and all, wrong in Oracle SQL?
                            
                                Sql Conditional Not Null Constraint
                            
                                Import CSV into SQL Server (including automatic table creation) [duplicate]
                            
                                Is this a good way to model address information in a relational database?
                            
                                SQL/Database Views in Grails
                            
                                SQL server profiler not showing LINQ To Sql queries
                            
                                Postgres: define a default value for CAST failures?
                            
                                PostgreSQL table variable
                            
                                What value could I insert into a bit type column?
                            
                                What is datetime2?
                            
                                Can you replace or update a SQL constraint?
                            
                                Is it faster to check if length = 0 than to compare it to an empty string?
                            
                                Will multiple calls to `now()` in a single postgres query always give same result?
                            
                                UPDATE with CASE and IN - Oracle
                            
                                UPDATE Query without WHERE Clause

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

take vs first performance in Ruby on Rails

Tags:

sql

ruby-on-rails

ruby-on-rails-4

activerecord

CuriousMind

People also ask

1 Answers

David Aldridge

Recent Activity

Donate For Us