When creating a simple MongoDB query, I have a question about the ordering of conditions in the query - for example (Mongoose.js syntax): <pre class="prettyprint"><code>conditions = { archived: false, first_name: "Billy" }; </code></pre> vs. <pre class="prettyprint"><code>conditions = { first_name: "Billy", archived: false }; </code></pre> ..in a simple find() function: <pre class="prettyprint"><code>User.find(conditions, function(err, users) { <some logic> }); </code></pre> ..assuming a simple single-key indexing strategy: <pre class="prettyprint"><code>UserSchema.index( { first_name: 1, archived: 1} ); </code></pre> ..does the order of the conditions listed above matter? IMPORTANT: I know the order DOES MATTER for compound indexes, but per above I am curious about single-key index queries. Also interested in cases of totally non-indexed queries since we're here. :) ALTERNATE EXPLANATION: Put another way, assuming 100 <code>User</code>s (50 archived and 50 not), given two possible internal MongoDB searching strategies: <ol> <li>First filter out all 50 of the <code>archived</code> users, then search through the remaining 50 non-archived users with the <code>first_name</code> value of "Billy"</li> <li>First search through ALL 100 <code>User</code> documents for the <code>first_name</code> value "Billy", and then filter the found objects by removing any Billys that are archived. </li> </ol> ..I would assume #1 to be faster (potentially MUCH faster in large queries with more than two conditions). But regardless of which is faster and why, surely one of them is. CORE QUESTION: Outside the vast and powerful world of compound indexes, does MongoDB know how to perform its most performant/quick searches/filters automatically, regardless of which fields and which ordering? Or do we need to tell the system what is best programmatically (via the order of conditions presented, etc)?

I'm a little confused by your question, simply because the index you provide (<code>{ first_name: 1, archived: 1 }</code>) is a compound index. All of the following queries will make use of that compound index: <pre class="prettyprint"><code>conditions = { archived: false, first_name: "Billy" }; conditions = { first_name: "Billy", archived: false }; conditions = { first_name: "Billy" }; </code></pre> Now, let's assume we have two separate indexes, <code>{ first_name: 1 }</code> and <code>{ archived: 1 }</code>. In this case, MongoDB will do query optimization to determine which index is the most efficient to use. You can read more about the query optimization performed by MongoDB here. The MongoDB query optimizer will thus likely use the same index for both of the multicondition queries you provided: <pre class="prettyprint"><code>conditions = { archived: false, first_name: "Billy" }; conditions = { first_name: "Billy", archived: false }; </code></pre> Alternatively, you can use <code>hint</code> to force MongoDB to use an index of your choosing. In general, this is probably not a good idea. You can also manually check which index is the most efficient for a specific query as detailed here. You can see which index a query is using by using the <code>.explain()</code> functionality in the Mongo shell. (If no index is used, you'll see <code>"cursor" : "BasicCursor"</code> in the resulting document. On the other hand, if the compound index is being used, you'll see something like <code>"cursor" : "BtreeCursor first_name_1_archived_1"</code>. If one of the single-field indexes was used, you might see <code>"cursor" : "BtreeCursor archived_1"</code>. Additionally, the search strategy for MongoDB works like this: <ul> <li>first, traverse the index, using the index bounds to filter out as many documents as possible; </li> <li>next, if there are additional predicates that cannot be satisfied using the index, <ul> <li>fetch the document, </li> <li>apply the predicate, </li> <li>and include/exclude the document from the results appropriately.</li> </ul> </li> </ul> The query optimizer runs all possible query plans in parallel and picks the "best" one, however all of the query plans follow the strategy above. (The BasicCursor is a degenerate case: it traverses all of the documents & applies the predicate to each one.) tl;dr? The Matcher is smart enough to match equality predicates when they're presented in any order. Does that make sense?

MongoDB (and Mongoose.js): Does the order of query conditions matter?

Tags:

mongodb

mongoose

When creating a simple MongoDB query, I have a question about the ordering of conditions in the query - for example (Mongoose.js syntax):

conditions = { archived: false, first_name: "Billy" };

vs.

conditions = { first_name: "Billy", archived: false };

..in a simple find() function:

User.find(conditions, function(err, users) { <some logic> });

..assuming a simple single-key indexing strategy:

UserSchema.index( { first_name: 1, archived: 1} );

..does the order of the conditions listed above matter?

IMPORTANT: I know the order DOES MATTER for compound indexes, but per above I am curious about single-key index queries. Also interested in cases of totally non-indexed queries since we're here. :)

ALTERNATE EXPLANATION: Put another way, assuming 100 Users (50 archived and 50 not), given two possible internal MongoDB searching strategies:

First filter out all 50 of the archived users, then search through the remaining 50 non-archived users with the first_name value of "Billy"
First search through ALL 100 User documents for the first_name value "Billy", and then filter the found objects by removing any Billys that are archived.

..I would assume #1 to be faster (potentially MUCH faster in large queries with more than two conditions). But regardless of which is faster and why, surely one of them is.

CORE QUESTION: Outside the vast and powerful world of compound indexes, does MongoDB know how to perform its most performant/quick searches/filters automatically, regardless of which fields and which ordering? Or do we need to tell the system what is best programmatically (via the order of conditions presented, etc)?

494

asked Aug 12 '13 04:08

toblerpwn

1 Answers

I'm a little confused by your question, simply because the index you provide ({ first_name: 1, archived: 1 }) is a compound index. All of the following queries will make use of that compound index:

conditions = { archived: false, first_name: "Billy" };
conditions = { first_name: "Billy", archived: false };
conditions = { first_name: "Billy" };

Now, let's assume we have two separate indexes, { first_name: 1 } and { archived: 1 }. In this case, MongoDB will do query optimization to determine which index is the most efficient to use. You can read more about the query optimization performed by MongoDB here.

The MongoDB query optimizer will thus likely use the same index for both of the multicondition queries you provided:

conditions = { archived: false, first_name: "Billy" };
conditions = { first_name: "Billy", archived: false };

Alternatively, you can use hint to force MongoDB to use an index of your choosing. In general, this is probably not a good idea. You can also manually check which index is the most efficient for a specific query as detailed here.

You can see which index a query is using by using the .explain() functionality in the Mongo shell. (If no index is used, you'll see "cursor" : "BasicCursor" in the resulting document. On the other hand, if the compound index is being used, you'll see something like "cursor" : "BtreeCursor first_name_1_archived_1". If one of the single-field indexes was used, you might see "cursor" : "BtreeCursor archived_1".

Additionally, the search strategy for MongoDB works like this:

first, traverse the index, using the index bounds to filter out as many documents as possible;
next, if there are additional predicates that cannot be satisfied using the index,
- fetch the document,
- apply the predicate,
- and include/exclude the document from the results appropriately.

The query optimizer runs all possible query plans in parallel and picks the "best" one, however all of the query plans follow the strategy above. (The BasicCursor is a degenerate case: it traverses all of the documents & applies the predicate to each one.)

tl;dr? The Matcher is smart enough to match equality predicates when they're presented in any order.

Does that make sense?

answered Oct 20 '22 20:10

Amalia

Related questions
                            
                                Hierarchical queries with Mongo using $graphLookup
                            
                                bson.D vs bson.M for find queries
                            
                                Range based paging mongodb
                            
                                How do I get the date a MongoDB collection was created using MongoDB C# driver?
                            
                                NodeJs Mongoose + Mongo, connecting to localhost
                            
                                Setting up MongoDB river for Elasticsearch
                            
                                How to convert casbah mongodb list to json in scala / play
                            
                                Getting started with Node.js, angular.js and MongoDB, modeling relations and other ramp up tips [closed]
                            
                                Why can't I run explain on MongoDB update?
                            
                                Query on top N rows in mongodb
                            
                                findAndModify - MongoError: exception: must specify remove or update
                            
                                PyMongo/Mongoengine equivalent of mongodump
                            
                                Mongotemplate - Query ObjectId according to greater than (gt) or less than (lt) operator
                            
                                MongoDB / Mongoose: MarkModified a nested object
                            
                                Mongodb Skip() and limit()
                            
                                How can I connect to mongodb using express without mongoose?
                            
                                MockBean annotation in Spring Boot test causes NoUniqueBeanDefinitionException
                            
                                Case insensitive sorting with Mongoid
                            
                                nodejs - mongodb - how to remove a record
                            
                                solr Data Import Handlers for MongoDB

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With