I'm trying to use aggregation framework with <code>$match</code> and <code>$group</code> stages. Does <code>$group</code> stage use index data? I'm using latest available mongodb version - <code>2.5.4</code>

<code>$group</code> does not use index data. From the mongoDB docs: <blockquote> The $match and $sort pipeline operators can take advantage of an index when they occur at the beginning of the pipeline. The $geoNear pipeline operator takes advantage of a geospatial index. When using $geoNear, the $geoNear pipeline operation must appear as the first stage in an aggregation pipeline. </blockquote>

As 4J41's answer says, <code>$group</code> does not (directly) use an index, although <code>$sort</code> does if it is the first stage in the pipeline. However, it seems possible that <code>$group</code> could, in principle, have an optimised implementation if it immediately follows a <code>$sort</code>, in which case you could make it effectively make use of an index by putting a <code>$sort</code> before hand. There does not seem to be a straight answer either way in the docs about whether <code>$group</code> has this optimisation (although I bet there would be if it did, so this suggests it doesn't). The answer is in MongoDB bug 4507: currently <code>$group</code> does NOT have this implementation, so the top line of 4J41's answer is right after all. If you really need efficiency, depending on the application it may be quickest to use a regular query and do the grouping in your client code. Edit: As sebastian's answer says, it seems that in practice using <code>$sort</code> (that can take advantage of an index) before a <code>$group</code> can make a very large speed improvement. The bug above is still open so it seems that it is not making the absolute best possible advantage of the index (that is, starting to group items as items are loaded, rather than loading them all in memory first). But it is still certainly worth doing.

Mongodb Aggregation Framework: Does $group use index?

2 Answers

$group does not use index data.

From the mongoDB docs:

The $match and $sort pipeline operators can take advantage of an index when they occur at the beginning of the pipeline.

The $geoNear pipeline operator takes advantage of a geospatial index. When using $geoNear, the $geoNear pipeline operation must appear as the first stage in an aggregation pipeline.

112

answered Sep 29 '22 11:09

4J41

As 4J41's answer says, $group does not (directly) use an index, although $sort does if it is the first stage in the pipeline. However, it seems possible that $group could, in principle, have an optimised implementation if it immediately follows a $sort, in which case you could make it effectively make use of an index by putting a $sort before hand.

There does not seem to be a straight answer either way in the docs about whether $group has this optimisation (although I bet there would be if it did, so this suggests it doesn't). The answer is in MongoDB bug 4507: currently $group does NOT have this implementation, so the top line of 4J41's answer is right after all. If you really need efficiency, depending on the application it may be quickest to use a regular query and do the grouping in your client code.

Edit: As sebastian's answer says, it seems that in practice using $sort (that can take advantage of an index) before a $group can make a very large speed improvement. The bug above is still open so it seems that it is not making the absolute best possible advantage of the index (that is, starting to group items as items are loaded, rather than loading them all in memory first). But it is still certainly worth doing.

answered Sep 29 '22 10:09

Arthur Tacca

Related questions
                            
                                Converting string to date in mongodb
                            
                                MongoDB Atlas mongoimport issues cannot decode array into a D
                            
                                tar gzip mongo dump like MySQL
                            
                                JSON.NET cast error when serializing Mongo ObjectId
                            
                                MongoDB - Error: getMore command failed: Cursor not found
                            
                                $unwind empty array
                            
                                MongoParseError: options useCreateIndex, useFindAndModify are not supported
                            
                                MongoDB 3.6.2 2008R2 Plus Not Installing
                            
                                Mongoose - using Populate on an array of ObjectId
                            
                                AutoReconnect exception "master has changed"
                            
                                MongoDB wildcard in the key of a query
                            
                                Why does mongoose use schema when mongodb's benefit is supposed to be that it's schema-less?
                            
                                Implementing MongoDB 2.4's full text search in a Meteor app
                            
                                MongoDB database encryption
                            
                                understand MongoDB cache system
                            
                                Sorting on Multiple fields mongo DB
                            
                                Mongoose Trying to open unclosed connection
                            
                                how to $project ObjectId to string value in mongodb aggregate?
                            
                                Django Sessions
                            
                                MongoDB schema design for multiple auth user accounts

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mongodb Aggregation Framework: Does $group use index?

Tags:

performance

mongodb

aggregation-framework

fedor.belov

People also ask

2 Answers

4J41

Arthur Tacca

Recent Activity

Donate For Us