I run this command: db.ads_view.aggregate({$group: {_id : "$campaign", "action" : {$sum: 1} }}); ads_view : 500 000 documents. this queries take 1.8s . this is its profile : https://gist.github.com/afecec63a994f8f7fd8a indexed : db.ads_view.ensureIndex({campaign: 1}); But mongodb don't use index. Anyone know if can aggregate framework use indexes, how to index this query.

This is a late answer, but since <code>$group</code> in Mongo as of version 4.0 still won't make use of indexes, it may be helpful for others. To speed up your aggregation significantly, performe a <code>$sort</code> before <code>$group</code>. So your query would become: <pre class="prettyprint"><code>db.ads_view.aggregate({$sort:{"campaign":1}},{$group: {_id : "$campaign", "action" : {$sum: 1} }}); </code></pre> This assumes an index on <code>campaign</code>, which should have been created according to your question. In Mongo 4.0, create the index with <code>db.ads_view.createIndex({campaign:1})</code>. I tested this on a collection containing 5.5+ Mio. documents. Without <code>$sort</code>, the aggregation would not have finished even after several hours; with <code>$sort</code> preceeding <code>$group</code>, aggregation is taking a couple of seconds.

Aggregate framework can't use indexes

2 Answers

This is a late answer, but since $group in Mongo as of version 4.0 still won't make use of indexes, it may be helpful for others.

To speed up your aggregation significantly, performe a $sort before $group.

So your query would become:

db.ads_view.aggregate({$sort:{"campaign":1}},{$group: {_id : "$campaign", "action" : {$sum: 1} }});

This assumes an index on campaign, which should have been created according to your question. In Mongo 4.0, create the index with db.ads_view.createIndex({campaign:1}).

I tested this on a collection containing 5.5+ Mio. documents. Without $sort, the aggregation would not have finished even after several hours; with $sort preceeding $group, aggregation is taking a couple of seconds.

183

answered Oct 17 '22 19:10

sebastian

The $group operator is not one of the ones that will use an index currently. The list of operators that do (as of 2.2) are:

$match
$sort
$limit
$skip

From here:

http://docs.mongodb.org/manual/applications/aggregation/#pipeline-operators-and-indexes

Based on the number of yields going on in the gist, I would assume you either have a very active instance or that a lot of this data is not in memory when you are doing the group (it will yield on page fault usually too), hence the 1.8s

Note that even if $group could use an index, and your index covered everything being grouped, it would still involve a full scan of the index to do the group, and would likely not be terrible fast anyway.

answered Oct 17 '22 19:10

Adam Comerford

Related questions
                            
                                MongoDB Object Serialized as JSON
                            
                                How to take the average of big data in MongoDB vs CouchDB?
                            
                                Doctrine ODM - like operator syntax
                            
                                Mongo DB Delete a field and value
                            
                                MongoDB C# Driver 'Cursor not found'
                            
                                Create MongoDB ObjectID from date in the past using PHP driver
                            
                                What is a good alternative to Kue that works with MongoDB instead of Redis?
                            
                                What's the proper way to handle mongoose connections with express.js?
                            
                                How to AND and NOT in MongoDB $text search
                            
                                Mongodb not able to start in Ubuntu 15.04
                            
                                The type or namespace MongoServer could not be found
                            
                                Mongodb group aggregation using ProjectionDefinition with c# driver
                            
                                And operator in MongoDB to perform query with multi-filters using the official .NET driver
                            
                                How to connect Django ORM to mongo atlas?
                            
                                UnhandledPromiseRejectionWarning: MongooseServerSelectionError
                            
                                MongoDB atlas connection fails with error MongoServerSelectionError: connection <monitor> to 52.64.0.234:27017 closed
                            
                                Robomongo: Cannot connect to replica set. Set's primary is unreachable
                            
                                Mongo does not have a max() function, how do I work around this?
                            
                                Generate user friendly id's in MongoDb
                            
                                mongodb not using indexes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Aggregate framework can't use indexes

Tags:

indexing

mongodb

meotimdihia

People also ask

2 Answers

sebastian

Adam Comerford

Recent Activity

Donate For Us