I have a mongoDB collection with millions of rows and I'm trying to optimize my queries. I'm currently using the aggregation framework to retrieve data and group them as I want. My typical aggregation query is something like : <code>$match > $group > $ group > $project</code> However, I noticed that the last parts only take a few ms, the beginning is the slowest. I tried to perform a query with only the $match filter, and then to perform the same query with collection.find. The aggregation query takes ~80ms while the find query takes 0 or 1ms. I have indexes on pretty much each field so I guess this isn't the problem. Any idea on what could go wrong ? Or is it just a "normal" drawback of the aggregation framework ? I could use find queries instead of aggregation queries, however I would have to perform a lot of processing after the request and this process can be done quickly with <code>$group</code> etc. so I would rather keep the aggregation framework. Thanks, EDIT : Here is my criteria : <pre class="prettyprint"><code>{ "action" : "click", "timestamp" : { "$gt" : ISODate("2015-01-01T00:00:00Z"), "$lt" : ISODate("2015-02-011T00:00:00Z") }, "itemId" : "5" } </code></pre>

The main purpose of the <code>aggregation framework</code> is to ease the query of a big number of entries and generate a low number of results that hold value to you. As you have said, you can also use multiple <code>find</code> queries, but remember that you can not create new fields with <code>find</code> queries. On the other hand, the <code>$group</code> stage allows you to define your new fields. If you would like to achieve the functionality of the <code>aggregation framework</code>, you would most likely have to run an initial <code>find</code> (or chain several ones), pull that information and further manipulate it with a programming language. The <code>aggregation pipeline</code> might seem to take longer, but at least you know you only have to take into account the performance of one system - MongoDB engine. Whereas, when it comes to manipulating the data returned from a <code>find</code> query, you would most likely have to further manipulate the data with a programming language, thus increasing the complexity depending on the intricacies of the programming language of choice.

MongoDB {aggregation $match} vs {find} speed

Tags:

mongodb

aggregation-framework

I have a mongoDB collection with millions of rows and I'm trying to optimize my queries. I'm currently using the aggregation framework to retrieve data and group them as I want. My typical aggregation query is something like : $match > $group > $ group > $project

However, I noticed that the last parts only take a few ms, the beginning is the slowest.

I tried to perform a query with only the $match filter, and then to perform the same query with collection.find. The aggregation query takes ~80ms while the find query takes 0 or 1ms.

I have indexes on pretty much each field so I guess this isn't the problem. Any idea on what could go wrong ? Or is it just a "normal" drawback of the aggregation framework ?

I could use find queries instead of aggregation queries, however I would have to perform a lot of processing after the request and this process can be done quickly with $group etc. so I would rather keep the aggregation framework.

Thanks,

EDIT :

Here is my criteria :

{     "action" : "click",     "timestamp" : {             "$gt" : ISODate("2015-01-01T00:00:00Z"),             "$lt" : ISODate("2015-02-011T00:00:00Z")     },     "itemId" : "5" }

696

asked Feb 06 '15 11:02

Owumaro

1 Answers

The main purpose of the aggregation framework is to ease the query of a big number of entries and generate a low number of results that hold value to you.

As you have said, you can also use multiple find queries, but remember that you can not create new fields with find queries. On the other hand, the $group stage allows you to define your new fields.

If you would like to achieve the functionality of the aggregation framework, you would most likely have to run an initial find (or chain several ones), pull that information and further manipulate it with a programming language.

The aggregation pipeline might seem to take longer, but at least you know you only have to take into account the performance of one system - MongoDB engine.

Whereas, when it comes to manipulating the data returned from a find query, you would most likely have to further manipulate the data with a programming language, thus increasing the complexity depending on the intricacies of the programming language of choice.

answered Oct 06 '22 13:10

vladzam

Related questions
                            
                                What's the difference between replaceOne() and updateOne() in MongoDB?
                            
                                MongoDB Show Current User
                            
                                MongoDb sum query
                            
                                How to use MongoDBs aggregate `$lookup` as `findOne()`
                            
                                Update field in exact element array in MongoDB
                            
                                How to pass argument to Mongo Script
                            
                                What does it mean to fit "working set" into RAM for MongoDB?
                            
                                How does MongoDB avoid the SQL injection mess?
                            
                                Updating the path 'x' would create a conflict at 'x'
                            
                                MongoError: ns not found when try to drop collection
                            
                                MongoDB ranged pagination
                            
                                Mongodb Query To select records having a given key
                            
                                MongoDB Opensource vs MongoDB Enterprise
                            
                                .NET best practices for MongoDB connections?
                            
                                Is there a simple way to export the data from a meteor deployed app?
                            
                                Mongorestore to a different database
                            
                                Is there a sample MongoDB Database along the lines of world for MySql? [closed]
                            
                                Is it safe to delete the journal file of mongodb?
                            
                                MongoDB aggregate by field exists
                            
                                Using Active Record generators after Mongoid installation?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With