<p>Is there an explain function for the Aggregation framework in MongoDB? I can't see it in the documentation.</p> <p>If not is there some other way to check, how a query performs within the aggregation framework?</p> <p>I know with find you just do </p> <pre class="prettyprint lang-js prettyprint-override"><code>db.collection.find().explain() </code></pre> <p>But with the aggregation framework I get an error</p> <pre class="prettyprint lang-js prettyprint-override"><code>db.collection.aggregate( { $project : { "Tags._id" : 1 }}, { $unwind : "$Tags" }, { $match: {$or: [{"Tags._id":"tag1"},{"Tags._id":"tag2"}]}}, { $group: { _id : { id: "$_id"}, "count": { $sum:1 } } }, { $sort: {"count":-1}} ).explain() </code></pre>

<p>Starting with MongoDB version 3.0, simply changing the order from</p> <pre class="prettyprint"><code>collection.aggregate(...).explain() </code></pre> <p>to</p> <pre class="prettyprint"><code>collection.explain().aggregate(...) </code></pre> <p>will give you the desired results (documentation here).</p> <p>For older versions >= 2.6, you will need to use the <code>explain</code> option for aggregation pipeline operations</p> <h3><code>explain:true</code></h3> <pre class="prettyprint"><code>db.collection.aggregate([ { $project : { "Tags._id" : 1 }}, { $unwind : "$Tags" }, { $match: {$or: [{"Tags._id":"tag1"},{"Tags._id":"tag2"}]}}, { $group: { _id : "$_id", count: { $sum:1 } }}, {$sort: {"count":-1}} ], { explain:true } ) </code></pre> <p>An important consideration with the Aggregation Framework is that an index can only be used to fetch the initial data for a pipeline (e.g. usage of <code>$match</code>, <code>$sort</code>, <code>$geonear</code> at the beginning of a pipeline) as well as subsequent <code>$lookup</code> and <code>$graphLookup</code> stages. Once data has been fetched into the aggregation pipeline for processing (e.g. passing through stages like <code>$project</code>, <code>$unwind</code>, and <code>$group</code>) further manipulation will be in-memory (possibly using temporary files if the <code>allowDiskUse</code> option is set).</p> <h3>Optimizing pipelines</h3> <p>In general, you can optimize aggregation pipelines by:</p> <ul> <li>Starting a pipeline with a <code>$match</code> stage to restrict processing to relevant documents.</li> <li>Ensuring the initial <code>$match</code> / <code>$sort</code> stages are supported by an efficient index.</li> <li>Filtering data early using <code>$match</code>, <code>$limit</code> , and <code>$skip</code> .</li> <li>Minimizing unnecessary stages and document manipulation (perhaps reconsidering your schema if complicated aggregation gymnastics are required).</li> <li>Taking advantage of newer aggregation operators if you have upgraded your MongoDB server. For example, MongoDB 3.4 added many new aggregation stages and expressions including support for working with arrays, strings, and facets.</li> </ul> <p>There are also a number of Aggregation Pipeline Optimizations that automatically happen depending on your MongoDB server version. For example, adjacent stages may be coalesced and/or reordered to improve execution without affecting the output results.</p> <h3>Limitations</h3> <p>As at MongoDB 3.4, the Aggregation Framework <code>explain</code> option provides information on how a pipeline is processed but does not support the same level of detail as the <code>executionStats</code> mode for a <code>find()</code> query. If you are focused on optimizing initial query execution you will likely find it beneficial to review the equivalent <code>find().explain()</code> query with <code>executionStats</code> or <code>allPlansExecution</code> verbosity.</p> <p>There are a few relevant feature requests to watch/upvote in the MongoDB issue tracker regarding more detailed execution stats to help optimize/profile aggregation pipelines:</p> <ul> <li>SERVER-19758: Add "executionStats" and "allPlansExecution" explain modes to aggregation explain</li> <li>SERVER-21784: Track execution stats for each aggregation pipeline stage and expose via explain</li> <li>SERVER-22622: Improve $lookup explain to indicate query plan on the "from" collection</li> </ul>

Mongodb Explain for Aggregation framework

Tags:

mongodb

aggregation-framework

Is there an explain function for the Aggregation framework in MongoDB? I can't see it in the documentation.

If not is there some other way to check, how a query performs within the aggregation framework?

I know with find you just do

db.collection.find().explain()

But with the aggregation framework I get an error

db.collection.aggregate(     { $project : { "Tags._id" : 1 }},     { $unwind : "$Tags" },     { $match: {$or: [{"Tags._id":"tag1"},{"Tags._id":"tag2"}]}},     {          $group:          {              _id : { id: "$_id"},             "count": { $sum:1 }          }     },     { $sort: {"count":-1}} ).explain()

949

asked Oct 03 '12 04:10

SCB

1 Answers

Starting with MongoDB version 3.0, simply changing the order from

collection.aggregate(...).explain()

collection.explain().aggregate(...)

will give you the desired results (documentation here).

For older versions >= 2.6, you will need to use the explain option for aggregation pipeline operations

`explain:true`

db.collection.aggregate([     { $project : { "Tags._id" : 1 }},     { $unwind : "$Tags" },     { $match: {$or: [{"Tags._id":"tag1"},{"Tags._id":"tag2"}]}},     { $group: {          _id : "$_id",         count: { $sum:1 }      }},     {$sort: {"count":-1}}   ],   {     explain:true   } )

An important consideration with the Aggregation Framework is that an index can only be used to fetch the initial data for a pipeline (e.g. usage of $match, $sort, $geonear at the beginning of a pipeline) as well as subsequent $lookup and $graphLookup stages. Once data has been fetched into the aggregation pipeline for processing (e.g. passing through stages like $project, $unwind, and $group) further manipulation will be in-memory (possibly using temporary files if the allowDiskUse option is set).

Optimizing pipelines

In general, you can optimize aggregation pipelines by:

Starting a pipeline with a $match stage to restrict processing to relevant documents.
Ensuring the initial $match / $sort stages are supported by an efficient index.
Filtering data early using $match, $limit , and $skip .
Minimizing unnecessary stages and document manipulation (perhaps reconsidering your schema if complicated aggregation gymnastics are required).
Taking advantage of newer aggregation operators if you have upgraded your MongoDB server. For example, MongoDB 3.4 added many new aggregation stages and expressions including support for working with arrays, strings, and facets.

There are also a number of Aggregation Pipeline Optimizations that automatically happen depending on your MongoDB server version. For example, adjacent stages may be coalesced and/or reordered to improve execution without affecting the output results.

Limitations

As at MongoDB 3.4, the Aggregation Framework explain option provides information on how a pipeline is processed but does not support the same level of detail as the executionStats mode for a find() query. If you are focused on optimizing initial query execution you will likely find it beneficial to review the equivalent find().explain() query with executionStats or allPlansExecution verbosity.

There are a few relevant feature requests to watch/upvote in the MongoDB issue tracker regarding more detailed execution stats to help optimize/profile aggregation pipelines:

SERVER-19758: Add "executionStats" and "allPlansExecution" explain modes to aggregation explain
SERVER-21784: Track execution stats for each aggregation pipeline stage and expose via explain
SERVER-22622: Improve $lookup explain to indicate query plan on the "from" collection

123

answered Sep 30 '22 15:09

Stennie

Related questions
                            
                                What's the use of Jade or Handlebars when writing AngularJs apps
                            
                                Error: the update operation document must contain atomic operators, when running updateOne
                            
                                Mongoose findByIdAndUpdate not returning correct model
                            
                                Why does direction of index matter in MongoDB?
                            
                                Get the latest record from mongodb collection
                            
                                MongoDB atomic "findOrCreate": findOne, insert if nonexistent, but do not update
                            
                                How does MongoDB sort records when no sort order is specified?
                            
                                $lookup on ObjectId's in an array
                            
                                Mongoose (mongodb) batch insert?
                            
                                How to get all count of mongoose model?
                            
                                Embedded MongoDB when running integration tests
                            
                                MongoDb query condition on comparing 2 fields
                            
                                MongoDB Aggregation: How to get total records count?
                            
                                Referencing another schema in Mongoose
                            
                                How to export JSON from MongoDB using Robomongo
                            
                                What's the $unwind operator in MongoDB?
                            
                                How to update a mongo record using Rogue with MongoCaseClassField when case class contains a scala Enumeration
                            
                                Using .sort with PyMongo
                            
                                Mongoose, update values in array of objects
                            
                                mongoDB/mongoose: unique if not null

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With