I am unwinding an array using MongoDB aggregation framework and the array has duplicates and I need to ignore those duplicates while doing a grouping further. How can I achieve that?

you can use $addToSet to do this: <pre class="prettyprint"><code>db.users.aggregate([ { $unwind: '$data' }, { $group: { _id: '$_id', data: { $addToSet: '$data' } } } ]); </code></pre> It's hard to give you more specific answer without seeing your actual query.

You have to use $addToSet, but at first you have to group by _id, because if you don't you'll get an element per item in the list. Imagine a collection posts with documents like this: <pre class="prettyprint"><code>{ body: "Lorem Ipsum...", tags: ["stuff", "lorem", "lorem"], author: "Enrique Coslado" } </code></pre> Imagine you want to calculate the most usual tag per author. You'd make an aggregate query like that: <pre class="prettyprint"><code>db.posts.aggregate([ {$project: { author: "$author", tags: "$tags", post_id: "$_id" }}, {$unwind: "$tags"}, {$group: { _id: "$post_id", author: {$first: "$author"}, tags: {$addToSet: "$tags"} }}, {$unwind: "$tags"}, {$group: { _id: { author: "$author", tags: "$tags" }, count: {$sum: 1} }} ]) </code></pre> That way you'll get documents like this: <pre class="prettyprint"><code>{ _id: { author: "Enrique Coslado", tags: "lorem" }, count: 1 } </code></pre>

MongoDB - Unwind array using aggregation and remove duplicates

2 Answers

you can use $addToSet to do this:

db.users.aggregate([   { $unwind: '$data' },   { $group: { _id: '$_id', data: { $addToSet: '$data' } } } ]);

It's hard to give you more specific answer without seeing your actual query.

116

answered Oct 08 '22 22:10

Roman Pekar

You have to use $addToSet, but at first you have to group by _id, because if you don't you'll get an element per item in the list.

Imagine a collection posts with documents like this:

{      body: "Lorem Ipsum...",       tags: ["stuff", "lorem", "lorem"],      author: "Enrique Coslado" }

Imagine you want to calculate the most usual tag per author. You'd make an aggregate query like that:

db.posts.aggregate([     {$project: {         author: "$author",          tags: "$tags",          post_id: "$_id"     }},       {$unwind: "$tags"},       {$group: {         _id: "$post_id",          author: {$first: "$author"},          tags: {$addToSet: "$tags"}     }},       {$unwind: "$tags"},      {$group: {         _id: {             author: "$author",             tags: "$tags"         },         count: {$sum: 1}     }} ])

That way you'll get documents like this:

{      _id: {          author: "Enrique Coslado",           tags: "lorem"      },      count: 1 }

answered Oct 08 '22 23:10

Enrique Coslado

Related questions
                            
                                Why does upsert a record using update_one raise ValueError?
                            
                                Mongodb: failed to connect to server on first connect
                            
                                How to join query in mongodb?
                            
                                mongodb cursor id not valid error
                            
                                Finding mongoDB records in batches (using mongoid ruby adapter)
                            
                                Warning on Connecting to MongoDB with a Node server
                            
                                How to find mongo documents with a same field
                            
                                Mongo C# driver - Building filter dynamically with nesting
                            
                                Remove multiple documents from mongo in a single query
                            
                                Running mongodb on ubuntu 16.04 LTS
                            
                                mongoose custom validation using 2 fields
                            
                                How to connect to MongoDB running in Docker container?
                            
                                Mongodb: when to call ensureIndex?
                            
                                Force mongodb to output strict JSON
                            
                                Severe performance drop with MongoDB Change Streams
                            
                                How do I abort a running query in the MongoDB shell?
                            
                                What is returned from Mongoose query that finds no matches?
                            
                                How to resolve error :dbpath (/data/db/) does not exist permanently in MongoDB
                            
                                How to set _id to db document in Mongoose?
                            
                                MongoDB How to know Primary DB server ip in a replica set?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

MongoDB - Unwind array using aggregation and remove duplicates

Tags:

mongodb

l a s

People also ask

2 Answers

Roman Pekar

Enrique Coslado

Recent Activity

Donate For Us