How do I append Mongo DB aggregation results to an existing collection?

Tags:

mongodb

aggregation-framework

I am trying to perform several insertions on an existent Mongo DB collection using the following code

db.dados_meteo.aggregate( [
                  { $match : { "POM" : "AguiardaBeira" } },
                  { $project : {
                     _id : { $concat: [
                        "0001:",
                      { $substr: [ "$DTM", 0, 4 ] },
                      { $substr: [ "$DTM", 5, 2 ] },
                      { $substr: [ "$DTM", 8, 2 ] },
                      { $substr: [ "$DTM", 11, 2 ] },
                      { $substr: [ "$DTM", 14, 2 ] },
                      { $substr: [ "$DTM", 17, 2 ] }
                       ] },
                    "RNF" : 1, "WET":1,"HMD":1,"TMP":1 } },
                  { $out : "dados_meteo_reloaded" }
              ] )

But each time I change the $match parameters and make a new aggregation, Mongo DB deletes the previous documents and inserts the new result.

Could you help me?

241

asked Feb 26 '15 11:02

Hugo

3 Answers

Starting Mongo 4.2, the new $merge aggregation operator (similar to $out) allows merging the result of an aggregation pipeline into the specified collection:

Given this input:

db.source.insert([
  { "_id": "id_1", "a": 34 },
  { "_id": "id_3", "a": 38 },
  { "_id": "id_4", "a": 54 }
])
db.target.insert([
  { "_id": "id_1", "a": 12 },
  { "_id": "id_2", "a": 54 }
])

the $merge aggregation stage can be used as such:

db.source.aggregate([
  // { $whatever aggregation stage, for this example, we just keep records as is }
  { $merge: { into: "target" } }
])

to produce:

// > db.target.find()
{ "_id" : "id_1", "a" : 34 }
{ "_id" : "id_2", "a" : 54 }
{ "_id" : "id_3", "a" : 38 }
{ "_id" : "id_4", "a" : 54 }

Note that the $merge operator comes with many options to specify how to merge inserted records conflicting with existing records.

In this case (with the default options), this:

keeps the target collection's existing documents (this is the case of { "_id": "id_2", "a": 54 })
inserts documents from the output of the aggregation pipeline into the target collection when they are not already present (based on the _id - this is the case of { "_id" : "id_3", "a" : 38 })
replaces the target collection's records when the aggregation pipeline produces documents existing in the target collection (based on the _id - this is the case of { "_id": "id_1", "a": 12 } replaced by { "_id" : "id_1", "a" : 34 })

answered Oct 27 '22 11:10

Xavier Guihot

The short answer is "you can't":

If the collection specified by the $out operation already exists, then upon completion of the aggregation, the $out stage atomically replaces the existing collection with the new results collection. The $out operation does not change any indexes that existed on the previous collection. If the aggregation fails, the $out operation makes no changes to the pre-existing collection.

As a workaround, you can copy the collection document specified by $out to a "permanent" collection just after aggregation, in one of a several ways (non of which is ideal though):

copyTo() is the easiest, mind the Warning. Don't use other for small results.
Use JS: db.out.find().forEach(function(doc) {db.target.insert(doc)})
Use mongoexport / mongoimport

answered Oct 27 '22 12:10

Ori Dar

It's not the prettiest thing ever, but as another alternative syntax (from a post-processing archive/append operation)...

db.targetCollection.insertMany(db.runCommand(
{
    aggregate: "sourceCollection",
    pipeline: 
    [
        { $skip: 0 },
        { $limit: 5 },
        { 
            $project:
            {
                myObject: "$$ROOT",
                processedDate: { $add: [new ISODate(), 0] }
            }
        }
    ]
}).result)

I'm not sure how this stacks up against the forEach variant, but i find it more intuitive to read.

answered Oct 27 '22 13:10

Jesse MacNett

Related questions
                            
                                how do i import data into mongodb from sql server?
                            
                                MongoDB Save and update
                            
                                Get one element from an array of objects that's part of one document (mongoose)
                            
                                Two nodes MongoDB replica set without arbiter
                            
                                MongoError: connection 0 to localhost:27017 timed out
                            
                                How to define a generic nested object in Mongoose
                            
                                Java MongoDB Object Versioning
                            
                                Create an index with MongoDb
                            
                                understanding mongoose [Schema.Types.Mixed]
                            
                                How to do HAVING COUNT in MongoDB?
                            
                                RangeError: Invalid status code: 0
                            
                                What's the difference between findOneAndUpdate and findOneAndReplace?
                            
                                What is the "admin" database in mongodb?
                            
                                Java, MongoDB: How to update every object while iterating a huge collection?
                            
                                How to get all items from IMongoCollection in C#?
                            
                                Filter MongoDb collection if field array and argument array intersect
                            
                                How to sort documents based on length of an Array field
                            
                                How to install a specific version of MongoDB?
                            
                                Insert Array inside an object in MongoDB
                            
                                pymongo default database connection

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With