MongoDB workaround for document above 16mb size?

1 Answers

To fix this problem you will need to make some small amendments to your data structure. By the sounds of it, for your documents to exceed the 16mb limit, you must be embedding your sensor data into an array in a single document.

I would not suggest using GridFS here, I do not believe it to be the best solution, and here is why.

There is a technique known as bucketing that you could employ which will essentially split your sensor readings out into separate documents, solving this problem for you.

The way it works is this:

Lets say I have a document with some embedded readings for a particular sensor that looks like this:

{
    _id : ObjectId("xxx"),
    sensor : "SensorName1",
    readings : [
        { date : ISODate("..."), reading : "xxx" },
        { date : ISODate("..."), reading : "xxx" },
        { date : ISODate("..."), reading : "xxx" }
    ]
}

With the structure above, there is already a major flaw, the readings array could grow exponentially, and exceed the 16mb document limit.

So what we can do is change the structure slightly to look like this, to include a count property:

{
    _id : ObjectId("xxx"),
    sensor : "SensorName1",
    readings : [
        { date : ISODate("..."), reading : "xxx" },
        { date : ISODate("..."), reading : "xxx" },
        { date : ISODate("..."), reading : "xxx" }
    ],
    count : 3
}

The idea behind this is, when you $push your reading into your embedded array, you increment ($inc) the count variable for every push that is performed. And when you perform this update (push) operation, you would include a filter on this "count" property, which might look something like this:

{ count : { $lt : 500} }

Then, set your Update Options so that you can set "upsert" to "true":

db.sensorReadings.update(
    { name: "SensorName1", count { $lt : 500} },
    {
        //Your update. $push your reading and $inc your count
        $push: { readings: [ReadingDocumentToPush] }, 
        $inc: { count: 1 }
    },
    { upsert: true }
)

see here for more info on MongoDb Update and the Upsert option:

MongoDB update documentation

What will happen is, when the filter condition is not met (i.e when there is either no existing document for this sensor, or the count is greater or equal to 500 - because you are incrementing it every time an item is pushed), a new document will be created, and the readings will now be embedded in this new document. So you will never hit the 16mb limit if you do this properly.

Now, when querying the database for readings of a particular sensor, you may get back multiple documents for that sensor (instead of just one with all the readings in it), for example, if you have 10,000 readings, you will get 20 documents back, each with 500 readings each.

You can then use aggregation pipeline and $unwind to filter your readings as if they were their own individual documents.

For more information on unwind see here, it's very useful

MongoDB Unwind

I hope this helps.

183

answered Oct 06 '22 01:10

pieperu

Related questions
                            
                                how to do pagination using mongoengine?
                            
                                MongoDB list available databases in java
                            
                                Laravel class not found with one-to-many
                            
                                mongodb error: how do I make sure that your journal directory is mounted
                            
                                Application failed to start (port 8080) not available
                            
                                How to connect to external MongoDB instance in Meteor?
                            
                                MongoDB Aggregation Limit Lookup
                            
                                How to connect flutter with MongoDB
                            
                                How to fetch next and previous item of the current one with Mongoose
                            
                                Meteor collection not updating subscription on client
                            
                                Steps to connect MongoDB and Solr using DataImportHandler
                            
                                How could I write aggregation without exceeds maximum document size?
                            
                                Mongo unique index case insensitive
                            
                                where is default installation directory for mongodb
                            
                                MongoDB commands from DOS or Windows
                            
                                mongodb translation for sql INSERT...SELECT
                            
                                How to read a specific key-value pair from mongodb collection
                            
                                Why can I not chain .catch when calling mongoose Model.create in node
                            
                                Mongodb aggregation by day based on unix timestamp
                            
                                Updating Nested Array Mongoose

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

MongoDB workaround for document above 16mb size?

Tags:

mongodb

DeathNote

People also ask

1 Answers

pieperu

Recent Activity

Donate For Us