storing data as object vs array in MongoDb for write performance

Tags:

mongodb

Should I store objects in an Array or inside an Object with top importance given Write Speed?

I'm trying to decide whether data should be stored as an array of objects, or using nested objects inside a mongodb document.

In this particular case, I'm keeping track of a set of continually updating files that I add and update and the file name acts as a key and the number of lines processed within the file.

the document looks something like this

Click to copy

{
  t_id:1220,
  some-other-info: {}, // there's other info here not updated frequently
  files: {
    log1-txt: {filename:"log1.txt",numlines:233,filesize:19928},
    log2-txt: {filename:"log2.txt",numlines:2,filesize:843}
  }
}

or this

Click to copy

{
  t_id:1220,
  some-other-info: {},
  files:[
    {filename:"log1.txt",numlines:233,filesize:19928},
    {filename:"log2.txt",numlines:2,filesize:843}
  ]
}

I am making an assumption that handling a document, especially when it comes to updates, it is easier to deal with objects, because the location of the object can be determined by the name; unlike an array, where I have to look through each object's value until I find the match.

Because the object key will have periods, I will need to convert (or drop) the periods to create a valid key (fi.le.log to filelog or fi-le-log). I'm not worried about the files' possible duplicate names emerging (such as fi.le.log and fi-le.log) so I would prefer to use Objects, because the number of files is relatively small, but the updates are frequent.

Or would it be better to handle this data in a separate collection for best write performance...

Click to copy

{
    "_id": ObjectId('56d9f1202d777d9806000003'),"t_id": "1220","filename": "log1.txt","filesize": 1843,"numlines": 554
},
{
    "_id": ObjectId('56d9f1392d777d9806000004'),"t_id": "1220","filename": "log2.txt","filesize": 5231,"numlines": 3027
}

292

asked Mar 04 '16 20:03

Daniel

1 Answers

From what I understand you are talking about write speed, without any read consideration. So we have to think about how you will insert/update your document.

We have to compare (assuming you know the _id you are replacing, replace {key} by the key name, in your example log1-txt or log2-txt):

Click to copy

db.Col.update({ _id: '' }, { $set: { 'files.{key}': object }})

Click to copy

db.Col.update({ _id: '', 'files.filename': '{key}'}, { $set: { 'files.$': object }})

The second one means that MongoDB have to browse the array, find the matching index and update it. The first one means MongoDB just update the specified field.

The worst: The second command will not work if the matching filename is not present in the array! So you have to execute it, check if nMatched is 0, and create it if it is so. That's really bad write speed (see here MongoDB: upsert sub-document).

If you will never/almost never use read queries / aggregation framework on this collection: go for the first one, that will be faster. If you want to aggregate, unwind, do some analytics on the files you parsed to have statistics about file size and line numbers, you may consider using the second one, you will avoid some headache.

Pure write speed will be better with the first solution.

113

answered Oct 04 '22 00:10

Jonathan Muller

Related questions
                            
                                Determining when focus occurs outside an element
                            
                                Wordpress customizer custom control transport postMessage not working
                            
                                How to detect changes with Date objects in Angular2?
                            
                                How to get checkboxes to initialize based on model?
                            
                                SVGPathData chrome 48
                            
                                Why HMAC sha256 return different value on PHP & Javascript
                            
                                Enable button when scroll bootstrap modal to bottom
                            
                                How to enhance a server side generated page with Aurelia.io?
                            
                                What do the functions 'beforeShowDay' and 'onSelect' actually do in following Datepicker widget implementation?
                            
                                Typescript Compiler error TS2307: Cannot find module 'jquery'
                            
                                How to prevent invoking 'Meteor.call' from JavaScript Console?
                            
                                ES6 - Exporting module with a getter
                            
                                Components and directives in angular 1.5
                            
                                How to parse JSON format date string into date format
                            
                                Why does v8 run out of memory in this situation?
                            
                                Read <input> in a functional (stateless) component
                            
                                Eslint config file from codacy
                            
                                Not able to send a DELETE request with fetch api
                            
                                Issues connecting to Amazon RDS Postgres database on node.js using sequelize ORM
                            
                                Why doesn't my directive's enter animation using ng-if run on first time when using AngularJS 1.5?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

storing data as object vs array in MongoDb for write performance

Tags:

javascript

mongodb

Daniel

People also ask

1 Answers

Jonathan Muller

Recent Activity

Donate For Us