Large document vs many documents

Tags:

Just wanted an opinion, or at least a rule of thumb over which is better in a database structure for CouchDB. Is it better to have all related data for an item in a single document, or have parts of all items in many documents?

Let me illustrate what I mean by giving you an example. I currently log 4 events from our system, at 1 minute intervals, lets call them event_1, event_2, event_3 and even_4. Data is stored for each of the 4 events, regardless of value (you'll always get a value, even if everything is okay).

Option 1: Group events, and append new timestamp/values to the document...

Click to copy

{
    event_1: [ 
        { timestamp, value },
        { timestamp, value },
        { timestamp, value },
        ...etc
    ]
},
{
    event_2: [ 
        { timestamp, value },
        { timestamp, value },
        { timestamp, value },
        ...etc
    ]
},
{
    event_3: [ 
        { timestamp, value },
        { timestamp, value },
        { timestamp, value },
        ...etc
    ]
}
...etc

Option 2: Keep a huge list of documents, with the latest values (which is how they're actually delivered from the system)?

Click to copy

{
    timestamp: {
        { event_1, value },
        { event_2, value },
        { event_3, value },
        { event_4, value }
    }
},
{
    timestamp: {
        { event_1, value },
        { event_2, value },
        { event_3, value },
        { event_4, value }
    }
},
{
    timestamp: {
        { event_1, value },
        { event_2, value },
        { event_3, value },
        { event_4, value }
    }
}
...etc

I'm currently using the 2nd option, but was just curious to see peoples opinions on what would be considered best practice...I'm starting to think that Option 1 might be better, as the way i am reporting, results are grouped by event (shown in line graph of each event).

587

asked Jun 30 '11 11:06

crawf

1 Answers

I would definitely prefer your Option 2.

Since CouchDB keeps all revisions of its documents there would be huge memory consumption using Option 1. So with each new value you store the new values and also a copy of the old ones. Using Option 2 you only store the new values without touching the old ones.

130

answered Sep 29 '22 10:09

phlogratos

Related questions
                            
                                Freebase: What data dump file contains the "imdb_id"?
                            
                                How to improve INSERT performance on a very large MySQL table
                            
                                MySQL/Hibernate - How do I debug a MySQL pooled connection that keeps dropping?
                            
                                Switching between multiple databases in Rails without breaking transactions
                            
                                Simple query slow in Laravel, but insanely fast in database console
                            
                                Best Practices: Storing a workflow state of an item in a database? [closed]
                            
                                How to select maximum 3 items per users in MySQL?
                            
                                How would you build a database filesystem (DBFS)?
                            
                                Oracle - Do you need to calculate statistics after creating index or adding columns?
                            
                                Why is my mongodb call so slow?
                            
                                How to stream data to database BLOB using Hibernate (no in-memory storing in byte[])
                            
                                MongoDB aggregation: Group all records into a single result
                            
                                PHP PDO_mssql SQLSTATE[01002] Adaptive Server connection failed (severity 9)
                            
                                Is it possible to lose a SQLite database connection?
                            
                                A simple data storage schema to restrict public access
                            
                                How to embed ArangoDB in a desktop application
                            
                                Does running a SQL Server 2005 database in compatibility level 80 have a negative impact on performance?
                            
                                Table design for user's information as well as login credentials?
                            
                                Neo4j database research
                            
                                Best database for Node.js app using websockets

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Large document vs many documents

Tags:

database

couchdb

crawf

People also ask

1 Answers

phlogratos

Recent Activity

Donate For Us