mongodb schema design for blogs

Tags:

How would you design the schema for a blog-like site with document-based databases (mongodb). The site has the following objects: User, Article, Comment. User can add Comments to Article. Each User can also vote exactly once per Comment.

I want to be able to do these queries efficiently:
1. get Article A, comments on Article A and # of votes per comments
2. get all comments by User B across all articles
3. get all comments User B voted for

My first attempt is to put articles and comments in separate collections and comment can contain a list of users that voted for it. This makes query 1 and 2 simple. And for 3, I added Vote collection which keep tracks of votes by users.

There's some obvious drawback such as duplicating user vote data and query 1 will take two calls to the database. Is there a better approach?

Article {   "user_id" }  Comment {    "user_id",    "article_id",    [user_voted], }  Vote {     "user_id",     "comment_id", }

839

asked Mar 07 '11 20:03

kefeizhou

1 Answers

Article {   "_id" : "A",   "title" : "Hello World",   "user_id" : 12345,   "text" : 'My test article',    "comments" : [     { 'text' : 'blah', 'user_id' : 654321, 'votes' : [987654]},     { 'text' : 'foo', 'user_id' : 987654, 'votes' : [12345, 654321] },     ...   ] }

The basic premise here is that I've nested the Comments inside of the Article. The Votes only apply to a Comment, so they've been stored as an array with each Comment. In this case, I've just stored the user_id. If you want to store more information (time_created, etc.), then you can votes an array of objects:

... 'votes' : [ { user_id : 987654, ts : 78946513 } ] ...

How to perform your queries efficiently:

get Article A, comments on Article A and # of votes per comments

db.articles.find( { _id : 'A' } )

This gets everything with one query. You may have to do some client-side logic to count votes per comment, but this is pretty trivial.

get all comments by User B across all articles

db.articles.ensureIndex( { "comments.user_id" : 1 } ) db.articles.find( { "comments.user_id" : 987654 } ) // returns all document fields

The index will allow for efficiently searching the comments within a document.

There's currently no way to extract only the matches from a sub-array. This query will in fact return all of the articles with comments by that user. If this is potentially way too much data, you can do some trimming.

db.articles.find( { "comments.user_id" : 987654 }, { "title" : 1, "comments.user_id" : 1 })

get all comments User B voted for

db.articles.ensureIndex( { "comments.votes" : 1 } ) db.articles.find( { "comments.votes" : 987654 } )

Again, this will return all of the Articles, not just the comments.

There's a trade-off to be made here. Returning the article may seem like we're bringing back too much data. But what are you planning to display to the user when you make query #3?

Getting a list of "comments I've voted for" is not terribly useful without the comment itself. Of course the comment is not very useful without the article itself (or at least just the title).

Most of the time, query #3 devolves into a join from Votes to Comments to Articles. If that's the case, then why not just bring back the Articles to start with?

117

answered Oct 03 '22 22:10

Gates VP

Related questions
                            
                                When should database synonyms be used?
                            
                                Is there a difference between Surrogate key, Synthetic Key, and Artificial Key?
                            
                                Why is negative id or zero considered a bad practice?
                            
                                Multi currency - what to store and when to convert?
                            
                                What are the different types of indexes, what are the benefits of each?
                            
                                Do link tables need a meaningless primary key field?
                            
                                Add ASP.NET Membership tables to my own existing database, or should I instead configure a separate ASP.NET membership database?
                            
                                Array Attribute for Ruby Model
                            
                                Simple way to parse a person's name into its component parts? [closed]
                            
                                What is the recommended equivalent of cascaded delete in MongoDB for N:M relationships?
                            
                                Choice of Database schema for storing folder system
                            
                                Best practice - logging events (general) and changes (database)
                            
                                A good database modeling tool? [closed]
                            
                                Techniques for database inheritance?
                            
                                Should you make a self-referencing table column a foreign key?
                            
                                Storing day and month (without year)
                            
                                One table or many? [closed]
                            
                                designing database to hold different metadata information
                            
                                How do I implement threaded comments?
                            
                                How would I implement separate databases for reading and writing operations?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

mongodb schema design for blogs

Tags:

mongodb

database-design

kefeizhou

People also ask

1 Answers

Gates VP

Recent Activity

Donate For Us