Is mongodb fit for sites like stackoverflow? [closed]

1 Answers

Put simply: Yes, it could be.

Let's break down the various pages/features and see how they could be stored/reproduced in MongoDB.

The whole information in this page could be stored in a single document under the collection questions. This could include "sub-documents" for each answer to keep the retrieval of this page fast.

Edit: as @beagleguy pointed out, you could hit the document size limit of 4MB quite quickly this way, so it would be better to store answers in separate documents and link them to the question by storing the ObjectIDs in an array.

The votes could be stored in a separate collection, with simple links to the question and to the user who voted. A db.eval() call could be executed to increment/decrement the vote count directly in the document when a vote is added (though it blocks so wouldn't be very performant), or a MapReduce call could be made regularly do offset that work. It could work the same way for favourites.

Things like the "viewed" numbers, logging user's access times, etc. would generally be handled using a modifier operation to increment a counter. Since v1.3 there is a new "Find and Modify" command which can issue an update command when retrieving the document, saving you an extra call.

Any sort of statistical data (such as reputation, badges, unique tags) could be collected using MapReduce and pushed to specific collections. Things like notifications could be pushed to another collection acting as a job queue, with a number of workers listening for new items in the queue (think badge notifications, new answers since user's last access time, etc).

The Questions page and it's filters could all be handled with capped-collections rather than querying for that data immediately.

Ultimately, YMMV. As with all tools, there are advantages and costs. There are some SO features which would take a lot of work in an RDBMS but could be handled quite simply in Mongo, and vice-versa.

I think the main advantage of Mongo over RDBMSs is the schema-less approach and replication. Changing the schema regularly in a "live" RDMBS-based app can be painful, even impossible if it's heavily used with large amounts of data - those types of ops can lock the tables for far too long. In Mongo, adding new fields is trivial since you may not need to add them to every document. If you do its a relatively quick operation to run a map/reduce to update documents.

As for replication, Mongo has the advantage that the DB doesn't need to be paused to take a snapshot for slaves. Many RDBMSs can't set up replication without this approach, which on large DBs can take the master down for a long time (I'm looking at you, MySQL!). This can be a blessing for StackOverflow-type sites, where you need to scale over time - no taking the master down every time you need to add a node.

155

answered Oct 01 '22 04:10

Phillip B Oldham

Related questions
                            
                                using sphinx search with mongodb as datasource
                            
                                MongoDB using NOT and AND together
                            
                                Get particular element from mongoDB array [duplicate]
                            
                                Mongodb array $push and $pull
                            
                                MongoDB not allowing using '.' in key [duplicate]
                            
                                MongoDB as a Time Series Database
                            
                                MongoDB empty string value vs null value
                            
                                MongoDB Java Inserting Throws org.bson.codecs.configuration.CodecConfigurationException: Can't find a codec for class io.github.ilkgunel.mongodb.Pojo
                            
                                Mongoose.js transactions
                            
                                Pull and addtoset at the same time with mongo
                            
                                Spring-data-mongodb connect to multiple databases in one Mongo instance
                            
                                Simple Node/Express app, the functional programming way (How to handle side-effects in JavaScript?)
                            
                                What is the equivalent of the collection.getIndexes() shell command in pymongo?
                            
                                Implement auto-complete feature using MongoDB search
                            
                                Using Mongoose / MongoDB $addToSet functionality on array of objects
                            
                                connecting to Mongodb inside a docker with mongodb compass GUI
                            
                                Where do I find my dbname for MongoDB connection string?
                            
                                Is MongoDB _id (ObjectId) generated in an ascending order?
                            
                                NODE.JS: FATAL ERROR- JS Allocation failed - process out of memory, while parsing large excel files
                            
                                How do I use the AsQueryable method asynchronously with MongoDb C# Driver 2.1?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is mongodb fit for sites like stackoverflow? [closed]

Tags:

mongodb

unpangloss

People also ask

1 Answers

Phillip B Oldham

Recent Activity

Donate For Us