Ok, there are questions about Why Is MongoDB So Fast I appreciate those answers, however, they are quite general. Yes, I know: <ul> <li>MongoDB is document-based, then why being document-based can lead to much higher speed?</li> <li>MongoDB is noSQL, but why noSQL means higher performance?</li> <li>SQL does a lot more than MongoDB for consistency, ACID, etc, but I believe MongoDB is also doing something similar to keep data safe, maintain indexing, etc, right?</li> </ul> Ok, I write this question just in order to find out <ol> <li> what are the detailed and specific reasons for MongoDB's high performance?</li> <li>What exactly SQL does, but MongoDB does not do, so it gains very high performance?</li> <li>If an interviewer (a MongoDB and SQL expert) asks you <code>"Why MongoDB is so fast"</code>, how would you answer? Obviously just answering: <code>"because MongoDB is noSQL"</code> is not enough.</li> </ol> Thanks

In general, MySQL and MongoDB are quite similar in "durable" write performance on a single machine. Simple key/value lookups are almost the same... if you want to use MySQL that way. Document support is, obviously, a big productivity benefit and a big win for performance. With automatic sharding... MongoDB is faster in indescribable ways. Out of the box, with proper design, you can scale out almost linearly without building any logic into your code whatsoever. Read/write splitting is also built into almost every driver... which, most, are sponsored or developed by 10gen themselves. I've scaled applications before and written read/write splitting code, distributed hashes for sharding, rebalancing jobs running continuously, and added gzip to mysql "document" stores. ugh. It's faster because it's simple and focused. It's designed with all of this in mind. Scale on commodity hardware is a priority. The priorities of a RDBMS are quite different.

Any detailed and specific reasons for Why MongoDB is much faster than SQL DBs?

2 Answers

First, let's compare apples with apples: Reads and writes with MongoDB are like single reads and writes by primary key on a table with no non-clustered indexes in an RDBMS.

So lets benchmark exactly that: http://mysqlha.blogspot.de/2010/09/mysql-versus-mongodb-yet-another-silly.html

And it turns out, the speed difference in a fair comparison of exactly the same primitive operation is not big. In fact, MySQL is slightly faster. I'd say, they are equivalent.

Why? Because actually, both systems are doing similar things in this particular benchmark. Returning a single row, searched by primary key, is actually not that much work. It is a very fast operation. I suspect that cross-process communication overheads are a big part of it.

My guess is, that the more tuned code in MySQL outweighs the slightly less systematic overheads of MongoDB (no logical locks and probably some other small things).

This leads to an interesting conclusion: You can use MySQL like a document database and get excellent performance out of it.

If the interviewer said: "We don't care about documents or styles, we just need a much faster database, do you think we should use MySQL or MongoDB?", what would I answer?

I'd recommend to disregard performance for a moment and look at the relative strength of the two systems. Things like scaling (way up) and replication come to mind for MongoDB. For MySQL, there are a lot more features like rich queries, concurrency models, better tooling and maturity and lots more.

Basically, you can trade features for performance. Are willing to do that? That is a choice that cannot be made generally. If you opt for performance at any cost, consider tuning MySQL first before adding another technology.

Here is what happens when a client retrieves a single row/document by primary key. I'll annotate the differences between both systems:

Client builds a binary command (same)
Client sends it over TCP (same)
Server parses the command (same)
Server accesses query plan from cache (SQL only, not MongoDB, not HandlerSocket)
Server asks B-Tree component to access the row (same)
Server takes a physical readonly-lock on the B-Tree path leading to the row (same)
Server takes a logical lock on the row (SQL only, not MongoDB, not HandlerSocket)
Server serializes the row and sends it over TCP (same)
Client deserializes it (same)

There are only two additional steps for typical SQL-bases RDBMS'es. That's why there isn't really a difference.

answered Oct 04 '22 15:10

usr

In general, MySQL and MongoDB are quite similar in "durable" write performance on a single machine. Simple key/value lookups are almost the same... if you want to use MySQL that way. Document support is, obviously, a big productivity benefit and a big win for performance.

With automatic sharding... MongoDB is faster in indescribable ways. Out of the box, with proper design, you can scale out almost linearly without building any logic into your code whatsoever.

Read/write splitting is also built into almost every driver... which, most, are sponsored or developed by 10gen themselves.

I've scaled applications before and written read/write splitting code, distributed hashes for sharding, rebalancing jobs running continuously, and added gzip to mysql "document" stores. ugh.

It's faster because it's simple and focused. It's designed with all of this in mind. Scale on commodity hardware is a priority. The priorities of a RDBMS are quite different.

answered Oct 04 '22 14:10

pestilence669

Related questions
                            
                                Is there anything faster than SqlDataReader in .NET?
                            
                                Find the smallest unused number in SQL Server
                            
                                SQL Server: Find out default value of a column with a query
                            
                                SQL error: Incorrect syntax near the keyword 'User'
                            
                                Versioning in SQL Tables - how to handle it?
                            
                                SQL Differences between stored procedure and triggers
                            
                                SQL-Server Performance: What is faster, a stored procedure or a view?
                            
                                How to check the existence of a row in SQLite with Python?
                            
                                How to get week start and end date string in PostgreSQL?
                            
                                INSERT SELECT statement in Oracle 11G
                            
                                Mysql insert random datetime in a given datetime range
                            
                                SQL Row_Number() function in Where Clause without ORDER BY?
                            
                                ActiveRecord finding existing table indexes
                            
                                Collation Error
                            
                                Rails scope for IS NOT NULL and is not empty/blank?
                            
                                How to create duplicate table with new name in SQL Server 2008
                            
                                MS SQL Date Only Without Time
                            
                                MySQL: Creating a new table with information from a query
                            
                                How can an object-oriented programmer get his/her head around database-driven programming?
                            
                                No unique or exclusion constraint matching the ON CONFLICT

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Any detailed and specific reasons for Why MongoDB is much faster than SQL DBs?

Tags:

performance

sql

mongodb

nosql

Jackson Tale

People also ask

2 Answers

usr

pestilence669

Recent Activity

Donate For Us