When to start MongoDB sharding

Tags:

At the moment we run a MongoDB Replicaset containing 2 Servers + 1 Arbiter.

And we store about 150 GB of data in the databases on the replicaset.

Right now we are thinking about when to start with sharding. Because we are wondering if there is a point where you can't start sharding anymore.

It is obvious that we would have to start sharding before we run out of hard disk space, our cpu is overloaded or the overall performance goes down because of too little RAM.

Somebody also told me that there is a limit of 256 GB data size after which you can't start sharding anymore. Also I read the official documentation http://docs.mongodb.org/manual/sharding/ and "MongoDB the definitive guide", I could not proove that.

From your experience is there a limit where you should have started with sharding ?

865

asked Jul 23 '13 12:07

Dukeatcoding

2 Answers

I would start sharding when you hit about 60-70% resource utilisation. This could be both hard disk space and RAM. The 256 GB limit is indeed there, it's documented at http://docs.mongodb.org/manual/reference/limits/#Sharding%20Existing%20Collection%20Data%20Size

176

answered Oct 05 '22 02:10

Derick

I have found the limit to be based on reads/writes; afterall sharding is about increasing capacity, mainly writes, while replica sets being more concerned with reads. However, using separate servers (nodes) for ranges of data (shard key) can help reads too so it does have a knock on effect for both.

For example you could be only using 40% of your current servers memory with your current working set but due to the amount of writes being sent to that single server you could actually be seeing speed problems due to IO. At this time you would take sharding into account.

So really I would personally say, and this question is heavily opinion based, that you should shard when you feel as though you need more capacity for operations than is cost effective for a single replica set.

I have known of single replica setups that can take what, normally, an entire cluster would but it depends on how big your budget is. As a computer gets bigger it gets more expensive.

answered Oct 05 '22 02:10

Sammaye

Related questions
                            
                                Using variables in MongoDB update statement
                            
                                How to configure a replica set with MongoDB
                            
                                i/o timeout with mgo and mongodb
                            
                                MongoDB search to return only specific fields
                            
                                How to get rid of cursor id error in mongodb?
                            
                                mongoDB : Creating An ObjectId For Each New Child Added To The Array Field
                            
                                MongoDB limit find results
                            
                                Python OOP: how to share a MongoDB connection with all classes
                            
                                how can run mongoose query in forEach loop
                            
                                passing arguments to docker exec from bash prompt and script
                            
                                Mongoose update an item from array by id
                            
                                Convert string array into object Id array
                            
                                MongoDB Compass - stuck on connecting to Atlas
                            
                                Mongoose embedded document updating
                            
                                What the overhead of Java ORM for MongoDB
                            
                                Extremely high QPS - DynamoDB vs MongoDB vs other noSQL?
                            
                                insert current time into mongo using pymongo
                            
                                Mongoose: Cast to ObjectId failed for value
                            
                                Mongodb upsert throwing DuplicateKeyException
                            
                                MongoDB Aggregation: How do I recombine a date using $project?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

When to start MongoDB sharding

Tags:

database

mongodb

sharding

Dukeatcoding

People also ask

2 Answers

Derick

Sammaye

Recent Activity

Donate For Us