MongoDB sharded cluster 25 slower than standalone node

Tags:

I'm confused by the situation and trying to fix this for a couple of days now. I'm running 3 shard on top of three 3-members replica sets (rs0, rs1 and rs2). All is working so far. Data is distributed over the 3 shards as well as cloned within the replica sets.

BUT: importing data into one of the replica set works fine with constantly 40k docs/s but by enabling sharding slows the entire process down to just 1.5k docs/s.

I've populated the data via different methods:

generated some random data in the mongo shell (running in my mongos)
JSON data import via mongoimport
MongoDB dump restore from another server via mongorestore

All of them result in just 1.5k doc/s which is disappointing. The mongod's are physical Xeon boxes with 32GB each, the 3 config servers are virtual servers (40 GB HDD, 2 GB RAM, if that matters), the mongos is running on my app server. By the way, the value of 1.5k inserts/s doesn't depend on the shard key, same behaviour for a dedicated shard key (single field key as well as compound key) as well as hashed shard key on _id field.

I tried a lot, even reinstalled the entire cluster twice. The question is: what is the bottleneck in this setup:

config servers running on virtual server? -> shouldn't be problematic due to the low resource consumption of config servers
mongos? -> running multiple Mongos on a dedicated box behind HAproxy might be an alternative, haven't tested that yet

830

asked Feb 04 '14 14:02

ctp

2 Answers

Let's do the math first: how big are your documents? Keep in mind that they have to be transferred over the net multiple times depending on your write concern.

May be you are experiencing this because of the indices which have to be build.

Please try this:

Disable all indices except the one for _id (which is not possible anyway, iirc)
Load your data
Reenable indices.
Enable sharding and balancing if not done already

This is the suggested way for importing data into a shared cluster anyway and should speed up your import considerably. Some (cautious !) fiddling with storage.syncPeriodSecs and storage.journal.commitIntervalMs might help, too.

The delay can occur even when storing the data on the primary shard. Depending on the size of your indices, they may slow down bulk operations considerably. You might also want to have a look at the replication.secondaryIndexPrefetch config option.

Another thing might be that your oplog simply gets filled faster than the replication can take place. Problem here: once it is created, you can not increase it's size. I am not sure wether it is safe to delete and recreate it in standalone mode and then reshare the replica set, but I doubt it. So the safe option would be to have the instance actually leave the replica set, reinstall it with a more appropriate oplog size and add the instance to the replica set as if it were the first time. If you don't care for the data, simply shut the replica set down, adjust the oplog size in the config file, delete the data dir and restart and reinitialize the replica set. Thinking of your problem twice, this sounds like the best bet to me, since the opllog isn't involved in standalone mode, iirc.

If you still have the same performance issues, my bet is on problems with disk or network IO.

You have a fairly standard setup, your mongos instance is running on a different machine than your mongod (be it a standalone or the primary of a replica set). You might want to check a few things:

Name resolution latency for resolving the name of your primary and secondary shards from the machine running your mongos instance. I can not count the times installing nscd improved performance for various operations.
network latency from your mongos instance to your primary shard. Assuming you have a firewall between your AppServer and your cluster, you might want to talk to the respective administrator.
In case you are using external authentication, try to measure how long it takes.
When using some sort of tunneling (e.g. stunnel or encryption like SSL/TLS), make sure you disable name resolution. Please keep in mind that encrypting and decrypting may take a relatively long time.
Measure random disk IO on the mongod instances

106

answered Oct 13 '22 11:10

Markus W Mahlberg

I was facing a similar performance issue. What helped to solve the performance issue was I ended up setting the mongod instance that was running on the same host as the mongos as the primary shard.

using the following command:

mongos> use admin
mongos> db.runCommand( { movePrimary: "mydb", to: "shard0003" } )

After making this change (without touching the load balancer or tweaking anything else), I was able to load a relatively large dataset (25 million rows) using a loader I had written, and the entire procedure took about 15 minutes instead of hours/days.

answered Oct 13 '22 09:10

user3892260

Related questions
                            
                                foreach %dopar% slower than for loop [duplicate]
                            
                                Execute more or complicated SQL queries, or use PHP to filter data?
                            
                                Max achievable polling frequency using Bluetooth LE GATT profile?
                            
                                d3.js : orthographic rotation optimization
                            
                                How to do nothing in an SQL case statement?
                            
                                Does a IEnumerable store objects after use
                            
                                Overhead involved in nested sequences in F#
                            
                                measuring runtime of bash script with & parameter in the body
                            
                                VBA, File System Object, speed/advantages/disadvantages
                            
                                Unexpected slowdown of function that modifies array in-place
                            
                                Is Java's LinkedList optimized to do get(index) in reverse when necessary?
                            
                                HashSet or Distinct to read distinct values of property in List<> of objects
                            
                                Java fastest way to get matching range
                            
                                Get cumulative value from a List of Map<String, Integer>
                            
                                How to improve the speed of InnoDB writes per second of MySQL DB
                            
                                Turn off ALL optimization by Dalvik VM
                            
                                select non-duplicated records
                            
                                does unrolling loops in x86-64 actually make code faster?
                            
                                Adding collections/entities makes Form Rendering terrible slow
                            
                                Fastest way to sort a python 3.7+ dictionary

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

MongoDB sharded cluster 25 slower than standalone node

Tags:

performance

mongodb

replication

sharding

ctp

People also ask

2 Answers

Markus W Mahlberg

user3892260

Recent Activity

Donate For Us