CouchDB view is extremely slow

Tags:

performance

couchdb

I have a CouchDB (v0.10.0) database that is 8.2 GB in size and contains 3890000 documents.

Now, I have the following as the Map of the view

function(doc) {emit([doc.Status], doc);

And it takes forever to load (4 hours and still no result).

Here's some extra information that might help describing the situation:

The view is not a temp view. The view is defined before the 3890000 documents are inserted.
There isn't anything on the server. It is a ubuntu box with nothing but the defaults installed.
I see that my CPU is moving and working hard (sometimes shoots to 100%). The memory is moving as well but not increasing.

So my question is:

What is actually happening in the background?
Is this a "one time" thing where I have to wait once and it will somehow works later?

545

asked Oct 11 '10 20:10

Chi Chan

1 Answers

Don't emit the whole doc. It's unnecessary. You can instead run your query with include_docs=true, which will let you access the document via each row's doc attribute.

When you emit the whole doc you make the index as large or larger than your entire database. :)

151

answered Sep 23 '22 23:09

mikeal

Related questions
                            
                                printf more than 5 times faster than std::cout?
                            
                                final variable interpretation
                            
                                Creation of std::thread slows down main program by 50%
                            
                                Java performance problem with LinkedBlockingQueue
                            
                                MySQL performance & variables tweaking
                            
                                How to correctly use std::atomic_signal_fence()?
                            
                                What is the best alternative for Shared Preferences In Android?
                            
                                What is the reasoning behind x64 having such a different performance result from x86?
                            
                                Generics vs. Array Lists
                            
                                Perfmon-like for Linux?
                            
                                Use JRuby for Ruby web applications? Is it worth it?
                            
                                Can Sun JVM handle gigantic heap sizes without problems, and how?
                            
                                How to speed up dumping a DataTable into an Excel worksheet?
                            
                                What is this cProfile result telling me I need to fix?
                            
                                .htaccess mod_rewrite performance
                            
                                Why is numpy much slower than matlab on a digitize example?
                            
                                Why does adding a polymorphic type signature degrade performance?
                            
                                Creating a numpy array of 3D coordinates from three 1D arrays
                            
                                Why does using arguments make this function so much slower?
                            
                                Efficient way to take the minimum/maximum n values and indices from a matrix using NumPy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With