Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How do I iterate over an entire MongoDB collection using mongojs?

I'm using mongojs and I'm trying to iterate over all elements in a collection

index = 0

db.keys.find({}, {uid: 1, _id: 0}).forEach((err, key) =>
    if err?
        console.log err
    else 
        console.log (++index) + " key: " + key_uid

which logs

1 key: bB0KN
2 key: LOtOL
3 key: 51xJM
4 key: x9wFP
5 key: hcJKP
6 key: QZxnE
.
.
.
96 key: EeW6E
97 key: wqfmM
98 key: LIGHK
99 key: bjWTI
100 key: 2zNGE
101 key: F71mL

and then stops. However when I log into mongo from the terminal and run

> db.keys.count()
2317381

So clearly it should be returning a lot more keys. Do you have any ideas what could be causing this behavior?

like image 762
Loourr Avatar asked Jun 13 '14 23:06

Loourr


People also ask

How do I get an entire collection in MongoDB?

To obtain a list of MongoDB collections, we need to use the Mongo shell command show collections . This command will return all collections created within a MongoDB database.

How do I run a loop in MongoDB?

Overview of forEach Loop in mongoDB Using of forEach loop in mongoDB is straight forward, if you know javascript . By using forEach loop you can manipulate documents on your collection at any level and at any size of data. you can run javascript code at client side of MongoDB and server side of MongoDB both.

How do I clone a collection in MongoDB?

Right-click on collection1 collection in DB Explorer and select Duplicate 'collection1' Collection... item in the popup menu. Specify destination collection name, duplication parameters and click Duplicate.

What does $and do in MongoDB?

$and performs a logical AND operation on an array of one or more expressions ( <expression1> , <expression2> , and so on) and selects the documents that satisfy all the expressions.


2 Answers

You need to use the each() method, not forEach(). forEach() is going to iterate over every document in the batch - as you've discovered this defaults to 101. each() will iterate over every document in the cursor. From the documentation:

each

Iterates over all the documents for this cursor. As with {cursor.toArray}, not all of the elements will be iterated if this cursor had been previouly accessed. In that case, {cursor.rewind} can be used to reset the cursor. However, unlike {cursor.toArray}, the cursor will only hold a maximum of batch size elements at any given time if batch size is specified. Otherwise, the caller is responsible for making sure that the entire result can fit the memory.

http://mongodb.github.io/node-mongodb-native/api-generated/cursor.html

Example code:

// Grab a cursor
      var cursor = collection.find();

      // Execute the each command, triggers for each document
      cursor.each(function(err, item) {

        // If the item is null then the cursor is exhausted/empty and closed
        if(item == null) {

          // Show that the cursor is closed
          cursor.toArray(function(err, items) {
            assert.ok(err != null);

            // Let's close the db
            db.close();
          });
        };
      });
like image 138
John Petrone Avatar answered Sep 23 '22 02:09

John Petrone


You're seeing only first 101 documents because that's the default number of documents MongoDB driver fetched from the server in the first batch.

For most queries, the first batch returns 101 documents or just enough documents to exceed 1 megabyte. Subsequent batch size is 4 megabytes.

You can try to use find and then iterate over documents.

coll.find({}, {uid:1, _id : 0}, function(err, docs){
    if (err) {
        console.log(err);
        return;
    }
    docs.forEach(function(doc, index) { 
        console.log(index + " key: " + doc.uid) 
    });
});
like image 24
Christian P Avatar answered Sep 24 '22 02:09

Christian P