I have these three MongoDB documents: <pre class="prettyprint"><code>{ "_id" : ObjectId("571094afc2bcfe430ddd0815"), "name" : "Barry", "surname" : "Allen", "address" : [ { "street" : "Red", "number" : NumberInt(66), "city" : "Central City" }, { "street" : "Yellow", "number" : NumberInt(7), "city" : "Gotham City" } ] } { "_id" : ObjectId("57109504c2bcfe430ddd0816"), "name" : "Oliver", "surname" : "Queen", "address" : { "street" : "Green", "number" : NumberInt(66), "city" : "Star City" } } { "_id" : ObjectId("5710953ac2bcfe430ddd0817"), "name" : "Tudof", "surname" : "Unknown", "address" : "homeless" } </code></pre> The <code>address</code> field is an <code>Array</code> of Objects in the first document, an <code>Object</code> in the second and a <code>String</code> in the third. My target is to find how many documents of my collection containinig the field <code>address.street</code>. In this case the right count is 1 but with my query I get two: <pre class="prettyprint"><code>db.coll.find({"address.street":{"$exists":1}}).count() </code></pre> I also tried map/reduce. It works but it is slower; so if it is possible, I would avoid it.

Can you try this: <pre class="prettyprint"><code>db.getCollection('collection_name').find({ "address.street":{"$exists":1}, "$where": "Array.isArray(this.address) == false && typeof this.address === 'object'" }); </code></pre> In where clause, we are excluding if address is array and Including address if it's type is object.

Count how many documents contain a field

Tags:

mongodb

mongodb-query

aggregation-framework

I have these three MongoDB documents:

Click to copy

{ 
    "_id" : ObjectId("571094afc2bcfe430ddd0815"), 
    "name" : "Barry", 
    "surname" : "Allen", 
    "address" : [
        {
            "street" : "Red", 
            "number" : NumberInt(66), 
            "city" : "Central City"
        }, 
        {
            "street" : "Yellow", 
            "number" : NumberInt(7), 
            "city" : "Gotham City"
        }
    ]
}

{ 
    "_id" : ObjectId("57109504c2bcfe430ddd0816"), 
    "name" : "Oliver", 
    "surname" : "Queen", 
    "address" : {
        "street" : "Green", 
        "number" : NumberInt(66), 
        "city" : "Star City"
    }
}
{ 
    "_id" : ObjectId("5710953ac2bcfe430ddd0817"), 
    "name" : "Tudof", 
    "surname" : "Unknown", 
    "address" : "homeless"
}

The address field is an Array of Objects in the first document, an Object in the second and a String in the third. My target is to find how many documents of my collection containinig the field address.street. In this case the right count is 1 but with my query I get two:

Click to copy

db.coll.find({"address.street":{"$exists":1}}).count()

I also tried map/reduce. It works but it is slower; so if it is possible, I would avoid it.

402

asked Apr 15 '16 11:04

DistribuzioneGaussiana

2 Answers

The distinction here is that the .count() operation is actually "correct" in returning the "document" count where the field is present. So the general considerations break down to:

If you just want to exlude the documents with the array field

Then the most effective way of excluding those documents where the "street" was a property of the "address" as an "array", then just use the dot-notation property of looking for the 0 index to not exist in the exlcusion:

Click to copy

db.coll.find({
  "address.street": { "$exists": true },
  "address.0": { "$exists": false }
}).count()

As a natively coded operator test in both cases $exists does the correct job and efficiently.

If you intended to count field occurences

If what you are actually asking is the "field count", where some "documents" contain array entries where that "field" may be present several times.

For that you need the aggregation framework or mapReduce like you mention. MapReduce uses JavaScript based processing and is therefore going to be considerably slower than the .count() operation. The aggregation framework also needs to calculate and "will" be slower than .count(), but not by as much as mapReduce.

In MongoDB 3.2 you get some help here by the expanded ability of $sum to work on an array of values as well as being an grouping accumulator. The other helper here is $isArray which allows a different processing method via $map when the data is in fact "an array":

Click to copy

db.coll.aggregate([
  { "$group": {
    "_id": null,
    "count": {
      "$sum": {
        "$sum": {
          "$cond": {
            "if": { "$isArray": "$address" },
            "then": {
              "$map": {
                "input": "$address",
                "as": "el",
                "in": {
                  "$cond": {
                    "if": { "$ifNull": [ "$$el.street", false ] },
                    "then": 1,
                    "else": 0
                  }
                }
              }
            },
            "else": {
              "$cond": {
                "if": { "$ifNull": [ "$address.street", false ] },
                "then": 1,
                "else": 0
              }
            }
          }
        }
      }
    }
  }}
])

Earlier versions hinge on a bit more conditional processing in order to treat the array and non-array data differently, and generally require $unwind to process array entries.

Either transposing the array via $map with MongoDB 2.6:

Click to copy

db.coll.aggregate([
  { "$project": {
    "address": {
      "$cond": {
        "if": { "$ifNull": [ "$address.0", false ] },
        "then": "$address",
        "else": {
          "$map": {
            "input": ["A"],
            "as": "el",
            "in": "$address"
          }
        }
      }
    }
  }},
  { "$unwind": "$address" },
  { "$group": {
    "_id": null,
    "count": {
      "$sum": {
        "$cond": {
          "if": { "$ifNull": [ "$address.street", false ] },
          "then": 1,
          "else": 0
        }
      }
    }
  }}
])

Or providing conditional selection with MongoDB 2.2 or 2.4:

Click to copy

db.coll.aggregate([
  { "$group": {
    "_id": "$_id",
    "address": { 
      "$first": {
        "$cond": [
          { "$ifNull": [ "$address.0", false ] },
          "$address",
          { "$const": [null] }
        ]
      }
    },
    "other": {
      "$push": {
        "$cond": [
          { "$ifNull": [ "$address.0", false ] },
          null,
          "$address"
        ]
      }
    },
    "has": { 
      "$first": {
        "$cond": [
          { "$ifNull": [ "$address.0", false ] },
          1,
          0
        ]
      }
    }
  }},
  { "$unwind": "$address" },
  { "$unwind": "$other" },
  { "$group": {
    "_id": null,
    "count": {
      "$sum": {
        "$cond": [
          { "$eq": [ "$has", 1 ] },
          { "$cond": [
            { "$ifNull": [ "$address.street", false ] },
            1,
            0
          ]},
          { "$cond": [
            { "$ifNull": [ "$other.street", false ] },
            1,
            0
          ]}
        ]
      }
    }
  }}
])

So the latter form "should" perform a bit better than mapReduce, but probably not by much.

In all cases the logic falls to using $ifNull as the "logical" form of $exists for the aggregation framework. Paired with $cond, a "truthfull" result is obtained when the property actually exsists, and a false value is returned when it is not. This determines whether 1 or 0 is returned respectively to the overall accumulation via $sum.

Ideally you have the modern version that can do this in a single $group pipeline stage, but otherwise you need the longer path.

173

answered Oct 07 '22 19:10

Blakes Seven

Can you try this:

Click to copy

db.getCollection('collection_name').find({
        "address.street":{"$exists":1},
        "$where": "Array.isArray(this.address) == false && typeof this.address === 'object'"
});

In where clause, we are excluding if address is array and Including address if it's type is object.

answered Oct 07 '22 18:10

Hiren S.

Related questions
                            
                                MongoDB can't update document because _id is string, not ObjectId
                            
                                How to Migrate spring hibernate mysql to mongodb
                            
                                can't shard a collection on mongodb
                            
                                MongoDB Full Text on an Array within an Array of Elements
                            
                                MongoDB full text search index: error: too many text index for, why?
                            
                                MongoDB Bulk Insert where many documents already exist
                            
                                Reducing the number of calls to MongoDB with mongoengine
                            
                                Update an item in an array that is in an array
                            
                                Mongodb Aggregation by Day then Hour
                            
                                how to update a Mongo.db collection in meteor.js?
                            
                                Does MongoDB support XOR (exclusive OR)?
                            
                                Meaning of variables in mongodb logs
                            
                                What is return type of db.collection.find() in mongodb
                            
                                How to search a document and remove field from it in mongodb using java?
                            
                                error while following Tumblelog Application with Flask and MongoEngine
                            
                                Get the count of the number of documents in a Collection Mongodb
                            
                                How to group by the first element of an array?
                            
                                How to update a large number of documents in MongoDB most effeciently?
                            
                                mongorestore metadata.json file
                            
                                Does mongoose findOne on model return a promise?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Count how many documents contain a field

Tags:

mongodb

mongodb-query

aggregation-framework

DistribuzioneGaussiana

People also ask

2 Answers

If you just want to exlude the documents with the array field

If you intended to count field occurences

Blakes Seven

Hiren S.

Recent Activity

Donate For Us