I'm having some trouble, as stated in title, to count elements in an Array using MongoDB. I have a DB with only one document, made as follow: <pre class="prettyprint"><code> {_id: ObjectId("abcdefghilmnopq"), "Array": [ {field1: "val1", field2: "val2", field3: "val3", ... }, {field1: "Value1", field2: "Value2", field3: "Value3", ... }, ... ] } </code></pre> I wanna count the number of elements of the array which have a certain condition (e.g. <code>field1: "a"</code>, and count all elements which have <code>field1 = a</code>). I'm trying with this code: <pre class="prettyprint"><code>db.collection.aggregate([ { $unwind : {path: "$Array", includeArrayIndex: "arrayIndex"}}, { $match : { "Array.field1" : "a"}}, { $project : { _id : 0, Array : 1, arrayIndex: 1, total: {$size: "$Array"}}} ]) </code></pre> but I receive this error: <blockquote> Command failed with error 17124: 'The argument to $size must be an array, but was of type: object' on server </blockquote> I looked for several answer to this problem, but I didn't find anything resolutive for my problem. I mean, 'Array' IS an array!

The error is because it's no longer an array after you <code>$unwind</code> and therefore no longer a valid argument to <code>$size</code>. You appear to be attempting to "merge" a couple of existing answers without understanding what they are doing. What you really want here is <code>$filter</code> and <code>$size</code> <pre class="prettyprint"><code>db.collection.aggregate([ { "$project": { "total": { "$size": { "$filter": { "input": "$Array", "cond": { "$eq": [ "$$this.field1", "a" ] } } } } }} ]) </code></pre> Or "reinvent the wheel" using <code>$reduce</code>: <pre class="prettyprint"><code>db.collection.aggregate([ { "$project": { "total": { "$reduce": { "input": "$Array", "initialValue": 0, "in": { "$sum": [ "$$value", { "$cond": [{ "$eq": [ "$$this.field1", "a" ] }, 1, 0] } } } } }} ]) </code></pre> Or for what you were trying to do with <code>$unwind</code>, you actually <code>$group</code> again in order to "count" how many matches there were: <pre class="prettyprint"><code>db.collection.aggregate([ { "$unwind": "$Array" }, { "$match": { "Array.field1": "a" } }, { "$group": { "_id": "$_id", "total": { "$sum": 1 } }} ]) </code></pre> The first two forms are the "optimal" for modern MongoDB environments. The final form with <code>$unwind</code> and <code>$group</code> is a "legacy" construct which really has not been necessary for this type of operation since MongoDB 2.6, though with some slightly different operators. In those first two we are basically comparing the <code>field1</code> value of each array element whilst it's still an array. Both <code>$filter</code> and <code>$reduce</code> are modern operators designed to work with an existing array in place. The same comparison is done on each one using the aggregation <code>$eq</code> operator which returns a boolean value based on whether the arguments given are "equal" or not. In this case on each array member to the expected value of <code>"a"</code>. In the case of <code>$filter</code>, the array actually remains intact except for any elements which did not meet the supplied condition in <code>"cond"</code> are removed from the array. Since we still have an "array" as output we can then use the <code>$size</code> operator to measure the number of array elements left after that filter condition was processed. The <code>$reduce</code> on the other hand works through the array elements and supplies an expression over each element and a stored "accumulator" value, which we initialized with <code>"initialValue"</code>. In this case the same <code>$eq</code> test is applied within the <code>$cond</code> operator. This is a "ternary" or <code>if/then/else</code> conditional operator which allows a tested expression which returns a boolean value to return the <code>then</code> value when <code>true</code> or the <code>else</code> value when <code>false</code>. In that expression we return <code>1</code> or <code>0</code> respectively and supply the overall result of adding that returned value and the current "accumulator" <code>"$$value"</code> with the <code>$sum</code> operator to add these together. The final form used <code>$unwind</code> on the array. What this actually does is deconstructs the array members to create a "new document" for every array member and it's related parent fields in the original document. This effectively "copies" the main document for every array member. Once you <code>$unwind</code> the structure of the documents is changed to a "flatter" form. This is why you can then do the subsequent <code>$match</code> pipeline stage to remove the un-matched documents. This brings us to <code>$group</code> which is applied to "bring back together" all of the information related to a common key. In this case it's the <code>_id</code> field of the original document, which was of course copied into every document produced by the <code>$unwind</code>. As we go back to this "common key" as a single document, we can "count" the remaining "documents" extracted from the array using the <code>$sum</code> accumulator. If we wanted the remaining "array" back, then you can <code>$push</code> and rebuild the array with only the remaining members: <pre class="prettyprint"><code> { "$group": { "_id": "$_id", "Array": { "$push": "$Array" }, "total": { "$sum": 1 } }} </code></pre> But of course instead of using <code>$size</code> in another pipeline stage, we can simply still "count" like we already did with the <code>$sum</code>

Aggregate Count Array Members Matching Condition

Tags:

mongodb

aggregation-framework

I'm having some trouble, as stated in title, to count elements in an Array using MongoDB. I have a DB with only one document, made as follow:

 {_id: ObjectId("abcdefghilmnopq"),
    "Array": [
      {field1: "val1",
       field2: "val2",
       field3: "val3",
       ...
       },
       {field1: "Value1",
        field2: "Value2",
        field3: "Value3",
       ...
       },
        ...
     ]
 }

I wanna count the number of elements of the array which have a certain condition (e.g. field1: "a", and count all elements which have field1 = a). I'm trying with this code:

db.collection.aggregate([
{ $unwind : {path: "$Array", 
             includeArrayIndex: "arrayIndex"}},
{ $match : { "Array.field1" : "a"}},
{ $project : { _id : 0, 
               Array : 1, 
               arrayIndex: 1, 
               total: {$size: "$Array"}}}
])

but I receive this error:

Command failed with error 17124: 'The argument to $size must be an array, but was of type: object' on server

I looked for several answer to this problem, but I didn't find anything resolutive for my problem. I mean, 'Array' IS an array!

791

asked Jun 01 '18 10:06

Andrea Cristiani

1 Answers

The error is because it's no longer an array after you $unwind and therefore no longer a valid argument to $size.

You appear to be attempting to "merge" a couple of existing answers without understanding what they are doing. What you really want here is $filter and $size

db.collection.aggregate([
  { "$project": {
    "total": {
      "$size": {
        "$filter": {
          "input": "$Array",
          "cond": { "$eq": [ "$$this.field1", "a" ] }
        }
      }
    }
  }}
])

Or "reinvent the wheel" using $reduce:

db.collection.aggregate([
  { "$project": {
    "total": {
      "$reduce": {
        "input": "$Array",
        "initialValue": 0,
        "in": {
          "$sum": [
            "$$value", 
            { "$cond": [{ "$eq": [ "$$this.field1", "a" ] }, 1, 0] }
        }
      }
    }
  }}
])

Or for what you were trying to do with $unwind, you actually $group again in order to "count" how many matches there were:

db.collection.aggregate([
  { "$unwind": "$Array" },
  { "$match": { "Array.field1": "a" } },
  { "$group": {
    "_id": "$_id",
    "total": { "$sum": 1 }
  }}
])

The first two forms are the "optimal" for modern MongoDB environments. The final form with $unwind and $group is a "legacy" construct which really has not been necessary for this type of operation since MongoDB 2.6, though with some slightly different operators.

In those first two we are basically comparing the field1 value of each array element whilst it's still an array. Both $filter and $reduce are modern operators designed to work with an existing array in place. The same comparison is done on each one using the aggregation $eq operator which returns a boolean value based on whether the arguments given are "equal" or not. In this case on each array member to the expected value of "a".

In the case of $filter, the array actually remains intact except for any elements which did not meet the supplied condition in "cond" are removed from the array. Since we still have an "array" as output we can then use the $size operator to measure the number of array elements left after that filter condition was processed.

The $reduce on the other hand works through the array elements and supplies an expression over each element and a stored "accumulator" value, which we initialized with "initialValue". In this case the same $eq test is applied within the $cond operator. This is a "ternary" or if/then/else conditional operator which allows a tested expression which returns a boolean value to return the then value when true or the else value when false.

In that expression we return 1 or 0 respectively and supply the overall result of adding that returned value and the current "accumulator" "$$value" with the $sum operator to add these together.

The final form used $unwind on the array. What this actually does is deconstructs the array members to create a "new document" for every array member and it's related parent fields in the original document. This effectively "copies" the main document for every array member.

Once you $unwind the structure of the documents is changed to a "flatter" form. This is why you can then do the subsequent $match pipeline stage to remove the un-matched documents.

This brings us to $group which is applied to "bring back together" all of the information related to a common key. In this case it's the _id field of the original document, which was of course copied into every document produced by the $unwind. As we go back to this "common key" as a single document, we can "count" the remaining "documents" extracted from the array using the $sum accumulator.

If we wanted the remaining "array" back, then you can $push and rebuild the array with only the remaining members:

  { "$group": {
    "_id": "$_id",
    "Array": { "$push": "$Array" },
    "total": { "$sum": 1 }
  }}

But of course instead of using $size in another pipeline stage, we can simply still "count" like we already did with the $sum

187

answered Oct 02 '22 20:10

Neil Lunn

Related questions
                            
                                Does MongoDB support XOR (exclusive OR)?
                            
                                Meaning of variables in mongodb logs
                            
                                What is return type of db.collection.find() in mongodb
                            
                                How to search a document and remove field from it in mongodb using java?
                            
                                error while following Tumblelog Application with Flask and MongoEngine
                            
                                Get the count of the number of documents in a Collection Mongodb
                            
                                How to group by the first element of an array?
                            
                                How to update a large number of documents in MongoDB most effeciently?
                            
                                mongorestore metadata.json file
                            
                                Does mongoose findOne on model return a promise?
                            
                                Count how many documents contain a field
                            
                                Bulk update is too slow
                            
                                mongodb how to query sum string?
                            
                                Spring data mongodb sort on multiple fields
                            
                                Mongoose: Set default value in a required field when this is not present in the document to save
                            
                                ReadBsonType can only be called when State is Type, not when State is Value
                            
                                How to resolve UnhandledPromiseRejectionWarning in mongoose?
                            
                                Docker-compose mongoose
                            
                                How to add new fields to existing document [duplicate]
                            
                                MongoDb How to aggregate by month and year

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With