I have a database of students and their contact details. I'm trying to find out the postcode that houses the most students. The documents for the students look something like this...
{studentcode: 'smi0001', firstname: 'bob', surname: 'smith', postcode: 2001}
I thought I could use the aggregation framework to find out the postcode with the most students by doing something like...
db.students.aggregate({$project: { postcode: 1 }, $group: {_id: '$postcode', students: {$sum: 1}}})
this works as expected (returns postcodes as _id
and the number of students in each postcode as 'students', but if I add $sort
to the pipeline it seems to try sorting by the whole student collection instead of the results of the $group
operation.
what I'm trying look like...
db.students.aggregate({$project: { postcode: 1 }, $group: {_id: '$postcode', students: {$sum: 1}}, $sort: {_id: -1}})
but it returns the whole collection and disregards the $project
and $group
... Am I missing something? I thought I'd just be able to sort by descending number of students and return the first item. Thanks in advance for any help.
In MongoDB, the $sort stage is used to sort all the documents in the aggregation pipeline and pass a sorted order to the next stage of the pipeline. Lets take a closer look at the above syntax: The $sort stage accepts a document that defines the field or fields that will be used for sorting.
To sort the whole array by value, or to sort by array elements that are not documents, identify the input array and specify 1 for an ascending sort or -1 for descending sort in the sortBy parameter.
To sort documents in MongoDB, you need to use sort() method. The method accepts a document containing a list of fields along with their sorting order. To specify sorting order 1 and -1 are used. 1 is used for ascending order while -1 is used for descending order.
You almost had it...
db.test.aggregate( {$group: {_id: '$postcode', students: {$sum: 1}}}, {$sort: {_id: -1}} );
gives (I added some test data matching your sample):
{ "result" : [ { "_id" : 2003, "students" : 3 }, { "_id" : 2002, "students" : 1 }, { "_id" : 2001, "students" : 2 } ], "ok" : 1 }
You had an outer {}
around everything, which was causing some confusion. The group and sort weren't working as separate operations in the pipeline.
You didn't really need the project for this case.
Update You probably want to sort by "students", like so, to get the biggest zipcodes (by population) first:
db.test.aggregate( {$group: {_id: '$postcode', students: {$sum: 1}}}, {$sort: {students: -1}} );
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With