Efficiently sorting the results of a mongodb geospatial query

Question

I have a very large collection of documents like:

{ loc: [10.32, 24.34], relevance: 0.434 }

and want to be able efficiently do a query like:

 { "loc": {"$geoWithin":{"$box":[[-103,10.1],[-80.43,30.232]]}} }

with arbitrary boxes.

Adding an 2d index on loc makes this very fast and efficient. However, I want to now also just get the most relevant documents:

.sort({ relevance: -1 })

Which causes everything to grind to a crawl (there can be huge amount of results in any particular box, and I just need the top 10 or so).

Any advise or help greatly appreciated!!

Sean Reilly · Accepted Answer

Have you tried using the aggregation framework?

A two stage pipeline might work:

a $match stage that uses your existing $geoWithin query.
a $sort stage that sorts by relevance: -1

Here's an example of what it might look like:

db.foo.aggregate(
    {$match: { "loc": {"$geoWithin":{"$box":[[-103,10.1],[-80.43,30.232]]}} }},
    {$sort: {relevance: -1}}
);

I'm not sure how it will perform. However, even if it's poor with MongoDB 2.4, it might be dramatically different in 2.6/2.5, as 2.6 will include improved aggregation sort performance.

Efficiently sorting the results of a mongodb geospatial query

Tags:

mongodb

Heptic

1 Answers

Sean Reilly

Recent Activity

Donate For Us

Efficiently sorting the results of a mongodb geospatial query

Tags:

mongodb

Heptic

1 Answers

Sean Reilly

Related questions

Recent Activity

Donate For Us