Limit ElasticSearch aggregation to top n query results

Tags:

I have a set of 2.8 million docs with sets of tags that I'm querying with ElasticSearch, but many of these docs can be grouped together by one ID. I want to query my data using the tags, and then aggregate them by the ID that repeats. Often my search results have tens of thousands of documents, but I only want to aggregate the top 100 results of the search. How can I constrain an aggregation to only the top 100 results from a query?

823

asked Mar 06 '15 09:03

Patrick Pan

1 Answers

Sampler Aggregation :

A filtering aggregation used to limit any sub aggregations' processing to a sample of the top-scoring documents.

"aggs": {
     "bestDocs": {
         "sampler": {
          //    "field": "<FIELD>", <-- optional, Controls diversity using a field
              "shard_size":100
         },
         "aggs": {
              "bestBuckets": {
                 "terms": {
                      "field": "id"
                  }
               }
         }
      }
  }

This query will limit the sub aggregation to top 100 docs from the result and then bucket them by ID.

Optionally, you can use the field or script and max_docs_per_value settings to control the maximum number of documents collected on any one shard which share a common value.

answered Oct 15 '22 08:10

Rahul

Related questions
                            
                                Best way to build a SMART mySQL & PHP search engine?
                            
                                Adding filter to each column of jqgrid
                            
                                Search in Issues feature gone from GitHub?
                            
                                How to sort an array by similarity in relation to an inputted word.
                            
                                Using Javascript to find most common words in string?
                            
                                Java - regular expression finding comments in code
                            
                                Notepad++: Multiple words search in a file (may be in different lines)?
                            
                                vim search wildcard match first occurrence
                            
                                Centos - "locate" command doesn't work
                            
                                Twitter: Hash tag search query
                            
                                fastest way to detect if a value is in a group of values in Javascript
                            
                                How to determine an array index in Objective C?
                            
                                Search for a string or part of string in PHP
                            
                                Find the first "missing" number in a sorted list
                            
                                Is it possible to capture search term from Google search?
                            
                                How do I replace an actual asterisk character (*) in a Regex expression?
                            
                                Wordpress Search Function to only search posts
                            
                                search in java ArrayList
                            
                                Find text in decompiled jar files with Jetbrains IDES
                            
                                Regex browser search? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Limit ElasticSearch aggregation to top n query results

Tags:

search

aggregation

elasticsearch

Patrick Pan

People also ask

1 Answers

Rahul

Recent Activity

Donate For Us