I would like to use the stats or extended_stats aggregation on the <code>_score</code> field but can't find any examples of this being done (i.e., seems like you can only use aggregations with actual document fields). Is it possible to request aggregations on calculated "metadata" fields for each hit in an ElasticSearch query response (e.g., <code>_score</code>, <code>_type</code>, <code>_shard</code>, etc.)? I'm assuming the answer is 'no' since fields like <code>_score</code> aren't indexed...

Note: The original answer is now outdated in terms of the latest version of Elasticsearch. The equivalent script using Groovy scripting would be: <pre class="prettyprint"><code>{ ..., "aggregations" : { "grades_stats" : { "stats" : { "script" : "_score" } } } } </code></pre> In order to make this work, you will need to enable dynamic scripting or, even better, store a file-based script and execute it by name (for added security by not enabling dynamic scripting)! <hr> You can use a script and refer to the score using doc.score. More details are available in ElasticSearch's scripting documentation. A sample stats aggregation could be: <pre class="prettyprint"><code>{ ..., "aggregations" : { "grades_stats" : { "stats" : { "script" : "doc.score" } } } } </code></pre> And the results would look like: <pre class="prettyprint"><code>"aggregations": { "grades_stats": { "count": 165, "min": 0.46667441725730896, "max": 3.1525731086730957, "avg": 0.8296855776598959, "sum": 136.89812031388283 } } </code></pre> A histogram may also be a useful aggregation: <pre class="prettyprint"><code>"aggs": { "grades_histogram": { "histogram": { "script": "doc.score * 10", "interval": 3 } } } </code></pre> Histogram results: <pre class="prettyprint"><code>"aggregations": { "grades_histogram": { "buckets": [ { "key": 3, "doc_count": 15 }, { "key": 6, "doc_count": 103 }, { "key": 9, "doc_count": 46 }, { "key": 30, "doc_count": 1 } ] } } </code></pre>

ElasticSearch: aggregation on _score field?

Tags:

elasticsearch

I would like to use the stats or extended_stats aggregation on the _score field but can't find any examples of this being done (i.e., seems like you can only use aggregations with actual document fields).

Is it possible to request aggregations on calculated "metadata" fields for each hit in an ElasticSearch query response (e.g., _score, _type, _shard, etc.)?

I'm assuming the answer is 'no' since fields like _score aren't indexed...

664

asked Jul 03 '14 15:07

Clint Harris

1 Answers

Note: The original answer is now outdated in terms of the latest version of Elasticsearch. The equivalent script using Groovy scripting would be:

{
    ...,
    "aggregations" : {
        "grades_stats" : { 
            "stats" : { 
                "script" : "_score" 
            } 
        }
    }
}

In order to make this work, you will need to enable dynamic scripting or, even better, store a file-based script and execute it by name (for added security by not enabling dynamic scripting)!

You can use a script and refer to the score using doc.score. More details are available in ElasticSearch's scripting documentation.

A sample stats aggregation could be:

{
    ...,
    "aggregations" : {
        "grades_stats" : { 
            "stats" : { 
                "script" : "doc.score" 
            } 
        }
    }
}

And the results would look like:

"aggregations": {
    "grades_stats": {
        "count": 165,
        "min": 0.46667441725730896,
        "max": 3.1525731086730957,
        "avg": 0.8296855776598959,
        "sum": 136.89812031388283
    }
}

A histogram may also be a useful aggregation:

"aggs": {
    "grades_histogram": {
        "histogram": {
            "script": "doc.score * 10",
            "interval": 3
        }
    }
}

Histogram results:

"aggregations": {
    "grades_histogram": {
        "buckets": [
            {
               "key": 3,
               "doc_count": 15
            },
            {
               "key": 6,
               "doc_count": 103
            },
            {
               "key": 9,
               "doc_count": 46
            },
            {
               "key": 30,
               "doc_count": 1
            }
        ]
    }
}

140

answered Sep 20 '22 06:09

mas2df

Related questions
                            
                                Elasticsearch monitoring indices
                            
                                How to aggregate over dynamic fields in elasticsearch?
                            
                                Elastic search or Trie for search/autocomplete?
                            
                                Elasticsearch, get average document length
                            
                                Elasticsearch / Python / Proxy
                            
                                Setting up MongoDB river for Elasticsearch
                            
                                How to implement case sensitive search in elasticsearch?
                            
                                How to do bulk delete in PHP ElasticSearch API
                            
                                Elasticsearch.Net.UnexpectedElasticsearchClientException during deserilize result
                            
                                ElasticSearch -- use distance from point to affect query relevance
                            
                                elastic search optimal number of masters
                            
                                Elasticsearch date range intersection
                            
                                get elasticsearch schema via commandline tool
                            
                                Is there any way not to return arrays when specifying return fields in an Elasticsearch query?
                            
                                Elasticsearch synonym analyzer not working
                            
                                Difference between must_not and filter in elasticsearch
                            
                                Alert/Notification using Kibana3?
                            
                                Elasticsearch How to retrieve the maximum id
                            
                                Elasticsearch multiple analyzers for a single field
                            
                                elasticsearch prefix query for multiple words to solve the autocomplete use case

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With