How to get specific _source fields in aggregation

Tags:

I am exploring ElasticSearch, to be used in an application, which will handle large volumes of data and generate some statistical results over them. My requirement is to retrieve certain statistics for a particular field. For example, for a given field, I would like to retrieve its unique values and document frequency of each value, along-with the length of the value. The value lengths are indexed along-with each document. So far, I have experimented with Terms Aggregation, with the following query:

{
  "size": 0,
  "query": {
  "match_all": {}
},
 "aggs": {
 "type_count": {
   "terms": {
     "field": "val.keyword",
     "size": 100
   }
  }
 }
}

The query returns all the values in the field val with the number of documents in which each value occurs. I would like the field val_len to be returned as well. Is it possible to achieve this using ElasticSearch? In other words, is it possible to include specific _source fields in buckets? I have looked through the documentation available online, but I haven't found a solution yet. Hoping somebody could point me in the right direction. Thanks in advance!

I tried to include _source in the following manners:

 "aggs": {
    "type_count": {
     "terms": {
        "field": "val.keyword",
        "size": 100        
      },
        "_source":["val_len"]
    }
  }

and

"aggs": {
 "type_count": {
   "terms": {
     "field": "val.keyword",
     "size": 100,
      "_source":["val_len"]
    }     
  }
}

But I guess this isn't the right way, because both gave me parsing errors.

548

asked Feb 12 '19 11:02

Poonam Anthony

1 Answers

You need to use another sub-aggregation called top_hits, like this:

"aggs": {
 "type_count": {
   "terms": {
     "field": "val.keyword",
     "size": 100
    },
    "aggs": {
      "hits": {
        "top_hits": {
          "_source":["val_len"],
          "size": 1
        }
      }
    }
  }
}

Another way of doing it is to use another avg sub-aggregation so you can sort on it, too

"aggs": {
 "type_count": {
   "terms": {
     "field": "val.keyword",
     "size": 100,
     "order": {
       "length": "desc"
     }
    },
    "aggs": {
      "length": {
        "avg": {
          "field": "val_len"
        }
      }
    }
  }
}

171

answered Oct 02 '22 17:10

Val

Related questions
                            
                                Elasticsearch: getting the tf-idf of every term in a given document
                            
                                Can I disable the bootstrap checks in Elasticsearch 5.4?
                            
                                Elasticsearch equal SQL %Like%
                            
                                how can I query last month data in elasticsearch
                            
                                Spring boot with micrometer Elasticsearch registry indexes only empty documents
                            
                                Elasticsearch: How to use two different multiple matching fields?
                            
                                Failed to derive xcontent from org.elasticsearch.common.bytes.BytesArray@0
                            
                                Elasticsearch highlight: how to get entire text of the field in Java client
                            
                                Recognising timestamps in Kibana and ElasticSearch
                            
                                Elasticsearch Aggregation by Day of Week and Hour of Day
                            
                                Geo Location Radius Search Using PHP and MySQL
                            
                                Notification System on ELK [closed]
                            
                                ElasticSearch and Nest: Why amd I missing the id field on a query?
                            
                                Connection refused - connect(2) for "localhost" port 9200 with DigitalOcean
                            
                                Error when trying to use Elasticsearch Transport Client: dependencies not loaded to class path
                            
                                Matching arrays in elastic search
                            
                                Elasticsarch C# Nest [5.x] attributes
                            
                                What is the best way to sync Postgres and ElasticSearch?
                            
                                Saving date in microsecond format in ElasticSearch
                            
                                Kibana Windows zip distribution takes too long to unzip

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get specific _source fields in aggregation

Tags:

elasticsearch

elasticsearch-aggregation

Poonam Anthony

People also ask

1 Answers

Val

Recent Activity

Donate For Us