Aggregate only matched nested object values in ElasticSearch

Tags:

elasticsearch

I need to sum only the values on the nested objects that match the query. It looks like ElasticSearch determines the documents matching the query and then sums across all of the nested objects. From the below outline I want to search on nestedobjects.objtype="A" and get back the sum of objvalue only for matching nestedobjects, I want to get the value 4. is this possible? If so, how?

Here is the mapping

{
  "myindex": {
    "mappings": {
      "mytype": {
        "properties": {
           "nestedobjects": {
             "type": "nested",
             "include_in_parent": true,
             "properties": {
               "objtype": {
                 "type": "string"
               },
               "objvalue": {
                 "type": "integer"
               }
             }
           }
         }
       }
     }
   }
 }

Here are my documents

PUT /myindex/mytype/1
{
  "nestedobjects": [
    { "objtype": "A", "objvalue": 1 },
    { "objtype": "B", "objvalue": 2 }
  ]
}
PUT /myindex/mytype/2
{
  "nestedobjects": [
    { "objtype": "A", "objvalue": 3 },
    { "objtype": "B", "objvalue": 3 }
  ]
}

Here is my query code.

POST allscriptshl7/_search?search_type=count
{
  "query": {
    "filtered": {
      "query": {
        "query_string": {
          "query": "nestedobjects.objtype:A"
        }
      }
    }
  },
  "aggregations": {
    "my_agg": {
      "sum": {
        "field": "nestedobjects.objvalue"
      }
    }
  }
}

878

asked Oct 01 '15 18:10

user481779

1 Answers

Since both (outer) documents match the condition that one of their inner documents match the query, both outer documents are returned, and the aggregation is calculated against all of the inner documents belonging to those outer documents. Whew.

Anyway, this seems to do what you're wanting, I think, using filter aggregation:

POST /myindex/_search?search_type=count
{
   "aggs": {
      "nested_nestedobjects": {
         "nested": {
            "path": "nestedobjects"
         },
         "aggs": {
            "filtered_nestedobjects": {
               "filter": {
                  "term": {
                     "nestedobjects.objtype": "a"
                  }
               },
               "aggs": {
                  "my_agg": {
                     "sum": {
                        "field": "nestedobjects.objvalue"
                     }
                  }
               }
            }
         }
      }
   }
}
...
{
   "took": 4,
   "timed_out": false,
   "_shards": {
      "total": 5,
      "successful": 5,
      "failed": 0
   },
   "hits": {
      "total": 2,
      "max_score": 0,
      "hits": []
   },
   "aggregations": {
      "nested_nestedobjects": {
         "doc_count": 4,
         "filtered_nestedobjects": {
            "doc_count": 2,
            "my_agg": {
               "value": 4,
               "value_as_string": "4.0"
            }
         }
      }
   }
}

Here is some code I used to test it:

http://sense.qbox.io/gist/c1494619ff1bd0394d61f3d5a16cb9dfc229113a

Very well-structured question, by the way.

180

answered Oct 24 '22 18:10

Sloan Ahrens

Related questions
                            
                                Using filter beside query_string in Elastic Search
                            
                                What are aliases in elasticsearch for?
                            
                                How to deploy AWS elasticsearch using serverless.yml
                            
                                Logstash with Elasticsearch
                            
                                How to find Index by Alias in Elasticsearch java api?
                            
                                Scroll example in ElasticSearch NEST API
                            
                                Elasticsearch Marvel - Turn off logging
                            
                                ElasticSearch index exists not working / reliable
                            
                                Elasticsearch store field vs _source
                            
                                Get the number of fields on an index
                            
                                Highlight whole content in Elasticsearch for multivalue fields
                            
                                ElasticSearch calculate percentage for each bucket from total
                            
                                How can I do scripted aggregation in Kibana + Elasticsearch?
                            
                                Locality-sensitive hashing - Elasticsearch
                            
                                How to index source code with ElasticSearch
                            
                                Elasticsearch delete duplicates
                            
                                Bulk Update on ElasticSearch using NEST

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With