How to find out result of elasticsearch parsing a query_string?

Tags:

Is there a way to find out via the elasticsearch API how a query string query is actually parsed? You can do that manually by looking at the lucene query syntax, but it would be really nice if you could look at some representation of the actual results the parser has.

223

asked Aug 23 '13 10:08

Hans-Peter Störr

1 Answers

As javanna mentioned in comments there's _validate api. Here's what works on my local elastic (version 1.6):

curl -XGET 'http://localhost:9201/pl/_validate/query?explain&pretty' -d'
{
  "query": {
      "query_string": {
      "query": "a OR (b AND c) OR (d AND NOT(e or f))",
      "default_field": "t"
    }
  }
}
'

pl is name of index on my cluster. Different index could have different analyzers, that's why query validation is executed in a scope of an index.

The result of the above curl is following:

{
  "valid" : true,
  "_shards" : {
    "total" : 1,
    "successful" : 1,
    "failed" : 0
  },
  "explanations" : [ {
    "index" : "pl",
    "valid" : true,
    "explanation" : "filtered(t:a (+t:b +t:c) (+t:d -(t:e t:or t:f)))->cache(org.elasticsearch.index.search.nested.NonNestedDocsFilter@ce2d82f1)"
  } ]
}

I made one OR lowercase on purpose and as you can see in explanation, it is interpreted as a token and not as a operator.

As for interpretation of the explanation. Format is similar to +- operators of query string query:

( and ) characters start and end bool query
+ prefix means clause that will be in must
- prefix means clause that will be in must_not
no prefix means that it will be in should (with default_operator equal to OR)

So above will be equivalent to following:

{
  "bool" : {
    "should" : [
      {
        "term" : { "t" : "a" }
      },
      {
        "bool": {
          "must": [
            {
              "term" : { "t" : "b" }
            },
            {
              "term" : { "t" : "c" }
            }
          ]
        }
      },
      {
        "bool": {
          "must": {
              "term" : { "t" : "d" }
          },
          "must_not": {
            "bool": {
              "should": [
                {
                  "term" : { "t" : "e" }
                },
                {
                  "term" : { "t" : "or" }
                },
                {
                  "term" : { "t" : "f" }
                }
              ]
            }
          }
        }
      }
    ]
  }
}

I used _validate api quite heavily to debug complex filtered queries with many conditions. It is especially useful if you want to check how analyzer tokenized input like an url or if some filter is cached.

There's also an awesome parameter rewrite that I was not aware of until now, which causes the explanation to be even more detailed showing the actual Lucene query that will be executed.

186

answered Oct 19 '22 05:10

slawek

Related questions
                            
                                Can Lucene return several search results from a single indexed file?
                            
                                Sort different groups using different sort orders in solr
                            
                                Why does Lucene.NET cause OutOfMemoryException when indexing large files?
                            
                                Compass Lucene hits
                            
                                Keeping query statistics using lucene
                            
                                Alternative IndexProvider for Neo4J 1.9.1
                            
                                AND query in elasticsearch with curl
                            
                                Solr Custom Similarity - Using a field from the indexed document
                            
                                How do I estimate the size of a Lucene index?
                            
                                Lucene search and underscores
                            
                                Indexing and Searching Over Word Level Annotation Layers in Lucene
                            
                                Is it possible to compile and use xapian, clucene or lucy on iOS?
                            
                                Can Apache Solr Handle TeraByte Large Data
                            
                                Store complex (i.e. label + id) meta data in SOLR document
                            
                                How to do grouping in Lucene search results?
                            
                                How to index BigDecimal values in Lucene 3.0.1
                            
                                Best way to filter fields stored in a remote database in solr/lucene?
                            
                                How to do a RavenDB query over multiple (complex structure) fields and return the matched values?
                            
                                Configure DataImportHandler in SolrCloud with ZooKeeper

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to find out result of elasticsearch parsing a query_string?

Tags:

lucene

elasticsearch

Hans-Peter Störr

People also ask

1 Answers

slawek

Recent Activity

Donate For Us