Elastic search query using match_phrase_prefix and fuzziness at the same time?

Tags:

I am new to elastic search, so I am struggling a bit to find the optimal query for our data.

Imagine I want to match the following word "Handelsstandens Boldklub".

Currently, I'm using the following query:

{
    query: {
      bool: {
        should: [
          {
            match: {
              name: {
                query: query, slop: 5, type: "phrase_prefix"
              }
            }
          },
          {
            match: {
              name: {
                query: query,
                fuzziness: "AUTO",
                operator: "and"
              }
            }
          }
        ]
      }
    }
  }

It currently list the word if I am searching for "Hand", but if I search for "Handle" the word will no longer be listed as I did a typo. However if I reach to the end with "Handlesstandens" it will be listed again, as the fuzziness will catch the typo, but only when I have typed the whole word.

Is it somehow possible to do phrase_prefix and fuzziness at the same time? So in the above case, if I make a typo on the way, it will still list the word?

So in this case, if I search for "Handle", it will still match the word "Handelsstandens Boldklub".

Or what other workarounds are there to achieve the above experience? I like the phrase_prefix matching as its also supports sloppy matching (hence I can search for "Boldklub han" and it will list the result)

Or can the above be achieved by using the completion suggester?

842

asked Aug 24 '16 09:08

Henrik Holm

1 Answers

Okay, so after investigating elasticsearch even further, I came to the conclusion that I should use ngrams.

Here is a really good explaniation of what it does and how it works. https://qbox.io/blog/an-introduction-to-ngrams-in-elasticsearch

Here is the settings and mapping I used: (This is elasticsearch-rails syntax)

settings analysis: {
  filter: {
    ngram_filter: {
      type: "ngram",
      min_gram: "2",
      max_gram: "20"
    }
  },
  analyzer: {
    ngram_analyzer: {
      type: "custom",
      tokenizer: "standard",
      filter: ["lowercase", "ngram_filter"]
    }
  }
} do
  mappings do
    indexes :name, type: "string", analyzer: "ngram_analyzer"
    indexes :country_id, type: "integer"
  end
end

And the query: (This query actually search in two different indexes at the same time)

{
    query: {
      bool: {
        should: [
          {
            bool: {
              must: [
                { match: { "club.country_id": country.id } },
                { match: { name: query } }
              ]
            }
          },
          {
            bool: {
              must: [
                { match: { country_id: country.id } },
                { match: { name: query } }
              ]
            }
          }
        ],
        minimum_should_match: 1
      }
    }
  }

But basically you should just do a match or multi match query, depending on how many fields you want to search in.

I hope someone find it helpful, as I was personally thinking to much in terms of fuzziness instead of ngrams (Didn't know about before). This led me in the wrong direction.

194

answered Oct 17 '22 19:10

Henrik Holm

Related questions
                            
                                In ElasticSearch, how does sort interact with function_score?
                            
                                Passing dynamic value to script query in Elastic Search
                            
                                how to configure Jira Dashboard in Kibana
                            
                                Elasticsearch document id type integer vs string : Is there any performace difference?
                            
                                ElasticSearch: compare dotted version strings
                            
                                Elasticsearch NoNodeAvailableException None of the configured nodes are available
                            
                                Laravel Scout - observe relations
                            
                                ElasticSearch as EventStore
                            
                                ElasticSearch - different result ordering for simple request and aggregation request (NEST)
                            
                                elasticsearch doc['...'] Arrays and order
                            
                                JestClient is throwing SocketTimeoutException after being idle for sometime
                            
                                Elasticsearch - Analyser creating the right tokens but query is not matching
                            
                                Mocking elasticsearch-py calls
                            
                                making a calculation with the elements of an elasticsearch json object, of a contract bridge score, using Python
                            
                                compute geo distance in elasticsearch
                            
                                Searching subtitle data in elasticsearch
                            
                                Update/delete existing log entry with logstash
                            
                                elasticsearch multi_match vs should
                            
                                Configure sink elasticsearch apache-flume
                            
                                Why is mongoosastic populate / elastic search not populating one of my references? I'm getting an empty object

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Elastic search query using match_phrase_prefix and fuzziness at the same time?

Tags:

autocomplete

elasticsearch

fuzzy-search

match-phrase

Henrik Holm

People also ask

1 Answers

Henrik Holm

Recent Activity

Donate For Us