How can I add fuzziness to a multi_match query? So if someone is to search for 'basball' it would still find 'baseball' articles. Currently my query looks like this: <pre class="prettyprint"><code>POST /newspaper/articles/_search { "query": { "function_score": { "query": { "multi_match": { "query": "baseball", "type": "phrase", "fields": [ "subject^3", "section^2.5", "article^2", "tags^1.5", "notes^1" ] } } } } } </code></pre> One option I was looking at is to do something like this, just don't know if this is the best option. It's important to keep the sorting based on the scoring: <pre class="prettyprint"><code> "query" : { "query_string" : { "query" : "subject:basball^3 section:basball^2.5 article:basball^2", "fuzzy_prefix_length" : 1 } } </code></pre> Suggestions?

To add fuzziness to a multiquery you need to add the fuzziness property as described here: <pre class="prettyprint"><code>{ "query": { "function_score": { "query": { "multi_match": { "query": "baseball", "type": "phrase", "fields": [ "subject^3", "section^2.5", "article^2", "tags^1.5", "notes^1" ], "fuzziness" : "AUTO", "prefix_length" : 2 } } } } } </code></pre> Please notice that prefix_length explained in the doc as: The number of initial characters which will not be “fuzzified”. This helps to reduce the number of terms which must be examined. Defaults to 0. To check the possible values of fuzziness please visit the ES docs.

ElasticSearch multi_match query over multiple fields with Fuzziness

Tags:

elasticsearch

fuzzy-search

How can I add fuzziness to a multi_match query? So if someone is to search for 'basball' it would still find 'baseball' articles. Currently my query looks like this:

POST /newspaper/articles/_search
{
    "query": {
        "function_score": {
            "query": {
                "multi_match": {
                    "query": "baseball",
                    "type": "phrase",
                    "fields": [
                        "subject^3", 
                        "section^2.5", 
                        "article^2", 
                        "tags^1.5",
                        "notes^1"
                    ]
                }
            }
        }
    }
}

One option I was looking at is to do something like this, just don't know if this is the best option. It's important to keep the sorting based on the scoring:

   "query" : { 
      "query_string" : { 
         "query" : "subject:basball^3 section:basball^2.5 article:basball^2", 
         "fuzzy_prefix_length" : 1 
      } 
   }

Suggestions?

1000

asked Apr 14 '15 16:04

Funtriaco Prado

1 Answers

To add fuzziness to a multiquery you need to add the fuzziness property as described here:

{
    "query": {
        "function_score": {
            "query": {
                "multi_match": {
                    "query": "baseball",
                    "type": "phrase",
                    "fields": [
                        "subject^3", 
                        "section^2.5", 
                        "article^2", 
                        "tags^1.5",
                        "notes^1"
                    ],
                    "fuzziness" : "AUTO",
                    "prefix_length" : 2

                }
            }
        }
    }
}

Please notice that prefix_length explained in the doc as:

The number of initial characters which will not be “fuzzified”. This helps to reduce the number of terms which must be examined. Defaults to 0.

To check the possible values of fuzziness please visit the ES docs.

answered Sep 20 '22 15:09

nan-ead

Related questions
                            
                                Spring Data Elasticsearch id vs. _id
                            
                                Change dynamically elasticsearch synonyms
                            
                                Exact search in array object type using elasticsearch
                            
                                NEST: How to query against multiple indices and handle different subclasses (document types)?
                            
                                Set update_all_types to true on ElasticSearch
                            
                                Elasticsearch - Filter where (one of nested array) and (all of nested array)
                            
                                Elasticsearch date format
                            
                                How to do a wildcard or regex match on _id in elasticsearch?
                            
                                Elasticsearch can't update non dynamic settings
                            
                                Elasticsearch - How to get popular words list of documents
                            
                                ElasticSearch: search inside the array of objects
                            
                                Case insensitivity does not work
                            
                                Querying Elasticsearch by combining a range and a term match json format
                            
                                How to perform token authentication in elasticsearch?
                            
                                Unknown key for a START_ARRAY in [fields]. in elasticsearch
                            
                                Document Versioning Elasticsearch: How do I compare different document versions?
                            
                                aggregate a field in elasticsearch-dsl using python
                            
                                null_value mapping in Elasticsearch
                            
                                ElasticSearch returning only documents with distinct value
                            
                                ElasticSearch: How to configure logging.yml

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With