How to calculate the score based on number of query terms in elasticsearch?

Tags:

tire

I want the queries to return a score that gets calculated like:

occurrence of each query term in title + description / number of query terms

for example

EbSearch.add [ 
new_job( id: 1, title: "Java Programmierer", 
description: "Java Programmierer")
]

res = EbSearch.search("Java Programmierer").results.first.score.should == 4

at the moment it outputs 8, because it does the query for each term and sums it up. I could just divide afterwards, but I don't have the analyzed query terms, so compounds could mess up the score.

The query is structured like below:

search = Tire.search index_name do
  query do 
    dis_max do 
       query { string query, fields: ['title^3', 'description.with_synonyms^0.5'], use_dis_max: false, default_operator: "OR" }  
       query { string query, fields: ['title^3', 'description.without_synonyms'], use_dis_max: false, default_operator: "OR"}
    end
  end
end

Any idea how i could solve this problem is greatly appreciated.

EDIT

I realized that i provided not enough context.

Here are some other snippets I already worked out. I wrote a custom SimilarityProvider to disable idf and normalization. https://gist.github.com/outsmartin/6114175

The complete Tire code is found here https://gist.github.com/6114186. It is a little bit more complicated then the example, but it should be understandable.

578

asked Jul 23 '13 16:07

1 Answers

You can easily get a list of analyzed terms for your query using analyze command. However, I have to mention that Elasticsearch scoring is much more complicated than it might seem when you run your tests on tiny indices. You can find the formula that Elasticsearch is using in Lucene documentation and you can use explain command to see how this formula is getting applied to your results. I would also suggest testing and tuning your scoring algorithm on an index with a single shard or using dfs_query_then_fetch search type, which produces more precise results on small indices.

answered Nov 15 '22 07:11

imotov

Related questions
                            
                                How to implement ACL on an ElasticSearch-based system?
                            
                                Storing nested objects in elastic search
                            
                                How to tune Elasticsearch to make it indexing fast?
                            
                                Using AWS4 Signature via Postman for CRUD Elastic operations
                            
                                Why elastic-search container memory usage keeps increasing with little use?
                            
                                Elasticsearch: Can it be used to avoid writing your own NLP? (e.g. Re-invent the wheel)
                            
                                Unable to search a query with symbols in elasticsearch
                            
                                How to percolate simple_query_string/query_string query
                            
                                How to combine completion, suggestion and match phrase across multiple text fields?
                            
                                elastic search update Service software release in AWS console
                            
                                Best practices for data storage with Elasticsearch and Kubernetes
                            
                                A better approach to exclude large list of items in Elasticsearch
                            
                                CouchDB, Elastic Search, and River Plugin not operating correctly
                            
                                elastic search double facet
                            
                                Is there a way to remove the calculation of length norms for fields in elastic search?
                            
                                How to kill the thread of searching request on elasticsearch cluster? Is there some API to do this?
                            
                                Is there a graphic tool to display (and maybe change) elasticsearch mappings?
                            
                                How can I check indices.memory.index_buffer_size parameter is effectively working in elasticsearch?
                            
                                Elasticsearch not returning an exact match first
                            
                                How to reindex ElasticSearch quickly?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate the score based on number of query terms in elasticsearch?

Tags:

elasticsearch

tire

outsmartin

People also ask

1 Answers

imotov

Recent Activity

Donate For Us