Disable IDF calculation

Tags:

elasticsearch

In my particular use case, the IDF-factor that gets calculated as part of the TF-IDF algorithm messes up the scoring for my queries. Basically, I want the queries to only take the term frequency into account. Is it possible to disable the IDF factor, i.e set it to 1, for a particular index? I have looked into the similarity module (in version 0.90.X), but haven't really found anything that could help; same goes for the function_score query. Do I need to write a custom Similarity class in java? Or is there a plugin for what I'm trying to achieve?

453

asked Jan 19 '14 20:01

GlurG

1 Answers

What about constant_score query?

See http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/ignoring-tfidf.html

Don't hesitate to use ?explain=true to see how scoring is working.

As you can here without constant_filter:

With IDF

And with constant_filter query (that wraps your real query):

Without IDF

Screenshots made with https://beemapp.me

170

answered Oct 17 '22 17:10

Thomas Decaux

Related questions
                            
                                Installed elastic search on server but cannot connect to it if from another machine
                            
                                Can't Connect to Elasticsearch (through Curl)
                            
                                How to request a single document by _id via alias?
                            
                                Elasticsearch script - variable not defined
                            
                                ElasticSearch service failing to start. Cannot find JVM
                            
                                ElasticSearch Search query is not case sensitive
                            
                                Creating an index Nest
                            
                                analyzed or not_analyzed, what to choose
                            
                                Elasticsearch query not giving exact match
                            
                                ElasticSearch bool should_not filter
                            
                                ElasticSearch Nest Insert/Update
                            
                                Elasticsearch Mapping - Rename existing field
                            
                                Elastic Search: How to write multi statement scripts?
                            
                                Connecting to AWS Elasticsearch instance using Python
                            
                                Elasticsearch 5.0.0. cluster node not joining
                            
                                Elasticsearch find all indexes using the Java client
                            
                                How to Log to Elastic Search by NLog or SeriLog with authentications
                            
                                Make logstash add different inputs to different indices

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With