Elastic Search limit results

Tags:

elasticsearch

In MySQL I can do something like:

  SELECT id FROM table WHERE field = 'foo' LIMIT 5

If the table has 10,000 rows, then this query is way way faster than if I left out the LIMIT part.

In ElasticSearch, I've got the following:

Click to copy

 {
    "query":{
       "fuzzy_like_this_field":{
          "body":{
             "like_text":"REALLY LONG (snip) TEXT HERE",
             "max_query_terms":1,
             "min_similarity":0.95,
             "ignore_tf":true
          }
       }
    }
 }

When I run this search, it takes a few seconds, whereas mysql can return results for the same query in far, far less time.

If I pass in the size parameter (set to 1), it successfully only returns 1 result, but the query itself isn't any faster than if I had set the size to unlimited and returned all the results. I suspect the query is being run in its entirety and only 1 result is being returned after the query is done processing. This means the "size" attribute is useless for my purposes.

Is there any way to have my search stop searching as soon as it finds a single record that matches the fuzzy search, rather than processing every record in the index before returning a response? Am I misunderstanding something more fundamental about this?

Thanks in advance.

738

asked Dec 20 '11 23:12

Jemaclus

1 Answers

You are correct the query is being ran entirely. Queries by default return data sorted by score, so your query is going to score each document. The docs state that the fuzzy query isn't going to scale well, so might want to consider other queries.

A limit filter might give you similar behavior to what your looking for.

A limit filter limits the number of documents (per shard) to execute on

To replicate mysql field='foo' try using a term filter. You should use filters when you don't care about scoring, they are faster and cache-able.

answered Oct 13 '22 00:10

Andy

Related questions
                            
                                Best approch of Elastic Search time based feeds module?
                            
                                Searchkick index is empty after reindexing from model
                            
                                Preferred method of indexing bulk data into ElasticSearch?
                            
                                How can I integrate Tomcat6's catalina.out file with Logstash + ElasticSearch + Kibana?
                            
                                Kibana + Elasticsearch without Logstash possible?
                            
                                Indexing/Searching "complex" JSON in elasticsearch
                            
                                script_score the script could not be loaded scripts of type [inline], operation [search] and lang [groovy] are disabled
                            
                                logstash output to elasticsearch index and mapping
                            
                                Elastic Search Give an error No alive nodes found in your cluster
                            
                                can I prioritize more exact matches when using ngram filter in search results?
                            
                                Weighted random sampling in Elasticsearch
                            
                                Best structure for storing tree in Elasticsearch?
                            
                                Elasticsearch Rails as_indexed_json vs mappings
                            
                                How to apply synonyms at query time instead of index time in Elasticsearch
                            
                                Elasticsearch : Completion suggester not working with whitespace Analyzer
                            
                                ElasticSearch: Specifying types in bulk requests is deprecated
                            
                                JSONField workaround on elasticsearch : MapperParsingException
                            
                                Indexing Mysql Database with elasticsearch
                            
                                ElasticSearch:filtering documents based on field length?
                            
                                What does Elasticsearch's auto_generate_phrase_queries do?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With