Which is better scrolls or search_after in elasticsearch to simulate random pagination?

Tags:

elasticsearch

I want to randomly jump to a page of results from elasticsearch. There are three ways to paginate in elasticsearch:

from/size - I can't use this because of the maximum depth limit of 10000.
scroll API - I can use this but it has a cost of memory usage (keeping the search context alive) associated with it.
search_after - I can also use this even it is less expensive than scrolls as it is stateless.

I know that anyway, Elasticsearch will sequentially read the data. Let's say if I wanted to get 99th page then elastic is going to read all 98 results to get the 99th result.

I can do one thing i.e. to reduce the data which I will sequentially get before the targeted data, in this case I will reduce the data returned for 98 pages and for the 99th one I will get the complete data.

My main question is "What if I don't have memory concerns then which approach would be faster to sequentially get 98 pages ?" (search_after or scrolls)

If I use scrolls I will be clearing it after every usage.

406

asked Jun 22 '18 07:06

TechnocratSid

1 Answers

If you don't have memory concerns, then the simplest option is to increase the index setting index.max_result_window from 10000 to the number you require.

See https://www.elastic.co/guide/en/elasticsearch/reference/current/index-modules.html#dynamic-index-settings

197

answered Oct 15 '22 00:10

Adam T

Related questions
                            
                                Renaming fields in elasticsearch
                            
                                Preserving order of terms in ElasticSearch query
                            
                                How to run rake in ruby-on-rails application in production?
                            
                                How do I list all stored scripts on an Elasticsearch cluster?
                            
                                Elasticsearch More Like this no result
                            
                                ElasticSearch query_string fails to parse query with some characters
                            
                                master_not_discovered_exception ElasticSearch single node
                            
                                How do I set the path.repo in Docker compose 3?
                            
                                Elasticsearch: HOW-TO delete a (cluster) setting
                            
                                Elastic NEST using Term filter on text field with inner keyword field
                            
                                Error: The 'elasticsearch' backend requires the installation of 'requests'. How do I fix it?
                            
                                How to make our customised dashboard as default dashboard on kibana
                            
                                ElasticSearch for Time Series Data [closed]
                            
                                What does Elasticsearch automatic slicing do?
                            
                                Elasticsearch Scan&scroll with JEST API
                            
                                Multiple should queries with must query
                            
                                Elasticsearch 2.0: how to delete by query in Java
                            
                                Where Elasticsearch store the data on Mac
                            
                                Should I choose datatype of keyword or long / integer for document personId in Elasticsearch?
                            
                                How to change Elasticsearch network host

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With