When I search for documents I took the first 10 and give them to the view, if the user scrolls to the end of the list the next 10 elements should be displayed. I know the last document id of the displayed documents, now I have to get the next 10. Basically I would perform the exact same search with an offset of 10 but it would be much better to be able to search with the same query, putting the document id of the last retrieved document to it and retrieve the matching documents after the document with that id. Is that possible with elasticsearch? === UPDATE I want to point out my issue a bit more, because it seems it is not clear enough as it is described right now. Sorry for that. The case: You have a kind of feed, the feed will grow every second. If a user goes to the feed he gets the most recent 10 entries, if he scrolls down he wants to get the next 10 entries. Because the feed is growing every second, a usual offset / limit (from / size in elasticsearch) can't solve this problem, you would display already displayed entries or completely newer entries, depending on the time between first request (first 10 entries) and the request for the next entries. The request to get the next 10 elements AFTER the already displayed entries gives the backend the id of the last entry which got displayed. The backend knows to ignore all entries before this specific one. At the moment I'm handling this in code, I request the list with all matching entries from Elasticsearch and iterate it, this way I can do everything I want (no surprise) and extract the needed chunk of entires. My question is: Is there is a build in solution for this issue in elasticsearch. Because solving the problem on my way is not the fastest.

You just have to create your query DSL and a pagination system with <blockquote> { "size": 10, "from" : YOUR_OFFSET } </blockquote>

It's an old topic, but it feels that Search After API, which is available since elasticsearch 5.0, does exactly what is needed. Provide an id of your last doc and it's timestamp, for example: <pre class="prettyprint"><code>GET twitter/tweet/_search { "size": 10, "query": { "match": { "title": "elasticsearch" } }, "search_after": [ 1463538857, "tweet#654323" ], "sort": [ { "date": "asc" }, { "_uid": "desc" } ] } </code></pre> Source: https://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-search-after.html

Elasticsearch get matching documents after specific document id

Tags:

offset

elasticsearch

When I search for documents I took the first 10 and give them to the view, if the user scrolls to the end of the list the next 10 elements should be displayed.

I know the last document id of the displayed documents, now I have to get the next 10. Basically I would perform the exact same search with an offset of 10 but it would be much better to be able to search with the same query, putting the document id of the last retrieved document to it and retrieve the matching documents after the document with that id.

Is that possible with elasticsearch?

=== UPDATE

I want to point out my issue a bit more, because it seems it is not clear enough as it is described right now. Sorry for that.

The case:

You have a kind of feed, the feed will grow every second. If a user goes to the feed he gets the most recent 10 entries, if he scrolls down he wants to get the next 10 entries.

Because the feed is growing every second, a usual offset / limit (from / size in elasticsearch) can't solve this problem, you would display already displayed entries or completely newer entries, depending on the time between first request (first 10 entries) and the request for the next entries.

The request to get the next 10 elements AFTER the already displayed entries gives the backend the id of the last entry which got displayed. The backend knows to ignore all entries before this specific one.

At the moment I'm handling this in code, I request the list with all matching entries from Elasticsearch and iterate it, this way I can do everything I want (no surprise) and extract the needed chunk of entires.

My question is: Is there is a build in solution for this issue in elasticsearch. Because solving the problem on my way is not the fastest.

493

asked Nov 08 '13 05:11

maddin2code

2 Answers

You just have to create your query DSL and a pagination system with

{ "size": 10, "from" : YOUR_OFFSET }

104

answered Oct 26 '22 07:10

remiheens

It's an old topic, but it feels that Search After API, which is available since elasticsearch 5.0, does exactly what is needed. Provide an id of your last doc and it's timestamp, for example:

Click to copy

GET twitter/tweet/_search
{
  "size": 10,
  "query": {
    "match": {
      "title": "elasticsearch"
    }
  },
  "search_after": [
    1463538857,
    "tweet#654323"
  ],
  "sort": [
    {
      "date": "asc"
    },
    {
      "_uid": "desc"
    }
  ]
}

Source: https://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-search-after.html

answered Oct 26 '22 07:10

MastaP

Related questions
                            
                                Why elastic-search container memory usage keeps increasing with little use?
                            
                                Elasticsearch: Can it be used to avoid writing your own NLP? (e.g. Re-invent the wheel)
                            
                                Unable to search a query with symbols in elasticsearch
                            
                                How to percolate simple_query_string/query_string query
                            
                                How to combine completion, suggestion and match phrase across multiple text fields?
                            
                                elastic search update Service software release in AWS console
                            
                                Best practices for data storage with Elasticsearch and Kubernetes
                            
                                A better approach to exclude large list of items in Elasticsearch
                            
                                CouchDB, Elastic Search, and River Plugin not operating correctly
                            
                                elastic search double facet
                            
                                Is there a way to remove the calculation of length norms for fields in elastic search?
                            
                                How to kill the thread of searching request on elasticsearch cluster? Is there some API to do this?
                            
                                Is there a graphic tool to display (and maybe change) elasticsearch mappings?
                            
                                How can I check indices.memory.index_buffer_size parameter is effectively working in elasticsearch?
                            
                                Elasticsearch not returning an exact match first
                            
                                How to reindex ElasticSearch quickly?
                            
                                How to calculate the score based on number of query terms in elasticsearch?
                            
                                Specifying and using a NGramTokenizer with the C# NEST client for Elastic Search
                            
                                Elasticsearch query performance
                            
                                Does not work autocomplete with EdgeNgramField using haystack and engine Elasticsearch (Django)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With