How to remove duplicate documents in Elasticsearch?

Question

Accepted Answer

1) If you don't mind generating new _id values and reindexing all of the documents into a new collection, then you can use Logstash and the fingerprint filter to generate a unique fingerprint (hash) from the fields that you are trying to de-duplicate, and use this fingerprint as the _id for documents as they are ...

Elasticsearch delete duplicates

Tags:

elasticsearch

FUD

People also ask

2 Answers

Tombart

Andy

Recent Activity

Donate For Us