Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Delete documents of type in Elasticsearch

I want to delete all the documents indexed within a type in Elasticsearch, using the HTTP/REST api, but I don't want to delete the mapping for this type

How can I build the query in the URL to do this?

like image 443
user2816801 Avatar asked Sep 25 '13 19:09

user2816801


People also ask

How do I delete a type in Elasticsearch?

You can use _delete_by_query path to delete type.

How do I delete a document in Elasticsearch?

You use DELETE to remove a document from an index. You must specify the index name and document ID. You cannot send deletion requests directly to a data stream. To delete a document in a data stream, you must target the backing index containing the document.

Does deleting index delete documents Elasticsearch?

Deleting an index deletes its documents, shards, and metadata.


4 Answers

Before executing command, index/mapping state; (screenshots taken from elasticsearch head plugin web interface)

enter image description here

enter image description here

enter image description here

Command;

curl -XDELETE 'http://localhost:9200/publishercategoryeu/autocomplete/_query' -d '
{
  "query": {
    "bool": {
      "must": [
        {
          "match_all": {}
        }
      ]
    }
  }
}
'

Result;

enter image description here

After executing command, index/mapping state;

enter image description here

enter image description here

enter image description here

As we can see we deleted all the documents indexed within a type(mapping) without delete index or type(mapping).

like image 179
csonuryilmaz Avatar answered Oct 04 '22 00:10

csonuryilmaz


A simple delete by query with a match_all query should do the trick. You can grab more info here :

delete by query api

Alternatively, you can delete the whole type and make use of the template api. Just drop a file in your config/templates/ folder containing your template, and you'll never loose it. The mapping will indeed be lost when you'll delete the mapping, but the template will be reused as soon as you index something again. Here's some more info :

template api

EDIT: new delete api: https://www.elastic.co/guide/en/elasticsearch/reference/current/docs-delete.html

like image 22
DeH Avatar answered Oct 04 '22 01:10

DeH


With the following command in the elasticsearch head plugin I was able to delete all documents of type logs from the logstash index without deleting the mapping:

{"query":{"match_all":{}}}

Deleting documents with Elasticsearch head plugin

To free space on the disk you must also optimize the index (Actions->Optimize for index logstash in the head plugin) after deleting the documents.

like image 42
asmaier Avatar answered Oct 03 '22 23:10

asmaier


Previous answers will not work with the most recent version of Elasticsearch. "Delete by query" was deprecated from Elasticsearch 2.0. Elasticsearch documentation says that it can cause an OutOfMemoryError during concurrent indexing and can cause primary and replica to become inconsistent. If you want follow the history of the issue in Github.

It now takes multiple steps in order to delete all documents from a type.

  1. Find all the ids of the document that you need to delete. The most efficient way to perform this operation is to use the scroll/scan API to find all the matching ids for a given type.

  2. Issue a bulk request to delete the documents by ids. An example provided below.

    curl -XPOST 'http://localhost:9200/_bulk' -d '
        { "delete": { "_index": "index", "_type": "type", "_id": "1"}
        { "delete": { "_index": "index", "_type": "type", "_id": "2"}'
    

Note that if you are providing a text file input to curl, you must use the --data-binary flag instead of plain -d.

like image 27
Jadiel de Armas Avatar answered Oct 04 '22 01:10

Jadiel de Armas