How to export pandas data to elasticsearch?

2 Answers

The following script works for localhost:

import numpy as np
import pandas as pd

df = pd.DataFrame(np.random.randint(0,100,size=(100, 4)), columns=list('ABCD'))

INDEX="dataframe"
TYPE= "record"

def rec_to_actions(df):
    import json
    for record in df.to_dict(orient="records"):
        yield ('{ "index" : { "_index" : "%s", "_type" : "%s" }}'% (INDEX, TYPE))
        yield (json.dumps(record, default=int))

from elasticsearch import Elasticsearch
e = Elasticsearch() # no args, connect to localhost:9200
if not e.indices.exists(INDEX):
    raise RuntimeError('index does not exists, use `curl -X PUT "localhost:9200/%s"` and try again'%INDEX)

r = e.bulk(rec_to_actions(df)) # return a dict

print(not r["errors"])

Verify using curl -g 'http://localhost:9200/dataframe/_search?q=A:[29%20TO%2039]'

There are many little things that can be added to suit different needs but main is there.

103

answered Sep 16 '22 12:09

Setop

I'm not aware of any to_elastic method integrated in pandas. You can always raise an issue on the pandas github repo or create a pull request.

However, there is espandas which allows to import a pandas DataFrame to elasticsearch. The following example from the README has been tested with Elasticsearch 6.2.1.

import pandas as pd
import numpy as np
from espandas import Espandas

df = (100 * pd.DataFrame(np.round(np.random.rand(100, 5), 2))).astype(int)
df.columns = ['A', 'B', 'C', 'D', 'E']
df['indexId'] = (df.index + 100).astype(str)

INDEX = 'foo_index'
TYPE = 'bar_type'
esp = Espandas()
esp.es_write(df, INDEX, TYPE)

Retrieving the mappings with GET foo_index/_mappings:

{
  "foo_index": {
    "mappings": {
      "bar_type": {
        "properties": {
          "A": {
            "type": "long"
          },
          "B": {
            "type": "long"
          },
          "C": {
            "type": "long"
          },
          "D": {
            "type": "long"
          },
          "E": {
            "type": "long"
          },
          "indexId": {
            "type": "text",
            "fields": {
              "keyword": {
                "type": "keyword",
                "ignore_above": 256
              }
            }
          }
        }
      }
    }
  }
}

answered Sep 16 '22 12:09

Jan Trienes

Related questions
                            
                                pandas read csv ignore ending semicolon of last column
                            
                                python: convert numerical data in pandas dataframe to floats in the presence of strings
                            
                                Plotting datetimeindex on x-axis with matplotlib creates wrong ticks in pandas 0.15 in contrast to 0.14
                            
                                Hash each value in a pandas data frame
                            
                                Selecting rows from a Dataframe based on values in multiple columns in pandas
                            
                                Python pandas merge keyerror
                            
                                pandas.read_html not support decimal comma
                            
                                pandas.concat of multiple data frames using only common columns
                            
                                Renaming the column names of pandas dataframe is not working as expected - python
                            
                                Subtracting columns based on key column in pandas dataframe
                            
                                Compare column names of Pandas Dataframe
                            
                                Divide DataFrame by first row
                            
                                Python Pandas - Removing Rows From A DataFrame Based on a Previously Obtained Subset
                            
                                How do I refer to the index of my Pandas dataframe?
                            
                                Apply styles while exporting to 'xlsx' in pandas with XlsxWriter
                            
                                Pandas: Always selecting the first sheet/tab in an Excel Sheet
                            
                                No module named 'pandas' in Pycharm
                            
                                Split pandas column and add last element to a new column
                            
                                Sort all columns of a dataframe
                            
                                pandas apply function with arguments

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to export pandas data to elasticsearch?

Tags:

pandas

elasticsearch

shantanuo

People also ask

2 Answers

Setop

Jan Trienes

Recent Activity

Donate For Us