Reading pandas dataframe that contains dictionaries in cells from csv

Tags:

I saved a pandas dataframe that looks like the following as a csv file.

    a
0 {'word': 5.7}
1 {'khfds': 8.34}

When I attempt to read the dataframe as shown below, I receive the following error.

df = pd.read_csv('foo.csv', index_col=0, dtype={'str': 'dict'})

TypeError: data type "dict" not understood

The heart of my question is how do I read the csv file to recover the dataframe in the same form as when it was created. I also have tried reading without the dtype={} as well as replacing 'dict' with alternatives such as 'dictionary', 'object', and 'str'.

686

asked Jun 07 '18 00:06

TommyTorty10

1 Answers

CSV files may only contain text, so dictionaries are out of scope. Therefore, you need to read the text literally to convert to dict. One way is using ast.literal_eval:

import pandas as pd
from ast import literal_eval
from io import StringIO

mystr = StringIO("""a
{'word': 5.7}
{'khfds': 8.34}""")

df = pd.read_csv(mystr)

df['a'] = df['a'].apply(literal_eval)

print(df['a'].apply(lambda x: type(x)))

0    <class 'dict'>
1    <class 'dict'>
Name: a, dtype: object

However, I strongly recommend you do not use Pandas specifically to store pointers to dictionaries. Pandas works best with contiguous memory blocks, e.g. separate numeric data into numeric series.

answered Oct 01 '22 20:10

jpp

Related questions
                            
                                Moving Collections between axes
                            
                                How to chain multiple command line responses in Python?
                            
                                Euclidean distance, different results between Scipy, pure Python, and Java
                            
                                Scipy randint vs numpy randint
                            
                                Pandas alternative to apply - to create new column based on multiple columns
                            
                                R_ext/eventloop.h: No such file error while installing rpy2 using pip
                            
                                Flask: serve assets without leading slash using url_for
                            
                                Django on GAE - How to automatically 'migrate' on deploy?
                            
                                Error trying to use cvtColor with cv2.COLOR_YUV2BGR_Y422 - error: (-215) scn == 2 && depth == 0 in function cv::cvtColor
                            
                                How can I manage a queue of requests in my Flask service?
                            
                                Polymorphism and pybind11
                            
                                Is there a “breadth-first” search option available in os.walk() or equivalent Python function?
                            
                                Replicate curl negotiate connection (using kerberos auth) in Python
                            
                                Unable to use multiple proxies within Scrapy spider
                            
                                ImportError: cannot import name 'string_int_label_map_pb2'
                            
                                CFFI fails in Python (Linux) virtual environment -- attempting to install cryptography package in venv
                            
                                DeviceCheck: Unable to verify authorization token
                            
                                "ValueError: Trying to share variable $var, but specified dtype float32 and found dtype float64_ref" when trying to use get_variable
                            
                                Why can't I access builtins if I use a custom dict as a function's globals?
                            
                                pandas rolling() function with monthly offset

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reading pandas dataframe that contains dictionaries in cells from csv

Tags:

python

dictionary

pandas

dataframe

csv

TommyTorty10

People also ask

1 Answers

jpp

Recent Activity

Donate For Us