pandas read json not working on MultiIndex

Tags:

I'm trying to read in a dataframe created via df.to_json() via pd.read_json but I'm getting a ValueError. I think it may have to do with the fact that the index is a MultiIndex but I'm not sure how to deal with that.

The original dataframe of 55k rows is called psi and I created test.json via:

psi.head().to_json('test.json')

Hereis the output of print psi.head().to_string() if you want to use that.

When I do it on this small set of data (5 rows), I get a ValueError.

! wget --no-check-certificate https://gist.githubusercontent.com/olgabot/9897953/raw/c270d8cf1b736676783cc1372b4f8106810a14c5/test.json
import pandas as pd
pd.read_json('test.json')

Here's the full stack:

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-14-1de2f0e65268> in <module>()
      1 get_ipython().system(u' wget https://gist.githubusercontent.com/olgabot/9897953/raw/c270d8cf1b736676783cc1372b4f8106810a14c5/test.json'>)
      2 import pandas as pd
----> 3 pd.read_json('test.json')

/home/obot/virtualenvs/envy/lib/python2.7/site-packages/pandas/io/json.pyc in read_json(path_or_buf, orient, typ, dtype, convert_axes, convert_dates, keep_default_dates, numpy, precise_float, date_unit)
    196         obj = FrameParser(json, orient, dtype, convert_axes, convert_dates,
    197                           keep_default_dates, numpy, precise_float,
--> 198                           date_unit).parse()
    199 
    200     if typ == 'series' or obj is None:

/home/obot/virtualenvs/envy/lib/python2.7/site-packages/pandas/io/json.pyc in parse(self)
    264 
    265         else:
--> 266             self._parse_no_numpy()
    267 
    268         if self.obj is None:

/home/obot/virtualenvs/envy/lib/python2.7/site-packages/pandas/io/json.pyc in _parse_no_numpy(self)
    481         if orient == "columns":
    482             self.obj = DataFrame(
--> 483                 loads(json, precise_float=self.precise_float), dtype=None)
    484         elif orient == "split":
    485             decoded = dict((str(k), v)

ValueError: No ':' found when decoding object value

> /home/obot/virtualenvs/envy/lib/python2.7/site-packages/pandas/io/json.py(483)_parse_no_numpy()
    482             self.obj = DataFrame(
--> 483                 loads(json, precise_float=self.precise_float), dtype=None)
    484         elif orient == "split":

But when I do it on the whole dataframe (55k rows) then I get an invalid pointer error and the IPython kernel dies. Any ideas?

EDIT: added how the json was generated in the first place.

261

asked Mar 31 '14 17:03

Olga Botvinnik

1 Answers

This is not implemented ATM, see the issue here: https://github.com/pydata/pandas/issues/4889.

You can simply reset the index first, e.g

df.reset_index().to_json(...)

and it will work.

135

answered Oct 27 '22 15:10

Jeff

Related questions
                            
                                Why __instancecheck__ is not always called depending on argument?
                            
                                How to Mock a missing attribute
                            
                                Flask-Admin Blueprint creation during Testing
                            
                                Efficient manipulation of a list of cartesian coordinates in Python
                            
                                pandas: fill a column with some numpy arrays
                            
                                How to set ffmpeg for matplotlib in mac os x
                            
                                Python socket server/client programming
                            
                                How to cache a Django Model in Memory [duplicate]
                            
                                Python PIL, preserve quality when resizing and saving [duplicate]
                            
                                Python sort a List by length of value in tuple
                            
                                Installing Python module with pip
                            
                                Django REST Framework, pre_save() and serializer.is_valid(), how do they work?
                            
                                Root logger in dictconfig
                            
                                scipy p-value returns 0.0
                            
                                Python swapping lists
                            
                                What's the difference between scipy.special.binom and scipy.misc.comb?
                            
                                How to read the alpha channel of a TIFF image in Python OpenCV?
                            
                                python Convert Encoding:LookupError: unknown encoding: ansi
                            
                                In numpy, calculating a matrix where each cell contains the product of all the other entries in that row
                            
                                How to restart a python script after it finishes

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

pandas read json not working on MultiIndex

Tags:

python

json

pandas

Olga Botvinnik

People also ask

1 Answers

Jeff

Recent Activity

Donate For Us