Read Json with NaN into Python and Pandas

Tags:

I understand that NaN is not allowed in JSON files. I usually use

import pandas as pd 
pd.read_json('file.json')

to read in JSON into python. Looking through the documentation, I do not see an option to handle that value.

I have a JSON file, data.json, that looks like

Click to copy

[{"city": "Los Angeles","job":"chef","age":30},
 {"city": "New York","job":"driver","age":35},
 {"city": "San Jose","job":"pilot","age":NaN}]

How can I read this into python/pandas and handle the NaN values?

EDIT:

Amazing answer below!! Thanks fixxxer!! Just so it's documented, reading it in from a separate file

Click to copy

import pandas as pd
import json

text=open('data.json','r')
x=text.read()

y=json.loads(x)
data=pd.DataFrame(y)
data.head()

504

asked Apr 26 '15 08:04

Max

1 Answers

Read the json file into a variable:

Click to copy

x = '''[{"city": "Los Angeles","job":"chef","age":30},  {"city": "New York","job":"driver","age":35},  {"city": "San Jose","job":"pilot","age":NaN}]'''

Now, load it with json.loads

Click to copy

In [41]: import json

In [42]: y = json.loads(x)

In [43]: y
Out[43]: 
[{u'age': 30, u'city': u'Los Angeles', u'job': u'chef'},
 {u'age': 35, u'city': u'New York', u'job': u'driver'},
 {u'age': nan, u'city': u'San Jose', u'job': u'pilot'}]

And,

Click to copy

    In [44]: pd.DataFrame(y)
Out[44]: 
   age         city     job
0   30  Los Angeles    chef
1   35     New York  driver
2  NaN     San Jose   pilot

answered Sep 23 '22 19:09

fixxxer

Related questions
                            
                                Pandas backwards compatibility issue with pickle 0.14.1 and 0.15.2
                            
                                Having trouble implementing a readlink() function
                            
                                Are Mixin classes abstract base classes
                            
                                Why does pandas.DataFrame.update change the dtypes of the updated dataframe?
                            
                                python module not working in PyCharm with virtualenv
                            
                                How to read HDF5 files that have only datasets (no groups) using h5py?
                            
                                Apply a Python function to an std::vector via Cython (callback)
                            
                                Extending threading.Timer for returning value from function gives TypeError
                            
                                Compressing request body with python-requests?
                            
                                Editing workbooks with rich text in openpyxl
                            
                                What is the best practice for storing UI messaging strings in Python/Django?
                            
                                Embedding multiple gridspec layouts on a single matplotlib figure?
                            
                                numpy sort acting weirdly when sorting on a pandas DataFrame
                            
                                Efficient data structure keeping objects sorted on multiple keys
                            
                                Running PEP8 checks from Python
                            
                                Python - safe & elegant way to set a variable from function that may return None
                            
                                Multiprocessing with Qt works in windows but not linux
                            
                                Split a Python string with nested separated symbol
                            
                                How to extrapolate curves in Python?
                            
                                Python : overflow error long int too large to convert to float

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Read Json with NaN into Python and Pandas

Tags:

python

json

pandas

nan

Max

People also ask

1 Answers

fixxxer

Recent Activity

Donate For Us