I have a dataframe in pandas and my goal is to write each row of the dataframe as a new json file. I'm a bit stuck right now. My intuition was to iterate over the rows of the dataframe (using df.iterrows) and use json.dumps to dump the file but to no avail. Any thoughts?

Looping over indices is very inefficient. A faster technique: <code>df['json'] = df.apply(lambda x: x.to_json(), axis=1)</code>

Pandas DataFrames have a to_json method that will do it for you: http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.to_json.html If you want each row in its own file you can iterate over the index (and use the index to help name them): <pre class="prettyprint"><code>for i in df.index: df.loc[i].to_json("row{}.json".format(i)) </code></pre>

Extending the answer of @MrE, if you're looking to convert multiple columns from a single row into another column with the content in json format (and not separate json files as output) I've had speed issues while using: <pre class="prettyprint"><code>df['json'] = df.apply(lambda x: x.to_json(), axis=1) </code></pre> I've achieved significant speed improvements on a dataset of 175K records and 5 columns using this line of code: <pre class="prettyprint"><code>df['json'] = df.to_json(orient='records', lines=True).splitlines() </code></pre> Speed went from >1 min to 350 ms.

Pandas row to json

4 Answers

Looping over indices is very inefficient.

A faster technique:

df['json'] = df.apply(lambda x: x.to_json(), axis=1)

answered Oct 11 '22 09:10

MrE

Pandas DataFrames have a to_json method that will do it for you: http://pandas.pydata.org/pandas-docs/stable/generated/pandas.DataFrame.to_json.html

If you want each row in its own file you can iterate over the index (and use the index to help name them):

Click to copy

for i in df.index:
    df.loc[i].to_json("row{}.json".format(i))

answered Oct 11 '22 08:10

tvashtar

Extending the answer of @MrE, if you're looking to convert multiple columns from a single row into another column with the content in json format (and not separate json files as output) I've had speed issues while using:

Click to copy

df['json'] = df.apply(lambda x: x.to_json(), axis=1)

I've achieved significant speed improvements on a dataset of 175K records and 5 columns using this line of code:

Click to copy

df['json'] = df.to_json(orient='records', lines=True).splitlines()

Speed went from >1 min to 350 ms.

answered Oct 11 '22 09:10

BramV

Using apply, this can be done as

Click to copy

def writejson(row):
  with open(row["filename"]+'.json', "w") as outfile:
    json.dump(row["json"], outfile, indent=2)

in_df.apply(writejson, axis=1)

Assuming the dataframe has a column named "filename" with filename for each json row.

answered Oct 11 '22 10:10

Steni Thomas

Related questions
                            
                                Can I use gcloud in Git Bash on Windows?
                            
                                How to download and decrypt HTTP Live Streaming (HLS) videos in iOS?
                            
                                Using CSS !important with JavaScript
                            
                                Electron: get full path of uploaded file
                            
                                Swift Remove Object from Realm
                            
                                Bootstrap 4 vertical align text won't center on card
                            
                                Xamarin.Forms 2.5.0 and Context
                            
                                Cypress failed to start on Windows
                            
                                Testing MutationObserver with Jest
                            
                                Mongodb Atlas: not authorized on admin to execute command
                            
                                "fetch" is undefined and "localStorage" is undefined , on using eslint-config-airbnb in react.js
                            
                                Use of undefined constant REQUEST_URI - assumed 'REQUEST_URI' in functions.php on line 73 [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas row to json

Tags:

python

json

pandas

Roger Josh

People also ask

4 Answers

MrE

tvashtar

BramV

Steni Thomas

Recent Activity

Donate For Us