I am working with a df and using numpy to transform data - including setting blanks (or '') to NaN. But when I write the df to csv - the output contains the string 'nan' as oppose to being NULL. I have looked around but can't find a workable solution. Here's the basic issue: <pre class="prettyprint"><code>df index x y z 0 1 NaN 2 1 NaN 3 4 </code></pre> CSV output: <pre class="prettyprint"><code>index x y z 0 1 nan 2 1 nan 3 4 </code></pre> I have tried a few things to set 'nan' to NULL but the csv output results in a 'blank' rather than NULL: <pre class="prettyprint"><code>dfDemographics = dfDemographics.replace('nan', np.NaN) dfDemographics.replace(r'\s+( +\.)|#', np.nan, regex=True).replace('', np.nan) dfDemographics = dfDemographics.replace('nan', '') # of course, this wouldn't work, but tried it anyway. </code></pre> Any help would be appreciated.

Pandas to the rescue, use <code>na_rep</code> to fix your own representation for NaNs. <pre class="prettyprint"><code>df.to_csv('file.csv', na_rep='NULL') </code></pre> <code>file.csv</code> <pre class="prettyprint"><code>,index,x,y,z 0,0,1.0,NULL,2 1,1,NULL,3.0,4 </code></pre>

Pandas Changing the format of NaN values when saving to CSV

Tags:

I am working with a df and using numpy to transform data - including setting blanks (or '') to NaN. But when I write the df to csv - the output contains the string 'nan' as oppose to being NULL.

I have looked around but can't find a workable solution. Here's the basic issue:

df index x    y   z 0     1   NaN  2 1     NaN  3   4

CSV output:

index x    y   z 0     1   nan  2 1     nan  3   4

I have tried a few things to set 'nan' to NULL but the csv output results in a 'blank' rather than NULL:

dfDemographics = dfDemographics.replace('nan', np.NaN) dfDemographics.replace(r'\s+( +\.)|#', np.nan, regex=True).replace('',  np.nan) dfDemographics = dfDemographics.replace('nan', '')  # of course, this wouldn't work, but tried it anyway.

Any help would be appreciated.

832

asked Jun 16 '18 19:06

Jerry

1 Answers

Pandas to the rescue, use na_rep to fix your own representation for NaNs.

df.to_csv('file.csv', na_rep='NULL')

file.csv

,index,x,y,z 0,0,1.0,NULL,2 1,1,NULL,3.0,4

167

answered Oct 06 '22 00:10

cs95

Related questions
                            
                                How to make chrome devtools to recognise moment.js
                            
                                Flutter is not able to install the apk into the real device suddenly
                            
                                AWS GraphQL: Variable 'input' has coerced Null value for NonNull type 'Input!'
                            
                                TabBarView with variable height inside a ListView
                            
                                "Error: Arguments array must have arguments." AppModule
                            
                                Java 11 HttpClient not sending basic authentication
                            
                                float/double Math.Round in C# [duplicate]
                            
                                OneHotEncoder categorical_features deprecated, how to transform specific column
                            
                                Razor pages and webapi in the same project
                            
                                React Material-UI Modal TypeError: Cannot read property 'hasOwnProperty' of undefined
                            
                                iOS 13 Killing app because it never posted an incoming call to the system after receiving a PushKit VoIP callback
                            
                                How to best capture and log scp output?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With