I'm reading a CSV with float numbers like this: <pre class="prettyprint"><code>Bob,0.085 Alice,0.005 </code></pre> And import into a dataframe, and write this dataframe to a new place <pre class="prettyprint"><code>df = pd.read_csv(orig) df.to_csv(pandasfile) </code></pre> Now this <code>pandasfile</code> has: <pre class="prettyprint"><code>Bob,0.085000000000000006 Alice,0.0050000000000000001 </code></pre> What happen? maybe I have to cast to a different type like float32 or something? Im using pandas 0.9.0 and numpy 1.6.2.

As mentioned in the comments, it is a general floating point problem. However you can use the <code>float_format</code> key word of <code>to_csv</code> to hide it: <pre class="prettyprint"><code>df.to_csv('pandasfile.csv', float_format='%.3f') </code></pre> or, if you don't want 0.0001 to be rounded to zero: <pre class="prettyprint"><code>df.to_csv('pandasfile.csv', float_format='%g') </code></pre> will give you: <pre class="prettyprint"><code>Bob,0.085 Alice,0.005 </code></pre> in your output file. For an explanation of <code>%g</code>, see Format Specification Mini-Language.

float64 with pandas to

I'm reading a CSV with float numbers like this:

Bob,0.085 Alice,0.005

And import into a dataframe, and write this dataframe to a new place

df = pd.read_csv(orig) df.to_csv(pandasfile)

Now this pandasfile has:

Bob,0.085000000000000006 Alice,0.0050000000000000001

What happen? maybe I have to cast to a different type like float32 or something?

Im using pandas 0.9.0 and numpy 1.6.2.

What does to_csv do in pandas?

Pandas DataFrame to_csv() function converts DataFrame into CSV data. We can pass a file object to write the CSV data into a file. Otherwise, the CSV data is returned in the string format.

Does to_csv overwrite?

If the file already exists, it will be overwritten. If no path is given, then the Frame will be serialized into a string, and that string will be returned.

Does to_csv create directory?

to_csv does create the file if it doesn't exist as you said, but it does not create directories that don't exist. Ensure that the subdirectory you are trying to save your file within has been created first. This can easily be wrapped up in a function if you need to do this frequently.

As mentioned in the comments, it is a general floating point problem.

However you can use the float_format key word of to_csv to hide it:

df.to_csv('pandasfile.csv', float_format='%.3f')

or, if you don't want 0.0001 to be rounded to zero:

df.to_csv('pandasfile.csv', float_format='%g')

will give you:

Bob,0.085 Alice,0.005

in your output file.

For an explanation of %g, see Format Specification Mini-Language.

float64 with pandas to_csv

Tags:

python

pandas

numpy

avances123

People also ask

1 Answers

bmu

Recent Activity

Donate For Us