I have a csv file containing numerical values such as <code>1524.449677</code>. There are always exactly 6 decimal places. When I import the csv file (and other columns) via pandas <code>read_csv</code>, the column automatically gets the datatype <code>object</code>. My issue is that the values are shown as <code>2470.6911370000003</code> which actually should be <code>2470.691137</code>. Or the value <code>2484.30691</code> is shown as <code>2484.3069100000002</code>. This seems to be a datatype issue in some way. I tried to explicitly provide the data type when importing via <code>read_csv</code> by giving the <code>dtype</code> argument as <code>{'columnname': np.float64}</code>. Still the issue did not go away. How can I get the values imported and shown exactly as they are in the source csv file?

Pandas uses a dedicated <code>dec 2 bin</code> converter that compromises accuracy in preference to speed. Passing <code>float_precision='round_trip'</code> to <code>read_csv</code> fixes this. Check out this page for more detail on this. After processing your data, if you want to save it back in a csv file, you can pass <code>float_format = "%.nf"</code> to the corresponding method. A full exemple: <pre class="prettyprint"><code>import pandas as pd df_in = pd.read_csv(source_file, float_precision='round_trip') df_out = ... # some processing of df_in df_out.to_csv(target_file, float_format="%.3f") # for 3 decimal places </code></pre>

Pandas read csv file with float values results in weird rounding and decimal digits

Tags:

I have a csv file containing numerical values such as 1524.449677. There are always exactly 6 decimal places.

When I import the csv file (and other columns) via pandas read_csv, the column automatically gets the datatype object. My issue is that the values are shown as 2470.6911370000003 which actually should be 2470.691137. Or the value 2484.30691 is shown as 2484.3069100000002.

This seems to be a datatype issue in some way. I tried to explicitly provide the data type when importing via read_csv by giving the dtype argument as {'columnname': np.float64}. Still the issue did not go away.

How can I get the values imported and shown exactly as they are in the source csv file?

353

asked Nov 18 '17 16:11

beta

1 Answers

Pandas uses a dedicated dec 2 bin converter that compromises accuracy in preference to speed.

Passing float_precision='round_trip' to read_csv fixes this.

Check out this page for more detail on this.

After processing your data, if you want to save it back in a csv file, you can pass
float_format = "%.nf" to the corresponding method.

A full exemple:

import pandas as pd  df_in  = pd.read_csv(source_file, float_precision='round_trip') df_out = ... # some processing of df_in df_out.to_csv(target_file, float_format="%.3f") # for 3 decimal places

answered Oct 16 '22 06:10

Paula Livingstone

Related questions
                            
                                When to use Rc vs Box?
                            
                                xcode 9.3 session expires every time i close and re-open Xcode
                            
                                What is the purpose of std::aligned_storage?
                            
                                Finding nearest neighbours of a triangular tesellation
                            
                                How to restore Tensorflow model from .pb file in python?
                            
                                Flutter - Upload Image to Firebase Storage
                            
                                React-MobX Error: The 'decorators' plugin requires a 'decoratorsBeforeExport' option, whose value must be a boolean
                            
                                VueJS - disable space in input text
                            
                                Helm conditionally install subchart
                            
                                selenium.common.exceptions.SessionNotCreatedException: Message: session not created: Chrome version must be between 70 and 73 with ChromeDriver
                            
                                Can I run Docker Desktop on Windows without admin privileges?
                            
                                difference between RUN cd and WORKDIR in Dockerfile

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With