I am trying to put some numbers into numpy array <pre class="prettyprint"><code>>>> np.array([20000001]).astype('float32') array([ 20000000.], dtype=float32) </code></pre> where did 1 go?

You simply don't have enough precision. The <code>float32</code> has only approximately 7 digits of accuracy, whereas the <code>float64</code> has approximately 16 digits of accuracy. Thus, any time you convert to a <code>float32</code>, it's only guaranteed to be "correct" to within about a part in 10^7. So, for example, you can try this: <pre class="prettyprint"><code>>>> np.array([20000001]).astype('float64') array([ 20000001.]) </code></pre> That's the expected answer. (The <code>dtype=float64</code> is automatically omitted, because that's the default.) In fact, you can go further and find <pre class="prettyprint"><code>>>> np.array([2000000000000001]).astype('float64')[0] 2000000000000001.0 </code></pre> but <pre class="prettyprint"><code>>>> np.array([20000000000000001]).astype('float64')[0] 20000000000000000.0 </code></pre> At some point, no matter how high your precision, you'll always get to the point where <code>float</code>s drop the least significant digits. See here for more info on <code>float</code>s. On the other hand, python's <code>int</code> objects have many more digits they can keep track of. In python 3, it's practically unlimited. So <code>int</code>s have basically infinite precision. See here for more info on <code>int</code>s.

First of all, <code>float64</code> works in this case: <pre class="prettyprint"><code>>>> np.array([20000001]).astype('float32') array([ 20000000.], dtype=float32) >>> np.array([20000001]).astype('float64') array([ 20000001.]) </code></pre> How does a <code>float</code> work under the hood: <img src="https://i.stack.imgur.com/pWYApm.jpg" alt="enter image description here"> What's the difference between <code>float32</code> and <code>float64</code>?: <ul> <li>32bit (single precision float): 24 bit significand</li> <li>64bit (double precision float): 53 bit significand</li> </ul> With <code>float32</code>, you get 23 bits to represent the digits plus 1 bit to represent the sign. Lets view <code>20000001</code> in binary: <pre class="prettyprint"><code>0b 1 0011 0001 0010 1101 0000 0001 ----> 0b 1 0011 0001 0010 1101 0000 00 </code></pre> So the last two bits "01" will get cut off when converting from <code>int</code> to <code>float32</code>. Interestingly, converting <code>20000003</code> will get you <code>20000004</code>: <pre class="prettyprint"><code>>>> np.array([20000003]).astype('float32') array([ 20000004.], dtype=float32) </code></pre> And that is: <pre class="prettyprint"><code>0b 1 0011 0001 0010 1101 0000 0011 ----> 0b 1 0011 0001 0010 1101 0000 01 </code></pre>

Why numpy converts 20000001 int to float32 as 20000000.?

Tags:

python

numpy

I am trying to put some numbers into numpy array

>>> np.array([20000001]).astype('float32')
array([ 20000000.], dtype=float32)

where did 1 go?

245

asked Feb 15 '17 15:02

Bob

Video Answer

2 Answers

You simply don't have enough precision. The float32 has only approximately 7 digits of accuracy, whereas the float64 has approximately 16 digits of accuracy. Thus, any time you convert to a float32, it's only guaranteed to be "correct" to within about a part in 10^7. So, for example, you can try this:

>>> np.array([20000001]).astype('float64')
array([ 20000001.])

That's the expected answer. (The dtype=float64 is automatically omitted, because that's the default.) In fact, you can go further and find

>>> np.array([2000000000000001]).astype('float64')[0]
2000000000000001.0

but

>>> np.array([20000000000000001]).astype('float64')[0]
20000000000000000.0

At some point, no matter how high your precision, you'll always get to the point where floats drop the least significant digits. See here for more info on floats.

On the other hand, python's int objects have many more digits they can keep track of. In python 3, it's practically unlimited. So ints have basically infinite precision. See here for more info on ints.

125

answered Oct 06 '22 01:10

Mike

First of all, float64 works in this case:

>>> np.array([20000001]).astype('float32')
array([ 20000000.], dtype=float32)
>>> np.array([20000001]).astype('float64')
array([ 20000001.])

How does a float work under the hood:

enter image description here

What's the difference between float32 and float64?:

32bit (single precision float): 24 bit significand
64bit (double precision float): 53 bit significand

With float32, you get 23 bits to represent the digits plus 1 bit to represent the sign. Lets view 20000001 in binary:

0b 1 0011 0001 0010 1101 0000 0001  ---->
0b 1 0011 0001 0010 1101 0000 00

So the last two bits "01" will get cut off when converting from int to float32.

Interestingly, converting 20000003 will get you 20000004:

>>> np.array([20000003]).astype('float32')
array([ 20000004.], dtype=float32)

And that is:

0b 1 0011 0001 0010 1101 0000 0011  ---->
0b 1 0011 0001 0010 1101 0000 01

answered Oct 05 '22 23:10

greedy52

Related questions
                            
                                How to install data_files to absolute path?
                            
                                Named parameter with no default?
                            
                                how to reverse Python uuid5() to its value?
                            
                                Divide Column in Pandas Dataframe by Sum of Column
                            
                                Custom function which performs create and update on DRF modelViewSet
                            
                                Most pythonic way to plot multiple signals
                            
                                How to use async/await in python 3.5+
                            
                                Python Pandas - how to get top n values and the sum of all other values
                            
                                How to update a list of variables in python?
                            
                                Drawing convexHull in openCV2 Python
                            
                                MySQL: django.db.utils.OperationalError: (1698, "Access denied for user 'root'@'localhost'") with correct username and pw
                            
                                Python tkinter: What are the correct values for the anchor option in the message widget?
                            
                                How to install numpy to Python 3.5?
                            
                                Plotting a choropleth map (with geopandas) using a user_defined classification scheme
                            
                                Make a contour plot by using three 1D arrays in python
                            
                                matplotlib: Can I interrupt an `axhline` with text?
                            
                                Inserting a list holding multiple values in MySQL using pymysql
                            
                                Python - how to pass a dictionary into defaultdict as value and not as a reference
                            
                                Incrementing class variables dynamically in Python
                            
                                python: can statement be inside expression?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With