In Python, it is possible to check if a <code>float</code> contains an integer value using <code>n.is_integer()</code>, based on this QA: How to check if a float value is a whole number. Does numpy have a similar operation that can be applied to arrays? Something that would allow the following: <pre class="prettyprint"><code>>>> x = np.array([1.0 2.1 3.0 3.9]) >>> mask = np.is_integer(x) >>> mask array([True, False, True, False], dtype=bool) </code></pre> It is possible to do something like <pre class="prettyprint"><code>>>> mask = (x == np.floor(x)) </code></pre> or <pre class="prettyprint"><code>>>> mask = (x == np.round(x)) </code></pre> but they involve calling extra methods and creating a bunch of temp arrays that could be potentially avoided. Does numpy have a vectorized function that checks for fractional parts of floats in a way similar to Python's <code>float.is_integer</code>?

From what I can tell, there is no such function that returns a boolean array indicating whether floats have a fractional part or not. The closest I can find is <code>np.modf</code> which returns the fractional and integer parts, but that creates two float arrays (at least temporarily), so it might not be best memory-wise. If you're happy working in place, you can try something like: <pre class="prettyprint"><code>>>> np.mod(x, 1, out=x) >>> mask = (x == 0) </code></pre> This should save memory versus using round or floor (where you have to keep <code>x</code> around), but of course you lose the original <code>x</code>. The other option is to ask for it to be implemented in Numpy, or implement it yourself.

While the accepted method of <code>(x % 1) == 0</code> is quite adequate, it bothers me that there is no way to accomplish this natively in numpy, especially given the existence of <code>float.is_integer</code> in vanilla python. I therefore did a bit of research on the floating point formats supported by numpy (<code>float16</code>, <code>float32</code>, <code>float64</code>, <code>float128</code> (acutally extended precision)), and on how to write a ufunc. The result is that for IEEE754 floats small enough to fit into a corresponding unsigned integer type (pretty much everything up to <code>float64</code> on a normal machine), you can do the checks with some simple bit twiddling. For example, here is a C99 function that very quickly tells you if your <code>float32</code> contains an integer value: <pre class="prettyprint"><code>#include <stdint.h> int is_integer(float n) { uint32_t k = ((union { float n; uint32_t k; }){n}).k; // Zero when everything except sign bit is zero if((k & 0x7FFFFFFF) == 0) return 1; uint32_t exponent = k & 0x7F800000; // NaN or Inf when the exponent bits are all ones // Guaranteed fraction when exponent < 0 if(exponent == 0x7F800000 || exponent < 0x3F800000) return 0; // Guaranteed integer when exponent >= FLT_MANT_DIG - 1 if(exponent >= 0x4B000000) return 1; // Otherwise, check that the significand bits past the exponent are zeros return (k & (0x7FFFFF >> ((exponent >> 23) - 0x7F))) == 0; } </code></pre> I went ahead and wrapped this function and its siblings in a ufunc, which can be found here: https://github.com/madphysicist/is_integer_ufunc. One nice feature is that this ufunc returns <code>True</code> for all integer types instead of raising an error. Another is that it runs anywhere from 5x to 40x faster than <code>(x % 1) == 0</code>, depending on dtype and input size. Based on the linked tutorial, you can install with <code>python setup.py {build_ext --inplace, build, install}</code>, depending on how bad you want it. Perhaps I should see if the numpy community is interested in including this ufunc.

Numpy: Check if float array contains whole numbers

Tags:

python

floating-point

numpy

In Python, it is possible to check if a float contains an integer value using n.is_integer(), based on this QA: How to check if a float value is a whole number.

Does numpy have a similar operation that can be applied to arrays? Something that would allow the following:

Click to copy

>>> x = np.array([1.0 2.1 3.0 3.9])
>>> mask = np.is_integer(x)
>>> mask
array([True, False, True, False], dtype=bool)

It is possible to do something like

Click to copy

>>> mask = (x == np.floor(x))

Click to copy

>>> mask = (x == np.round(x))

but they involve calling extra methods and creating a bunch of temp arrays that could be potentially avoided.

Does numpy have a vectorized function that checks for fractional parts of floats in a way similar to Python's float.is_integer?

996

asked Jan 27 '16 15:01

Mad Physicist

2 Answers

From what I can tell, there is no such function that returns a boolean array indicating whether floats have a fractional part or not. The closest I can find is np.modf which returns the fractional and integer parts, but that creates two float arrays (at least temporarily), so it might not be best memory-wise.

If you're happy working in place, you can try something like:

Click to copy

>>> np.mod(x, 1, out=x)
>>> mask = (x == 0)

This should save memory versus using round or floor (where you have to keep x around), but of course you lose the original x.

The other option is to ask for it to be implemented in Numpy, or implement it yourself.

180

answered Nov 12 '22 16:11

hunse

While the accepted method of (x % 1) == 0 is quite adequate, it bothers me that there is no way to accomplish this natively in numpy, especially given the existence of float.is_integer in vanilla python.

I therefore did a bit of research on the floating point formats supported by numpy (float16, float32, float64, float128 (acutally extended precision)), and on how to write a ufunc.

The result is that for IEEE754 floats small enough to fit into a corresponding unsigned integer type (pretty much everything up to float64 on a normal machine), you can do the checks with some simple bit twiddling. For example, here is a C99 function that very quickly tells you if your float32 contains an integer value:

Click to copy

#include <stdint.h>

int is_integer(float n)
{
    uint32_t k = ((union { float n; uint32_t k; }){n}).k;

    // Zero when everything except sign bit is zero
    if((k & 0x7FFFFFFF) == 0) return 1;

    uint32_t exponent = k & 0x7F800000;

    // NaN or Inf when the exponent bits are all ones
    // Guaranteed fraction when exponent < 0
    if(exponent == 0x7F800000 || exponent < 0x3F800000) return 0;
    // Guaranteed integer when exponent >= FLT_MANT_DIG - 1
    if(exponent >= 0x4B000000) return 1;
    // Otherwise, check that the significand bits past the exponent are zeros
    return (k & (0x7FFFFF >> ((exponent >> 23) - 0x7F))) == 0;
}

I went ahead and wrapped this function and its siblings in a ufunc, which can be found here: https://github.com/madphysicist/is_integer_ufunc. One nice feature is that this ufunc returns True for all integer types instead of raising an error. Another is that it runs anywhere from 5x to 40x faster than (x % 1) == 0, depending on dtype and input size.

Based on the linked tutorial, you can install with python setup.py {build_ext --inplace, build, install}, depending on how bad you want it. Perhaps I should see if the numpy community is interested in including this ufunc.

answered Nov 12 '22 15:11

Mad Physicist

Related questions
                            
                                How to make a subprocess.call timeout using python 2.7.6?
                            
                                Get the index that caused an IndexError exception
                            
                                Boolean to string with lowercase
                            
                                Include run-time dependencies in Python wheels
                            
                                Django RelatedObjectDoesNotExist error
                            
                                Why are lil_matrix and dok_matrix so slow compared to common dict of dicts?
                            
                                How to manage logging in curses
                            
                                Changing the appearance of a Scrollbar in tkinter (using ttk styles)
                            
                                Improving line-wise I/O operations in D
                            
                                Calculating the number of specific consecutive equal values in a vectorized way in pandas
                            
                                SpooledTemporaryFile: units of maximum (in-memory) size?
                            
                                Difference between using train_test_split and cross_val_score in sklearn.cross_validation
                            
                                Plotting a imshow() image in 3d in matplotlib
                            
                                Anaconda python not available from sudo
                            
                                How to get value from a theano tensor variable backed by a shared variable?
                            
                                Remove leading NaN in pandas
                            
                                Python - list comprehension in this case is efficient?
                            
                                /usr/local/bin/python: No module named pip
                            
                                Bulk Partial Upsert in Elasticseach with python
                            
                                Django query expression for calculated fields that require conditions and casting

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Numpy: Check if float array contains whole numbers

Tags:

python

floating-point

numpy

Mad Physicist

People also ask

2 Answers

hunse

Mad Physicist

Recent Activity

Donate For Us