Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

check type within numpy array

I have different types of data. most of them are int and sometimes float. The int is different in size so 8/ 16/ 32 bits are the sizes.
For this situation I'm creating a numerical type converter. therefore i check the type by using isinstence(). This because I have read that isinstance() is less worse than type().

The point is that a lot of data i get is numpy arrays. I use spyder as IDE and then i see by the variables also a type. but when i type isinstance(var,'type i read') i get False.

I did some checks:

a = 2.17 
b = 3 
c = np.array(np.random.rand(2, 8))
d = np.array([1])

for there isinstance(var,type) i get:

isinstance(a, float)
True
isinstance(b, int)
True
isinstance(c, float)  # or isinstance(c, np.float64)
False
isinstance(d, int)  # or isinstance(c, np.int32)
False

c and d are True when i ask

isinstance(c, np.ndarray)
True
isinstance(d, np.ndarray)
True

i can check with step in the ndarray by

isinstance(c[i][j], np.float64)
True
isinstance(d[i], np.int32)
True

but this means that for every dimension i have to add a new index otherwise it is False again. I can check there type with dtype like c.dtype == 'float64'...

Oke so for what i have find and tried... My questions are basicly:

  • how is the var.dtype method compared to isinstance() and type() (worst/ better etc)?
  • if var.dtype is even worse as isinstance() is there some method in the isinstance() without all the manual indexing? (autoindexing etc)?
like image 397
Jan-Bert Avatar asked Oct 28 '16 19:10

Jan-Bert


People also ask

How can you identify the datatype of a given NumPy array?

The astype() function creates a copy of the array, and allows you to specify the data type as a parameter. The data type can be specified using a string, like 'f' for float, 'i' for integer etc. or you can use the data type directly like float for float and int for integer.

How do you find the data type of an array?

The data type of an array in Python can be found with the dtype() function. Within this dtype() function, you specify the array. Python will then return the data type of the array.

What is data type in NumPy?

A data type object (an instance of numpy. dtype class) describes how the bytes in the fixed-size block of memory corresponding to an array item should be interpreted. It describes the following aspects of the data: Type of the data (integer, float, Python object, etc.)

How do I change the datatype of a NumPy array?

We have a method called astype(data_type) to change the data type of a numpy array. If we have a numpy array of type float64, then we can change it to int32 by giving the data type to the astype() method of numpy array. We can check the type of numpy array using the dtype class.


4 Answers

An array is an object of type np.ndarray. Its values or elements are stored in a data buffer, which can be thought of as a contiguous block of memory bytes. The bytes in the data buffer do not have a type, because they are not Python objects.

The array has a dtype parameter, which is used to interpret those bytes. If dtype is int32 (there are various synonyms), 4 bytes are interpreted as an integer. Accessing an element, say c[0] gives a new object that depends on the dtype, e.g. an object type np.int32.

c[0].item will give an Python object of the corresponding type:

In [2102]: c=np.array([1]) In [2103]: c.dtype Out[2103]: dtype('int32') In [2104]: type(c) Out[2104]: numpy.ndarray In [2105]: type(c[0]) Out[2105]: numpy.int32 In [2107]: c[0].item() Out[2107]: 1 In [2108]: type(c[0].item()) Out[2108]: int 

(And c[0].dtype is the same as for c.dtype; you don't need to index individual elements of an array to check their dtype).

The same 4 bytes of this array can be viewed as dtype int8 - a single byte integer.

In [2112]: c.view('b') Out[2112]: array([1, 0, 0, 0], dtype=int8) 

A single element of this alternate view is np.int8, but when I take item(), I get a Python integer. There isn't a int8 Python numeric type.

In [2113]: type(c.view('b')[0]) Out[2113]: numpy.int8 In [2115]: type(c.view('b')[0].item()) Out[2115]: int 

A list contains pointers to Python objects, each of which has a type. So does an array of dtype=object. But the common numeric array does not contain Python integers or floats. It has a data buffer that can interpreted in various ways according to the dtype. Python integers don't come in different sizes, at least not to the same extent as numpy dtypes.

So the isinstance and type() stuff does not apply to the contents of an ndarray.

====================

From the comments I gather you are trying to convert integer arrays to float. You aren't converting scalars. If so then dtype is all that matters; an array always has a dtype. It's unclear whether you are ok with casting a np.float32 to np.float64.

I'd suggest studying, and experimenting with the np.can_cast function and the x.astype method.

x.astype(np.float64, copy=False) 

for example will convert all int dtypes to float, without copying the ones that are already float64. It may copy and convert np.float32 ones.

Look also at the casting parameter of these functions.

===========================

I found in scipy.optimize.minimize another testing tool

In [156]: np.typecodes Out[156]:  {'All': '?bhilqpBHILQPefdgFDGSUVOMm',  'AllFloat': 'efdgFDG',  'AllInteger': 'bBhHiIlLqQpP',  'Character': 'c',  'Complex': 'FDG',  'Datetime': 'Mm',  'Float': 'efdg',  'Integer': 'bhilqp',  'UnsignedInteger': 'BHILQP'} 

It can be used to check for integers with:

if x0.dtype.kind in np.typecodes["AllInteger"]:     x0 = np.asarray(x0, dtype=float) 
like image 59
hpaulj Avatar answered Oct 12 '22 17:10

hpaulj


To directly answer the question, you can do this:

isinstance(arr.flat[0], np.floating)

  • .flat will collapse any number of dimensions down, so you can then access the 0th element easily.
  • np.floating will match any numpy float type
like image 35
rasen58 Avatar answered Oct 12 '22 17:10

rasen58


A slight variation from @rasen58 and @hpaulj:

To check if an np array, c, has elements of type float, c.dtype == np.floating works for me.

like image 41
sleepearly Avatar answered Oct 12 '22 19:10

sleepearly


All entries in a numpy array are of the same type. The numpy type and the Python type are not the same thing. This can be a bit confusing, but the type numpy refers to is more like the types used by languages like C - you might say more low level closer the the machine.

You can not say which type is better, because it would be like comparing apple and oranges.

like image 27
J. P. Petersen Avatar answered Oct 12 '22 18:10

J. P. Petersen