I am a bit struggled with so many <code>int</code> data types in cython. <code>np.int, np.int_, np.int_t, int</code> I guess <code>int</code> in pure python is equivalent to <code>np.int_</code>, then where does <code>np.int</code> come from? I cannot find the document from numpy? Also, why does <code>np.int_</code> exist given we do already have <code>int</code>? In cython, I guess <code>int</code> becomes a C type when used as <code>cdef int</code> or <code>ndarray[int]</code>, and when used as <code>int()</code> it stays as the python caster? Is <code>np.int_</code> equivalent to <code>long</code> in C? so <code>cdef long</code> is the identical to <code>cdef np.int_</code>? Under what circumstances should I use <code>np.int_t</code> instead of <code>np.int</code>? e.g. <code>cdef np.int_t</code>, <code>ndarray[np.int_t]</code> ... Can someone briefly explain how the wrong use of those types would affect the performance of compiled cython code?

It's a bit complicated because the names have different meanings depending on the context. <h3><code>int</code></h3> <ol> <li> In Python The <code>int</code> is normally just a Python type, it's of arbitrary precision, meaning that you can store any conceivable integer inside it (as long as you have enough memory). <pre class="prettyprint"><code>>>> int(10**50) 100000000000000000000000000000000000000000000000000 </code></pre> </li> <li> However, when you use it as <code>dtype</code> for a NumPy array it will be interpreted as <code>np.int_</code> 1. Which is not of arbitrary precision, it will have the same size as C's <code>long</code>: <pre class="prettyprint"><code>>>> np.array(10**50, dtype=int) OverflowError: Python int too large to convert to C long </code></pre> That also means the following two are equivalent: <pre class="prettyprint"><code>np.array([1,2,3], dtype=int) np.array([1,2,3], dtype=np.int_) </code></pre> </li> <li> As Cython type identifier it has another meaning, here it stands for the c type <code>int</code>. It's of limited precision (typically 32bits). You can use it as Cython type, for example when defining variables with <code>cdef</code>: <pre class="prettyprint"><code>cdef int value = 100 # variable cdef int[:] arr = ... # memoryview </code></pre> As return value or argument value for <code>cdef</code> or <code>cpdef</code> functions: <pre class="prettyprint"><code>cdef int my_function(int argument1, int argument2): # ... </code></pre> As "generic" for <code>ndarray</code>: <pre class="prettyprint"><code>cimport numpy as cnp cdef cnp.ndarray[int, ndim=1] val = ... </code></pre> For type casting: <pre class="prettyprint"><code>avalue = <int>(another_value) </code></pre> And probably many more. </li> <li> In Cython but as Python type. You can still call <code>int</code> and you'll get a "Python int" (of arbitrary precision), or use it for <code>isinstance</code> or as <code>dtype</code> argument for <code>np.array</code>. Here the context is important, so converting to a Python <code>int</code> is different from converting to a C int: <pre class="prettyprint"><code>cdef object val = int(10) # Python int cdef int val = <int>(10) # C int </code></pre> </li> </ol> <h3><code>np.int</code></h3> Actually this is very easy. It's just an alias for <code>int</code>: <pre class="prettyprint"><code>>>> int is np.int True </code></pre> So everything from above applies to <code>np.int</code> as well. However you can't use it as a type-identifier except when you use it on the <code>cimport</code>ed package. In that case it represents the Python integer type. <pre class="prettyprint"><code>cimport numpy as cnp cpdef func(cnp.int obj): return obj </code></pre> This will expect <code>obj</code> to be a Python integer not a NumPy type: <pre class="prettyprint"><code>>>> func(np.int_(10)) TypeError: Argument 'obj' has incorrect type (expected int, got numpy.int32) >>> func(10) 10 </code></pre> My advise regarding <code>np.int</code>: Avoid it whenever possible. In Python code it's equivalent to <code>int</code> and in Cython code it's also equivalent to Pythons <code>int</code> but if used as type-identifier it will probably confuse you and everyone who reads the code! It certainly confused me... <h3><code>np.int_</code></h3> Actually it only has one meaning: It's a Python type that represents a scalar NumPy type. You use it like Pythons <code>int</code>: <pre class="prettyprint"><code>>>> np.int_(10) # looks like a normal Python integer 10 >>> type(np.int_(10)) # but isn't (output may vary depending on your system!) numpy.int32 </code></pre> Or you use it to specify the <code>dtype</code>, for example with <code>np.array</code>: <pre class="prettyprint"><code>>>> np.array([1,2,3], dtype=np.int_) array([1, 2, 3]) </code></pre> But you cannot use it as type-identifier in Cython. <h3><code>cnp.int_t</code></h3> It's the type-identifier version for <code>np.int_</code>. That means you can't use it as dtype argument. But you can use it as type for <code>cdef</code> declarations: <pre class="prettyprint"><code>cimport numpy as cnp import numpy as np cdef cnp.int_t[:] arr = np.array([1,2,3], dtype=np.int_) |---TYPE---| |---DTYPE---| </code></pre> This example (hopefully) shows that the type-identifier with the trailing <code>_t</code> actually represents the type of an array using the dtype without the trailing <code>t</code>. You can't interchange them in Cython code! <h3>Notes</h3> There are several more numeric types in NumPy I'll include a list containing the NumPy dtype and Cython type-identifier and the C type identifier that could also be used in Cython here. But it's basically taken from the NumPy documentation and the Cython NumPy <code>pxd</code> file: <pre class="prettyprint"><code>NumPy dtype Numpy Cython type C Cython type identifier np.bool_ None None np.int_ cnp.int_t long np.intc None int np.intp cnp.intp_t ssize_t np.int8 cnp.int8_t signed char np.int16 cnp.int16_t signed short np.int32 cnp.int32_t signed int np.int64 cnp.int64_t signed long long np.uint8 cnp.uint8_t unsigned char np.uint16 cnp.uint16_t unsigned short np.uint32 cnp.uint32_t unsigned int np.uint64 cnp.uint64_t unsigned long np.float_ cnp.float64_t double np.float32 cnp.float32_t float np.float64 cnp.float64_t double np.complex_ cnp.complex128_t double complex np.complex64 cnp.complex64_t float complex np.complex128 cnp.complex128_t double complex </code></pre> Actually there are Cython types for <code>np.bool_</code>: <code>cnp.npy_bool</code> and <code>bint</code> but both they can't be used for NumPy arrays currently. For scalars <code>cnp.npy_bool</code> will just be an unsigned integer while <code>bint</code> will be a boolean. Not sure what's going on there... <hr> 1 Taken From the NumPy documentation "Data type objects" <blockquote> <h3>Built-in Python types</h3> Several python types are equivalent to a corresponding array scalar when used to generate a dtype object: <pre class="prettyprint"><code>int np.int_ bool np.bool_ float np.float_ complex np.cfloat bytes np.bytes_ str np.bytes_ (Python2) or np.unicode_ (Python3) unicode np.unicode_ buffer np.void (all others) np.object_ </code></pre> </blockquote>

Difference between np.int, np.int_, int, and np.int_t in cython?

1 Answers

It's a bit complicated because the names have different meanings depending on the context.

`int`

In Python

The int is normally just a Python type, it's of arbitrary precision, meaning that you can store any conceivable integer inside it (as long as you have enough memory).
```
>>> int(10**50) 100000000000000000000000000000000000000000000000000 
```
However, when you use it as dtype for a NumPy array it will be interpreted as np.int_ ¹. Which is not of arbitrary precision, it will have the same size as C's long:
```
>>> np.array(10**50, dtype=int) OverflowError: Python int too large to convert to C long 
```
That also means the following two are equivalent:
```
np.array([1,2,3], dtype=int) np.array([1,2,3], dtype=np.int_) 
```
As Cython type identifier it has another meaning, here it stands for the c type int. It's of limited precision (typically 32bits). You can use it as Cython type, for example when defining variables with cdef:
```
cdef int value = 100 # variable cdef int[:] arr = ... # memoryview 
```
As return value or argument value for cdef or cpdef functions:
```
cdef int my_function(int argument1, int argument2): # ... 
```
As "generic" for ndarray:
```
cimport numpy as cnp cdef cnp.ndarray[int, ndim=1] val = ... 
```
For type casting:
```
avalue = <int>(another_value) 
```
And probably many more.
In Cython but as Python type. You can still call int and you'll get a "Python int" (of arbitrary precision), or use it for isinstance or as dtype argument for np.array. Here the context is important, so converting to a Python int is different from converting to a C int:
```
cdef object val = int(10) # Python int cdef int val = <int>(10) # C int 
```

`np.int`

Actually this is very easy. It's just an alias for int:

>>> int is np.int True

So everything from above applies to np.int as well. However you can't use it as a type-identifier except when you use it on the cimported package. In that case it represents the Python integer type.

cimport numpy as cnp  cpdef func(cnp.int obj):     return obj

This will expect obj to be a Python integer not a NumPy type:

>>> func(np.int_(10)) TypeError: Argument 'obj' has incorrect type (expected int, got numpy.int32) >>> func(10) 10

My advise regarding np.int: Avoid it whenever possible. In Python code it's equivalent to int and in Cython code it's also equivalent to Pythons int but if used as type-identifier it will probably confuse you and everyone who reads the code! It certainly confused me...

`np.int_`

Actually it only has one meaning: It's a Python type that represents a scalar NumPy type. You use it like Pythons int:

>>> np.int_(10)        # looks like a normal Python integer 10 >>> type(np.int_(10))  # but isn't (output may vary depending on your system!) numpy.int32

Or you use it to specify the dtype, for example with np.array:

>>> np.array([1,2,3], dtype=np.int_) array([1, 2, 3])

But you cannot use it as type-identifier in Cython.

`cnp.int_t`

It's the type-identifier version for np.int_. That means you can't use it as dtype argument. But you can use it as type for cdef declarations:

cimport numpy as cnp import numpy as np  cdef cnp.int_t[:] arr = np.array([1,2,3], dtype=np.int_)      |---TYPE---|                         |---DTYPE---|

This example (hopefully) shows that the type-identifier with the trailing _t actually represents the type of an array using the dtype without the trailing t. You can't interchange them in Cython code!

Notes

There are several more numeric types in NumPy I'll include a list containing the NumPy dtype and Cython type-identifier and the C type identifier that could also be used in Cython here. But it's basically taken from the NumPy documentation and the Cython NumPy pxd file:

NumPy dtype          Numpy Cython type         C Cython type identifier  np.bool_             None                      None np.int_              cnp.int_t                 long np.intc              None                      int        np.intp              cnp.intp_t                ssize_t np.int8              cnp.int8_t                signed char np.int16             cnp.int16_t               signed short np.int32             cnp.int32_t               signed int np.int64             cnp.int64_t               signed long long np.uint8             cnp.uint8_t               unsigned char np.uint16            cnp.uint16_t              unsigned short np.uint32            cnp.uint32_t              unsigned int np.uint64            cnp.uint64_t              unsigned long np.float_            cnp.float64_t             double np.float32           cnp.float32_t             float np.float64           cnp.float64_t             double np.complex_          cnp.complex128_t          double complex np.complex64         cnp.complex64_t           float complex np.complex128        cnp.complex128_t          double complex

Actually there are Cython types for np.bool_: cnp.npy_bool and bint but both they can't be used for NumPy arrays currently. For scalars cnp.npy_bool will just be an unsigned integer while bint will be a boolean. Not sure what's going on there...

¹ Taken From the NumPy documentation "Data type objects"

Built-in Python types

Several python types are equivalent to a corresponding array scalar when used to generate a dtype object:
int           np.int_ bool          np.bool_ float         np.float_ complex       np.cfloat bytes         np.bytes_ str           np.bytes_ (Python2) or np.unicode_ (Python3) unicode       np.unicode_ buffer        np.void (all others)  np.object_ 

198

answered Sep 19 '22 22:09

MSeifert

Related questions
                            
                                Error loading DLL in python, not a valid win32 application [duplicate]
                            
                                Set LD_LIBRARY_PATH before importing in python
                            
                                What does the standard Keras model output mean? What is epoch and loss in Keras?
                            
                                Optional dependencies in a pip requirements file
                            
                                How to set the pandas dataframe data left/right alignment?
                            
                                Python Multiprocessing Exit Elegantly How?
                            
                                What does "SSLError: [SSL] PEM lib (_ssl.c:2532)" mean using the Python ssl library?
                            
                                Python can't find my module
                            
                                Running interactive commands in Paramiko
                            
                                Very strange behavior of operator 'is' with methods
                            
                                How do I make coverage include not tested files?
                            
                                Concatenation of the result of a function with a mutable default argument
                            
                                Python decorator as a staticmethod
                            
                                What are the URL parameters? (element at position #3 in urlparse result)
                            
                                save numpy array in append mode
                            
                                Using .pth files
                            
                                How to mock.patch a class imported in another module
                            
                                Requirements.txt greater than equal to and then less than?
                            
                                Python Error: "ValueError: need more than 1 value to unpack"
                            
                                Optional parameters in functions and their mutable default values [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Difference between np.int, np.int_, int, and np.int_t in cython?

Tags:

python

c

numpy

cython

colinfang

People also ask