What is the difference between native int type and the numpy.int types?

1 Answers

There are several major differences. The first is that python integers are flexible-sized (at least in python 3.x). This means they can grow to accommodate any number of any size (within memory constraints, of course). The numpy integers, on the other hand, are fixed-sized. This means there is a maximum value they can hold. This is defined by the number of bytes in the integer (int32 vs. int64), with more bytes holding larger numbers, as well as whether the number is signed or unsigned (int32 vs. uint32), with unsigned being able to hold larger numbers but not able to hold negative number.

So, you might ask, why use the fixed-sized integers? The reason is that modern processors have built-in tools for doing math on fixed-size integers, so calculations on those are much, much, much faster. In fact, python uses fixed-sized integers behind-the-scenes when the number is small enough, only switching to the slower, flexible-sized integers when the number gets too large.

Another advantage of fixed-sized values is that they can be placed into consistently-sized adjacent memory blocks of the same type. This is the format that numpy arrays use to store data. The libraries that numpy relies on are able to do extremely fast computations on data in this format, in fact modern CPUs have built-in features for accelerating this sort of computation. With the variable-sized python integers, this sort of computation is impossible because there is no way to say how big the blocks should be and no consistentcy in the data format.

That being said, numpy is actually able to make arrays of python integers. But rather than arrays containing the values, instead they are arrays containing references to other pieces of memory holding the actual python integers. This cannot be accelerated in the same way, so even if all the python integers fit within the fixed integer size, it still won't be accelerated.

None of this is the case with Python 2. In Python 2, Python integers are fixed integers and thus can be directly translated into numpy integers. For variable-length integers, Python 2 had the long type. But this was confusing and it was decided this confusion wasn't worth the performance gains, especially when people who need performance would be using numpy or something like it anyway.

162

answered Sep 19 '22 20:09

TheBlackCat

Related questions
                            
                                Counting repeated characters in a string in Python
                            
                                Google App Engine Remote API does not work from local client
                            
                                Why use SQLAlchemy? Is it very convinent for coding? [closed]
                            
                                Are Boto3 Resources and Clients Equivalent? When Use One or Other?
                            
                                Separate SQLAlchemy models by file in Flask [duplicate]
                            
                                How to create and open a jupyter notebook ipynb file directly from terminal
                            
                                Worst Case Analysis for Regular Expressions
                            
                                What does the -> (dash-greater-than arrow symbol) mean in a Python method signature? [duplicate]
                            
                                Execute an installed Python package as a script?
                            
                                How can I install a conda environment when offline?
                            
                                What are the differences between lxml and ElementTree?
                            
                                Tilde sign in python dataframe
                            
                                Anaconda 4.7.5 - Warning about conda-build <3.18.3 and issues with python packages
                            
                                What is the return value of subprocess.call()?
                            
                                What scalability issues are associated with NetworkX?
                            
                                BdbQuit raised when debugging Python with pdb
                            
                                Is there a way to implement methods like __len__ or __eq__ as classmethods?
                            
                                What does Python optimization (-O or PYTHONOPTIMIZE) do?
                            
                                Get a dict of all variables currently in scope and their values
                            
                                Applying LIMIT and OFFSET to all queries in SQLAlchemy

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between native int type and the numpy.int types?

Tags:

python

numpy

Aguy

People also ask

1 Answers

TheBlackCat

Recent Activity

Donate For Us