Why is pow(a, d, n) so much faster than a**d % n?

People also ask

How is Python POW so fast?

A dedicated modular-exponentiation function like 3-argument pow() can leverage such tricks because it knows you're doing modular arithmetic. The Python parser can't recognize this given the bare expression a**d % n , so it will perform the full calculation (which will take much longer).

See the Wikipedia article on modular exponentiation. Basically, when you do a**d % n, you actually have to calculate a**d, which could be quite large. But there are ways of computing a**d % n without having to compute a**d itself, and that is what pow does. The ** operator can't do this because it can't "see into the future" to know that you are going to immediately take the modulus.

BrenBarn answered your main question. For your aside:

why is it almost twice as fast when run with Python 2 or 3 than PyPy, when usually PyPy is much faster?

If you read PyPy's performance page, this is exactly the kind of thing PyPy is not good at—in fact, the very first example they give:

Bad examples include doing computations with large longs – which is performed by unoptimizable support code.

Theoretically, turning a huge exponentiation followed by a mod into a modular exponentiation (at least after the first pass) is a transformation a JIT might be able to make… but not PyPy's JIT.

As a side note, if you need to do calculations with huge integers, you may want to look at third-party modules like gmpy, which can sometimes be much faster than CPython's native implementation in some cases outside the mainstream uses, and also has a lot of additional functionality that you'd otherwise have to write yourself, at the cost of being less convenient.

There are shortcuts to doing modular exponentiation: for instance, you can find a**(2i) mod n for every i from 1 to log(d) and multiply together (mod n) the intermediate results you need. A dedicated modular-exponentiation function like 3-argument pow() can leverage such tricks because it knows you're doing modular arithmetic. The Python parser can't recognize this given the bare expression a**d % n, so it will perform the full calculation (which will take much longer).

The way x = a**d % n is calculated is to raise a to the d power, then modulo that with n. Firstly, if a is large, this creates a huge number which is then truncated. However, x = pow(a, d, n) is most likely optimized so that only the last n digits are tracked, which are all that are required for calculating multiplication modulo a number.

Related questions
                            
                                Strip / trim all strings of a dataframe
                            
                                How do you send a HEAD HTTP request in Python 2?
                            
                                How to dynamically compose an OR query filter in Django?
                            
                                How to find the min/max value of a common key in a list of dicts?
                            
                                How to convert Nonetype to int or string?
                            
                                Python class definition syntax
                            
                                PEP 8, why no spaces around '=' in keyword argument or a default parameter value?
                            
                                Why is str.translate much faster in Python 3.5 compared to Python 3.4?
                            
                                Flask vs webapp2 for Google App Engine
                            
                                Difference between coroutine and future/task in Python 3.5?
                            
                                Google Authenticator implementation in Python
                            
                                Pros and cons to use Celery vs. RQ [closed]
                            
                                Java or Python for Natural Language Processing [closed]
                            
                                When should we call multiprocessing.Pool.join?
                            
                                How do "and" and "or" act with non-boolean values?
                            
                                Disable IPython Exit Confirmation
                            
                                List all the modules that are part of a python package?
                            
                                How to convert int to Enum in python?
                            
                                How to use Sphinx's autodoc to document a class's __init__(self) method?
                            
                                Import CSV file as a pandas DataFrame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is pow(a, d, n) so much faster than a**d % n?

Tags:

performance

python

pypy

People also ask

Recent Activity

Donate For Us