Paraphrasing from in "Programming Pearls" book (about c language on older machines, since book is from the late 90's): Integer arithmetic operations (<code>+</code>, <code>-</code>, <code>*</code>) can take around 10 nano seconds whereas the <code>%</code> operator takes up to 100 nano seconds. <ul> <li>Why there is that much difference?</li> <li>How does a modulus operator work internally?</li> <li>Is it same as division (<code>/</code>) in terms of time?</li> </ul>

The modulus/modulo operation is usually understood as the integer equivalent of the remainder operation - a side effect or counterpart to division. Except for some degenerate cases (where the divisor is a power of the operating base - i.e. a power of 2 for most number formats) this is just as expensive as integer division! So the question is really, why is integer division so expensive? I don't have the time or expertise to analyze this mathematically, so I'm going to appeal to grade school maths: Consider the number of lines of working out in the notebook (not including the inputs) required for: <ul> <li>Equality: (Boolean operations) essentially none - in computer "big O" terms this is known a O(1) </li> <li>addition: two, working left to right, one line for the output and one line for the carry. This is an O(N) operation</li> <li>long multiplication: n*(n+1) + 2: two lines for each of the digit products (one for total, one for carry) plus a final total and carry. So O(N^2) but with a fixed N (32 or 64), and it can be pipelined in silicon to less than that</li> <li>long division: unknown, depends upon the argument size - it's a recursive descent and some instances descend faster than others (1,000,000 / 500,000 requires less lines than 1,000 / 7). Also each step is essentially a series of multiplications to isolate the closest factors. (Although multiple algorithms exist). Feels like an O(N^3) with variable N</li> </ul> So in simple terms, this should give you a feel for why division and hence modulo is slower: computers still have to do long division in the same stepwise fashion tha you did in grade school. If this makes no sense to you; you may have been brought up on school math a little more modern than mine (30+ years ago). <hr> The Order/Big O notation used above as O(something) expresses the complexity of a computation in terms of the size of its inputs, and expresses a fact about its execution time. http://en.m.wikipedia.org/wiki/Big_O_notation O(1) executes in constant (but possibly large) time. O(N) takes as much time as the size of its data-so if the data is 32 bits it takes 32 times the O(1) time of the step to calculate one of its N steps, and O(N^2) takes N times N (N squared) the time of its N steps (or possibly N times MN for some constant M). Etc. <hr> In the above working I have used O(N) rather than O(N^2) for addition since the 32 or 64 bits of the first input are calculated in parallel by the CPU. In a hypothetical 1 bit machine a 32 bit addition operation would be O(32^2) and change. The same order reduction applies to the other operations too.

Why is modulus operator slow?

1 Answers

The modulus/modulo operation is usually understood as the integer equivalent of the remainder operation - a side effect or counterpart to division.

Except for some degenerate cases (where the divisor is a power of the operating base - i.e. a power of 2 for most number formats) this is just as expensive as integer division!

So the question is really, why is integer division so expensive?

I don't have the time or expertise to analyze this mathematically, so I'm going to appeal to grade school maths:

Consider the number of lines of working out in the notebook (not including the inputs) required for:

Equality: (Boolean operations) essentially none - in computer "big O" terms this is known a O(1)
addition: two, working left to right, one line for the output and one line for the carry. This is an O(N) operation
long multiplication: n*(n+1) + 2: two lines for each of the digit products (one for total, one for carry) plus a final total and carry. So O(N^2) but with a fixed N (32 or 64), and it can be pipelined in silicon to less than that
long division: unknown, depends upon the argument size - it's a recursive descent and some instances descend faster than others (1,000,000 / 500,000 requires less lines than 1,000 / 7). Also each step is essentially a series of multiplications to isolate the closest factors. (Although multiple algorithms exist). Feels like an O(N^3) with variable N

So in simple terms, this should give you a feel for why division and hence modulo is slower: computers still have to do long division in the same stepwise fashion tha you did in grade school.

If this makes no sense to you; you may have been brought up on school math a little more modern than mine (30+ years ago).

The Order/Big O notation used above as O(something) expresses the complexity of a computation in terms of the size of its inputs, and expresses a fact about its execution time. http://en.m.wikipedia.org/wiki/Big_O_notation

O(1) executes in constant (but possibly large) time. O(N) takes as much time as the size of its data-so if the data is 32 bits it takes 32 times the O(1) time of the step to calculate one of its N steps, and O(N^2) takes N times N (N squared) the time of its N steps (or possibly N times MN for some constant M). Etc.

In the above working I have used O(N) rather than O(N^2) for addition since the 32 or 64 bits of the first input are calculated in parallel by the CPU. In a hypothetical 1 bit machine a 32 bit addition operation would be O(32^2) and change. The same order reduction applies to the other operations too.

answered Oct 10 '22 11:10

Alex Brown

Related questions
                            
                                How to get the quotient and remainder of division
                            
                                How can I do mod without a mod operator?
                            
                                Java - Is there a method for Euclidean or floored modulo
                            
                                re implement modulo using bit shifts?
                            
                                PHP Modulo Decimal
                            
                                Python 'string' % [1, 2, 3] doesn't raise TypeError
                            
                                Kotlin negative modulo returns negative value
                            
                                Sum divisible by n
                            
                                Computing x mod y where y is not representable as floating point
                            
                                Modulo of negative integers in Go
                            
                                Better ways to implement a modulo operation (algorithm question)
                            
                                Modular Exponentiation for high numbers in C++
                            
                                What is the modulo operator for longs in Java?
                            
                                Modulo operator in Elixir
                            
                                C# ModInverse Function
                            
                                Implementing the modulo operator as a function in C
                            
                                Difference Between Modulus Implementation in Python Vs Java
                            
                                Numpy matrix power/exponent with modulo?
                            
                                Why is a modulo operation returning an unexpected value
                            
                                How to calculate modulo of negative integers in JavaScript?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is modulus operator slow?

Tags:

modulo

integer-division

cpu-speed

programming-pearls

AV94

People also ask

1 Answers

Alex Brown

Recent Activity

Donate For Us