64/32-bit division on a processor with 32/16-bit division

Q: Why are 32-bit processors better than 16?

While a 16-bit processor can simulate 32-bit arithmetic using double-precision operands, 32-bit processors are much more efficient. While 16-bit processors can use segment registers to access more than 64K elements of memory, this technique becomes awkward and slow if it must be used frequently.

Q: What is the difference between 16 and 32-bit?

16-bit Windows applications were designed to run under Windows 3.0 and 3.1, while 32-bit Windows applications were designed for Windows 95, 98, NT, and 2000. They are written to two different Application Program Interfaces (APIs) called "Win16" and "Win32".

Tags:

My processor, a small 16-bit microcontroller with no FPU and integer math only has 16/16 division and 32/16 division which both take 18 cycles. At the moment I'm using a very slow software routine (~7,500 cycles) to do 64/32 division. Is there any way to use these division engines to calculate a 64/32 division? Similar to how I'm already using the 16x16 multiplier and adder to calculate 32x32 multiplies? I'm using C but can work with any general explanation on how it can be done... I'm hoping to target <200 cycles (if it's at all possible.)

515

asked Jan 23 '11 01:01

Thomas O

2 Answers

See "Hacker's Delight", multiword division (pages 140-145).

The basic concept (going back to Knuth) is to think of your problem in base-65536 terms. Then you have a 4 digit by 2 digit division problem, with 2/1 digit division as a primitive.

The C code is here: https://github.com/hcs0/Hackers-Delight/blob/master/divmnu.c.txt

178

answered Dec 22 '22 13:12

payne

My copy of Knuth (The Art of Computer Programming) is at work, so I can't check it until Monday, but that would be my first source. It has a whole section on arithmetic.

edit: your post about "16/16 division and 32/16 division which both take 18 cycles." -- dsPICs have a conditional subtract operation in assembly. Consider using this as your computational primitive.

Also note that if X = XH * 2³² + XL and D = DH * 2¹⁶ + DL, then if you are looking for

(Q,R) = X/D where X = Q * D + R

where Q = QH * 2¹⁶ + QL, R = RH * 2¹⁶ + RL, then

XH * 2³² + XL = DH * QH * 2³² + (DL * QH + DH * QL) * 2¹⁶ + (DL * QL) + RH * 2¹⁶ + RL

This suggests (by looking at terms that are the high 32 bits) to use the following procedure, akin to long division:

(QH, R0) = XH / (DH+1) -> XH = QH * (DH+1) + R0 [32/16 divide]
R1 = X - (QH * 2¹⁶) * D [requires a 16*32 multiply, a shift-left by 16, and a 64-bit subtract]
calculate R1' = R1 - D * 2¹⁶
while R1' >= 0, adjust QH upwards by 1, set R1 = R1', and goto step 3
(QL, R2) = (R1 >> 16) / (DH+1) -> R1 = QL * (DH+1) + R2 [32/16 divide]
R3 = R1 - (QL * D) [requires a 16*32 multiply and a 48-bit subtract]
calculate R3' = R3 - D
while R3' >= 0, adjust QL upwards by 1, set R3 = R3', and goto step 7

Your 32-bit quotient is the pair (QH,QL), and 32-bit remainder is R3.

(This assumes that the quotient is not larger than 32-bit, which you need to know ahead of time, and can easily check ahead of time.)

answered Dec 22 '22 13:12

Jason S

Related questions
                            
                                What is a monad-transformer as distinct from a monad?
                            
                                Matlab mex file is slow compared to its straight C equivalent
                            
                                Cross-browser solution for displaying MJPEG stream
                            
                                jQuery load body from external HTML
                            
                                MySQL column name standards / conventions [closed]
                            
                                4 points and Ellipse
                            
                                How do I change "Open" to "Select" in the NSOpenPanel?
                            
                                Remove attributes using HtmlAgilityPack
                            
                                Conversion of lambda expression to function pointer
                            
                                Should deleting a non-existent resource result in a 404 in RESTful Rails?
                            
                                The disadvantages of JavaScript prototype inheritance, what are they?
                            
                                Check what a running process is doing: print stack trace of an uninstrumented Python program

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With