Is it really efficient to use Karatsuba algorithm in 64-bit x 64-bit multiplication?

Tags:

I work on AVX2 and need to calculate 64-bit x64-bit -> 128-bit widening multiplication and got 64-bit high part in the fastest manner. Since AVX2 has not such an instruction, is it reasonable for me to use Karatsuba algorithm for efficiency and gaining speed?

968

asked Jun 26 '15 09:06

Yigit Demirag

1 Answers

No. On modern architectures the crossover at which Karatsuba beats schoolbook multiplication is usually somewhere between 8 and 24 machine words (e.g. between 512 and 1536 bits on x86_64). For fixed sizes, the threshold is at the smaller end of that range, and the new ADCX/ADOX instructions likely bring it in somewhat further for scalar code, but 64x64 is still too small to benefit from Karatsuba.

157

answered Nov 15 '22 12:11

Stephen Canon

Related questions
                            
                                print a filled square in console
                            
                                error C2731: 'wWinMain' : function cannot be overloaded
                            
                                C++ throw() optimization
                            
                                How to disengage std::experimental::optional?
                            
                                Disconnected node during Graph traversal
                            
                                How do I use atoi function with strings in C++
                            
                                C++ inherit a function with different default argument values
                            
                                C++ specialized template class for a given type list
                            
                                C++: convert "boost::multiprecision::float128" to "double"
                            
                                Detecting set of planes from point cloud
                            
                                How to draw polygon with 3D points in modern openGL?
                            
                                Same symbols in different libraries and linking order
                            
                                Is there a way to print the bit representation of an object?
                            
                                Convenient way to define all comparison operators for class with one numeric data member?
                            
                                Try to understand std::enable_shared_from_this<T> but cause a bad_weak_ptr using it
                            
                                C++ for loop variable lifetime is weird
                            
                                How do I store and access a type dynamically in c++?
                            
                                How to effectively combine a list of NumericVectors into one large NumericVector?
                            
                                Don't know where exception was thrown using google-test
                            
                                How can I get the index of a type in a variadic class template?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it really efficient to use Karatsuba algorithm in 64-bit x 64-bit multiplication?

Tags:

c++

performance

parallel-processing

simd

avx2

Yigit Demirag

People also ask

1 Answers

Stephen Canon

Recent Activity

Donate For Us