How to use numpy einsum_path result?

Tags:

I'm performing a decently complex operation on some 3- and 4-dimensional tensor using numpy einsum.

My actual code is

np.einsum('oij,imj,mjkn,lnk,plk->op',phi,B,Suu,B,phi)

This does what I want it to do.

Using einsum_path, the result is:

>>> path = np.einsum_path('oij,imj,mjkn,lnk,plk->op',phi,B,Suu,B,phi)

>>> print(path[0])
['einsum_path', (0, 1), (0, 3), (0, 1), (0, 1)]

>>> print(path[1])
  Complete contraction:  oij,imj,mjkn,lnk,plk->op
         Naive scaling:  8
     Optimized scaling:  5
      Naive FLOP count:  2.668e+07
  Optimized FLOP count:  1.340e+05
   Theoretical speedup:  199.136
  Largest intermediate:  7.700e+02 elements
--------------------------------------------------------------------------
scaling                  current                                remaining
--------------------------------------------------------------------------
   4                imj,oij->moj                     mjkn,lnk,plk,moj->op
   5               moj,mjkn->nok                          lnk,plk,nok->op
   4                plk,lnk->npk                              nok,npk->op
   4                 npk,nok->op                                   op->op

This indicates a theoretical speedup of about 200x.

How can I use this result to speed up my code? How do I "implement" what einsum_path is telling me?

989

asked Feb 01 '19 14:02

Luca

1 Answers

Do some time tests

path = np.einsum_path('oij,imj,mjkn,lnk,plk->op',phi,B,Suu,B,phi)

np.einsum('oij,imj,mjkn,lnk,plk->op',phi,B,Suu,B,phi, optimize=False)
np.einsum('oij,imj,mjkn,lnk,plk->op',phi,B,Suu,B,phi, optimize=True)         
np.einsum('oij,imj,mjkn,lnk,plk->op',phi,B,Suu,B,phi, optimize=path[0])

In my testing the second 2 run at the same speed. For a small problem optimize=False is faster, presumably because the analysis and rearranging takes time. For a large problem, with a larger theoretical speedup, the actual speedup for True can be larger than than theory. Presumably memory management is slowing down the False case.

The theoretical speedup is just that, an estimate based just on FLOPS count. That will be true only to the extent that FLOPS dominate the calculation.

You could also time the path calc. The size of the problem will determine whether its time is a small or large part of the total time.

172

answered Oct 18 '22 02:10

hpaulj

Related questions
                            
                                Keras floods Jupyter cell output during fit (verbose=1)
                            
                                How to create an abstract subclass of a concrete superclass in Python 3?
                            
                                Multiprocessing slower than serial processing in Windows (but not in Linux)
                            
                                Use __init__.py to modify sys path is a good idea?
                            
                                Permission Check Discord.py Bot
                            
                                can't understand [Errno 111] Connection refused
                            
                                Why doesn't tkinter release memory when an instance is destroyed?
                            
                                Multiprocessing large XML file with shared memory complex objects
                            
                                Tkinter button expand using grid
                            
                                How do I create a seaborn line plot for PySpark dataframe?
                            
                                OpenSSL: error:1409442E:SSL routines:ssl3_read_bytes:tlsv1 alert protocol version
                            
                                AttributeError when training CNN 1D with Python Keras
                            
                                Python fuzzy string matching as correlation style table/matrix
                            
                                How do I downsample a 1d numpy array?
                            
                                Fill missing value by averaging previous row value
                            
                                how to create custom operators in airflow and use them in airflow template which is running through cloud composer(in google cloud platform)
                            
                                What is the equivalent way of doing this type of pythonic vectorized assignment in MATLAB?
                            
                                Structure of package that can also be run as command line script
                            
                                Optical Character Recognition Multiple Line Detection
                            
                                how to fix the flake 8 error "E712 comparison to False should be 'if cond is False:' or 'if not cond:'" in pandas dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use numpy einsum_path result?

Tags:

python

numpy

numpy-einsum

Luca

People also ask

1 Answers

hpaulj

Recent Activity

Donate For Us