The Assignment Problem, a NumPy function?

Tags:

Since an assignment problem can be posed in the form of a single matrix, I am wondering if NumPy has a function to solve such a matrix. So far I have found none. Maybe one of you guys know if NumPy/SciPy has an assignment-problem-solve function?

Edit: In the meanwhile I have found a Python (not NumPy/SciPy) implementation at http://software.clapper.org/munkres/. Still I suppose a NumPy/SciPy implementation could be much faster, right?

782

asked Sep 09 '09 10:09

Paul

2 Answers

There is now a numpy implementation of the munkres algorithm in scikit-learn under sklearn/utils/linear_assignment_.py its only dependency is numpy. I tried it with some approximately 20x20 matrices, and it seems to be about 4 times as fast as the one linked to in the question. cProfiler shows 2.517 seconds vs 9.821 seconds for 100 iterations.

174

answered Oct 04 '22 08:10

Sean Johnson

I was hoping that the newer scipy.optimize.linear_sum_assignment would be fastest, but (perhaps not surprisingly) the Cython library (which does not have pip support) is significantly faster, at least for my use case:

UPDATE: using munkres v1.1.2 and scipy v1.5.0 achieves the following results:

$ python -m timeit -s "from scipy.optimize import linear_sum_assignment; import numpy as np; np.random.seed(0); c = np.random.rand(20,30)" "a,b = linear_sum_assignment(c)"
10000 loops, best of 5: 32.8 usec per loop
$ python -m timeit -s "from munkres import Munkres; import numpy as np;  np.random.seed(0); c = np.random.rand(20,30); m = Munkres()" "a = m.compute(c)"
100 loops, best of 5: 2.41 msec per loop
$ python -m timeit -s "from scipy.optimize import linear_sum_assignment; import numpy as np; np.random.seed(0);" "c = np.random.rand(20,30); a,b = linear_sum_assignment(c)"
5000 loops, best of 5: 51.7 usec per loop
$ python -m timeit -s "from munkres import Munkres; import numpy as np;  np.random.seed(0)" "c = np.random.rand(20,30); m = Munkres(); a = m.compute(c)"
10 loops, best of : 26 msec per loop

answered Oct 04 '22 07:10

Matthew

Related questions
                            
                                Python - OSError: [WinError 17] The system cannot move the file to a different disk drive:
                            
                                Drawing multiple edges between two nodes with networkx
                            
                                Write dictionary values in an excel file
                            
                                How to calculate percentage with Pandas' DataFrame
                            
                                Pyplot: using percentage on x axis
                            
                                Nginx Django and Gunicorn. Gunicorn sock file is missing?
                            
                                How do I use within / in operator in a Pandas DataFrame? [duplicate]
                            
                                Install gdal using conda?
                            
                                Calculating cumulative returns with pandas dataframe
                            
                                Pandas Counting Unique Rows
                            
                                Splitting a list into uneven groups?
                            
                                How to measure the speed of a python function
                            
                                Creating a Gin Index with Trigram (gin_trgm_ops) in Django model
                            
                                How to re-partition pyspark dataframe?
                            
                                KeyError when loading pickled scikit-learn model using joblib
                            
                                Why can't I get reproducible results in Keras even though I set the random seeds?
                            
                                Create random shape/contour using matplotlib
                            
                                How to fix AttributeError: 'Series' object has no attribute 'to_numpy'
                            
                                Why are SQL aggregate functions so much slower than Python and Java (or Poor Man's OLAP)
                            
                                Python Threads - Critical Section

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

The Assignment Problem, a NumPy function?

Tags:

python

optimization

numpy

combinatorics

scipy

Paul

People also ask

2 Answers

Sean Johnson

Matthew

Recent Activity

Donate For Us