I have two methods that rank a list of strings differently, and what we can consider to be the "right" ranking of the list (i.e. a gold standard). In other words: <pre class="prettyprint"><code> ranked_list_of_strings_1 = method_1(list_of_strings) ranked_list_of_strings_2 = method_2(list_of_strings) correctly_ranked_list_of_strings # Some permutation of list_of_strings </code></pre> How can I determine which method is better considering that <code>method_1</code> and <code>method_2</code> are black boxes? Are there any methods to measure this available either in <code>SciPy</code> or <code>scikit-learn</code> or similar libraries? In my specific case, I actually have a dataframe, and each method outputs a score. What matters is not the difference in score between the methods and the true scores, but that the methods get the ranking right (higher score means higher ranking for all columns). <pre class="prettyprint"><code> strings scores_method_1 scores_method_2 true_scores 5714 aeSeOg 0.54 0.1 0.8 5741 NQXACs 0.15 0.3 0.4 5768 zsFZQi 0.57 0.7 0.2 </code></pre>

The scikit-learn library also seems to have a NDCG (and DCG) metric implemented now. https://scikit-learn.org/stable/modules/generated/sklearn.metrics.ndcg_score.html#sklearn.metrics.ndcg_score

Distances between rankings

Tags:

python

pandas

scipy

scikit-learn

I have two methods that rank a list of strings differently, and what we can consider to be the "right" ranking of the list (i.e. a gold standard).

In other words:

 ranked_list_of_strings_1 = method_1(list_of_strings)
 ranked_list_of_strings_2 = method_2(list_of_strings)    
 correctly_ranked_list_of_strings # Some permutation of list_of_strings

How can I determine which method is better considering that method_1 and method_2 are black boxes? Are there any methods to measure this available either in SciPy or scikit-learn or similar libraries?

In my specific case, I actually have a dataframe, and each method outputs a score. What matters is not the difference in score between the methods and the true scores, but that the methods get the ranking right (higher score means higher ranking for all columns).

      strings        scores_method_1   scores_method_2   true_scores
5714  aeSeOg                    0.54               0.1           0.8
5741  NQXACs                    0.15               0.3           0.4
5768  zsFZQi                    0.57               0.7           0.2

354

asked May 23 '14 00:05

Amelio Vazquez-Reina

2 Answers

The scikit-learn library also seems to have a NDCG (and DCG) metric implemented now.

https://scikit-learn.org/stable/modules/generated/sklearn.metrics.ndcg_score.html#sklearn.metrics.ndcg_score

108

answered Oct 19 '22 18:10

elz

You're looking for Normalized Discounted Cumulative Gain (NDGC). It's a metric commonly used in search engine rankings to test the quality of the result ranking.

The idea is that you test your ranking (in your case the two methods) against user feedback through clicks (in your cast the true rank). NDGC will tell you the quality of your ranking relative to the truth.

Python has RankEval based module that implements this metric (and some others if you want to try them). The repo is here and there is a nice IPython NB with examples

answered Oct 19 '22 18:10

cwharland

Related questions
                            
                                What happens to other threads when main thread calls sys.exit()?
                            
                                Excluding unwanted results of findAll using BeautifulSoup
                            
                                difference between from x import y and import x.y
                            
                                Make legend correspond to colors of scatter points in matplotlib
                            
                                Pythonic way to split math calculations
                            
                                Solving a graph issue with Python
                            
                                How to use Python Mock to raise an exception - but with Errno set to a given value
                            
                                DRF - Method 'GET' not allowed
                            
                                Updating json field in Postgres
                            
                                PGP/GPG Signed Python code
                            
                                Signal handling in python-daemon
                            
                                Creating a scrolling panel in wxPython
                            
                                PyCharm 3.1 hangs forever during indexing and unusable
                            
                                streaming m3u8 file with opencv
                            
                                Python: Importing a module with the same name as a function
                            
                                How to return a relative URI Location header with Flask?
                            
                                Matplotlib tight_layout causing RuntimeError
                            
                                pip: Any workaround to avoid --allow-external?
                            
                                turning a two dimensional array into a two column dataframe pandas
                            
                                In SQLAlchemy, can I create an Engine from an existing ODBC connection?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With