Using NumPy to Find Median of Second Element of List of Tuples

Tags:

Let's say I have a list of tuples, as follows:

list = [(a,1), (b,3), (c,5)]

My goal is to obtain the first element of the median of the list of tuples, using the tuples' second element. In the above case, I would want an output of b, as the median is 3. I tried using NumPy with the following code, to no avail:

import numpy as np

list = [('a',1), ('b',3), ('c',5)]
np.median(list, key=lambda x:x[1])

803

asked Aug 05 '15 15:08

Wally

Video Answer

2 Answers

You could calculate the median like this:

np.median(dict(list).values()) 
# in Python 2.7; in Python 3.x it would be `np.median(list(dict(list_of_tuples).values()))`

That converts your list to a dictionary first and then calculates the median of its values.

When you want to get the actual key, you can do it like this:

dl = dict(list) #{'a': 1, 'b': 3, 'c': 5}

dl.keys()[dl.values().index(np.median(dl.values()))]

which will print 'b'. That assumes that the median is in the list, if not a ValueError will be thrown. You could therefore then use a try/except like this using the example from @Anand S Kumar's answer:

import numpy as np

l = [('a',1), ('b',3), ('c',5), ('d',22),('e',11),('f',3)]

# l = [('a',1), ('b',3), ('c',5)]

dl = dict(l)
try:
    print(dl.keys()[dl.values().index(np.median(dl.values()))])
except ValueError:
    print('The median is not in this list. Its value is ',np.median(dl.values()))
    print('The closest key is ', dl.keys()[min(dl.values(), key=lambda x:abs(x-np.median(dl.values())))])

For the first list you will then obtain:

The median is not in this list. Its value is 4.0

The closest key is f

for your example it just prints:

b

129

answered Sep 28 '22 10:09

Cleb

np.median does not accept any argument called key . Instead you can use a list comprehension, to take just the second elements from the inner list. Example -

In [3]: l = [('a',1), ('b',3), ('c',5)]

In [4]: np.median([x[1] for x in l])
Out[4]: 3.0

In [5]: l = [('a',1), ('b',3), ('c',5), ('d',22),('e',11),('f',3)]

In [6]: np.median([x[1] for x in l])
Out[6]: 4.0

Also, if its not for example purpose, do not use list as variable name, it shadows the builtin function list .

answered Sep 28 '22 09:09

Anand S Kumar

Related questions
                            
                                Having trouble with sending an email through SMTP Python
                            
                                How to sort row index case-insensitive way in Pandas DataFrame
                            
                                Converting to ASCII with numbers above 128
                            
                                Function name of wrapped function? [duplicate]
                            
                                How to increase the performance for estimating `Pi`in Python
                            
                                Convert string to ISODate in MongoDB
                            
                                Fast 1D linear np.NaN interpolation over large 3D array
                            
                                QFileDialog - differences between PyQt4/PyQt5/PySide
                            
                                Recover Python script from memory, I screwed up
                            
                                Django: Invalid block tag: 'static', expected 'endif'
                            
                                OpenCV can't find ORB
                            
                                How to index nested lists in Python?
                            
                                Iterate through each value of list in order, starting at random value
                            
                                How to keep the current figure when using ipython notebook with %matplotlib inline?
                            
                                Issue in setting the background color in pyqtgraph
                            
                                write numpy array to CSV with row indices and header
                            
                                argparse argument named "print"
                            
                                Python Bottle - Difference between "redirect" and "return template"
                            
                                Python open() append and read, file.read() returns empty string
                            
                                Make 2D Numpy array from coordinates

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using NumPy to Find Median of Second Element of List of Tuples

Tags:

python

tuples

numpy