Pandas table lookup

Tags:

I have a pandas lookup table which looks like this

Grade   Lower_Boundary  Upper_Boundary
1   -110    -96
2   -96 -91
3   -91 -85
4   -85 -81
5   -81 -77
6   -77 -72
7   -72 -68
8   -68 -63
9   -63 -58
10  -58 -54
11  -54 -50
12  -50 -46
13  -46 -42
14  -42 -38
15  -38 -34
16  -34 -28
17  -28 -18
18  -18 -11
19  -11 -11
20  -11 -9

I have another pandas dataframe that looks contains score. I want to assign 'Grade' to the score column, by looking up the look up table. So based on which interval of lower and upper boundary the score falls, the grade should be assigned from that row in the lookup table. Is there a way to do it without typing a bunch of if then else statements? I am thinking just of excel's index match.

Score   Grade
-75 6
-75 6
-60 9
-66 8
-66 8
-98 1
-60 9
-82 4
-70 7
-60 9
-60 9
-60 9
-56 10
-70 7
-70 7
-70 7
-66 8
-56 10
-66 8
-66 8

423

asked Feb 17 '16 22:02

Zenvega

1 Answers

A one-line solution (I call your lookup table lookup):

df['Score'].apply(lambda score: lookup['Grade'][(lookup['Lower_Boundary'] <= score) & (lookup['Upper_Boundary'] > score)].values[0])

Explanation:

For a given score, here is how to find the grade:

score = -75
match = (lookup['Lower_Boundary'] <= score) & (lookup['Upper_Boundary'] > score)
grade = lookup['Grade'][match]

This return a series of length 1. You can get its value with, for instance:

grade.values[0]

All you need to do is apply the above to the score column. If you want a one-liner, use a lambda function:

df['Score'].apply(lambda score: lookup['Grade'][(lookup['Lower_Boundary'] <= score) & (lookup['Upper_Boundary'] > score)].values[0])

Otherwise the following would be more readable:

def lookup_grade(score):
    match = (lookup['Lower_Boundary'] <= score) & (lookup['Upper_Boundary'] > score)
    grade = lookup['Grade'][match]
    return grade.values[0]

df['Score'].apply(lookup_grade)

This approach would also make it easier to deal with cases when no match is found.

138

answered Oct 09 '22 11:10

IanS

Related questions
                            
                                Python Testing - Reset all mocks?
                            
                                Speeding up an iloc solution within a pandas dataframe
                            
                                TypeError: '_io.TextIOWrapper' object is not callable, creating text file error
                            
                                Parsing Yaml in Python: Detect duplicated keys
                            
                                Scipy.optimize.minimize method='SLSQP' ignores constraint
                            
                                Python-pandas Replace NA with the median or mean of a group in dataframe
                            
                                From string to sympy expression
                            
                                Generating LMDB for Caffe
                            
                                Default rounding mode in python, and how to specify it to another one?
                            
                                How do I change directory in python so it remains after running the script?
                            
                                How to write in .csv file from a generator in python
                            
                                Valid parameters for astype in NumPy
                            
                                How to loop through a column in Python?
                            
                                How does python assign values after assignment operator [duplicate]
                            
                                how to get insights for all campaigns in single query + Facebook marketing api
                            
                                Sharing Google sheet with service account email
                            
                                How to read images with different size in a TFRecord file
                            
                                Run shell command in pdb mode
                            
                                Pyodbc - print first 10 rows (python)
                            
                                Unsupported operand type(s) for +: 'float' and 'str' error [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas table lookup

Tags:

python

lookup

pandas

Zenvega

People also ask

1 Answers

IanS

Recent Activity

Donate For Us