I find myself often having to check whether a column or row exists in a dataframe before trying to reference it. For example I end up adding a lot of code like: <pre class="prettyprint"><code>if 'mycol' in df.columns and 'myindex' in df.index: x = df.loc[myindex, mycol] else: x = mydefault </code></pre> Is there any way to do this more nicely? For example on an arbitrary object I can do <code>x = getattr(anobject, 'id', default)</code> - is there anything similar to this in pandas? Really any way to achieve what I'm doing more gracefully?

There is a method for <code>Series</code>: So you could do: <pre class="prettyprint"><code>df.mycol.get(myIndex, NaN) </code></pre> Example: <pre class="prettyprint"><code>In [117]: df = pd.DataFrame({'mycol':arange(5), 'dummy':arange(5)}) df Out[117]: dummy mycol 0 0 0 1 1 1 2 2 2 3 3 3 4 4 4 [5 rows x 2 columns] In [118]: print(df.mycol.get(2, NaN)) print(df.mycol.get(5, NaN)) 2 nan </code></pre>

return default if pandas dataframe.loc location doesn't exist

Tags:

python

pandas

I find myself often having to check whether a column or row exists in a dataframe before trying to reference it. For example I end up adding a lot of code like:

if 'mycol' in df.columns and 'myindex' in df.index: x = df.loc[myindex, mycol] else: x = mydefault

Is there any way to do this more nicely? For example on an arbitrary object I can do x = getattr(anobject, 'id', default) - is there anything similar to this in pandas? Really any way to achieve what I'm doing more gracefully?

524

asked May 01 '14 06:05

fantabolous

2 Answers

There is a method for Series:

So you could do:

df.mycol.get(myIndex, NaN)

Example:

In [117]:  df = pd.DataFrame({'mycol':arange(5), 'dummy':arange(5)}) df Out[117]:    dummy  mycol 0      0      0 1      1      1 2      2      2 3      3      3 4      4      4  [5 rows x 2 columns] In [118]:  print(df.mycol.get(2, NaN)) print(df.mycol.get(5, NaN)) 2 nan

answered Sep 21 '22 21:09

EdChum

Python has this mentality to ask for forgiveness instead of permission. You'll find a lot of posts on this matter, such as this one.

In Python catching exceptions is relatively inexpensive, so you're encouraged to use it. This is called the EAFP approach.

For example:

try:     x = df.loc['myindex', 'mycol'] except KeyError:     x = mydefault

answered Sep 20 '22 21:09

FooBar

Related questions
                            
                                Django: how do you serve media / stylesheets and link to them within templates
                            
                                Copying and pasting code into the Python interpreter
                            
                                How to write a list to a file with newlines in Python3
                            
                                Error while importing Tensorflow in Python 2.7 in Ubuntu 12.04. 'GLIBC_2.17 not found'
                            
                                Meaning of using commas and underscores with Python assignment operator?
                            
                                Streaming data with Python and Flask
                            
                                Convert JSON date string to Python datetime
                            
                                Python 3.6 project structure leads to RuntimeWarning
                            
                                Get Primary Key after Saving a ModelForm in Django
                            
                                Python dictionary keys besides strings and integers?
                            
                                Python: min(None, x)
                            
                                Generate a sequence of numbers in Python
                            
                                How to make a local variable (inside a function) global [duplicate]
                            
                                how to add border around an image in opencv python
                            
                                Pandas - combine column values into a list in a new column
                            
                                ImportError: No module named 'xlrd'
                            
                                What python libraries can tell me approximate location and time zone given an IP address?
                            
                                Objective-C (cocoa) equivalent to python's endswith/beginswith
                            
                                running a command line containing Pipes and displaying result to STDOUT
                            
                                Python: significance of -u option?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With