If I have a DataFrame: <pre class="prettyprint"><code>myDF = DataFrame(data=[[11,11],[22,'2A'],[33,33]], columns = ['A','B']) </code></pre> Gives the following dataframe (Starting out on stackoverflow and don't have enough reputation for an image of the DataFrame) <pre class="prettyprint"><code> | A | B | 0 | 11 | 11 | 1 | 22 | 2A | 2 | 33 | 33 | </code></pre> If i want to convert column B to int values and drop values that can't be converted I have to do: <pre class="prettyprint"><code>def convertToInt(cell): try: return int(cell) except: return None myDF['B'] = myDF['B'].apply(convertToInt) </code></pre> If I only do: <blockquote> myDF['B'].apply(int) </blockquote> the error obviously is: <blockquote> C:\WinPython-32bit-2.7.5.3\python-2.7.5\lib\site-packages\pandas\lib.pyd in pandas.lib.map_infer (pandas\lib.c:42840)() ValueError: invalid literal for int() with base 10: '2A' </blockquote> Is there a way to add exception handling to myDF['B'].apply() Thank you in advance!

I had the same question, but for a more general case where it was hard to tell if the function would generate an exception (i.e. you couldn't explicitly check this condition with something as straightforward as <code>isdigit</code>). After thinking about it for a while, I came up with the solution of embedding the <code>try/except</code> syntax in a separate function. I'm posting a toy example in case it helps anyone. <pre class="prettyprint"><code>import pandas as pd import numpy as np x=pd.DataFrame(np.array([['a','a'], [1,2]])) def augment(x): try: return int(x)+1 except: return 'error:' + str(x) x[0].apply(lambda x: augment(x)) </code></pre>

Exception Handling in Pandas .apply() function

Tags:

python

exception-handling

pandas

If I have a DataFrame:

myDF = DataFrame(data=[[11,11],[22,'2A'],[33,33]], columns = ['A','B'])

Gives the following dataframe (Starting out on stackoverflow and don't have enough reputation for an image of the DataFrame)

   | A  | B  |  0  | 11 | 11 |  1  | 22 | 2A |  2  | 33 | 33 |

If i want to convert column B to int values and drop values that can't be converted I have to do:

def convertToInt(cell):     try:         return int(cell)     except:         return None myDF['B'] = myDF['B'].apply(convertToInt)

If I only do:

myDF['B'].apply(int)

the error obviously is:

C:\WinPython-32bit-2.7.5.3\python-2.7.5\lib\site-packages\pandas\lib.pyd in pandas.lib.map_infer (pandas\lib.c:42840)()

ValueError: invalid literal for int() with base 10: '2A'

Is there a way to add exception handling to myDF['B'].apply()

Thank you in advance!

643

asked Apr 03 '14 19:04

RukTech

1 Answers

I had the same question, but for a more general case where it was hard to tell if the function would generate an exception (i.e. you couldn't explicitly check this condition with something as straightforward as isdigit).

After thinking about it for a while, I came up with the solution of embedding the try/except syntax in a separate function. I'm posting a toy example in case it helps anyone.

import pandas as pd import numpy as np  x=pd.DataFrame(np.array([['a','a'], [1,2]]))  def augment(x):     try:         return int(x)+1     except:         return 'error:' + str(x)  x[0].apply(lambda x: augment(x))

197

answered Sep 19 '22 17:09

atkat12

Related questions
                            
                                why dict objects are unhashable in python?
                            
                                How to rearrange Pandas column sequence?
                            
                                Date Time Formats in Python
                            
                                How to get all combinations of length n in python
                            
                                Fetch an email with imaplib but do not mark it as SEEN
                            
                                How to split a sequence according to a predicate?
                            
                                Going to Python from R, what's the python equivalent of a data frame?
                            
                                Why is creating a class in Python so much slower than instantiating a class?
                            
                                Can i add help text in django model fields
                            
                                Loading text file containing both float and string using numpy.loadtxt
                            
                                Python: String Formatter Align center [duplicate]
                            
                                Understanding how to create a heap in Python
                            
                                suppress Scrapy Item printed in logs after pipeline
                            
                                pip/easy_install failure: failed to create process
                            
                                What is first class function in Python
                            
                                Tensorflow Confusion Matrix in TensorBoard
                            
                                Can I pass arguments to pytest fixtures?
                            
                                Sorting a defaultdict by value in python
                            
                                Python - dump dict as a json string
                            
                                Is it possible to remove a break point set with ipdb.set_trace()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With