Creating New Column In Pandas Dataframe Using Regex [duplicate]

Tags:

I have a column in a pandas df of type object that I want to parse to get the first number in the string, and create a new column containing that number as an int.

For example:

Existing df

    col
    'foo 12 bar 8'
    'bar 3 foo'
    'bar 32bar 98'

Desired df

    col               col1
    'foo 12 bar 8'    12
    'bar 3 foo'       3
    'bar 32bar 98'    32

I have code that works on any individual cell in the column series

int(re.search(r'\d+', df.iloc[0]['col']).group())

The above code works fine and returns 12 as it should. But when I try to create a new column using the whole series:

df['col1'] = int(re.search(r'\d+', df['col']).group())

I get the following Error:

TypeError: expected string or bytes-like object

I tried wrapping a str() around df['col'] which got rid of the error but yielded all 0's in col1

I've also tried converting col to a list of strings and iterating through the list, which only yields the same error. Does anyone know what I'm doing wrong? Help would be much appreciated.

761

asked Sep 21 '17 18:09

Cam8593

1 Answers

This will do the trick:

search = []    
for values in df['col']:
    search.append(re.search(r'\d+', values).group())

df['col1'] = search

the output looks like this:

            col    col1
0  foo 12 bar 8      12
1     bar 3 foo       3
2  bar 32bar 98      32

140

answered Oct 02 '22 00:10

Albo

Related questions
                            
                                Matplotlib: create two subplots in line with two y axes each
                            
                                Python - mock imported dictionary
                            
                                Select data using a regular expression
                            
                                I can't access scrapyd port 6800 from browser
                            
                                How to replace multiple matches / groups with regexes?
                            
                                How to close kafka consumer once all messages are consumed?
                            
                                How do I load a caffe model and convert to a numpy array?
                            
                                Splitting dataframe column into equal windows in Pandas
                            
                                Error Keyerror 255 when executing pymysql.connect
                            
                                Numpy uint8_t arrays to vtkImageData
                            
                                MatPlotLib, datetimes, and TypeError: ufunc 'isfinite' not supported for the input types…
                            
                                Python Opencv - Cannot change pixel value of a picture
                            
                                Auto-generating username when adding a user with django
                            
                                Selecting string columns in pandas df (equivalent to df.select_dtypes)
                            
                                Is there a way to handle exceptions automatically with Python Click?
                            
                                what is the difference between "eval" and "int"
                            
                                python - what does yield (yield) do?
                            
                                NameError: global name 'flash' is not defined
                            
                                How to install Python using Windows Command Prompt
                            
                                Keras 2 fit_generator UserWarning: `steps_per_epoch` is not the same as the Keras 1 argument `samples_per_epoch`

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Creating New Column In Pandas Dataframe Using Regex [duplicate]

Tags:

python

regex

pandas

Cam8593

People also ask

1 Answers

Albo

Recent Activity

Donate For Us