I have a Python dataframe with about 1,500 rows and 15 columns. With one specific column I would like to remove the first 3 characters of each row. As a simple example here is a dataframe: <pre class="prettyprint"><code>import pandas as pd d = { 'Report Number':['8761234567', '8679876543','8994434555'], 'Name' :['George', 'Bill', 'Sally'] } d = pd.DataFrame(d) </code></pre> I would like to remove the first three characters from each field in the <code>Report Number</code> column of dataframe <code>d</code>.

Use vectorised <code>str</code> methods to slice each string entry <pre class="prettyprint"><code>In [11]: d['Report Number'] = d['Report Number'].str[3:] d Out[11]: Name Report Number 0 George 1234567 1 Bill 9876543 2 Sally 4434555 </code></pre>

Remove first x number of characters from each row in a column of a Python dataframe

Tags:

python

string

replace

pandas

dataframe

I have a Python dataframe with about 1,500 rows and 15 columns. With one specific column I would like to remove the first 3 characters of each row. As a simple example here is a dataframe:

import pandas as pd

d = {
    'Report Number':['8761234567', '8679876543','8994434555'],
    'Name'         :['George', 'Bill', 'Sally']
     }

d = pd.DataFrame(d)

I would like to remove the first three characters from each field in the Report Number column of dataframe d.

995

asked Feb 20 '17 16:02

d84_n1nj4

2 Answers

Use vectorised str methods to slice each string entry

In [11]:
d['Report Number'] = d['Report Number'].str[3:]
d

Out[11]:
     Name Report Number
0  George       1234567
1    Bill       9876543
2   Sally       4434555

104

answered Oct 07 '22 12:10

EdChum

It is worth noting Pandas "vectorised" str methods are no more than Python-level loops.

Assuming clean data, you will often find a list comprehension more efficient:

# Python 3.6.0, Pandas 0.19.2

d = pd.concat([d]*10000, ignore_index=True)

%timeit d['Report Number'].str[3:]           # 12.1 ms per loop
%timeit [i[3:] for i in d['Report Number']]  # 5.78 ms per loop

Note these aren't equivalent, since the list comprehension does not deal with null data and other edge cases. For these situations, you may prefer the Pandas solution.

answered Oct 07 '22 14:10

jpp

Related questions
                            
                                RuntimeWarning: divide by zero encountered in log
                            
                                Pandas: peculiar performance drop for inplace rename after dropna
                            
                                How to tell which Keras model is better?
                            
                                Can Python's optparse display the default value of an option?
                            
                                multiprocessing global variable updates not returned to parent
                            
                                Concatenate sparse matrices in Python using SciPy/Numpy
                            
                                ModuleNotFoundError with pytest
                            
                                whats the fastest way to find eigenvalues/vectors in python?
                            
                                Equivalent of Paste R to Python
                            
                                Duplicating model instances and their related objects in Django / Algorithm for recusrively duplicating an object
                            
                                How do YOU deploy your WSGI application? (and why it is the best way)
                            
                                python 2 code: if python 3 then sys.exit()
                            
                                Beautiful Soup findAll doesn't find them all
                            
                                How can I decorate an instance method with a decorator class?
                            
                                Peeking in a heap in python
                            
                                Find which python modules are being imported
                            
                                how to check DEBUG true/false in django template - exactly in layout.html [duplicate]
                            
                                Beginner Python Practice? [closed]
                            
                                How to iterate Queue.Queue items in Python?
                            
                                How do you call an instance of a class in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With