Convert data on reading csv in pandas

Tags:

I'm reading a .csv file into a pandas dataframe. The .csv file contains several columns. Column 'A' contains a string '20-989-98766'. Is it possible to only read the last 5 characters '98766' from the string when loading the file?

Click to copy

df = pd.read_csv("test_data2.csv", column={'A':read the last 5 characters})

output:

Click to copy

A
98766
95476
.....

381

asked Apr 11 '17 15:04

magicsword

1 Answers

You can define a func and pass this as an arg to converters param for read_csv:

Click to copy

In [57]:
import io
import pandas as pd
def func(x):
    return x[-5:]
t="""column
'20-989-98766"""
df = pd.read_csv(io.StringIO(t), converters={'column': func})
df


Out[57]:
  column
0  98766

So here I define a func and pass this to converters in the form of a dict with your column name as the key, this will call the func on every row in your csv

so in your case the following should work:

Click to copy

df = pd.read_csv("test_data2.csv", converters={'A':func})

answered Oct 17 '22 14:10

EdChum

Related questions
                            
                                Use a custom failure message for `assertRaises()` in Python?
                            
                                How can I unit test the jinja2 template logic?
                            
                                What is the best method for using Datashader to plot data from a NumPy array?
                            
                                Prevent long lines getting wrapped in ruamel.yaml
                            
                                Python interface pattern and unit test code coverage
                            
                                Most important original feature(s) of Principal Component Analysis
                            
                                Conditional color with matplotlib scatter
                            
                                How can I access output embedding(output vector) in gensim word2vec?
                            
                                python3 + Pandas styles + Change alternate row color
                            
                                SSL Certification Error > hostname doesn't match
                            
                                Why is bokeh so much slower than matplotlib
                            
                                How to type hint a function that returns a function? [duplicate]
                            
                                How to add the second line of labels for axes
                            
                                Reindexing a pandas DataFrame using a dict (python3)
                            
                                Numpy reductions over successive non-contiguous slices
                            
                                Updating arrow position in matplotlib
                            
                                BigQuery invalid table name error when using Standard SQL in BigQuery API's
                            
                                Clear MatPlotLib figure in Jupyter Python notebook
                            
                                How to attach CSV file with MIME/SMTP and email?
                            
                                scikit-learn: cross_val_predict only works for partitions

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Convert data on reading csv in pandas

Tags:

python

pandas

magicsword

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us