Pandas Read CSV file with variable rows to skip with special character at the beginning of row

Tags:

When reading a CSV file using pandas, read_csv method, how do I skip the lines if the number of lines are not known in advance ?

I have a CSV file which contains some meta-data at the beginning of the file and then contains the header and actual data.

The meta data always start with a # sign and it would always be at the top of CSV file.
The number of lines for meta data is not fixed.

Example for the file sample_file.csv:

# Meta-Data Line 1
# Meta-Data Line 2
# Meta-Data Line 3
col1,col2,col3
a,b,c
d,e,f
g,h,i

How would I use Pandas read_csv function and skiprows parameter to read the csv ?

df = pd.read_csv('sample_file.csv', skiprows=?)

Does Pandas 0.19.X or greater support this use case ?

443

asked Jan 30 '17 21:01

Spandan Brahmbhatt

1 Answers

comment is what you're searching for:

df = pd.read_csv('sample_file.csv', comment='#')

From the documentation:

comment : str, default None

Indicates remainder of line should not be parsed. If found at the beginning of a line, the line will be ignored altogether. This parameter must be a single character. Like empty lines (as long as skip_blank_lines=True), fully commented lines are ignored by the parameter header but not by skiprows. For example, if comment=’#’, parsing ‘#emptyna,b,cn1,2,3’ with header=0 will result in ‘a,b,c’ being treated as the header.

answered Oct 20 '22 11:10

Zeugma

Related questions
                            
                                I want to plot perpendicular vectors in Python
                            
                                Using SQLAlchemy how do I populate rows after creating the db using db.create_all()
                            
                                z-axis scaling and limits in a 3-D scatter plot in Matplotlib
                            
                                How can I determine the function in which a closure was created?
                            
                                When should I use type checking (if ever) in Python?
                            
                                How to send data to Flask via AJAX?
                            
                                Pandas - sort and head inside groupby
                            
                                Puzzle: how many ways can you hit a target with a laser beam within four reflective walls
                            
                                Django redis LPUSH / RPUSH
                            
                                Installing Keras package with conda install
                            
                                PyQt5 and Python 3.6 installation?
                            
                                Retrieve a number from a span tag, using Python requests and Beautiful Soup
                            
                                Function input() in pyspark
                            
                                Is LIBGDX Slower in python than Java
                            
                                Change execution concurrency of Airflow DAG
                            
                                High GPU Memory-Usage but zero volatile gpu-util
                            
                                Pytest: running tests multiple times with different input data
                            
                                scikit-learn - Convert pipeline prediction to original value/scale
                            
                                How to code a sequence to sequence RNN in keras?
                            
                                Testing the connection of Postgres-DB

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas Read CSV file with variable rows to skip with special character at the beginning of row

Tags:

python

pandas

csv

Spandan Brahmbhatt

People also ask

1 Answers

Zeugma

Recent Activity

Donate For Us