I'm having trouble figuring out how to skip n rows in a csv file but keep the header which is the 1 row. What I want to do is iterate but keep the header from the first row. <code>skiprows</code> makes the header the first row after the skipped rows. What is the best way of doing this? <pre class="prettyprint"><code>data = pd.read_csv('test.csv', sep='|', header=0, skiprows=10, nrows=10) </code></pre>

You can pass a list of row numbers to <code>skiprows</code> instead of an integer. By giving the function the integer 10, you're just skipping the first 10 lines. To keep the first row 0 (as the header) and then skip everything else up to row 10, you can write: <pre class="prettyprint"><code>pd.read_csv('test.csv', sep='|', skiprows=range(1, 10)) </code></pre> <hr> <h3>Other ways to skip rows using <code>read_csv</code> </h3> The two main ways to control which rows <code>read_csv</code> uses are the <code>header</code> or <code>skiprows</code> parameters. Supose we have the following CSV file with one column: <pre class="prettyprint"><code>a b c d e f </code></pre> In each of the examples below, this file is <code>f = io.StringIO("\n".join("abcdef"))</code>. <ul> <li> Read all lines as values (no header, defaults to integers) <pre class="prettyprint"><code>>>> pd.read_csv(f, header=None) 0 0 a 1 b 2 c 3 d 4 e 5 f </code></pre> </li> <li> Use a particular row as the header (skip all lines before that): <pre class="prettyprint"><code>>>> pd.read_csv(f, header=3) d 0 e 1 f </code></pre> </li> <li> Use a multiple rows as the header creating a MultiIndex (skip all lines before the last specified header line): <pre class="prettyprint"><code>>>> pd.read_csv(f, header=[2, 4]) c e 0 f </code></pre> </li> <li> Skip N rows from the start of the file (the first row that's not skipped is the header): <pre class="prettyprint"><code>>>> pd.read_csv(f, skiprows=3) d 0 e 1 f </code></pre> </li> <li> Skip one or more rows by giving the row indices (the first row that's not skipped is the header): <pre class="prettyprint"><code>>>> pd.read_csv(f, skiprows=[2, 4]) a 0 b 1 d 2 f </code></pre> </li> </ul>

Python Pandas read_csv skip rows but keep header

Tags:

python

pandas

csv

I'm having trouble figuring out how to skip n rows in a csv file but keep the header which is the 1 row.

What I want to do is iterate but keep the header from the first row. skiprows makes the header the first row after the skipped rows. What is the best way of doing this?

data = pd.read_csv('test.csv', sep='|', header=0, skiprows=10, nrows=10)

213

asked Dec 05 '14 22:12

mcd

1 Answers

You can pass a list of row numbers to skiprows instead of an integer.

By giving the function the integer 10, you're just skipping the first 10 lines.

To keep the first row 0 (as the header) and then skip everything else up to row 10, you can write:

pd.read_csv('test.csv', sep='|', skiprows=range(1, 10))

Other ways to skip rows using `read_csv`

The two main ways to control which rows read_csv uses are the header or skiprows parameters.

Supose we have the following CSV file with one column:

a b c d e f

In each of the examples below, this file is f = io.StringIO("\n".join("abcdef")).

Read all lines as values (no header, defaults to integers)

>>> pd.read_csv(f, header=None)    0 0  a 1  b 2  c 3  d 4  e 5  f

Use a particular row as the header (skip all lines before that):
```
>>> pd.read_csv(f, header=3)    d 0  e 1  f 
```

Use a multiple rows as the header creating a MultiIndex (skip all lines before the last specified header line):

>>> pd.read_csv(f, header=[2, 4])                                                                                                                                                                            c    e 0  f

Skip N rows from the start of the file (the first row that's not skipped is the header):

>>> pd.read_csv(f, skiprows=3)                                                                                                                                                                          d 0  e 1  f

Skip one or more rows by giving the row indices (the first row that's not skipped is the header):

>>> pd.read_csv(f, skiprows=[2, 4])                                                                                                                                                                          a 0  b 1  d 2  f

answered Oct 01 '22 11:10

Alex Riley

Related questions
                            
                                Matplotlib: how to set the current figure?
                            
                                Is it possible to use Python to write cross-platform apps for both iOS and Android?
                            
                                Flattening a list of NumPy arrays?
                            
                                Does the Python 3 interpreter have a JIT feature?
                            
                                Python method/function arguments starting with asterisk and dual asterisk [duplicate]
                            
                                Creating a new corpus with NLTK
                            
                                Will scikit-learn utilize GPU?
                            
                                making matplotlib scatter plots from dataframes in Python's pandas
                            
                                Where is Python's "best ASCII for this Unicode" database? [closed]
                            
                                django syncdb and an updated model
                            
                                How to decorate all functions of a class without typing it over and over for each method? [duplicate]
                            
                                Getting index of item while processing a list using map in python
                            
                                Is there an idiomatic file extension for Jinja templates?
                            
                                TypeError: 'zip' object is not subscriptable
                            
                                Boolean operators vs Bitwise operators
                            
                                str.format() raises KeyError
                            
                                How to get all methods of a python class with given decorator
                            
                                How can I schedule updates (f/e, to update a clock) in tkinter?
                            
                                Converting a string representation of a list into an actual list object [duplicate]
                            
                                Translate every element in numpy array according to key

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Pandas read_csv skip rows but keep header

Tags:

python

pandas

csv

mcd

People also ask

1 Answers

Other ways to skip rows using `read_csv`

Alex Riley

Recent Activity

Donate For Us

Python Pandas read_csv skip rows but keep header

Tags:

python

pandas

csv

mcd

People also ask

1 Answers

Other ways to skip rows using read_csv

Alex Riley

Related questions

Recent Activity

Donate For Us

Other ways to skip rows using `read_csv`