I know that I can reset the indices like so <pre class="prettyprint"><code>df.reset_index(inplace=True) </code></pre> but this will start the index from <code>0</code>. I want to start it from <code>1</code>. How do I do that without creating any extra columns and by keeping the index/reset_index functionality and options? I do not want to create a new dataframe, so <code>inplace=True</code> should still apply.

Just assign directly a new index array: <pre class="prettyprint"><code>df.index = np.arange(1, len(df) + 1) </code></pre> Example: <pre class="prettyprint"><code>In [151]: df = pd.DataFrame({'a':np.random.randn(5)}) df Out[151]: a 0 0.443638 1 0.037882 2 -0.210275 3 -0.344092 4 0.997045 In [152]: df.index = np.arange(1,len(df)+1) df Out[152]: a 1 0.443638 2 0.037882 3 -0.210275 4 -0.344092 5 0.997045 </code></pre> Or just: <pre class="prettyprint"><code>df.index = df.index + 1 </code></pre> If the index is already 0 based TIMINGS For some reason I can't take timings on <code>reset_index</code> but the following are timings on a 100,000 row df: <pre class="prettyprint"><code>In [160]: %timeit df.index = df.index + 1 The slowest run took 6.45 times longer than the fastest. This could mean that an intermediate result is being cached 10000 loops, best of 3: 107 µs per loop In [161]: %timeit df.index = np.arange(1, len(df) + 1) 10000 loops, best of 3: 154 µs per loop </code></pre> So without the timing for <code>reset_index</code> I can't say definitively, however it looks like just adding 1 to each index value will be faster if the index is already <code>0</code> based

In Python pandas, start row index from 1 instead of zero without creating additional column

Tags:

python

indexing

pandas

dataframe

I know that I can reset the indices like so

Click to copy

df.reset_index(inplace=True)

but this will start the index from 0. I want to start it from 1. How do I do that without creating any extra columns and by keeping the index/reset_index functionality and options? I do not want to create a new dataframe, so inplace=True should still apply.

914

asked Aug 27 '15 12:08

Bram Vanroy

1 Answers

Just assign directly a new index array:

Click to copy

df.index = np.arange(1, len(df) + 1)

Example:

Click to copy

In [151]:  df = pd.DataFrame({'a':np.random.randn(5)}) df Out[151]:           a 0  0.443638 1  0.037882 2 -0.210275 3 -0.344092 4  0.997045 In [152]:  df.index = np.arange(1,len(df)+1) df Out[152]:           a 1  0.443638 2  0.037882 3 -0.210275 4 -0.344092 5  0.997045

Or just:

Click to copy

df.index = df.index + 1

If the index is already 0 based

TIMINGS

For some reason I can't take timings on reset_index but the following are timings on a 100,000 row df:

Click to copy

In [160]:  %timeit df.index = df.index + 1 The slowest run took 6.45 times longer than the fastest. This could mean that an intermediate result is being cached  10000 loops, best of 3: 107 µs per loop   In [161]:  %timeit df.index = np.arange(1, len(df) + 1) 10000 loops, best of 3: 154 µs per loop

So without the timing for reset_index I can't say definitively, however it looks like just adding 1 to each index value will be faster if the index is already 0 based

194

answered Oct 02 '22 20:10

EdChum

Related questions
                            
                                How to specify python requests http put body?
                            
                                Anti-Join Pandas
                            
                                CSVWriter not saving data to file the moment I write it
                            
                                How to pass an argument to a function pointer parameter?
                            
                                Slicing a vector in C++
                            
                                Simple implementation of N-Gram, tf-idf and Cosine similarity in Python
                            
                                Dynamic terminal printing with python
                            
                                Writing to MySQL database with pandas using SQLAlchemy, to_sql
                            
                                Python packages - import by class, not file
                            
                                Python : Trying to POST form using requests
                            
                                Web scraping - how to identify main content on a webpage
                            
                                Python dictionary creation syntax
                            
                                Pandas: Creating DataFrame from Series
                            
                                Conda environments and .BAT files
                            
                                Writing a connection string when password contains special characters
                            
                                Passing list of parameters to SQL in psycopg2
                            
                                How to obtain values of parameters of get request in flask?
                            
                                How to plot multiple Seaborn Jointplot in Subplot
                            
                                How to install pip3 on Windows?
                            
                                Python AND operator on two boolean lists - how?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In Python pandas, start row index from 1 instead of zero without creating additional column

Tags:

python

indexing

pandas

dataframe

Bram Vanroy

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us