I want to know if there is a way to assign new .loc values to a dataframe in order to index this row. I was writing code where I was indexing rows by the .loc[], but now I have randomly shuffled the dataframe in to two sets and so when I index the row by .loc[], i get a key error as the row might be in the other dataset.
I want to be able to assign a new .loc[] index to the data right after I shuffle so I can still index as I always have.
Example, I have a dataframe:
length height... water type
4 15.85 14.7240 ... 0.173 orange
92 20.06 17.3565 ... 0.171 orange
155 22.71 15.8040 ... 0.169 apple
142 11.76 12.2355 ... 0.175 pear
91 20.33 16.0785 ... 0.175 apple
The index given is displayed on the left (i.e 4 to 91), I want to change these index values to what I want to assign them, which is in sequential order (i.e 0 to 4). So that when I call .loc[0] it will return the first row and not give me a KeyError as that row is in another dataset
Thanks.
From Pandas documentation:
>>> df = pd.DataFrame([('bird', 389.0),
... ('bird', 24.0),
... ('mammal', 80.5),
... ('mammal', np.nan)],
... index=['falcon', 'parrot', 'lion', 'monkey'],
... columns=('class', 'max_speed'))
>>> df
class max_speed
falcon bird 389.0
parrot bird 24.0
lion mammal 80.5
monkey mammal NaN
using reset_index with drop parameter:
>>> df.reset_index(drop=True)
class max_speed
0 bird 389.0
1 bird 24.0
2 mammal 80.5
3 mammal NaN
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With