Select and modify a slice in pandas dataframe by integer index

Tags:

I have a dataframe like the following:

df = pd.DataFrame([[1,2],[10,20],[10,2],[1,40]],columns = ['a','b'])
    a   b
0   1   2
1   10  20
2   10  2
3   1   40

I want to select the b column where a == 1, the following is a classic selecting:

df[df.a == 1].b
    a   b
0   1   2
3   1   40

Then I want to select the ith row of this subdataframe, which isn't the row with index i. There again are several ways, like the following:

df[df.a == 1].b.iloc[[1]]
Output: 
3    40
Name: b, dtype: int64

So far so good. The problem is when I try to modify the value I got there, indeed this selection method yields a copy of the slice of the dataframe, not the object itself. Therefore I can't modify it inplace.

test[test.a == 1].b.iloc[[1]] = 3
SettingWithCopyWarning:
A value is trying to be set on a copy of a slice from a DataFrame

I don't know in which part the 'copy' problem lies, since the two following yield the same problem:

test.iloc[[3]].b = 3
test[test.a == 1].b = 3

So my question is this one: how can I change a value by both a mask selection (conditionally on the a column value) and a row selection (by the rank of the row in the subdataframe, not its index value)?

297

asked Jun 21 '17 07:06

ysearka

1 Answers

Use loc with the boolean mask and directly pass the index up:

In[178]:
df.loc[df.loc[df['a'] == 1,'b'].index[1], 'b'] = 3
df

Out[178]: 
    a   b
0   1   2
1  10  20
2  10   2
3   1   3

So here we mask the df using df['a'] == 1, this returns a boolean array and we mask the df and select just column 'b':

In[179]:
df.loc[df['a'] == 1,'b']

Out[179]: 
0    2
3    40
Name: b, dtype: int64

then just subscript the index directly:

In[180]:
df.loc[df['a'] == 1,'b'].index[1]

Out[180]: 3

We can then pass this index label back up to the top-level loc.

This test[test.a == 1].b.iloc[[1]] = 3 is chained indexing which is why the warning is raised.

189

answered Oct 14 '22 03:10

EdChum

Related questions
                            
                                Running Python scripts inside Android Studio [closed]
                            
                                mock xmlrpc.client method python
                            
                                How to pass arguments to pytest if pytest is run programmatically from another module?
                            
                                How to access numpy default global random number generator
                            
                                Pandas stacked area chart with zero values
                            
                                How to show the count values on the top of a bar in a countplot?
                            
                                How to use xml sax parser to read and write a large xml?
                            
                                Tweepy.cursor multiple / OR logic function for query terms
                            
                                Python, Bokeh: How to turn off auto-update of axes
                            
                                Declaring new variables inside class methods
                            
                                How to optimize a sklearn pipeline, using XGboost, for a different `eval_metric`?
                            
                                Unexpected 32-bit integer overflow in pandas/numpy int64 (python 3.6)
                            
                                import matplotlib failing on Heroku
                            
                                Save a pivottablejs figure to file
                            
                                Broadcast 1D array against 2D array for lexsort : Permutation for sorting each column independently when considering yet another vector
                            
                                how to pipe multiple sql- and py-scripts
                            
                                Adding to sqlalchemy mapping class non db attributes
                            
                                Windows 10 conda is not recognized as an internal or external command
                            
                                Passing a list as a url value to urlopen
                            
                                django.core.exceptions.ValidationError: ["'' is not a valid UUID."]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Select and modify a slice in pandas dataframe by integer index

Tags:

python

indexing

pandas

ysearka

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us