I have big data set and there are tons of values which are way over average. For example, <pre class="prettyprint"><code> A B 1 'H' 10 2 'E' 10000 3 'L' 12 4 'L' 8 5 'O' 11 </code></pre> and I want to set <code>B2</code> cell as 0 and I tried this, <pre class="prettyprint"><code>df['B'] = df['B'].replace([df['B'] > 15], 0) </code></pre> But didn't get any luck. How can make my data frame like this, <pre class="prettyprint"><code> A B 1 'H' 10 2 'E' 0 3 'L' 12 4 'L' 8 5 'O' 11 </code></pre> Thank you!

You are really close - instead of <code>replace</code>, use <code>mask</code>: <pre class="prettyprint"><code>df['B'] = df['B'].mask(df['B'] > 15, 0) print (df) A B 1 'H' 10 2 'E' 0 3 'L' 12 4 'L' 8 5 'O' 11 </code></pre> Alternative: <pre class="prettyprint"><code>df['B'] = np.where(df['B'] > 15, 0, df['B']) print (df) A B 1 'H' 10 2 'E' 0 3 'L' 12 4 'L' 8 5 'O' 11 </code></pre> If you want replace some range: <pre class="prettyprint"><code>df['B'] = np.where(df['B'].between(8,11), 0, df['B']) print (df) A B 1 'H' 0 2 'E' 10000 3 'L' 12 4 'L' 0 5 'O' 0 </code></pre>

Replace a specific range of values in a pandas dataframe

    A         B
1  'H'       10
2  'E'    10000
3  'L'       12
4  'L'        8
5  'O'       11

and I want to set B2 cell as 0 and I tried this,

df['B'] = df['B'].replace([df['B'] > 15], 0)

But didn't get any luck. How can make my data frame like this,

    A         B
1  'H'       10
2  'E'        0
3  'L'       12
4  'L'        8
5  'O'       11

Thank you!

492

asked Sep 12 '17 05:09

jayko03

1 Answers

You are really close - instead of replace, use mask:

df['B'] = df['B'].mask(df['B'] > 15, 0)
print (df)
     A   B
1  'H'  10
2  'E'   0
3  'L'  12
4  'L'   8
5  'O'  11

Alternative:

df['B'] = np.where(df['B'] > 15, 0, df['B'])
print (df)
     A   B
1  'H'  10
2  'E'   0
3  'L'  12
4  'L'   8
5  'O'  11

If you want replace some range:

df['B'] = np.where(df['B'].between(8,11), 0, df['B'])
print (df)
     A      B
1  'H'      0
2  'E'  10000
3  'L'     12
4  'L'      0
5  'O'      0

141

answered Nov 15 '22 09:11

jezrael

Related questions
                            
                                scatter plots in seaborn/matplotlib with point size and color given by continuous dataframe column
                            
                                [python][selenium] on-screen position of element
                            
                                Multiple constructors in python
                            
                                python pandas resample count and sum
                            
                                Python Sqlite UPDATE multiple values
                            
                                How to change the values of a column based on two conditions in Python
                            
                                Loop through generator two items at a time
                            
                                Pymongo: How to check if the update was successful ?
                            
                                Use str.join with generator expression in python [duplicate]
                            
                                text caption not appearing matplotlib
                            
                                python how to get name of the enum
                            
                                Request body serialization differences when lambda function invoked via API Gateway v Lambda Console
                            
                                update frame in matplotlib with live camera preview
                            
                                What value does readline return when reaching the end of the file in Python?
                            
                                Django says that MySQL does not allow unique CharFields to have a max_length > 255, but it does
                            
                                Removing nan from list - Python
                            
                                mysql.connector.errors.ProgrammingError: 1064 (4200): You have an error in your SQL syntax;
                            
                                OpenCV Python: fast solution for 3-channel float32 image reading?
                            
                                Rowwise min() and max() fails for column with NaNs
                            
                                Reuse pytest fixtures

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Replace a specific range of values in a pandas dataframe

Tags:

python

pandas

dataframe

jayko03

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us