Pandas dataframe: How to set values after an index to 0

Tags:

I have a Pandas dataframe, each row contains a name followed by many numbers in the columns. After a specific index for each row (calculated uniquely in every row), I want to set all the remaining values in that row to 0.

So, I tried out a few things and have the below working code:

for i in range(n):
    index = np.where(df.columns == df['match_this_value'][i])[0].item()
    df.iloc[i, index] = df['take_this_value'][i].day 
    df.iloc[i, (index+1):] = 0

However, this takes quite long as my dataset is very large. The runtime is about 70 seconds for my sample dataset, as my entire dataset is much longer. Is there a faster way to do this? Furthermore, is there a better way to do this manipulation without looping through each row?

EDIT: Sorry I should have specified how the index is calculated. the Index is calculated through an np.where by compared all of the columns of the dataframe (for each row) against one specific column and finding the match. so something like:

index = np.where(df.columns == df['match_this_value'][i])[0].item()

Once I have this index, I set the value at that column to the value of another column in the df. The entire code right now looks like this:

for i in range(n):
    index = np.where(df.columns == df['match_this_value'][i])[0].item()
    df.iloc[i, index] = df['take_this_value'][i].day 
    df.iloc[i, (index+1):] = 0

279

asked Jun 21 '19 12:06

Mat R

1 Answers

you could do :


import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.randn(4, 4), columns=list('ABCD'))

#           A         B         C         D
# 0  0.750017  0.582230  1.411253 -0.379428
# 1 -0.747129  1.800677 -1.243459 -0.098760
# 2 -0.742997 -0.035036  1.012052 -0.767602
# 3 -0.694679  1.013968 -1.000412  0.752191

indexes = np.random.choice(range(df.shape[1]), df.shape[0])
# array([0, 3, 1, 1])
df_indexes = np.tile(range(df.shape[1]), (df.shape[0], 1))
df[df_indexes>indexes[:, None]] = 0
print(df) 
#           A         B         C        D
# 0  0.750017  0.000000  0.000000  0.00000
# 1 -0.747129  1.800677 -1.243459 -0.09876
# 2 -0.742997 -0.035036  0.000000  0.00000
# 3 -0.694679  1.013968  0.000000  0.00000

So here you include a boolean mask df_indexes>indexes[:, None], and indexes here would be replaced with your "specific indexes"

126

answered Nov 15 '22 00:11

Ayoub ZAROU

Related questions
                            
                                tensorflow keras: I am getting this error 'module "tensorflow._api.v1.keras.layers' has no attribute 'flatten'"
                            
                                Python setattr() to function takes initial function name
                            
                                Allow Discord Rewrite bot to respond to other bots
                            
                                Python - How to set French locale?
                            
                                AttributeError: module 'cv2.cv2' has no attribute 'freetype' in OpenCV
                            
                                Creating string art from image
                            
                                Pygame/MoviePy - The video displays with a terrible framerate and the window size is bigger than my screen
                            
                                How can I convert fastai image from open_image() format to opencv?
                            
                                Python 3 - ValueError: Found array with 0 sample(s) (shape=(0, 11)) while a minimum of 1 is required by MinMaxScaler
                            
                                Plotting histograms with Arabic characters
                            
                                Keras multi-label image classification with F1-score
                            
                                Use generic in type alias
                            
                                How to start a python operator boto3 AWS-glue task in airflow based on another AWS-glue task successful completion in Airflow?
                            
                                Style dash components with dark-theme bootstrap css
                            
                                POST request to API Prestashop with Python
                            
                                DRF change default viewset's lookup_field for custom action
                            
                                InvalidArgumentException: Message: invalid argument: user data directory is already in use error while initiating Chrome with ChromeDriver Selenium
                            
                                ModuleNotFoundError - Airflow error while import Python file
                            
                                How to properly return a list from a pytest fixture for use in parametrize?
                            
                                Lenient JSON Parser for Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas dataframe: How to set values after an index to 0

Tags:

python

pandas

dataframe

Mat R

People also ask

1 Answers

Ayoub ZAROU

Recent Activity

Donate For Us