Replace ones in binary columns with values from another column

Tags:

I have a data frame that looks like this:

df = pd.DataFrame({"value": [4, 5, 3], "item1": [0, 1, 0], "item2": [1, 0, 0], "item3": [0, 0, 1]})
df

  value item1   item2   item3
0   4   0      1         0
1   5   1      0         0
2   3   0      0         1

Basically what I want to do is replace the value of the one hot encoded elements with the value from the "value" column and then delete the "value" column. The resulting data frame should be like this:

Click to copy

df_out = pd.DataFrame({"item1": [0, 5, 0], "item2": [4, 0, 0], "item3": [0, 0, 3]})

   item1    item2   item3
0   0        4      0
1   5        0      0
2   0        0      3

746

asked Dec 05 '18 12:12

gorjan

2 Answers

Why not just multiply?

Click to copy

df.pop('value').values * df

   item1  item2  item3
0      0      5      0
1      4      0      0
2      0      0      3

DataFrame.pop has the nice effect of in-place removing and returning a column, so you can do this in a single step.

if the "item_*" columns have anything besides 1 in them, then you can multiply with bools:

Click to copy

df.pop('value').values * df.astype(bool)

   item1  item2  item3
0      0      5      0
1      4      0      0
2      0      0      3

If your DataFrame has other columns, then do this:

Click to copy

df
   value  name  item1  item2  item3
0      4  John      0      1      0
1      5  Mike      1      0      0
2      3  Stan      0      0      1

# cols = df.columns[df.columns.str.startswith('item')]
cols = df.filter(like='item').columns
df[cols] = df.pop('value').values * df[cols]

df
  name  item1  item2  item3
0  John      0      5      0
1  Mike      4      0      0
2  Stan      0      0      3

answered Sep 20 '22 17:09

cs95

You could do something like:

Click to copy

df = pd.DataFrame([df['value']*df['item1'],df['value']*df['item2'],df['value']*df['item3']])
df.columns = ['item1','item2','item3']

EDIT: As this answer will not scale well to many columns as @coldspeed comments, it should be done iterating a loop:

Click to copy

 cols = ['item1','item2','item3']
 for c in cols:
     df[c] *= df['value']
 df.drop('value',axis=1,inplace=True)

answered Sep 17 '22 17:09

horro

Related questions
                            
                                Heroku ---> Installing pip remote: AttributeError: module 'pip._vendor.requests' has no attribute 'Session'
                            
                                I get an error when return a queryset objects: Cannot resolve expression type, unknown output_field
                            
                                Converting pandas data frame with degree minute second (DMS) coordinates to decimal degrees
                            
                                Background color when cropping image with PIL
                            
                                How to use Paramiko getfo to download file from SFTP server to memory to process it
                            
                                Python Dash: Custom CSS
                            
                                How do I select only a specific digit from the MNIST dataset provided by Keras?
                            
                                No module named 'termcolor'
                            
                                How to draw bounding box on best matches?
                            
                                Pandas get_dummies on multiple columns
                            
                                Anaconda unicode error on command line startup on Windows
                            
                                (Easiest) Way to use Python 3.6 and 3.7 on same computer?
                            
                                RuntimeError: OrderedDict mutated during iteration (Python3)
                            
                                Tensorflow——keras model.save() raise NotImplementedError
                            
                                Evaluate Xpath2.0 in python
                            
                                Tensorflow v1.10+ why is an input serving receiver function needed when checkpoints are made without it?
                            
                                Python, run package with `python3.6 -m somepackge.run`
                            
                                ssl module in python is not available Windows 7
                            
                                Django compress error: Invalid input of type: 'CacheKey'
                            
                                Why is ‘==‘ coming before ‘in’ in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Replace ones in binary columns with values from another column

Tags:

python

pandas

dataframe

gorjan

People also ask

2 Answers

cs95

horro

Recent Activity

Donate For Us