Create a new column based on previous row value and delete the current row

Q: How do I change the value of a row in pandas?

Updating Row Values Like updating the columns, the row value updating is also very simple. You have to locate the row value first and then, you can update that row with new values. You can use the pandas loc function to locate the rows. We have located row number 3, which has the details of the fruit, Strawberry.

Q: How to update a row in Google Sheets with new values?

You have to locate the row value first and then, you can update that row with new values. You can use the pandas loc function to locate the rows. We have located row number 3, which has the details of the fruit, Strawberry. Now, we have to update this row with a new fruit named Pineapple and its details.

Q: How to update multiple columns at the same time?

You can even update multiple column names at a single time. For that, you have to add other column names separated by a comma under the curl braces. #multile column update data.rename(columns = {'Fruit':'Fruit Name','Colour':'Color','Price':'Cost'}) Just like this, you can update all your columns at the same time. 3.

Tags:

python

for-loop

python-3.x

pandas

dataframe

I have an input dataframe which can be generated from the code given below

  df = pd.DataFrame({'subjectID' :[1,1,2,2],'keys': 
  ['H1Date','H1','H2Date','H2'],'Values': 
  ['10/30/2006',4,'8/21/2006',6.4]})

The input dataframe looks like as shown below

enter image description here

This is what I did

s1 = df.set_index('subjectID').stack().reset_index()

s1.rename(columns={0:'values'}, 
             inplace=True)
d1 = s1[s1['level_1'].str.contains('Date')]
d2 = s1[~s1['level_1'].str.contains('Date')]

d1['g'] = d1.groupby('subjectID').cumcount()
d2['g'] = d2.groupby('subjectID').cumcount()

d3 = pd.merge(d1,d2,on=["subjectID", 'g'],how='left').drop(['g','level_1_x','level_1_y'], axis=1)

Though it works, I am afraid that this may not be the best approach. As we might have more than 200 columns and 50k RECORDS. Any help to improve my code further is very helpful.

I expect my output dataframe to look like as shown below

enter image description here

594

asked Jun 29 '19 07:06

The Great

Video Answer

1 Answers

may be something like:

s=df.groupby(df['keys'].str.contains('Date').cumsum()).cumcount()+1

final=(df.assign(s=s.astype(str)).set_index(['subjectID','s']).
       unstack().sort_values(by='s',axis=1))
final.columns=final.columns.map(''.join)
print(final)

           keys1     Values1 keys2 Values2
subjectID                                  
1          H1Date  10/30/2006    H1       4
2          H2Date   8/21/2006    H2     6.4

167

answered Oct 26 '22 23:10

anky

Related questions
                            
                                Python - How to set French locale?
                            
                                AttributeError: module 'cv2.cv2' has no attribute 'freetype' in OpenCV
                            
                                Creating string art from image
                            
                                Pygame/MoviePy - The video displays with a terrible framerate and the window size is bigger than my screen
                            
                                How can I convert fastai image from open_image() format to opencv?
                            
                                Python 3 - ValueError: Found array with 0 sample(s) (shape=(0, 11)) while a minimum of 1 is required by MinMaxScaler
                            
                                Plotting histograms with Arabic characters
                            
                                Keras multi-label image classification with F1-score
                            
                                Use generic in type alias
                            
                                How to start a python operator boto3 AWS-glue task in airflow based on another AWS-glue task successful completion in Airflow?
                            
                                Style dash components with dark-theme bootstrap css
                            
                                POST request to API Prestashop with Python
                            
                                DRF change default viewset's lookup_field for custom action
                            
                                InvalidArgumentException: Message: invalid argument: user data directory is already in use error while initiating Chrome with ChromeDriver Selenium
                            
                                ModuleNotFoundError - Airflow error while import Python file
                            
                                How to properly return a list from a pytest fixture for use in parametrize?
                            
                                Lenient JSON Parser for Python
                            
                                Pandas dataframe: How to set values after an index to 0
                            
                                python signalR - 500 Server Error when trying to connect
                            
                                Loop over a tensor and apply function to each element

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With