I have a data frame and what I am trying to do is essentially tabulate the score of the winning and losing team in the same spot. I have tried to put a lambda function, but have had no success with it. The data frame I currently have is the first one and I would like to create a dataset in the form of the second question. Thanks. <img src="https://i.stack.imgur.com/3i5QU.png" alt="enter image description here"> <img src="https://i.stack.imgur.com/taY8q.png" alt="an"> <pre class="prettyprint"><code>GameId Team Home Score 1 Spirit 1 81 1 Rockers 0 66 2 Lightning 1 73 2 Flames 0 82 Game ID Home Team Away Team Home Score Away Score 1 Spirit Rockers 81 66 2 Lightning Flames 73 82 </code></pre>

First use <code>.pivot</code> and then do some list comprehension to rename the columns from tuples to the desired names (the columns are tuples as a result of setting <code>Home</code> as a column when pivoting). <code>[::-1]</code> reverses the name from e.g. Team Home to Home Team, when joining the Tuples in the list comprehension. <pre class="prettyprint"><code>df = pd.pivot(df, columns='Home', values=['Team','Score'], index='GameId').reset_index() df.columns = [' '.join(str(s).strip().replace('1', 'Home').replace('0', 'Away') for s in col[::-1]) for col in df.columns] </code></pre> Ouput: <pre class="prettyprint"><code> GameId Away Team Home Team Away Score Home Score 0 1 Rockers Spirit 66 81 1 2 Flames Lightning 82 73 </code></pre>

Pandas - Applying Function to every other row

Tags:

python

pandas

I have a data frame and what I am trying to do is essentially tabulate the score of the winning and losing team in the same spot. I have tried to put a lambda function, but have had no success with it. The data frame I currently have is the first one and I would like to create a dataset in the form of the second question. Thanks.

enter image description here

GameId      Team    Home    Score
1           Spirit  1       81
1           Rockers 0       66
2           Lightning   1   73
2           Flames  0       82


Game ID Home Team   Away Team   Home Score  Away Score
1       Spirit      Rockers     81          66
2       Lightning   Flames      73          82

800

asked Jun 12 '20 03:06

Kulwant

3 Answers

Try this:

Input:

import pandas as pd

raw_df = pd.DataFrame({"GameId": [1, 1, 2, 2],
                       "Team": ["Spirit", "Rockets", "Lighting", "Flames"],
                       "Home": [1, 0, 1, 0],
                       "Score": [81, 66, 73, 82]})
print(raw_df)

Output:

   GameId      Team  Home  Score
0       1    Spirit     1     81
1       1   Rockets     0     66
2       2  Lighting     1     73
3       2    Flames     0     82

Input:

raw_df.loc[:, "Home"] = raw_df.Home.map({
        1: "Home",
        0: "Away"
    })

result = raw_df.pivot_table(index=["GameId"],
                            columns=["Home"],
                            values=["Team", "Score"],
                            aggfunc={"Team": lambda team: " ".join(team.tolist()),
                                     "Score": lambda score: score})

result = result.sort_index(axis="columns", level=[0, "Home"], ascending=False)
result.columns = [' '.join(reversed(col)) for col in result.columns]
print(result)

Output:

       Home Team Away Team  Home Score  Away Score
GameId                                            
1         Spirit   Rockets          81          66
2       Lighting    Flames          73          82

149

answered Oct 22 '22 13:10

Xu Qiushi

import pandas as pd
df=pd.DataFrame({'GameId':[1,1,2,2],'Team': ['Spirit','Rockers','Lighting','Flames'],'Home':[1,0,1,0],'Score':[81,66,73,82]})
merge=pd.merge(df,df,left_on='GameId',right_on='GameId')
merge=merge[merge['Home_x']!=merge['Home_y']]
merge=merge.drop_duplicates(subset=['GameId'])
merge=merge[['GameId','Team_x','Team_y','Score_x','Score_y']]
merge.columns=['GameId','Home Team','Away Team','Home Score','Away Score']

enter image description here

Explanation: using pd.merge(), I am performing a self join. After this, I am removing rows with same team names in both home & away columns. Dropping duplicates on gameId afterwards followed by selecting required columns & renaming them

answered Oct 22 '22 13:10

Mehul Gupta

First use .pivot and then do some list comprehension to rename the columns from tuples to the desired names (the columns are tuples as a result of setting Home as a column when pivoting). [::-1] reverses the name from e.g. Team Home to Home Team, when joining the Tuples in the list comprehension.

df = pd.pivot(df, columns='Home', values=['Team','Score'], index='GameId').reset_index()
df.columns = [' '.join(str(s).strip().replace('1', 'Home').replace('0', 'Away') for s in col[::-1]) for col in df.columns]

Ouput:

    GameId  Away Team   Home Team   Away Score  Home Score
0   1       Rockers     Spirit      66          81
1   2       Flames      Lightning   82          73

answered Oct 22 '22 15:10

David Erickson

Related questions
                            
                                How do I setup my own time zone in Django?
                            
                                Librosa raised OSError('sndfile library not found') in Docker
                            
                                AttributeError: module 'os' has no attribute 'uname
                            
                                Discord.py - how to detect if a user mentions/pings the bot
                            
                                Is this a bug or do I not understand something?
                            
                                Change colors in python dash plotly theme
                            
                                Python unittest setting a global variable correctly
                            
                                Import error: No module named 'secrets' - python manage.py not working after pull to Digital Ocean
                            
                                Difference between rect.move() and rect.move_ip in pygame
                            
                                Passing Ipython variables as string arguments to shell command
                            
                                Groupby and shift a dask dataframe
                            
                                WARNING: WARNING:tensorflow:Model was constructed with shape (None, 150) , but it was called on an input with incompatible shape (None, 1)
                            
                                Adding products to cart not working properly
                            
                                When does dataloader shuffle happen for Pytorch?
                            
                                Is there a GO equivalent to python's virtualenv?
                            
                                How to Click the "OK" Button within an Alert using Python + Selenium
                            
                                How to crop OpenCV Image from center
                            
                                How to run Jupyter Notebook with a different version of Python?
                            
                                padding='same' conversion to PyTorch padding=#
                            
                                print 3 columns from pandas data set in a table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With