In my code, <code>df</code> is defined like this <pre class="prettyprint"><code>df = pd.read_excel(io=file_name, sheet_name=sheet, sep='\s*,\s*') </code></pre> I have a <code>[86 rows x 1 columns]</code> dataframe <code>df</code> which looks like this on <code>print(df)</code> <pre class="prettyprint"><code> 0 Male 511 Female 461 Male 273 Female 217 Male 394 Female 337 Female 337 Male 337 ... </code></pre> I wish to write a code that would <code>merge</code> the <code>Male</code> and <code>Female</code> entries like this <pre class="prettyprint"><code> 0 1 2 3 ... Male 511 273 394 337 ... Female 461 217 337 337 ... </code></pre> The final task I need to do is to <code>.sum()</code> the male row and then the female row to get the total of each sex. I am new to python and pandas and I haven't been able to make much progress so far. Any help, tutorial, documentation would be great! Thank you! Edit: By <code>keys</code> I mean the indexes. I hope these labels of Male and Females can be used to 'club' these rows together, but I don't know how to. Edit: I have accomplished my last task directly via <pre class="prettyprint"><code>print(df.ix['Female'].sum()) print(df.ix['Male'].sum()) </code></pre> But I am yet to achieve my forst task. Any ideas?

Create <code>MultiIndex</code> by <code>GroupBy.cumcount</code> for new columns names created by reshaping by <code>unstack</code>: <pre class="prettyprint"><code>df.index = [df.index, df.groupby(level=0).cumcount()] print (df) 0 Male 0 511 Female 0 461 Male 1 273 Female 1 217 Male 2 394 Female 2 337 3 337 Male 3 337 </code></pre> <hr> <pre class="prettyprint"><code>df = df[0].unstack() print (df) 0 1 2 3 Female 461 217 337 337 Male 511 273 394 337 </code></pre> And then <code>sum</code> all rows by <code>axis=1</code>: <pre class="prettyprint"><code>print (df.sum(axis=1)) Female 1352 Male 1515 dtype: int64 </code></pre>

How to sum rows with the same keys?

Tags:

python

sorting

pandas

dataframe

In my code, df is defined like this

df = pd.read_excel(io=file_name, sheet_name=sheet, sep='\s*,\s*')

I have a [86 rows x 1 columns] dataframe df which looks like this on print(df)

          0
Male    511
Female  461
Male    273
Female  217
Male    394
Female  337
Female  337
Male    337
...

I wish to write a code that would merge the Male and Female entries like this

          0   1   2   3 ...
Male    511 273 394 337 ...
Female  461 217 337 337 ...

The final task I need to do is to .sum() the male row and then the female row to get the total of each sex. I am new to python and pandas and I haven't been able to make much progress so far. Any help, tutorial, documentation would be great! Thank you!

Edit: By keys I mean the indexes. I hope these labels of Male and Females can be used to 'club' these rows together, but I don't know how to.

Edit: I have accomplished my last task directly via

print(df.ix['Female'].sum())
print(df.ix['Male'].sum())

But I am yet to achieve my forst task. Any ideas?

562

asked Jun 08 '18 09:06

Vibhu

1 Answers

Create MultiIndex by GroupBy.cumcount for new columns names created by reshaping by unstack:

df.index = [df.index, df.groupby(level=0).cumcount()]

print (df)
            0
Male   0  511
Female 0  461
Male   1  273
Female 1  217
Male   2  394
Female 2  337
       3  337
Male   3  337

df = df[0].unstack()
print (df)
          0    1    2    3
Female  461  217  337  337
Male    511  273  394  337

And then sum all rows by axis=1:

print (df.sum(axis=1))

Female    1352
Male      1515
dtype: int64

177

answered Nov 07 '22 09:11

jezrael

Related questions
                            
                                Why is my decision tree creating a split that doesn't actually divide the samples?
                            
                                Cannot monkey patch module variable in Python unit tests
                            
                                In a Pandas categorical, what is format="table"?
                            
                                Python3.5 Asyncio - Preventing task exception from dumping to stdout?
                            
                                Problems with underscore in the domain name
                            
                                Scoping rules in python
                            
                                Monte Carlo Analysis Python Oil and Gas Volumetrics
                            
                                How to select dataframe columns using string keys when the column names are timestamps?
                            
                                What does the pymodbus "unit" parameter mean?
                            
                                Can you use python sockets for docker container communication?
                            
                                Writing Nested Dictionary to csv
                            
                                Retrieving column names from ref cursor with cx_Oracle
                            
                                Calling Google cloud Vision API's on numpy matrices
                            
                                Is it necessary to open a SFTPClient per one thread in Paramiko with multi-threading?
                            
                                Customize legend and color scale in interactive charts `altair`
                            
                                Convert dtype of a specific column in a numpy array [duplicate]
                            
                                Faster way to iterate all keys and values in redis db
                            
                                Algorithm to calculate 'initial lists' in O(m*log m)
                            
                                how to create upside down bar graphs with shared x-axis with matplotlib / seaborn and a pandas dataframe
                            
                                Does pytest have anything like google test's non-fatal EXPECT_* behavior?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With