I have the following list of dictionaries: <pre class="prettyprint"><code>dictionary =[{'Flow': 100, 'Location': 'USA', 'Name': 'A1'}, {'Flow': 90, 'Location': 'Europe', 'Name': 'B1'}, {'Flow': 20, 'Location': 'USA', 'Name': 'A1'}, {'Flow': 70, 'Location': 'Europe', 'Name': 'B1'}] </code></pre> I want to create a new list of dictionaries, with summed <code>Flow</code> values of all dictionaries where <code>Location</code> and <code>Name</code> are the same. My desired output would be: <pre class="prettyprint"><code>new_dictionary =[{'Flow': 120, 'Location': 'USA', 'Name': 'A1'}, {'Flow': 160, 'Location': 'Europe', 'Name': 'B1'},] </code></pre> How can I achieve this?

This is possible, but non-trivial to implement in python. Might I suggest using pandas? This is simple with a <code>groupby</code>, <code>sum</code>, and <code>to_dict</code>. <pre class="prettyprint"><code>import pandas as pd (pd.DataFrame(dictionary) .groupby(['Location', 'Name'], as_index=False) .Flow.sum() .to_dict('r')) [{'Flow': 160, 'Location': 'Europe', 'Name': 'B1'}, {'Flow': 120, 'Location': 'USA', 'Name': 'A1'}] </code></pre> To install, use <code>pip install --user pandas</code>. <hr> Otherwise, you can apply a pseudo-generic group operation using <code>itertools.groupby</code>. <pre class="prettyprint"><code>from itertools import groupby from operator import itemgetter grouper = ['Location', 'Name'] key = itemgetter(*grouper) dictionary.sort(key=key) [{**dict(zip(grouper, k)), 'Flow': sum(map(itemgetter('Flow'), g))} for k, g in groupby(dictionary, key=key)] [{'Flow': 160, 'Location': 'Europe', 'Name': 'B1'}, {'Flow': 120, 'Location': 'USA', 'Name': 'A1'}] </code></pre>

How to sum elements in list of dictionaries if two key values are the same

Tags:

python

dictionary

list

I have the following list of dictionaries:

dictionary =[{'Flow': 100, 'Location': 'USA', 'Name': 'A1'},
            {'Flow': 90, 'Location': 'Europe', 'Name': 'B1'},
            {'Flow': 20, 'Location': 'USA', 'Name': 'A1'},
            {'Flow': 70, 'Location': 'Europe', 'Name': 'B1'}]

I want to create a new list of dictionaries, with summed Flow values of all dictionaries where Location and Name are the same. My desired output would be:

new_dictionary =[{'Flow': 120, 'Location': 'USA', 'Name': 'A1'},
            {'Flow': 160, 'Location': 'Europe', 'Name': 'B1'},]

How can I achieve this?

302

asked Aug 27 '18 05:08

user3200392

1 Answers

This is possible, but non-trivial to implement in python. Might I suggest using pandas? This is simple with a groupby, sum, and to_dict.

import pandas as pd

(pd.DataFrame(dictionary)
   .groupby(['Location', 'Name'], as_index=False)
   .Flow.sum()
   .to_dict('r'))

[{'Flow': 160, 'Location': 'Europe', 'Name': 'B1'},
 {'Flow': 120, 'Location': 'USA', 'Name': 'A1'}]

To install, use pip install --user pandas.

Otherwise, you can apply a pseudo-generic group operation using itertools.groupby.

from itertools import groupby
from operator import itemgetter

grouper = ['Location', 'Name']
key = itemgetter(*grouper)
dictionary.sort(key=key)

[{**dict(zip(grouper, k)), 'Flow': sum(map(itemgetter('Flow'), g))} 
    for k, g in groupby(dictionary, key=key)]

[{'Flow': 160, 'Location': 'Europe', 'Name': 'B1'},
 {'Flow': 120, 'Location': 'USA', 'Name': 'A1'}]

185

answered Oct 08 '22 15:10

cs95

Related questions
                            
                                Naming convention for Django URL, templates, models and views
                            
                                Pyinstaller and --onefile: How to include an image in the exe file
                            
                                How to groupby time series by 10 minutes using pandas
                            
                                Python unsharp mask
                            
                                Load static files for all templates in django
                            
                                How to define a new function in pdb
                            
                                scrapy response.xpath returns empty array on xml document with default namespace, while response.re works
                            
                                How to sort in descending order with numpy?
                            
                                How to get all keys from Ordered Dictionary?
                            
                                CommandError: You appear not to have the 'sqlite3' program installed or on your path
                            
                                How to avoid flake8's "F821 undefined name '_'" when _ has been installed by gettext?
                            
                                how to copy numpy array value into higher dimensions
                            
                                Issue NaN with Adam solver
                            
                                what is the most efficient way to find the position of the first np.nan value?
                            
                                asyncpg - connection vs connection pool
                            
                                Django connection to postgres by docker-compose
                            
                                Sample rows of pandas dataframe in proportion to counts in a column
                            
                                Combining two csv files using pandas
                            
                                How to store mySQL query result into pandas DataFrame with pymysql?
                            
                                Getting logs twice in AWS lambda function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With