I am relatively new to python 2.7, and can't figure out the following despite searching extensively on StackOverflow: I have a <code>list</code> of <code>dict</code>s I wan't to combine, when key is the same, and add specific values (in the example <code>'price'</code>). Input: <pre class="prettyprint"><code>[{'id1': 'a', 'price': '2', 'color': 'green'}, {'id1': 'b', 'price': '5', 'color': 'red'}, {'id1': 'a', 'price': '2', 'color': 'green'}] </code></pre> Expected: <pre class="prettyprint"><code>[{'id1': 'a', 'price': '4', 'color': 'green'}, {'id1': 'b', 'price': '5', 'color': 'red'}] </code></pre>

Same idea as your question before the edit. <pre class="prettyprint"><code>>>> data = [{'id1': 'a', 'price': '2', 'color': 'green'}, ... {'id1': 'b', 'price': '5', 'color': 'red'}, ... {'id1': 'a', 'price': '2', 'color': 'green'}] </code></pre> Construct a temporary dictionary and accumulate values in it <pre class="prettyprint"><code>>>> temp = {} >>> for d in data: ... if d['id1'] not in temp: ... temp[d['id1']] = {} ... temp_d = temp[d['id1']] ... temp_d['price'] = temp_d.get('price', 0) + int(d['price']) ... temp_d.setdefault('colors', set()).add(d['color']) ... >>> temp {'a': {'colors': {'green'}, 'price': 4}, 'b': {'colors': {'red'}, 'price': 5}} </code></pre> Then using list comprehension and dictionary comprehension, reconstruct the list of dictionaries. <pre class="prettyprint"><code>>>> [{'id1': k, 'price': v['price'], 'colors': v['colors']} for k, v in temp.items()] [{'id1': 'a', 'colors': {'green'}, 'price': 4}, {'id1': 'b', 'colors': {'red'}, 'price': 5}] </code></pre> <hr> <pre class="prettyprint"><code>>>> data = [{'id1': 'a', 'price': '2'}, {'id1': 'b', 'price': '5'}, ... {'id1': 'a', 'price': '2'}] </code></pre> Create a temporary dictionary where we can accummulate the sum of prices against their ids, <pre class="prettyprint"><code>>>> temp = {} >>> for d in data: ... temp[d['id1']] = temp.get(d['id1'], 0) + int(d['price']) ... >>> temp {'a': 4, 'b': 5} </code></pre> Here we try to get the value of <code>d['id1']</code> from <code>temp</code> and if it is not found, 0 will be returned. We then add the <code>price</code> from the current dictionary and store the result back in the <code>temp</code> against the current id1. Then reconstruct the list of dictionaries, with list comprehension and dictionary comprehension, like this <pre class="prettyprint"><code>>>> [{'id1': k, 'price': temp[k]} for k in temp] [{'price': 4, 'id1': 'a'}, {'price': 5, 'id1': 'b'}] </code></pre>

Merge values of same key, in list of dicts

Tags:

python

dictionary

list

python-2.7

I am relatively new to python 2.7, and can't figure out the following despite searching extensively on StackOverflow:

I have a list of dicts I wan't to combine, when key is the same, and add specific values (in the example 'price').

Input:

[{'id1': 'a', 'price': '2', 'color': 'green'}, {'id1': 'b', 'price': '5', 'color': 'red'}, {'id1': 'a', 'price': '2', 'color': 'green'}]

Expected:

[{'id1': 'a', 'price': '4', 'color': 'green'}, {'id1': 'b', 'price': '5', 'color': 'red'}]

955

asked Jun 10 '15 12:06

DauleDK

2 Answers

Same idea as your question before the edit.

>>> data = [{'id1': 'a', 'price': '2', 'color': 'green'},
...         {'id1': 'b', 'price': '5', 'color': 'red'},
...         {'id1': 'a', 'price': '2', 'color': 'green'}]

Construct a temporary dictionary and accumulate values in it

>>> temp = {}
>>> for d in data:
...     if d['id1'] not in temp:
...         temp[d['id1']] = {}
...     temp_d = temp[d['id1']]
...     temp_d['price'] = temp_d.get('price', 0) + int(d['price'])
...     temp_d.setdefault('colors', set()).add(d['color'])
... 
>>> temp
{'a': {'colors': {'green'}, 'price': 4}, 'b': {'colors': {'red'}, 'price': 5}}

Then using list comprehension and dictionary comprehension, reconstruct the list of dictionaries.

>>> [{'id1': k, 'price': v['price'], 'colors': v['colors']} for k, v in temp.items()]
[{'id1': 'a', 'colors': {'green'}, 'price': 4}, {'id1': 'b', 'colors': {'red'}, 'price': 5}]

>>> data = [{'id1': 'a', 'price': '2'}, {'id1': 'b', 'price': '5'},
...         {'id1': 'a', 'price': '2'}]

Create a temporary dictionary where we can accummulate the sum of prices against their ids,

>>> temp = {}
>>> for d in data:
...     temp[d['id1']] = temp.get(d['id1'], 0) + int(d['price'])
... 
>>> temp
{'a': 4, 'b': 5}

Here we try to get the value of d['id1'] from temp and if it is not found, 0 will be returned. We then add the price from the current dictionary and store the result back in the temp against the current id1.

Then reconstruct the list of dictionaries, with list comprehension and dictionary comprehension, like this

>>> [{'id1': k, 'price': temp[k]} for k in temp]
[{'price': 4, 'id1': 'a'}, {'price': 5, 'id1': 'b'}]

answered Oct 15 '22 06:10

thefourtheye

I've managed to compact the code like this:

import itertools as it
from operator import itemgetter

grupos = it.groupby(sorted(data, key=itemgetter('id1')), key=itemgetter('id1'))
res = [{'id1': v, 'price': sum(int(dicc['price']) for dicc in diccs) } for v, diccs in 
grupos]
print(res)

the output is:

[{'id1': 'a', 'price': 4}, {'id1': 'b', 'price': 25}, {'id1': 'c', 'price': 2}, {'id1': 'd', 'price': 1}, {'id1': 'e', 'price': 20}]

answered Oct 15 '22 08:10

Lopabe

Related questions
                            
                                OOP - organising big classes [closed]
                            
                                How can I get Sphinx autosummary to display the docs for an instance attributes?
                            
                                Python ThreadPool from multiprocessing.pool cannot ultilize all CPUs
                            
                                not getting all cookie info using python requests module
                            
                                scitkit-learn query data dimension must match training data dimension
                            
                                BeautifulSoup scraping nested tables
                            
                                Flask deployement on lighttpd and raspberry pi
                            
                                Why does Rexster Server (and Titan) stop responding?
                            
                                Python & Matplot: How can I draw a simple shape by points?
                            
                                Fastest way to shift a Numpy array
                            
                                pandas .to_sql timing out with RDS
                            
                                String from input is limited?
                            
                                Python Requests - Auth Token
                            
                                How to do a simple Pika SelectConnection to send a message, in python?
                            
                                Getting the parameter names of scipy.stats distributions
                            
                                PHP's array_slice vs Python's splitting arrays
                            
                                What is the legality of scraping youtube data? [closed]
                            
                                Python decorate methods with variable number of positional args and optional arg
                            
                                How to hide the python console window in Pyinstaller
                            
                                OpenCV Opening/Closing shifts the positions of the pixels

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With