Python sum list of dicts by key with nested dicts

Tags:

I have a list of dicts and would like to design a function to output a new dict which contains the sum for each unique key across all the dicts in the list.

For the list:

[
    {
         'apples': 1,
         'oranges': 1,
         'grapes': 2
    },
    {
         'apples': 3,
         'oranges': 5,
         'grapes': 8
    },
    {
         'apples': 13,
         'oranges': 21,
         'grapes': 34
    }
]

So far so good, this can be done with a counter:

def sumDicts(listToProcess):
    c = Counter()
    for entry in listToProcess:
        c.update(entry)
    return (dict(c))

Which correctly returns:

{'apples': 17, 'grapes': 44, 'oranges': 27}

The trouble comes when the dicts in my list start to contain nested dicts:

[
    {
        'fruits': {
            'apples': 1,
            'oranges': 1,
            'grapes': 2
            },
        'vegetables': {
            'carrots': 6,
            'beans': 3,
            'peas': 2
        },
        'grains': 4,
        'meats': 1  
    },
    {
        'fruits': {
            'apples': 3,
            'oranges': 5,
            'grapes': 8
            },
        'vegetables': {
            'carrots': 7,
            'beans': 4,
            'peas': 3
        },
        'grains': 3,
        'meats': 2  
    },
    {
        'fruits': {
            'apples': 13,
            'oranges': 21,
            'grapes': 34
            },
        'vegetables': {
            'carrots': 8,
            'beans': 5,
            'peas': 4
        },
        'grains': 2,
        'meats': 3
    },
]

Now the same function will give a TypeError because the counter can't add two Dicts.

The desired result would be:

{
    'fruits': {
        'apples': 17,
        'oranges': 27,
        'grapes': 44
        },
    'vegetables': {
        'carrots': 21,
        'beans': 12,
        'peas': 9
    },
    'grains': 9,
    'meats': 6  
}

Any ideas on how to do this in a reasonably efficient, Pythonic, generalizable way?

Thanks!

996

asked Jan 20 '19 08:01

Ben Franske

1 Answers

I would do this by performing a recursive merge on a recursively defined collections.defaultdict object.

from collections import defaultdict

def merge(d, new_d):
    for k, v in new_d.items():
        if isinstance(v, dict):
            merge(d[k], v)
        else: 
            d[k] = d.setdefault(k, 0) + v

# https://stackoverflow.com/a/19189356/4909087    
nested = lambda: defaultdict(nested)
d = nested()

for subd in data:
    merge(d, subd)

Using default_to_regular to convert it back, we have:

default_to_regular(d)
# {
#     "fruits": {
#         "apples": 17,
#         "oranges": 27,
#         "grapes": 44
#     },
#     "vegetables": {
#         "carrots": 21,
#         "beans": 12,
#         "peas": 9
#     },
#     "grains": 9,
#     "meats": 6
# }

answered Oct 10 '22 03:10

cs95

Related questions
                            
                                Python dataclasses: What type to use if __post_init__ performs type conversion?
                            
                                Error when checking target: expected dense_3 to have shape (2,) but got array with shape (1,)
                            
                                Tensorflow: Tensor to numpy array conversion without running any session
                            
                                Python, speech_recognition tool does not recognize .wav file
                            
                                Pandas/Numpy NaN None comparison
                            
                                How to define a global error handler in gRPC python
                            
                                Creating a list of empty lists in numba
                            
                                Why does the UnboundLocalError occur on the second variable of the flat comprehension?
                            
                                Pytest select tests based on mark.parameterize value?
                            
                                ValueError: The computed initial MA coefficients are not invertible You should induce invertibility
                            
                                How can I plot a heatmap on a sphere given a list of latitudes and longitudes?
                            
                                Configuring Visual Studio Code for remote Python interpreter via SSH
                            
                                The axis argument to unique is not supported for dtype object
                            
                                How to make a tkinter canvas background transparent?
                            
                                Is there a builtin way to define a function that takes either 1 argument or 3?
                            
                                multivariable linearization in python: 'Pow' object has no attribute 'sqrt'
                            
                                How to make Pycharm run all python unit tests recursively from tests folder
                            
                                I can't import tensorflow-gpu
                            
                                Find the current line number of a running python process
                            
                                Airflow : ExternalTaskSensor doesn't trigger the task

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python sum list of dicts by key with nested dicts

Tags:

python

dictionary

recursion

Ben Franske

People also ask

1 Answers

cs95

Recent Activity

Donate For Us