item frequency in a python list of dictionaries

Tags:

python

dictionary

Ok, so I have a list of dicts:

[{'name': 'johnny', 'surname': 'smith', 'age': 53},
 {'name': 'johnny', 'surname': 'ryan', 'age': 13},
 {'name': 'jakob', 'surname': 'smith', 'age': 27},
 {'name': 'aaron', 'surname': 'specter', 'age': 22},
 {'name': 'max', 'surname': 'headroom', 'age': 108},
]

and I want the 'frequency' of the items within each column. So for this I'd get something like:

{'name': {'johnny': 2, 'jakob': 1, 'aaron': 1, 'max': 1}, 
'surname': {'smith': 2, 'ryan': 1, 'specter': 1, 'headroom': 1}, 
'age': {53:1, 13:1, 27: 1. 22:1, 108:1}}

Any modules out there that can do stuff like this?

955

asked Jun 28 '09 20:06

dochead

2 Answers

collections.defaultdict from the standard library to the rescue:

from collections import defaultdict

LofD = [{'name': 'johnny', 'surname': 'smith', 'age': 53},
 {'name': 'johnny', 'surname': 'ryan', 'age': 13},
 {'name': 'jakob', 'surname': 'smith', 'age': 27},
 {'name': 'aaron', 'surname': 'specter', 'age': 22},
 {'name': 'max', 'surname': 'headroom', 'age': 108},
]

def counters():
  return defaultdict(int)

def freqs(LofD):
  r = defaultdict(counters)
  for d in LofD:
    for k, v in d.items():
      r[k][v] += 1
  return dict((k, dict(v)) for k, v in r.items())

print freqs(LofD)

emits

{'age': {27: 1, 108: 1, 53: 1, 22: 1, 13: 1}, 'surname': {'headroom': 1, 'smith': 2, 'specter': 1, 'ryan': 1}, 'name': {'jakob': 1, 'max': 1, 'aaron': 1, 'johnny': 2}}

as desired (order of keys apart, of course -- it's irrelevant in a dict).

112

answered Sep 17 '22 16:09

Alex Martelli

items = [{'name': 'johnny', 'surname': 'smith', 'age': 53},  {'name': 'johnny', 'surname': 'ryan', 'age': 13},  {'name': 'jakob', 'surname': 'smith', 'age': 27},  {'name': 'aaron', 'surname': 'specter', 'age': 22},  {'name': 'max', 'surname': 'headroom', 'age': 108}]

global_dict = {}

for item in items:
    for key, value in item.items():
        if not global_dict.has_key(key):
            global_dict[key] = {}

        if not global_dict[key].has_key(value):
            global_dict[key][value] = 0

        global_dict[key][value] += 1

print global_dict

Simplest solution and actually tested.

answered Sep 17 '22 16:09

tefozi

Related questions
                            
                                How to filter s3 objects by last modified date with Boto3
                            
                                Create blob container in azure storage if it does not exists
                            
                                "exec: "python": executable file not found in $PATH
                            
                                python, Windows 10: launching an application on a specific virtual desktop environment (work-spaces)
                            
                                Download attachment from mail using python
                            
                                Convert one-hot encoded data-frame columns into one column
                            
                                can't install pip anymore with python 2.7?
                            
                                How to keep the only the top N values in a dataframe
                            
                                Installing scipy and scikit-learn on apple m1
                            
                                best way to iterate through elements of pandas Series
                            
                                Comparison of Python and Perl solutions to Wide Finder challenge
                            
                                AJAX console window with ANSI/VT100 support?
                            
                                Python globals, locals, and UnboundLocalError
                            
                                What is the correct way to backup ZODB blobs?
                            
                                How to externally populate a Django model?
                            
                                How to configure IPython to use gvim on Windows?
                            
                                How to compare value of 2 fields in Django QuerySet?
                            
                                A Python walker that can ignore directories
                            
                                Why am I getting "'ResultSet' has no attribute 'findAll'" using BeautifulSoup in Python?
                            
                                Processing pairs of values from two sequences in Clojure

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With