I have a defaultdict that looks like this: <pre class="prettyprint"><code>my_dict = defaultdict(dict) </code></pre> which will print out: <pre class="prettyprint"><code>defaultdict(<class 'dict'>, {}) </code></pre> I also have two lists, which look like this: <pre class="prettyprint"><code>list1 = ["W", "IY", "W"] list2 = ["w", "ee", "w"] </code></pre> I would like to create a default dict which looks like this: <pre class="prettyprint"><code>defaultdict(<class 'dict'>, {'W': {'w': 2}, 'IY': {'ee': 1}} </code></pre> which has list1 within a dictionary as keys, with the keys as the next list with a separate dictionary, counting the instances of list2 as values. So far I have this: <pre class="prettyprint"><code>from collections import defaultdict d = defaultdict(dict) list1 = ["W", "IY", "W"] list2 = ["w", "ee", "w"] for char in list1: d[char] += 1 </code></pre> I know that this is not correct, as the defaultdict(dict) cannot be treated this way. Is there a way a I could do this? Any help would be greatly appreciated :)

Here is a solution using <code>collections.Counter</code>. <pre class="prettyprint"><code>import collections d = collections.defaultdict(collections.Counter) list1 = ["O", "TH", "O", "O"] list2 = ["o", "th", "o", "o1"] for key, value in zip(list1, list2): d[key].update([value]) >>> d defaultdict(<class 'collections.Counter'>, {'TH': Counter({'th': 1}), 'O': Counter({'o': 2, 'o1': 1})}) >>> </code></pre> While this doesn't strictly follow your requirements, <code>collections.Counter</code> inherits from <code>dict</code> so it has all of <code>dict</code>'s attributes

EDITED based on the comment on my original answer. You'll need a mapping of all possible phonemes to all possible spellings (graphemes). <pre class="prettyprint"><code>phonemes = {TH : [th], O : [o], OH : [oh, oo]} for char in set(list1): if char not in d: d[char] = {char.lower() : {phone : list2.count(phone) for phone in phonemes[char]}} </code></pre>

Adding keys to defaultdict(dict)

Tags:

python

dictionary

I have a defaultdict that looks like this:

my_dict = defaultdict(dict)

which will print out:

defaultdict(<class 'dict'>, {})

I also have two lists, which look like this:

list1 =  ["W", "IY", "W"]
list2 =  ["w", "ee", "w"]

I would like to create a default dict which looks like this:

defaultdict(<class 'dict'>, {'W': {'w': 2}, 'IY': {'ee': 1}}

which has list1 within a dictionary as keys, with the keys as the next list with a separate dictionary, counting the instances of list2 as values.

So far I have this:

from collections import defaultdict

d = defaultdict(dict)

list1 = ["W", "IY", "W"]
list2 = ["w", "ee", "w"]

for char in list1:
    d[char] += 1

I know that this is not correct, as the defaultdict(dict) cannot be treated this way. Is there a way a I could do this? Any help would be greatly appreciated :)

550

asked Apr 29 '16 02:04

RoadRunner

3 Answers

Here is a solution using collections.Counter.

import collections
d = collections.defaultdict(collections.Counter)

list1 = ["O", "TH", "O", "O"]
list2 = ["o", "th", "o", "o1"]

for key, value in zip(list1, list2):
    d[key].update([value])

>>> d
defaultdict(<class 'collections.Counter'>, {'TH': Counter({'th': 1}), 'O': Counter({'o': 2, 'o1': 1})})
>>>

While this doesn't strictly follow your requirements, collections.Counter inherits from dict so it has all of dict's attributes

118

answered Oct 17 '22 18:10

wwii

You can also use a nested defaultdict and zip like so:

d = defaultdict(lambda: defaultdict(int))
for k, v in zip(list1, list2):
    d[k][v] += 1
# d['TH']['th']: 1
# d['O']['o']: 2

or, if you want to keep your data structure:

d = defaultdict(dict)
for k, v in zip(list1, list2):
    d[k][v] = d[k].get(v, 0) + 1  
    # use dict.get(key, default=None) and specify an appropriate default value (0)

Using dict.get(key, default=None) allows you to access key-values of a common dict much like those a defaultdict, however, updating is a little more clunky.

answered Oct 17 '22 20:10

user2390182

EDITED based on the comment on my original answer.

You'll need a mapping of all possible phonemes to all possible spellings (graphemes).

phonemes = {TH : [th], O : [o], OH : [oh, oo]}

for char in set(list1):
    if char not in d:
        d[char] = {char.lower() : {phone : list2.count(phone) for phone in phonemes[char]}}

answered Oct 17 '22 18:10

aberger

Related questions
                            
                                How to extract schema for avro file in python
                            
                                Counting relationships in SQLAlchemy
                            
                                How to Find Documents That are in the same Cluster with KMeans
                            
                                name 'get_config' is not defined
                            
                                how to close pandas dataframe plot
                            
                                Pylint warning: Possible unbalanced tuple unpacking with sequence
                            
                                How do chained comparisons in Python actually work?
                            
                                Why use re.match(), when re.search() can do the same thing?
                            
                                Get row numbers of rows matching a condition in numpy
                            
                                Python win32gui SetAsForegroundWindow function not working properly
                            
                                How to programmatically count the number of files in an archive using python
                            
                                Data type of pandas column changes to object when it's passed to a function via apply?
                            
                                How to select a list of rows by name in Pandas dataframe
                            
                                How to correctly use auto_created attribute in django?
                            
                                Is there a chain calling method in Python?
                            
                                Python multiprocessing - Why is using functools.partial slower than default arguments?
                            
                                Equivalent to get_contents_to_file in boto3
                            
                                Python Pandas: pivot only certain columns in the DataFrame while keeping others
                            
                                Python send control + Q then control + A (special keys)
                            
                                How to test a Django model with pytest?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With