I have a list of dictionaries in Python, which looks like following: <pre class="prettyprint"><code>d = [{feature_a:1, feature_b:'Jul', feature_c:100}, {feature_a:2, feature_b:'Jul', feature_c:150}, {feature_a:1, feature_b:'Mar', feature_c:110}, ...] </code></pre> What I want to achieve is that to keep the <code>feature_a</code>, <code>_b</code> and <code>_c</code> unique. For example, if we have 3 entries which have the same <code>feature_a</code> and <code>_b</code>, but have 3 different values of <code>feature_c</code> <code>100</code>, <code>100</code>, <code>150</code>, then after the operation, it should be <code>100</code> and <code>150</code>. How can I achieve this? ================================================================ UPDATE： OK, Thanks for Anand's excellent answer, it works perfectly. However, I have a further question. Suppose we have a new <code>feature_d</code> and the dictionary looks like: <pre class="prettyprint"><code>d = [{feature_a:1, feature_b:'Jul', feature_c:100, feature_d:'A'}, {feature_a:2, feature_b:'Jul', feature_c:150, feature_d: 'B'}, {feature_a:1, feature_b:'Mar', feature_c:110, feature_d:'F'}, ...] </code></pre> and I only want to deduplicate <code>feature_a</code>, <code>_b</code> and <code>_c</code>, but leave <code>feature_d</code> out. How can I achieve this? Many thanks.

If the order of the initial <code>d</code> list is not important , you can take the <code>.items()</code> of each dictionary and convert it into a <code>frozenset()</code> , which is hashable, and then you can convert the whole thing to a <code>set()</code> or <code>frozenset()</code> , and then convert each <code>frozenset()</code> back to dictionary. Example - <pre class="prettyprint"><code>uniq_d = list(map(dict, frozenset(frozenset(i.items()) for i in d))) </code></pre> <code>sets()</code> do not allow duplicate elements. Though you would end up losing the order of the list. For Python 2.x , the <code>list(...)</code> is not needed, as <code>map()</code> returns a list. <hr> Example/Demo - <pre class="prettyprint"><code>>>> import pprint >>> pprint.pprint(d) [{'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100}, {'feature_a': 2, 'feature_b': 'Jul', 'feature_c': 150}, {'feature_a': 1, 'feature_b': 'Mar', 'feature_c': 110}, {'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100}, {'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 150}] >>> uniq_d = list(map(dict, frozenset(frozenset(i.items()) for i in d))) >>> pprint.pprint(uniq_d) [{'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100}, {'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 150}, {'feature_a': 1, 'feature_b': 'Mar', 'feature_c': 110}, {'feature_a': 2, 'feature_b': 'Jul', 'feature_c': 150}] </code></pre> <hr> For the new requirement - <blockquote> However, what if that I have another feature_d but I only want to dedup feature_a, _b and _c If two entries which have same feature_a, _b and _c, they are considered the same and duplicated, no matter what is in feature_d </blockquote> A simple way to do this is to use a set and a new list, add only the features you need to the set, and check using only the features you need. Example - <pre class="prettyprint"><code>seen_set = set() new_d = [] for i in d: if tuple([i['feature_a'],i['feature_b'],i['feature_c']]) not in seen_set: new_d.append(i) seen_set.add(tuple([i['feature_a'],i['feature_b'],i['feature_c']])) </code></pre> Example/Demo - <pre class="prettyprint"><code>>>> d = [{'feature_a':1, 'feature_b':'Jul', 'feature_c':100, 'feature_d':'A'}, ... {'feature_a':2, 'feature_b':'Jul', 'feature_c':150, 'feature_d': 'B'}, ... {'feature_a':1, 'feature_b':'Mar', 'feature_c':110, 'feature_d':'F'}, ... {'feature_a':1, 'feature_b':'Mar', 'feature_c':110, 'feature_d':'G'}] >>> seen_set = set() >>> new_d = [] >>> for i in d: ... if tuple([i['feature_a'],i['feature_b'],i['feature_c']]) not in seen_set: ... new_d.append(i) ... seen_set.add(tuple([i['feature_a'],i['feature_b'],i['feature_c']])) ... >>> pprint.pprint(new_d) [{'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100, 'feature_d': 'A'}, {'feature_a': 2, 'feature_b': 'Jul', 'feature_c': 150, 'feature_d': 'B'}, {'feature_a': 1, 'feature_b': 'Mar', 'feature_c': 110, 'feature_d': 'F'}] </code></pre>

How to make values in list of dictionary unique?

I have a list of dictionaries in Python, which looks like following:

d = [{feature_a:1, feature_b:'Jul', feature_c:100}, {feature_a:2, feature_b:'Jul', feature_c:150}, {feature_a:1, feature_b:'Mar', feature_c:110}, ...]

What I want to achieve is that to keep the feature_a, _b and _c unique.

For example, if we have 3 entries which have the same feature_a and _b, but have 3 different values of feature_c 100, 100, 150, then after the operation, it should be 100 and 150.

How can I achieve this?

================================================================ UPDATE：

OK, Thanks for Anand's excellent answer, it works perfectly. However, I have a further question.

Suppose we have a new feature_d and the dictionary looks like:

d = [{feature_a:1, feature_b:'Jul', feature_c:100, feature_d:'A'}, {feature_a:2, feature_b:'Jul', feature_c:150, feature_d: 'B'}, {feature_a:1, feature_b:'Mar', feature_c:110, feature_d:'F'}, ...]

and I only want to deduplicate feature_a, _b and _c, but leave feature_d out. How can I achieve this?

Many thanks.

How do I get a list of unique values from a dictionary?

We can use the dict. fromkeys method of the dict class to get unique values from a Python list. This method preserves the original order of the elements and keeps only the first element from the duplicates.

How do I get a list of unique values in Python?

Using Python's import numpy, the unique elements in the array are also obtained. In the first step convert the list to x=numpy. array(list) and then use numpy. unique(x) function to get the unique values from the list.

How do you create a unique dictionary in Python?

To get a list of unique dictionaries with Python, we can use dict comprehension. which creates a dictionary with the key being the id value of the dicts in L . And we set v to the dict with the given 'id' value.

How do you assign a value to a dictionary list?

Appending a dictionary to a list with the same key and different values. Using append() method. Using copy() method to list using append() method. Using deepcopy() method to list using append() method.

If the order of the initial d list is not important , you can take the .items() of each dictionary and convert it into a frozenset() , which is hashable, and then you can convert the whole thing to a set() or frozenset() , and then convert each frozenset() back to dictionary. Example -

uniq_d = list(map(dict, frozenset(frozenset(i.items()) for i in d)))

sets() do not allow duplicate elements. Though you would end up losing the order of the list. For Python 2.x , the list(...) is not needed, as map() returns a list.

Example/Demo -

>>> import pprint
>>> pprint.pprint(d)
[{'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100},
 {'feature_a': 2, 'feature_b': 'Jul', 'feature_c': 150},
 {'feature_a': 1, 'feature_b': 'Mar', 'feature_c': 110},
 {'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100},
 {'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 150}]
>>> uniq_d = list(map(dict, frozenset(frozenset(i.items()) for i in d)))
>>> pprint.pprint(uniq_d)
[{'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100},
 {'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 150},
 {'feature_a': 1, 'feature_b': 'Mar', 'feature_c': 110},
 {'feature_a': 2, 'feature_b': 'Jul', 'feature_c': 150}]

For the new requirement -

However, what if that I have another feature_d but I only want to dedup feature_a, _b and _c

If two entries which have same feature_a, _b and _c, they are considered the same and duplicated, no matter what is in feature_d

A simple way to do this is to use a set and a new list, add only the features you need to the set, and check using only the features you need. Example -

seen_set = set()
new_d = []
for i in d:
    if tuple([i['feature_a'],i['feature_b'],i['feature_c']]) not in seen_set:
        new_d.append(i)
        seen_set.add(tuple([i['feature_a'],i['feature_b'],i['feature_c']]))

Example/Demo -

>>> d = [{'feature_a':1, 'feature_b':'Jul', 'feature_c':100, 'feature_d':'A'},
...  {'feature_a':2, 'feature_b':'Jul', 'feature_c':150, 'feature_d': 'B'},
...  {'feature_a':1, 'feature_b':'Mar', 'feature_c':110, 'feature_d':'F'},
...  {'feature_a':1, 'feature_b':'Mar', 'feature_c':110, 'feature_d':'G'}]
>>> seen_set = set()
>>> new_d = []
>>> for i in d:
...     if tuple([i['feature_a'],i['feature_b'],i['feature_c']]) not in seen_set:
...         new_d.append(i)
...         seen_set.add(tuple([i['feature_a'],i['feature_b'],i['feature_c']]))
...
>>> pprint.pprint(new_d)
[{'feature_a': 1, 'feature_b': 'Jul', 'feature_c': 100, 'feature_d': 'A'},
 {'feature_a': 2, 'feature_b': 'Jul', 'feature_c': 150, 'feature_d': 'B'},
 {'feature_a': 1, 'feature_b': 'Mar', 'feature_c': 110, 'feature_d': 'F'}]

How to make values in list of dictionary unique?

Tags:

python

unique

ChangeMyName

People also ask

1 Answers

Anand S Kumar

Recent Activity

Donate For Us

How to make values in list of dictionary unique?

Tags:

python

unique

ChangeMyName

People also ask

1 Answers

Anand S Kumar

Related questions

Recent Activity

Donate For Us