I have a list of dictionary. I wish to have only one result for each unique api and the result need to show according to priority: 0, 1, 2. May I know how should I work on it? Data: <pre class="prettyprint"><code>[ {'api':'test1', 'result': 0}, {'api':'test2', 'result': 1}, {'api':'test3', 'result': 2}, {'api':'test3', 'result': 0}, {'api':'test3', 'result': 1}, ] </code></pre> Expected output: <pre class="prettyprint"><code>[ {'api':'test1', 'result': 0}, {'api':'test2', 'result': 1}, {'api':'test3', 'result': 0}, ] </code></pre>

<pre class="prettyprint"><code>data = [ {'api': 'test1', 'result': 0}, {'api': 'test3', 'result': 2}, {'api': 'test2', 'result': 1}, {'api': 'test3', 'result': 1}, {'api': 'test3', 'result': 0} ] def find(data): step1 = sorted(data, key=lambda k: k['result']) print('step1', step1) step2 = {} for each in step1: if each['api'] not in step2: step2[each['api']] = each print('step2', step2) step3 = list(step2.values()) print('step3', step3) print('\n') return step3 find(data) </code></pre> Try this, it will give you <pre class="prettyprint"><code>step1 [{'api': 'test1', 'result': 0}, {'api': 'test3', 'result': 0}, {'api': 'test2', 'result': 1}, {'api': 'test3', 'result': 1}, {'api': 'test3', 'result': 2}] step2 {'test1': {'api': 'test1', 'result': 0}, 'test3': {'api': 'test3', 'result': 0}, 'test2': {'api': 'test2', 'result': 1}} step3 [{'api': 'test1', 'result': 0}, {'api': 'test3', 'result': 0}, {'api': 'test2', 'result': 1}] </code></pre> Sort all first, then find first for each "api", and there goes your result.

How to retrieve minimum unique values from list?

Tags:

python

I have a list of dictionary. I wish to have only one result for each unique api and the result need to show according to priority: 0, 1, 2. May I know how should I work on it?

Data:

[
{'api':'test1', 'result': 0},
{'api':'test2', 'result': 1},
{'api':'test3', 'result': 2},
{'api':'test3', 'result': 0},
{'api':'test3', 'result': 1},
]

Expected output:

[
{'api':'test1', 'result': 0},
{'api':'test2', 'result': 1},
{'api':'test3', 'result': 0},
]

326

asked Dec 31 '20 09:12

UnKnown

3 Answers

Assuming input data you can do classic sql-ish groupby:

from itertools import groupby

# in case your data is sorted already by api skip the below line
data = sorted(data, key=lambda x: x['api'])

res = [
    {'api': g, 'result': min(v, key=lambda x: x['result'])['result']} 
    for g, v in groupby(data, lambda x: x['api'])
]

Outputs:

[{'api': 'test1', 'result': 0}, {'api': 'test2', 'result': 1}, {'api': 'test3', 'result': 0}]

195

answered Oct 27 '22 14:10

Grzegorz Skibinski

You can pass through the list once and preserve the best ones you see for each group. This is time and space efficient.

def get_min_unique(items, id_key, value_key):
  lowest = {}
  for item in items:
    key = item[id_key]
    if key not in lowest or lowest[key][value_key] > item[value_key]:
        lowest[key] = item
  return list(lowest.values())

For example with your own data:

data = [
  {'api':'test1', 'result': 0},
  {'api':'test2', 'result': 1},
  {'api':'test3', 'result': 2},
  {'api':'test3', 'result': 0},
  {'api':'test3', 'result': 1},
]

assert get_min_unique(data, 'api', 'result') == [
  {'api': 'test1', 'result': 0},
  {'api': 'test2', 'result': 1},
  {'api': 'test3', 'result': 0},
]

answered Oct 27 '22 15:10

Cireo

data = [
    {'api': 'test1', 'result': 0},
    {'api': 'test3', 'result': 2},
    {'api': 'test2', 'result': 1},
    {'api': 'test3', 'result': 1},
    {'api': 'test3', 'result': 0}
]

def find(data):
    step1 = sorted(data, key=lambda k: k['result'])
    print('step1', step1)

    step2 = {}
    for each in step1:
        if each['api'] not in step2:
            step2[each['api']] = each
    print('step2', step2)

    step3 = list(step2.values())
    print('step3', step3)
    print('\n')
    return step3

find(data)

Try this, it will give you

step1 [{'api': 'test1', 'result': 0}, {'api': 'test3', 'result': 0}, {'api': 'test2', 'result': 1}, {'api': 'test3', 'result': 1}, {'api': 'test3', 'result': 2}]
step2 {'test1': {'api': 'test1', 'result': 0}, 'test3': {'api': 'test3', 'result': 0}, 'test2': {'api': 'test2', 'result': 1}}
step3 [{'api': 'test1', 'result': 0}, {'api': 'test3', 'result': 0}, {'api': 'test2', 'result': 1}]

Sort all first, then find first for each "api", and there goes your result.

answered Oct 27 '22 15:10

BananZ

Related questions
                            
                                What's the point of the "is_authenticated" method used in Flask-Login?
                            
                                PyCharm does not highlight errors
                            
                                Generate numbers with 3 digits
                            
                                Python - Find second smallest number
                            
                                subtracting the mean of each row in numpy with broadcasting
                            
                                How to check if coordinate inside certain area Python
                            
                                Get all items from thread Queue
                            
                                Convert string / character to integer in python
                            
                                python date interval intersection
                            
                                How to use python urllib2 to send json data for login
                            
                                How to build sphinx documentation for django project
                            
                                NumPy: Pretty print tabular data
                            
                                E: unable to locate package pip
                            
                                Check if element is already in a Queue [duplicate]
                            
                                Python: write a wav file into numpy float array
                            
                                Not plotting 'zero' in matplotlib or change zero to None [Python]
                            
                                Python test Average Calculator returen error 'list' object has no attribute 'len'
                            
                                Accessing first column of pandas value_counts
                            
                                matplotlib savefig - text chopped off [duplicate]
                            
                                Paramiko: "not a valid RSA private key file"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With