Going through 2 lists with array data

Tags:

This one is causing me a headache, and I am having trouble to find a solution with a for-loop.

Basically, my data looks like this:

short_list = [ [1, 2, 3], [4, 5, 6], [7, 8, 9], [10, 11, 12] ]
long_list  = [ [1, 2, 3, 4, 5], [2, 3, 4, 5, 6], [6, 7, 8, 9, 10], [9, 10, 11, 12, 13] ]

I would need to know how many times each number from each row in the short_list appears in each row of the long_list, and the comparison is NOT needed when both list indices are the same, because they come from the same data set.

Example: I need to know the occurrence of each number in [1, 2, 3] in the long_list rows [2, 3, 4, 5, 6], [6, 7, 8, 9, 10] and [9, 10, 11, 12, 13]. And then continue with the next data row in short_list, etc.

571

asked Feb 06 '18 11:02

El Fred

2 Answers

Here's one way to do it. It's straight off the top of my head, so there is probably a much better way to do it.

from collections import defaultdict

short_list = [ [1, 2, 3], [4, 5, 6], [7, 8, 9], [10, 11, 12] ]
long_list  = [ [1, 2, 3, 4, 5], [2, 3, 4, 5, 6], [6, 7, 8, 9, 10], [9, 10, 11, 12, 13] ]

occurrences = defaultdict(int)

for i, sl in enumerate(short_list):
    for j, ll in enumerate(long_list):
        if i != j:
            for n in sl:
                occurrences[n] += ll.count(n)

>>> occurrences
defaultdict(<class 'int'>, {1: 0, 2: 1, 3: 1, 4: 1, 5: 1, 6: 1, 7: 0, 8: 0, 9: 1, 10: 1, 11: 0, 12: 0})

Note that enumerate() is used to provide indices while iterating. The indices are compared to ensure that sub-lists at the same relative position are not compared.

The result is a dictionary keyed by items from the short list with the values being the total count of that item in the long list sans the sublist with the same index.

190

answered Oct 30 '22 22:10

mhawke

This is a brute-force solution. I've amended the input data to make the results more interesting:

from collections import Counter
from toolz import concat

short_list = [ [1, 2, 3], [4, 5, 6], [7, 8, 9], [10, 11, 12] ]
long_list  = [ [1, 2, 3, 4, 5], [2, 3, 4, 5, 6], [6, 7, 8, 9, 10], [2, 3, 11, 12, 13] ]

for idx, i in enumerate(short_list):
    long_list_filtered = (x for x in concat(long_list[:idx] + long_list[idx+1:]) if x in set(i)))
    print(idx, Counter(long_list_filtered))

# 0 Counter({2: 2, 3: 2})
# 1 Counter({4: 1, 5: 1, 6: 1})
# 2 Counter()
# 3 Counter({10: 1})

answered Oct 30 '22 22:10

jpp

Related questions
                            
                                When pip install 'pyrebase', I got error ' UnicodeDecodeError: 'cp949' codec can't decode byte 0xe2 in position 500: illegal multibyte sequence'
                            
                                setup.py: require a recent version of setuptools before trying to install
                            
                                Is python3.6 new change 'async for' not compatible with enumerate
                            
                                Using matplotlib in pygame
                            
                                Python does not use locale from environment
                            
                                How to get last day of each month in Pandas DataFrame index (using TimeGrouper)
                            
                                Why does *x, unpack map objects in python 3?
                            
                                how can i use pip search with my own nexus pypi repo?
                            
                                Given general 3D plane equation, how can I plot this in python matplotlib?
                            
                                Recursive and random grouping a list
                            
                                Pandas groupby sort within groups retaining multiple aggregates
                            
                                Collapse pandas multiindex to a single index
                            
                                Problems using Airflow v1.9 Python Operator
                            
                                Is there a way to implement a circular waiting indicator using PyQt?
                            
                                How to clear both clipboards securely in Gnome, from Python?
                            
                                Remove string characters from a given found substring until the end in Python
                            
                                How to remove rows of a DataFrame based off of data from another DataFrame?
                            
                                Adding items in the middle of a list in python [duplicate]
                            
                                Replace diagonals of a 2D array with python [duplicate]
                            
                                Python: find_element_by_css_selector

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Going through 2 lists with array data

Tags:

python

arrays

loops

list

El Fred

People also ask

2 Answers

mhawke

jpp

Recent Activity

Donate For Us