Given a list containing a known pattern surrounded by noise, is there an elegant way to get all items that equal the pattern. See below for my crude code. <pre class="prettyprint"><code>list_with_noise = [7,2,1,2,3,4,2,1,2,3,4,9,9,1,2,3,4,7,4,3,1,2,3,5] known_pattern = [1,2,3,4] res = [] for i in list_with_noise: for j in known_pattern: if i == j: res.append(i) continue print res </code></pre> we would get <code>2, 1, 2, 3, 4, 2, 1, 2, 3, 4, 1, 2, 3, 4, 4, 3</code> bonus: avoid appending i if the full pattern is not present (ie., allow 1,2,3,4 but not 1,2,3) examples: <pre class="prettyprint"><code>find_sublists_in_list([7,2,1,2,3,4,2,1,2,3,4,9,9,1,2,3,4,7,4,3,1,2,3,5],[1,2,3,4]) [1,2,3,4],[1,2,3,4],[1,2,3,4] find_sublists_in_list([7,2,1,2,3,2,1,2,3,6,9,9,1,2,3,4,7,4,3,1,2,6],[1,2,3,4]) [1,2,3],[1,2,3],[1,2,3] </code></pre> The lists contain named tuples.

I know this question is 5 months old and already "accepted", but googling a very similar problem brought me to this question and all the answers seem to have a couple of rather significant problems, plus I'm bored and want to try my hand at a SO answer, so I'm just going to rattle off what I've found. The first part of the question, as I understand it, is pretty trivial: just return the original list with all the elements not in the "pattern" filtered out. Following that thinking, the first code I thought of used the filter() function: <pre class="prettyprint"><code>def subfinder(mylist, pattern): return list(filter(lambda x: x in pattern, mylist)) </code></pre> I would say that this solution is definitely more succinct than the original solution, but it's not any faster, or at least not appreciably, and I try to avoid lambda expressions if there's not a very good reason for using them. In fact, the best solution I could come up with involved a simple list comprehension: <pre class="prettyprint"><code>def subfinder(mylist, pattern): pattern = set(pattern) return [x for x in mylist if x in pattern] </code></pre> This solution is both more elegant and significantly faster than the original: the comprehension is about 120% faster than the original, while casting the pattern into a set first bumps that up to a whopping 320% faster in my tests. Now for the bonus: I'll just jump right into it, my solution is as follows: <pre class="prettyprint"><code>def subfinder(mylist, pattern): matches = [] for i in range(len(mylist)): if mylist[i] == pattern[0] and mylist[i:i+len(pattern)] == pattern: matches.append(pattern) return matches </code></pre> This is a variation of Steven Rumbalski's "inefficient one liner", that, with the addition of the "mylist[i] == pattern[0]" check and thanks to python's short-circuit evaluation, is significantly faster than both the original statement and the itertools version (and every other offered solution as far as I can tell) and it even supports overlapping patterns. So there you go.

elegant find sub-list in list

Tags:

python

list

design-patterns

Given a list containing a known pattern surrounded by noise, is there an elegant way to get all items that equal the pattern. See below for my crude code.

list_with_noise = [7,2,1,2,3,4,2,1,2,3,4,9,9,1,2,3,4,7,4,3,1,2,3,5] known_pattern = [1,2,3,4] res = []   for i in list_with_noise:     for j in known_pattern:         if i == j:             res.append(i)             continue  print res

we would get 2, 1, 2, 3, 4, 2, 1, 2, 3, 4, 1, 2, 3, 4, 4, 3

bonus: avoid appending i if the full pattern is not present (ie., allow 1,2,3,4 but not 1,2,3)

examples:

find_sublists_in_list([7,2,1,2,3,4,2,1,2,3,4,9,9,1,2,3,4,7,4,3,1,2,3,5],[1,2,3,4])  [1,2,3,4],[1,2,3,4],[1,2,3,4]   find_sublists_in_list([7,2,1,2,3,2,1,2,3,6,9,9,1,2,3,4,7,4,3,1,2,6],[1,2,3,4])  [1,2,3],[1,2,3],[1,2,3]

The lists contain named tuples.

926

asked Apr 11 '12 13:04

Django Doctor

1 Answers

I know this question is 5 months old and already "accepted", but googling a very similar problem brought me to this question and all the answers seem to have a couple of rather significant problems, plus I'm bored and want to try my hand at a SO answer, so I'm just going to rattle off what I've found.

The first part of the question, as I understand it, is pretty trivial: just return the original list with all the elements not in the "pattern" filtered out. Following that thinking, the first code I thought of used the filter() function:

def subfinder(mylist, pattern):     return list(filter(lambda x: x in pattern, mylist))

I would say that this solution is definitely more succinct than the original solution, but it's not any faster, or at least not appreciably, and I try to avoid lambda expressions if there's not a very good reason for using them. In fact, the best solution I could come up with involved a simple list comprehension:

def subfinder(mylist, pattern):     pattern = set(pattern)     return [x for x in mylist if x in pattern]

This solution is both more elegant and significantly faster than the original: the comprehension is about 120% faster than the original, while casting the pattern into a set first bumps that up to a whopping 320% faster in my tests.

Now for the bonus: I'll just jump right into it, my solution is as follows:

def subfinder(mylist, pattern):     matches = []     for i in range(len(mylist)):         if mylist[i] == pattern[0] and mylist[i:i+len(pattern)] == pattern:             matches.append(pattern)     return matches

This is a variation of Steven Rumbalski's "inefficient one liner", that, with the addition of the "mylist[i] == pattern[0]" check and thanks to python's short-circuit evaluation, is significantly faster than both the original statement and the itertools version (and every other offered solution as far as I can tell) and it even supports overlapping patterns. So there you go.

answered Oct 02 '22 07:10

mintchkin

Related questions
                            
                                How can I use valgrind with Python C++ extensions?
                            
                                Does Python do slice-by-reference on strings?
                            
                                Removing entries from a dictionary based on values
                            
                                Load CSV to Pandas MultiIndex DataFrame
                            
                                Failed to install package Beautiful Soup. Error Message is "SyntaxError: Missing parentheses in call to 'print'"
                            
                                Using a sparse matrix versus numpy array
                            
                                Why does Python copy NumPy arrays where the length of the dimensions are the same?
                            
                                Does python logging.handlers.RotatingFileHandler allow creation of a group writable log file?
                            
                                CMake output name for dynamic-loaded library?
                            
                                Nested SSH session with Paramiko
                            
                                Django templates: overriding blocks of included children templates through an extended template
                            
                                How to upload a file to S3 without creating a temporary local file
                            
                                Django: Access given field's choices tuple
                            
                                I am getting the error 'redefined-outer-name'
                            
                                Official abbreviation for: import scipy as sp/sc
                            
                                How to use tf.while_loop() in tensorflow
                            
                                Python pandas groupby aggregate on multiple columns, then pivot
                            
                                Django -- User.DoesNotExist does not exist?
                            
                                How to plot 1-d data at given y-value with pylab
                            
                                Pluck in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With