What's the most Pythonic way to identify consecutive duplicates in a list?

Tags:

I've got a list of integers and I want to be able to identify contiguous blocks of duplicates: that is, I want to produce an order-preserving list of duples where each duples contains (int_in_question, number of occurrences).

For example, if I have a list like:

[0, 0, 0, 3, 3, 2, 5, 2, 6, 6]

I want the result to be:

[(0, 3), (3, 2), (2, 1), (5, 1), (2, 1), (6, 2)]

I have a fairly simple way of doing this with a for-loop, a temp, and a counter:

result_list = [] current = source_list[0] count = 0 for value in source_list:     if value == current:         count += 1     else:         result_list.append((current, count))         current = value         count = 1 result_list.append((current, count))

But I really like python's functional programming idioms, and I'd like to be able to do this with a simple generator expression. However I find it difficult to keep sub-counts when working with generators. I have a feeling a two-step process might get me there, but for now I'm stumped.

Is there a particularly elegant/pythonic way to do this, especially with generators?

865

asked Jun 15 '11 02:06

machine yearning

1 Answers

>>> from itertools import groupby >>> L = [0, 0, 0, 3, 3, 2, 5, 2, 6, 6] >>> grouped_L = [(k, sum(1 for i in g)) for k,g in groupby(L)] >>> # Or (k, len(list(g))), but that creates an intermediate list >>> grouped_L [(0, 3), (3, 2), (2, 1), (5, 1), (2, 1), (6, 2)]

Batteries included, as they say.

Suggestion for using sum and generator expression from JBernardo; see comment.

166

answered Oct 14 '22 07:10

jscs

Related questions
                            
                                HashSet conversion to List
                            
                                Pandas Series of lists to one series
                            
                                Replace individual list elements in Haskell?
                            
                                How do I split a string into a list?
                            
                                How can I make multiple empty lists in python?
                            
                                How can I format a list to print each element on a separate line in python? [duplicate]
                            
                                Why didn't Stream have a toList() method?
                            
                                Is there a more efficient way to replace NULL with NA in a list?
                            
                                Is list[i:j] guaranteed to be an empty list if list[j] precedes list[i]?
                            
                                numpy-equivalent of list.pop?
                            
                                Iterate over pairs in a list (circular fashion) in Python
                            
                                Pandas drop_duplicates method not working on dataframe containing lists
                            
                                Can't modify list elements in a loop [duplicate]
                            
                                Difference in LinkedList, queue vs list
                            
                                Appending to the same list from different processes using multiprocessing
                            
                                Is List<T> thread-safe for reading?
                            
                                Creating an Observable List/Collection
                            
                                List comprehension vs generator expression's weird timeit results?
                            
                                join two lists of dictionaries on a single key
                            
                                Is it better to use List or Collection?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What's the most Pythonic way to identify consecutive duplicates in a list?

Tags:

python

generator

list

duplicates

machine yearning

People also ask

1 Answers

jscs

Recent Activity

Donate For Us