I have this list (python): <pre class="prettyprint"><code>[[item1],[item2],[item3],[/],[item4],[item5],[item6],[/]...] </code></pre> I want to separate these into chunks and the elements that will go into each chunk are the elements before the separator "/". So my chunks would look like: <pre class="prettyprint"><code>chunk1 = [[item1],[item2],[item3]] chunk2 = [[item4],[item5],[item6]] </code></pre> I've tried and tried, nothing efficient came to mind. Tried looping through it with a for and and if element[x] == '/' then get some positions. It's very dirty and doesn't properly work. Any help would be appreciated.

This can also be done as (assuming empty chunks are not desired and l is the list to be "chunked"): <pre class="prettyprint"><code>chunks, last_chunk = [], [] for x in l: if x == '/': if last_chunk: chunks.append(last_chunk) last_chunk = [] else: last_chunk.append(x) if last_chunk: chunks.append(last_chunk) </code></pre>

The usual approach for collecting contiguous chunks is to use <code>itertools.groupby</code>, for example: <pre class="prettyprint"><code>>>> from itertools import groupby >>> blist = ['item1', 'item2', 'item3', '/', 'item4', 'item5', 'item6', '/'] >>> chunks = (list(g) for k,g in groupby(blist, key=lambda x: x != '/') if k) >>> for chunk in chunks: ... print(chunk) ... ['item1', 'item2', 'item3'] ['item4', 'item5', 'item6'] </code></pre> (Your representation of your list <code>[item1],[item2],[item3],[/],</code> makes it look like each of your elements in the list is actually a list, in which case the same approach will work, you simply need to compare against <code>['/']</code> or whatever your separator is.)

Split a list into chunks determined by a separator

Tags:

python

split

I have this list (python):

[[item1],[item2],[item3],[/],[item4],[item5],[item6],[/]...]

I want to separate these into chunks and the elements that will go into each chunk are the elements before the separator "/".

So my chunks would look like:

chunk1 = [[item1],[item2],[item3]]
chunk2 = [[item4],[item5],[item6]]

I've tried and tried, nothing efficient came to mind. Tried looping through it with a for and and if element[x] == '/' then get some positions. It's very dirty and doesn't properly work.

Any help would be appreciated.

887

asked Jun 14 '15 02:06

Benjamin

3 Answers

I wrote something simpler for you to understand - Basically look out for '/', if it's not there keep appending to chunks. itertools.groupby would be worth learning, but something simpler that one understands first is a good idea to start with.

l = ['i1', 'i2', 'i3', '/', 'i4', 'i5', 'i6', '/']

chunks = []
x = 0
chunks.append([])   # create an empty chunk to which we'd append in the loop
for i in l:
    if i != '/':
        chunks[x].append(i)
    else:
        x += 1
        chunks.append([])

print chunks

If your elements are strings, there's a faster way to do what I have done in python - basically - first create a ' ' (space) separated string and then, first split by '/' and then by ' ' again.

l = ['i1', 'i2', 'i3', '/', 'i4', 'i5', 'i6', '/']

s = " ".join(l)  # first create a string, joining by a <space> it could be anything

chunks2 = [x.split() for x in s.split("/")]
print chunks2

151

answered Sep 21 '22 07:09

gabhijit

This can also be done as (assuming empty chunks are not desired and l is the list to be "chunked"):

chunks, last_chunk = [], []
for x in l:
    if x == '/':
         if last_chunk:
             chunks.append(last_chunk)
             last_chunk = []
    else:
         last_chunk.append(x)
if last_chunk:
    chunks.append(last_chunk)

answered Sep 21 '22 07:09

dcg

The usual approach for collecting contiguous chunks is to use itertools.groupby, for example:

>>> from itertools import groupby
>>> blist = ['item1', 'item2', 'item3', '/', 'item4', 'item5', 'item6', '/']
>>> chunks = (list(g) for k,g in groupby(blist, key=lambda x: x != '/') if k)
>>> for chunk in chunks:
...     print(chunk)
...     
['item1', 'item2', 'item3']
['item4', 'item5', 'item6']

(Your representation of your list [item1],[item2],[item3],[/], makes it look like each of your elements in the list is actually a list, in which case the same approach will work, you simply need to compare against ['/'] or whatever your separator is.)

answered Sep 20 '22 07:09

DSM

Related questions
                            
                                relation "account_emailaddress" does not exist - django error
                            
                                Use numpy to multiply a matrix across an array of points?
                            
                                How to reverse a priority queue in Python without using classes?
                            
                                Celery tasks not throwing exception in Django Tests
                            
                                How to create mosaic plot from Pandas dataframe with Statsmodels library?
                            
                                How to get tweets of a particular hashtag in a location in a tweepy?
                            
                                Unexpected result -- numpy fromfunction with constant functions
                            
                                Is there a JavaScript equivalent to Python's for loops?
                            
                                python-requests - user-agent is being overriden
                            
                                Face pattern for boxes in boxplots
                            
                                How to run a function periodically with Flask and Celery?
                            
                                Python 3.4.3 subprocess.Popen get output of command without piping?
                            
                                pip doesn't see setuptools
                            
                                How to point LLVM_CONFIG environment variable to the path for llvm-config
                            
                                Store Numpy array index in variable
                            
                                Draw a curve connecting two points instead of a straight line
                            
                                Building Dynamic HTML Email Content with Python
                            
                                Connecting to MySQL database via SSH
                            
                                How can I strip namespaces out of an lxml tree?
                            
                                Can't use a string pattern on a bytes-like object - python's re error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With