python remove all whitespace from entries in a list

Question

while calling readlines() on a .srt file , I got a list of characters with lots of leading and trailing whitespace like below

with open(infile) as f:
    r=f.readlines()
    return r

I got this list

['\xef\xbb\xbf1
', '00:00:00,000 --> 00:00:03,000
', "[D. Evans] Now that you've written your first Python program,
",'
', '2
', '00:00:03,000 --> 00:00:06,000
', 'you might be wondering why we need to invent new languages like Python
', '
']

I have only included a few elements for brevity..How do I clean this list sothat I can remove all whitespace characters and get only the relevant elements like

 ['1','00:00:00,000 --> 00:00:03,000',"[D. Evans] Now that you've written your first Python program"...]

Jordan · Accepted Answer

You can strip each line. Running it as a generator could also save you some memory if you're working on a big file.

Also, looks like you're working on a UTF-8 file with a BOM (which is sort of silly, or at least unnecessary) for the first several characters, so you need to open it differently.

import codecs

def strip_it_good(file):
    with codecs.open(file, "r", "utf-8-sig") as f:
        for line in f:
            yield line.strip()

python remove all whitespace from entries in a list

Tags:

python

string

list

damon

1 Answers

Jordan

Recent Activity

Donate For Us

python remove all whitespace from entries in a list

Tags:

python

string

list

damon

1 Answers

Jordan

Related questions

Recent Activity

Donate For Us