In Python, how do I split a string and keep the separators?

People also ask

How do you split a separator in Python?

Python String split() Method The split() method splits a string into a list. You can specify the separator, default separator is any whitespace. Note: When maxsplit is specified, the list will contain the specified number of elements plus one.

How do I split a string into separate strings in Python?

Summary. Use the Python String split() method to split a string into a list of substrings. Use the sep argument to specify where the split should occur. Use the maxsplit argument to limit the number of splits.

>>> re.split('(\W)', 'foo/bar spam\neggs')
['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs']

If you are splitting on newline, use splitlines(True).

>>> 'line 1\nline 2\nline without newline'.splitlines(True)
['line 1\n', 'line 2\n', 'line without newline']

(Not a general solution, but adding this here in case someone comes here not realizing this method existed.)

another example, split on non alpha-numeric and keep the separators

import re
a = "foo,bar@candy*ice%cream"
re.split('([^a-zA-Z0-9])',a)

output:

['foo', ',', 'bar', '@', 'candy', '*', 'ice', '%', 'cream']

explanation

re.split('([^a-zA-Z0-9])',a)

() <- keep the separators
[] <- match everything in between
^a-zA-Z0-9 <-except alphabets, upper/lower and numbers.

If you have only 1 separator, you can employ list comprehensions:

text = 'foo,bar,baz,qux'  
sep = ','

Appending/prepending separator:

result = [x+sep for x in text.split(sep)]
#['foo,', 'bar,', 'baz,', 'qux,']
# to get rid of trailing
result[-1] = result[-1].strip(sep)
#['foo,', 'bar,', 'baz,', 'qux']

result = [sep+x for x in text.split(sep)]
#[',foo', ',bar', ',baz', ',qux']
# to get rid of trailing
result[0] = result[0].strip(sep)
#['foo', ',bar', ',baz', ',qux']

Separator as it's own element:

result = [u for x in text.split(sep) for u in (x, sep)]
#['foo', ',', 'bar', ',', 'baz', ',', 'qux', ',']
results = result[:-1]   # to get rid of trailing

Related questions
                            
                                Read file from line 2 or skip header row
                            
                                How do I fix 'ImportError: cannot import name IncompleteRead'?
                            
                                Virtualenv Command Not Found
                            
                                How to fix Python indentation
                            
                                How to append multiple values to a list in Python
                            
                                How do you extract a column from a multi-dimensional array?
                            
                                List comprehension on a nested list?
                            
                                Defining private module functions in python
                            
                                How can I find where Python is installed on Windows?
                            
                                Why are some float < integer comparisons four times slower than others?
                            
                                Escaping regex string
                            
                                What is the purpose of "pip install --user ..."?
                            
                                How can I read large text files in Python, line by line, without loading it into memory?
                            
                                how do I insert a column at a specific column index in pandas?
                            
                                Is Python strongly typed?
                            
                                Disable Tensorflow debugging information
                            
                                Using Pandas to pd.read_excel() for multiple worksheets of the same workbook
                            
                                Get human readable version of file size?
                            
                                Why do python lists have pop() but not push()
                            
                                Replace non-ASCII characters with a single space

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

In Python, how do I split a string and keep the separators?

Tags:

python

regex

People also ask

Recent Activity

Donate For Us