How to split a string at line breaks in python?

Question

I want to copy some tabular data from Excel into a python array. That is, user willselect a range in an Excel table, press "Copy" (CTRL+C) so that the range will be copied to clipboard. Then I will get this clipboard data into a python array (list). I use win32clipboard from pywin32 to get clipboard data into an array:

import win32clipboard

def getClip():
    win32clipboard.OpenClipboard()
    data = win32clipboard.GetClipboardData()
    win32clipboard.CloseClipboard()
    return data

I copy the following range A1:B5 from Excel:

enter image description here

When I use the function above, I get a string like:

How to split this string into a list, so that the list will look like:

[(365,179), (96, -90), (48, -138), (12, -174), (30, -156)]

I use split method, but it doesn't give me what I want.

data.split("
")

['365	179
', '96	-90
', '48	-138
', '12	-174
', '30	-156
', '']

poke · Accepted Answer

There’s actually a str.splitlines method which will split the string by line breaks, regardless of which line breaks are used. So this works on Unix systems with just an , on Windows with and even on old Mac systems where the line break was just an .

>>> s = '365	179
96	-90
48	-138
12	-174
30	-156
'
>>> s.splitlines()
['365	179', '96	-90', '48	-138', '12	-174', '30	-156']

Once you have this result, you can split by tabs to get the individual cells. So you essentially have to call cell.split(' ') on each cell. This is best done with a list comprehension:

>>> [row.split('	') for row in s.splitlines()]
[['365', '179'], ['96', '-90'], ['48', '-138'], ['12', '-174'], ['30', '-156']]

As an alternative, you could also use map to apply the splitting operation on each cell:

>>> list(map(lambda cell: cell.split('	'), s.splitlines()))
[['365', '179'], ['96', '-90'], ['48', '-138'], ['12', '-174'], ['30', '-156']]

As the copied data in the clipboard will always have the rows separated by newlines, and the columns separated by tabs, this solution is also safe to use for any range of cells you copied.

If you further want to convert integers or float to its correct datatypes in Python, I guess you could add some more conversion logic by calling int() on all cells that only have digits in them, float() on all cells that have digits and the dot in them ., leaving the rest as strings:

>>> def convert (cell):
        try:
            return int(cell)
        except ValueError:
            try:
                return float(cell)
            except ValueError:
                return cell
>>> [tuple(map(convert, row.split('	'))) for row in s.splitlines()]
[(365, 179), (96, -90), (48, -138), (12, -174), (30, -156)]

For a different string:

>>> s = 'Foo	bar
123.45	42
-85	3.14'
>>> [tuple(map(convert, row.split('	'))) for row in s.splitlines()]
[('Foo', 'bar'), (123.45, 42), (-85, 3.14)]

Ashwini Chaudhary · Answer

>>> s = '365	179
96	-90
48	-138
12	-174
30	-156
'
>>> [map(int, x.split('	')) for x in s.rstrip().split('
')]
[[365, 179], [96, -90], [48, -138], [12, -174], [30, -156]]

Using the code from my other answer, you can also handle other types as well:

from ast import literal_eval
def solve(x):
    try:
        return literal_eval(x)
    except (ValueError, SyntaxError):
        return x

s = '365	Foo
Bar	-90.01
48	spam
12e10	-174
30	-156
'
print [map(solve, x.split('	')) for x in s.rstrip().split('
')]
#[[365, 'Foo'], ['Bar', -90.01], [48, 'spam'], [120000000000.0, -174], [30, -156]]

How to split a string at line breaks in python?

Tags:

python

arrays

list

clipboard

pywin32

alwbtc

2 Answers

poke

Ashwini Chaudhary

Recent Activity

Donate For Us

How to split a string at line breaks in python?

Tags:

python

arrays

list

clipboard

pywin32

alwbtc

2 Answers

poke

Ashwini Chaudhary

Related questions

Recent Activity

Donate For Us