python reading text file

Tags:

python

I have a text file, of which i need each column, preferably into a dictionary or list, the format is :

N       ID   REMAIN        VERS          
2 2343333   bana           twelve    
3 3549287   moredp       twelve        
3 9383737   hinsila           twelve           
3 8272655   hinsila           eight

I have tried:

crs = open("file.txt", "r")
for columns in ( raw.strip().split() for raw in crs ):  
    print columns[0]

Result = 'Out of index error'

Also tried:

crs = csv.reader(open(file.txt", "r"), delimiter=',', quotechar='|', skipinitialspace=True)
    for row in crs:
                   for columns in row:
                             print columns[3]

Which seems to read each char as a column, instead of each 'word'

I would like to get the four columns, ie:

2
2343333
bana
twelve

into seperate dictionaries or lists

Any help is great, thanks!

205

asked Sep 20 '11 12:09

Kilizo

2 Answers

This works fine for me:

>>> crs = open("file.txt", "r")
>>> for columns in ( raw.strip().split() for raw in crs ):  
...     print columns[0]
... 
N
2
3
3
3

If you want to convert columns to rows, use zip.

>>> crs = open("file.txt", "r")
>>> rows = (row.strip().split() for row in crs)
>>> zip(*rows)
[('N', '2', '3', '3', '3'), 
 ('ID', '2343333', '3549287', '9383737', '8272655'), 
 ('REMAIN', 'bana', 'moredp', 'hinsila', 'hinsila'), 
 ('VERS', 'twelve', 'twelve', 'twelve', 'eight')]

If you have blank lines, filter them before using zip.

>>> crs = open("file.txt", "r")
>>> rows = (row.strip().split() for row in crs)
>>> zip(*(row for row in rows if row))
[('N', '2', '3', '3', '3'), ('ID', '2343333', '3549287', '9383737', '8272655'), ('REMAIN', 'bana', 'moredp', 'hinsila', 'hinsila'), ('VERS', 'twelve', 'twelve', 'twelve', 'eight')]

131

answered Oct 12 '22 22:10

senderle

>>> with open("file.txt") as f:
...    c = csv.reader(f, delimiter=' ', skipinitialspace=True)
...    for line in c:
...        print(line)
... 
['N', 'ID', 'REMAIN', 'VERS', ''] #that '' is for leading space after columns.
['2', '2343333', 'bana', 'twelve', '']
['3', '3549287', 'moredp', 'twelve', '']
['3', '9383737', 'hinsila', 'twelve', '']
['3', '8272655', 'hinsila', 'eight', '']

Or, old-fashioned way:

>>> with open("file.txt") as f:
...     [line.split() for line in f]
...
[['N', 'ID', 'REMAIN', 'VERS'],
 ['2', '2343333', 'bana', 'twelve'],
 ['3', '3549287', 'moredp', 'twelve'],
 ['3', '9383737', 'hinsila', 'twelve'],
 ['3', '8272655', 'hinsila', 'eight']]

And for getting column values:

>>> l
[['N', 'ID', 'REMAIN', 'VERS'],
 ['2', '2343333', 'bana', 'twelve'],
 ['3', '3549287', 'moredp', 'twelve'],
 ['3', '9383737', 'hinsila', 'twelve'],
 ['3', '8272655', 'hinsila', 'eight']]
>>> {l[0][i]: [line[i] for line in l[1:]]  for i in range(len(l[0]))}
{'ID': ['2343333', '3549287', '9383737', '8272655'],
 'N': ['2', '3', '3', '3'],
 'REMAIN': ['bana', 'moredp', 'hinsila', 'hinsila'],
 'VERS': ['twelve', 'twelve', 'twelve', 'eight']}

answered Oct 12 '22 20:10

utdemir

Related questions
                            
                                Check whether the process is being run as a pipe
                            
                                ValueError: need more than 1 value to unpack
                            
                                HTML5 video element non-seekable when using Django development server
                            
                                Is ctime always <= mtime?
                            
                                Module subprocess has no attribute 'STARTF_USESHOWWINDOW'
                            
                                Problem with multi threaded Python app and socket connections
                            
                                Python doctest: result with multiple lines
                            
                                How to export std::vector
                            
                                Python, logging: use custom handler with dictionary configuration?
                            
                                Reading multiple Python pickled data at once, buffering and newlines?
                            
                                How do you change the SQL isolation level from Python using MySQLdb?
                            
                                Is there a way to specify the build directory for py2exe
                            
                                Trouble activating virtualenv on server via Fabric
                            
                                Issues trying to SSH into a fresh EC2 instance with Paramiko
                            
                                How to get a win32 handle of an open file in python?
                            
                                Error "The object invoked has disconnected from its clients" - automate IE 8 with python and win32com
                            
                                os.path equivalent for web urls in python?
                            
                                Python For Loop Slowing With Time
                            
                                Intensity normalization of image using Python+PIL - Speed issues
                            
                                Why cannot pass print function to dir() in python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With