Find number of columns in csv file

Tags:

My program needs to read csv files which may have 1,2 or 3 columns, and it needs to modify its behaviour accordingly. Is there a simple way to check the number of columns without "consuming" a row before the iterator runs? The following code is the most elegant I could manage, but I would prefer to run the check before the for loop starts:

import csv f = 'testfile.csv' d = '\t'  reader = csv.reader(f,delimiter=d) for row in reader:     if reader.line_num == 1: fields = len(row)     if len(row) != fields:         raise CSVError("Number of fields should be %s: %s" % (fields,str(row)))     if fields == 1:         pass     elif fields == 2:         pass     elif fields == 3:         pass     else:         raise CSVError("Too many columns in input file.")

Edit: I should have included more information about my data. If there is only one field, it must contain a name in scientific notation. If there are two fields, the first must contain a name, and the second a linking code. If there are three fields, the additional field contains a flag which specifies whether the name is currently valid. Therefore if any row has 1, 2 or 3 columns, all must have the same.

905

asked Jul 03 '12 11:07

rudivonstaden

2 Answers

You can use itertools.tee

itertools.tee(iterable[, n=2])
Return n independent iterators from a single iterable.

eg.

reader1, reader2 = itertools.tee(csv.reader(f, delimiter=d)) columns = len(next(reader1)) del reader1 for row in reader2:     ...

Note that it's important to delete the reference to reader1 when you are finished with it - otherwise tee will have to store all the rows in memory in case you ever call next(reader1) again

187

answered Sep 22 '22 15:09

John La Rooy

This seems to work as well:

import csv  datafilename = 'testfile.csv' d = '\t' f = open(datafilename,'r')  reader = csv.reader(f,delimiter=d) ncol = len(next(reader)) # Read first line and count columns f.seek(0)              # go back to beginning of file for row in reader:     pass #do stuff

answered Sep 18 '22 15:09

mgilson

Related questions
                            
                                Python - appending to same file from multiple threads
                            
                                Multiple imshow-subplots, each with colorbar
                            
                                How do I generate and open an Outlook email with Python (but do not send)
                            
                                Insert element in Python list after every nth element
                            
                                Airflow - run task regardless of upstream success/fail
                            
                                Python: ufunc 'add' did not contain a loop with signature matching types dtype('S21') dtype('S21') dtype('S21')
                            
                                multiprocess or threading in python?
                            
                                What is a good size (in bytes) for a log file?
                            
                                What are Python metaclasses useful for?
                            
                                Testing Equivalence of xml.etree.ElementTree
                            
                                Uploading large files with Python/Django
                            
                                Why would shutil.copy() raise a permission exception when cp doesn't?
                            
                                install filter on logging level in python using dictConfig
                            
                                Sending messages with Telegram - APIs or CLI?
                            
                                Opening a .ipynb.txt File
                            
                                parametrize and running a single test in pytest
                            
                                How can you test that two dictionaries are equal with pytest in python
                            
                                Why 1//0.01 == 99 in Python?
                            
                                Can I use a class attribute as a default value for an instance method?
                            
                                How to make a list of n numbers in Python and randomly select any number?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With