What is the pythonic way to read CSV file data as rows of namedtuples?

Tags:

What is the best way to take a data file that contains a header row and read this row into a named tuple so that the data rows can be accessed by header name?

I was attempting something like this:

import csv
from collections import namedtuple

with open('data_file.txt', mode="r") as infile:
    reader = csv.reader(infile)
    Data = namedtuple("Data", ", ".join(i for i in reader[0]))
    next(reader)
    for row in reader:
        data = Data(*row)

The reader object is not subscriptable, so the above code throws a TypeError. What is the pythonic way to reader a file header into a namedtuple?

733

asked Jan 25 '12 17:01

2 Answers

Use:

Data = namedtuple("Data", next(reader))

and omit the line:

next(reader)

Combining this with an iterative version based on martineau's comment below, the example becomes for Python 2

import csv
from collections import namedtuple
from itertools import imap

with open("data_file.txt", mode="rb") as infile:
    reader = csv.reader(infile)
    Data = namedtuple("Data", next(reader))  # get names from column headers
    for data in imap(Data._make, reader):
        print data.foo
        # ...further processing of a line...

and for Python 3

import csv
from collections import namedtuple

with open("data_file.txt", newline="") as infile:
    reader = csv.reader(infile)
    Data = namedtuple("Data", next(reader))  # get names from column headers
    for data in map(Data._make, reader):
        print(data.foo)
        # ...further processing of a line...

113

answered Oct 26 '22 19:10

Please have a look at csv.DictReader. Basically, it provides the ability to get the column names from the first row as you're looking for and, after that, lets you access to each column in a row by name using a dictionary.

If for some reason you still need to access the rows as a collections.namedtuple, it should be easy to transform the dictionaries to named tuples as follows:

with open('data_file.txt') as infile:
    reader = csv.DictReader(infile)
    Data = collections.namedtuple('Data', reader.fieldnames)
    tuples = [Data(**row) for row in reader]

answered Oct 26 '22 19:10

jcollado

Related questions
                            
                                Python NLTK pos_tag not returning the correct part-of-speech tag
                            
                                Running simple python script continuously on Heroku
                            
                                Why do HTTPS requests produce SSL CERTIFICATE_VERIFY_FAILED error?
                            
                                matplotlib border width
                            
                                Python/Matplotlib - Change the relative size of a subplot
                            
                                Assignment Condition in Python While Loop
                            
                                Write a raw binary file with NumPy array data
                            
                                Flask - How to create custom abort() code?
                            
                                How can I copy an immutable object like tuple in Python?
                            
                                How to convert numbers to alphabet? [duplicate]
                            
                                How to scale Seaborn's y-axis with a bar plot
                            
                                How to define a mathematical function in SymPy?
                            
                                How do I set a default, max and min value for an integerfield Django?
                            
                                Django Delete all but last five of queryset
                            
                                Removing non-breaking spaces from strings using Python
                            
                                In Django, can you add a method to querysets?
                            
                                Python implementation of the Wilson Score Interval?
                            
                                Is there an equivalent of Python's `pass` in c++ std11?
                            
                                How to annotate a generator in python3?
                            
                                "unpacking" a passed dictionary into the function's name space in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the pythonic way to read CSV file data as rows of namedtuples?

Tags:

python

csv

namedtuple

drbunsen

People also ask

2 Answers

Sven Marnach

jcollado

Recent Activity

Donate For Us