Type conversion for namedtuple fields during initialization

Tags:

I have a class that has few static fields and is initialized from iterable (like output from csvreader). The __init__ performs type conversion from strings to numbers for some of them:

class PerformanceTestResult(object):
    def __init__(self, csv_row):
        # csv_row[0] is just an ordinal number of the test - skip that
        self.name = csv_row[1]          # Name of the performance test
        self.samples = int(csv_row[2])  # Number of measurement samples taken
        self.min = int(csv_row[3])      # Minimum runtime (ms)
        self.max = int(csv_row[4])      # Maximum runtime (ms)
        self.mean = int(csv_row[5])     # Mean (average) runtime (ms)
        self.sd = float(csv_row[6])     # Standard deviation (ms)

I’m thinking about converting it to be just a namedtuple, as there is not much else to it. But I would like to maintain the type conversion during initialization. Is there a way to do this with namedtuple? (I haven’t noticed __init__ method in the verbose output from namedtuple factory method, which gives me pause about how the default initializer works.)

345

asked Jun 18 '17 11:06

Palimondo

1 Answers

Instead of passing in the csv_row as-is, like you currently do, you could unpack it using the unpack operator *. For example:

>>> def f(a, b):
...     return a + b
...
>>> csv_row = [1, 2]
>>> f(*csv_row)  # Instead of your current f(csv_row)

This would also work with namedtuple, since the order of arguments will be kept upon unpacking:

>>> from collections import namedtuple
>>> PerformanceTestResult = namedtuple('PerformanceTestResult', [
...     'name',
...     'samples',
...     'min',
...     'max',
...     'mean',
...     'sd',
... ])
>>> test_row = ['test', '123', 2, 5, 3, None]  # from your csv file
>>> ptr = PerformanceTestResult(*test_row)
>>> ptr
PerformanceTestResult(name='test', samples='123', min=2, max=5, mean=3, sd=None)

Not only does this allow you to use namedtuple, which seems like a really good idea here, but it also removes the need for your PerformanceTestResult to know anything about the CSV file! Abstraction is good, since now you can use this same class regardless of where the data comes from and in what format.

If you need the int() and float() conversions, you're gonna have to write a separate conversion function. You can either build it into the PerformanceTestResult by subclassing:

_PerformanceTestResult = namedtuple('PerformanceTestResult', [...])

class PerformanceTestResult(_PerformanceTestResult):
    @classmethod
    def from_csv(cls, row):
        return cls(
            row[0],
            int(row[1]),
            int(row[2]),
            int(row[3]),
            int(row[4]),
            int(row[5]),
            float(row[6])
        )

Which can be used like so:

>>> ptr = PerformanceTestResult.from_csv(your_csv_row)

Or you can create a separate conversion function:

def parse_csv_row(row):
    return (row[0], int(row[1]), ...)

And now use this to convert the row before unpacking:

>>> ptr = PerformanceTestResult(*parse_csv_row(your_csv_row))

153

answered Oct 17 '22 05:10

Markus Meskanen

Related questions
                            
                                Send/receive data with python socket
                            
                                is there any alternative to sys.getsizeof() in PyPy?
                            
                                How to skip blank lines with read_fwf in pandas?
                            
                                Timestamp roundtrip from Spark Python to Pandas and back
                            
                                Does pickle randomly fail with OSError on large files?
                            
                                pytest won't convert date field to datetime.date object in Django
                            
                                tqdm progressbar and colorama do not work together
                            
                                Why do properties have to be class attributes in Python?
                            
                                how can you tell if github repository is for python 2 or python 3
                            
                                Does sklearn have group lasso?
                            
                                TensorFlow ValueError: Variable does not exist, or was not created with tf.get_variable()
                            
                                Split array into equal sized windows
                            
                                Python/Splinter: How to find and select an option on a site?
                            
                                Which tool can I trust?
                            
                                Tensorflow: is it possible to create 2D LSTM?
                            
                                How do I view data object contents within an npz file?
                            
                                Pandas update Dataframe with Dictionary
                            
                                Implementing Adagrad in Python
                            
                                How can I vectorize a function that uses lagged values of its own output?
                            
                                How to speedup my tensorflow execution on hadoop?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Type conversion for namedtuple fields during initialization

Tags:

python

namedtuple

iterable-unpacking

Palimondo

People also ask

1 Answers

Markus Meskanen

Recent Activity

Donate For Us