I'm starting to use pytest to add unit test to a software that can analyse different kind of datasets. I wrote a set of test functions that I would like to apply to different datasets. One complication is that the datasets are quite big, so I would like to do: <ul> <li>Load dataset1</li> <li>Run tests</li> <li>Load dataset2</li> <li>Run tests</li> </ul> and so on. Right now I'm able to use one dataset using a fixture: <pre class="prettyprint"><code>@pytest.fixture(scope="module") def data(): return load_dataset1() </code></pre> and then passing <code>data</code>to each test function. I know that I can pass the <code>params</code> keyword to <code>pytest.fixture</code>. But, how can I implement the sequential load of the different datasets (not loading all of them in RAM at the same time)?

Use <code>params</code> as you mentioned: <pre class="prettyprint"><code>@pytest.fixture(scope='module', params=[load_dataset1, load_dataset2]) def data(request): loader = request.param dataset = loader() return dataset </code></pre> Use <code>fixture finalization</code> if you want to do fixture specific finalization: <pre class="prettyprint"><code>@pytest.fixture(scope='module', params=[load_dataset1, load_dataset2]) def data(request): loader = request.param dataset = loader() def fin(): # finalize dataset-related resource pass request.addfinalizer(fin) return dataset </code></pre>

Run same test on multiple datasets

Tags:

python

unit-testing

pytest

nose

I'm starting to use pytest to add unit test to a software that can analyse different kind of datasets.

I wrote a set of test functions that I would like to apply to different datasets. One complication is that the datasets are quite big, so I would like to do:

Load dataset1
Run tests
Load dataset2
Run tests

and so on.

Right now I'm able to use one dataset using a fixture:

@pytest.fixture(scope="module")
def data():
    return load_dataset1()

and then passing datato each test function.

I know that I can pass the params keyword to pytest.fixture. But, how can I implement the sequential load of the different datasets (not loading all of them in RAM at the same time)?

284

asked Mar 23 '14 13:03

user2304916

1 Answers

Use params as you mentioned:

@pytest.fixture(scope='module', params=[load_dataset1, load_dataset2])
def data(request):
    loader = request.param
    dataset = loader()
    return dataset

Use fixture finalization if you want to do fixture specific finalization:

@pytest.fixture(scope='module', params=[load_dataset1, load_dataset2])
def data(request):
    loader = request.param
    dataset = loader()
    def fin():
        # finalize dataset-related resource
        pass
    request.addfinalizer(fin)
    return dataset

154

answered Oct 04 '22 22:10

falsetru

Related questions
                            
                                google app engine modules - long running tasks > 10 minutes
                            
                                Can isAlive() be False immediately after calling start() because the thread hasn't yet started?
                            
                                Regex for splitting a string which contains commas
                            
                                No module 'zlib' for Python 2.6
                            
                                How to slice and extend a 2D numpy array?
                            
                                How to insert a blank space(&nbsp) into a Beautifulsoup tag?
                            
                                Redis Error 8 connecting localhost:6379. nodename nor servname provided, or not known
                            
                                Flask RESTful POST JSON fails
                            
                                Passing numpy string-format arrays to fortran using f2py
                            
                                sending powershell script to Windows ec2 in user data
                            
                                Construct single numpy array from smaller arrays of different sizes
                            
                                Overriding URLField's validation with custom validation
                            
                                convert numerical representation of date (excel format) to python date and time, then split them into two seperate dataframe columns in pandas
                            
                                Is it possible to stream output from a python subprocess to a webpage in real time?
                            
                                Is there a way to get a dict object with nonlocal variables?
                            
                                Error while installing scipy on windows
                            
                                How can I wait for child processes?
                            
                                Openstack. "No valid host was found" for any image other than cirrOS
                            
                                How to get all items in multi list in bottle?
                            
                                Python lmfit - how to calculate R squared?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With