Get apply's function input dataframe with mocking

Tags:

I have the following functions

def main():
    (
        pd.DataFrame({'a': [1, 2, float('NaN')], 'b': [1.0, 2, 3]})
        .dropna(subset=['a'])
        .assign(
            b=lambda x: x['b'] * 2
        )
        .apply(do_something_with_each_row, axis='columns')
    )

def do_something_with_each_row(one_row):
    # do_something_with_row
    print(one_row)

In my test, I want to look at the dataframe built after all chained operations and check if everything is fine with it before calling do_something_with_each_row. This last function does not return a dataframe (it just iterates over all rows similarly to iterrow).

I tried to mock the apply function like this:

# need pytest-mock and pytest
import pandas as pd


def test_not_working(mocker):
    mocked_apply = mocker.patch.object(pd.Dataframe, 'apply')
    main()

but in this case, I don't get the access to the dataframe which is input to apply to test its content.

I also tried to mock the do_something_with_each_row:

# need pytest-mock and pytest
import pandas as pd


def test_not_working_again(mocker):
    mocked_to_something = mocker.patch('path.to.file.do_something_with_each_row')
    main()

but this time I have all the calls with row arguments but they all have None values.

How could I get the dataframe for which apply function is called and check that it is indeed same as the following:

pd.Dataframe({'a': [1, 2], 'b': [2.0, 4]})

I am working with the 0.24.2 pandas version, an upgrade to pandas 1.0.5 does not change the matter.

I tried search in pandas issues but didn't find anything about this subject.

302

asked Jun 11 '20 11:06

ndclt

1 Answers

If I understood your question correctly this is one of the ways to get the behavior you want:

def test_i_think_this_is_what_you_asked(mocker):
    original_apply = pd.DataFrame.apply
    def mocked_apply(self, *args, **kw):
        assert len(self) == 2 # self is the pd.DataFrame at the time apply is called
        assert self.a[0] == 1
        assert self.a[1] == 3 # this will fail cause the value is 2
        assert self.b[0] == 2.0
        assert self.b[1] == 4.0
        return original_apply(self, *args, **kw)
    mocker.patch.object(pd.DataFrame, 'apply', side_effect=mocked_apply, autospec=True)
    main()

170

answered Oct 18 '22 01:10

Alexander Pivovarov

Related questions
                            
                                Qt.ScrollBarAsNeeded not showing scrollbar when it's actually needed
                            
                                Ttk Theme Settings
                            
                                Seaborn bug? Inconsistent in heatmap plotting
                            
                                Did I reinvent the wheel with this deduplicating function?
                            
                                Split file to chunk
                            
                                Check if page is vertical using PyPDF2?
                            
                                Practical Use of Reversed Set Operators in Python
                            
                                Split queue into train/test set
                            
                                Dynamic database connection Flask-SQLAlchemy
                            
                                Why doesn't Keras need the gradient of a custom loss function?
                            
                                Module can't be found when called from outside
                            
                                How to handle multiple results from a coroutine function?
                            
                                Changing Proxy Settings without Closing the Driver in Selenium/Splinter
                            
                                Pandas: join on partial string match, like Excel VLOOKUP
                            
                                How python can get difference between all pairs of rows under multiple columns
                            
                                How can I get PyCharm to recognize a custom property decorator?
                            
                                Python Splitting a Generator Yield into Two Parts
                            
                                Selenium & Heroku: urllib3.exceptions.ProtocolError: ('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))
                            
                                os.path.basename() is inconsistent and I'm not sure why
                            
                                Tensorboard: AttributeError: 'Model' object has no attribute '_get_distribution_strategy'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Get apply's function input dataframe with mocking

Tags:

python-3.x

pandas

pytest

pytest-mock

ndclt

People also ask

1 Answers

Alexander Pivovarov

Recent Activity

Donate For Us