guidelines on using pandas inplace keyword argument

Tags:

What is the guideline for using inplace?

For example,

df = df.reset_index()

df.reset_index(inplace=True)

Same same but different?

956

asked Dec 16 '15 19:12

user3659451

1 Answers

In terms of the resulting DataFrame df, the two approaches are the same. The difference lies in the (maximum) memory usage, since the in-place version does not create a copy of the DataFrame.

Consider this setup:

import numpy as np
import pandas as pd

def make_data():
    return pd.DataFrame(np.random.rand(1000000, 100))

def func_copy():
    df = make_data()
    df = df.reset_index()
    
def func_inplace():
    df = make_data()
    df.reset_index(inplace=True)

We can use the memory_profiler library to perform some benchmarking for the memory usage:

%load_ext memory_profiler

%memit func_copy()
# peak memory: 1602.66 MiB, increment: 1548.66 MiB

%memit func_inplace()
# peak memory: 817.02 MiB, increment: 762.94 MiB

As expected, the in-place version is more memory efficient.

On the other hand, there also seems to be a non-trivial difference in running time between the approaches when the data size is large enough (e.g. in the above example):

%timeit func_copy()
1 loops, best of 3: 2.56 s per loop

%timeit func_inplace()
1 loops, best of 3: 1.35 s per loop

These differences may or may not be significant depending on the use case (e.g. adhoc exploratory analysis vs. production code), data size and the hardware resource available. In general, it might be a good idea to use the in-place version whenever possible for better memory and run time efficiency.

106

answered Sep 21 '22 04:09

YS-L

Related questions
                            
                                imported modules becomes None when replacing current module in sys.modules using a class object
                            
                                Why was reload removed from python builtins in the switch to python3?
                            
                                scikit-learn: fitting data into chunks vs fitting it all at once
                            
                                Segfault when import_array not in same translation unit
                            
                                Cannot import pyodbc on Mac
                            
                                Anyone successfully bundled data files into a single file with Pyinstaller?
                            
                                Storing day and month without year in Python
                            
                                Global query timeout in MySQL 5.6
                            
                                How to convert a pygame Surface to a PIL Image?
                            
                                serializing to JSON that would retain hebrew charcters
                            
                                Python doctest exceptions
                            
                                Creation and validation of directory using try/except or if else? [duplicate]
                            
                                UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-3: ordinal not in range(128)
                            
                                Python Processes not joining
                            
                                401 error while using tweepy
                            
                                How do I write a decorator to wrap something in a context manager, that takes parameters?
                            
                                When to use '.flat', '.flatiter' or '.flatten()'
                            
                                Compiled Python binary reports wrong version
                            
                                Assign 0 to certain words when the words are not present
                            
                                Python enum34 access by name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

guidelines on using pandas inplace keyword argument

Tags:

pandas

in-place

python-2.7

user3659451

People also ask

1 Answers

YS-L

Recent Activity

Donate For Us