How to calculate differences across n columns in pandas rather than rows

Tags:

I am playing around with data and need to look at differences across columns (as well as rows) in a fairly large dataframe. The easiest way for rows is clearly the diff() method, but I cannot find the equivalent for columns?

My current solution to obtain a dataframe with the columns differenced for via

df.transpose().diff().transpose()

Is there a more efficient alternative? Or is this such odd usage of pandas that this was just never requested/ considered useful? :)

Thanks,

975

asked Mar 23 '15 19:03

John Smizz

1 Answers

Pandas DataFrames are excellent for manipulating table-like data whose columns have different dtypes.

If subtracting across columns and rows both make sense, then it means all the values are the same kind of quantity. That might be an indication that you should be using a NumPy array instead of a Pandas DataFrame.

In any case, you can use arr = df.values to extract a NumPy array of the underlying data from the DataFrame. If all the columns share the same dtype, then the NumPy array will have the same dtype. (When the columns have different dtypes, df.values has object dtype).

Then you can compute the differences along rows or columns using np.diff(arr, axis=...):

import numpy as np
import pandas as pd

df = pd.DataFrame(np.arange(12).reshape(3,4), columns=list('ABCD'))
#    A  B   C   D
# 0  0  1   2   3
# 1  4  5   6   7
# 2  8  9  10  11

np.diff(df.values, axis=0)    # difference of the rows
# array([[4, 4, 4, 4],
#        [4, 4, 4, 4]])

np.diff(df.values, axis=1)    # difference of the columns
# array([[1, 1, 1],
#        [1, 1, 1],
#        [1, 1, 1]])

138

answered Nov 14 '22 23:11

unutbu

Related questions
                            
                                Python : issue running XLRD
                            
                                Pandas + scikit-learn K-means not working properly - treats all of dataframe rows as one big multi-dimensional example
                            
                                How does one get an Enum's members into the global namespace?
                            
                                ValueError: readline of closed file in Python
                            
                                Python subclassing a class with custom __new__
                            
                                How to enable debug mode as an option using Python's logging module
                            
                                Serialize a string with pyyaml without an ellipsis
                            
                                How do I use a constant LETTER in sympy?
                            
                                Can't import pandas into pycharm interpreter, despite changing pyCharm python interpreter path
                            
                                Trailing spaces removed on Python heredoc lines in PyCharm
                            
                                Python subclassing process with parameter
                            
                                Print original exception in excepthook
                            
                                How to pause Multiprocessing Process in Python?
                            
                                How do I escape colons in an attribute name with Python's ElementTree?
                            
                                Python String Templating with Case Sensitivity
                            
                                Python Beginner - where comes <bound method ... of <... object at 0x0000000005EAAEB8>> from?
                            
                                Does asyncio support running a subprocess from a non-main thread?
                            
                                Django admin list display optimize queryset
                            
                                Shift time series with missing dates in Pandas
                            
                                Which $TERM to use to have both 256 colors and mouse move events in python curses?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to calculate differences across n columns in pandas rather than rows

Tags:

python

pandas

numpy

John Smizz

People also ask

1 Answers

unutbu

Recent Activity

Donate For Us