numpy diff on a pandas Series

Tags:

I want to use numpy.diff on a pandas Series. Am I right that this is a bug? Or am I doing it wrong?

In [163]: s = Series(np.arange(10))

In [164]: np.diff(s)
Out[164]: 
0   NaN
1     0
2     0
3     0
4     0
5     0
6     0
7     0
8     0
9   NaN

In [165]: np.diff(np.arange(10))
Out[165]: array([1, 1, 1, 1, 1, 1, 1, 1, 1])

I am using pandas 0.9.1rc1, numpy 1.6.1.

435

asked Dec 03 '12 18:12

Dan Allan

1 Answers

Pandas implements diff like so:

In [3]: s = pd.Series(np.arange(10))

In [4]: s.diff()
Out[4]:
0   NaN
1     1
2     1
3     1
4     1
5     1
6     1
7     1
8     1
9     1

Using np.diff directly:

In [7]: np.diff(s.values)
Out[7]: array([1, 1, 1, 1, 1, 1, 1, 1, 1])

In [8]: np.diff(np.array(s))
Out[8]: array([1, 1, 1, 1, 1, 1, 1, 1, 1])

So why doesn't np.diff(s) work? Because np is taking np.asanyarray() of the series before finding the diff. Like so:

In [25]: a = np.asanyarray(s)

In [26]: a 
Out[26]:
0    0
1    1
2    2
3    3
4    4
5    5
6    6
7    7
8    8
9    9

In [27]: np.diff(a)
Out[27]:
0   NaN
1     0
2     0
3     0
4     0
5     0
6     0
7     0
8     0
9   NaN

answered Oct 06 '22 05:10

Aman

Related questions
                            
                                Django form field label translations
                            
                                Fastest way to store large files in Python
                            
                                Using a session cookie from selenium in urllib2
                            
                                How to know who is importing me in python?
                            
                                Getting all visible text from a webpage using Selenium
                            
                                Python Nose: Log tests results to a file with Multiprocess Plugin
                            
                                How can I parse free-text time intervals in Python, ranging from years to seconds?
                            
                                How to print Numpy arrays without any extra notation (square brackets [ ] and spaces between elements)?
                            
                                How do you change the code example font size in LaTeX PDF output with Sphinx?
                            
                                Python/iptables: Capturing all UDP packets and their original destination
                            
                                Subprocess Popen not working with pythonw.exe
                            
                                Installed Python Modules - Python can't find them
                            
                                No cv.Point in Python OpenCV on latest stable Debian
                            
                                Blank label_suffix across entire Django project
                            
                                Trying to serve django static files on development server - not found
                            
                                What is the simple way to merge named tuples in Python?
                            
                                How to run selenium web driver behind a proxy server which needs authentication in python
                            
                                Trouble in parsing date using dateutil
                            
                                how to assign list of values to a key using OrderedDict in python
                            
                                Flask-WTFform: Flash does not display errors

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

numpy diff on a pandas Series

Tags:

python

pandas

numpy

Dan Allan

People also ask

1 Answers

Aman

Recent Activity

Donate For Us