There is a Pandas DataFrame object with some stock data. SMAs are moving averages calculated from previous 45/15 days. <pre class="prettyprint"><code>Date Price SMA_45 SMA_15 20150127 102.75 113 106 20150128 103.05 100 106 20150129 105.10 112 105 20150130 105.35 111 105 20150202 107.15 111 105 20150203 111.95 110 105 20150204 111.90 110 106 </code></pre> I want to find all dates, when SMA_15 and SMA_45 intersect. Can it be done efficiently using Pandas or Numpy? How? EDIT: What I mean by 'intersection': The data row, when: <ul> <li>long SMA(45) value was bigger than short SMA(15) value for longer than short SMA period(15) and it became smaller.</li> <li>long SMA(45) value was smaller than short SMA(15) value for longer than short SMA period(15) and it became bigger.</li> </ul>

I'm taking a crossover to mean when the SMA lines -- as functions of time -- intersect, as depicted on <a href="http://www.investopedia.com/terms/s/sma.asp" rel="noreferrer">this investopedia page</a>. <img src="https://i.stack.imgur.com/GnpIo.gif" alt="enter image description here"> Since the SMAs represent continuous functions, there is a crossing when, for a given row, (SMA_15 is less than SMA_45) and (the previous SMA_15 is greater than the previous SMA_45) -- or vice versa. In code, that could be expressed as <pre class="prettyprint"><code>previous_15 = df['SMA_15'].shift(1) previous_45 = df['SMA_45'].shift(1) crossing = (((df['SMA_15'] <= df['SMA_45']) & (previous_15 >= previous_45)) | ((df['SMA_15'] >= df['SMA_45']) & (previous_15 <= previous_45))) </code></pre> If we change your data to <pre class="prettyprint"><code>Date Price SMA_45 SMA_15 20150127 102.75 113 106 20150128 103.05 100 106 20150129 105.10 112 105 20150130 105.35 111 105 20150202 107.15 111 105 20150203 111.95 110 105 20150204 111.90 110 106 </code></pre> so that there are crossings, <img src="https://i.stack.imgur.com/pQCb2.png" alt="enter image description here"> then <pre class="prettyprint"><code>import pandas as pd df = pd.read_table('data', sep='\s+') previous_15 = df['SMA_15'].shift(1) previous_45 = df['SMA_45'].shift(1) crossing = (((df['SMA_15'] <= df['SMA_45']) & (previous_15 >= previous_45)) | ((df['SMA_15'] >= df['SMA_45']) & (previous_15 <= previous_45))) crossing_dates = df.loc[crossing, 'Date'] print(crossing_dates) </code></pre> yields <pre class="prettyprint"><code>1 20150128 2 20150129 Name: Date, dtype: int64 </code></pre>

Python and Pandas - Moving Average Crossover

Tags:

python

pandas

numpy

There is a Pandas DataFrame object with some stock data. SMAs are moving averages calculated from previous 45/15 days.

Date      Price   SMA_45      SMA_15
20150127  102.75  113         106
20150128  103.05  100         106
20150129  105.10  112         105
20150130  105.35  111         105
20150202  107.15  111         105
20150203  111.95  110         105
20150204  111.90  110         106

I want to find all dates, when SMA_15 and SMA_45 intersect.

Can it be done efficiently using Pandas or Numpy? How?

EDIT:

What I mean by 'intersection':

The data row, when:

long SMA(45) value was bigger than short SMA(15) value for longer than short SMA period(15) and it became smaller.
long SMA(45) value was smaller than short SMA(15) value for longer than short SMA period(15) and it became bigger.

280

asked Feb 05 '15 13:02

chilliq

2 Answers

I'm taking a crossover to mean when the SMA lines -- as functions of time -- intersect, as depicted on this investopedia page.

enter image description here

Since the SMAs represent continuous functions, there is a crossing when, for a given row, (SMA_15 is less than SMA_45) and (the previous SMA_15 is greater than the previous SMA_45) -- or vice versa.

In code, that could be expressed as

previous_15 = df['SMA_15'].shift(1)
previous_45 = df['SMA_45'].shift(1)
crossing = (((df['SMA_15'] <= df['SMA_45']) & (previous_15 >= previous_45))
            | ((df['SMA_15'] >= df['SMA_45']) & (previous_15 <= previous_45)))

If we change your data to

Date      Price   SMA_45      SMA_15
20150127  102.75  113         106
20150128  103.05  100         106
20150129  105.10  112         105
20150130  105.35  111         105
20150202  107.15  111         105
20150203  111.95  110         105
20150204  111.90  110         106

so that there are crossings,

enter image description here

then

import pandas as pd

df = pd.read_table('data', sep='\s+')
previous_15 = df['SMA_15'].shift(1)
previous_45 = df['SMA_45'].shift(1)
crossing = (((df['SMA_15'] <= df['SMA_45']) & (previous_15 >= previous_45))
            | ((df['SMA_15'] >= df['SMA_45']) & (previous_15 <= previous_45)))
crossing_dates = df.loc[crossing, 'Date']
print(crossing_dates)

yields

1    20150128
2    20150129
Name: Date, dtype: int64

answered Oct 12 '22 11:10

unutbu

The following methods gives the similar results, but takes less time than the previous methods:

df['position'] = df['SMA_15'] > df['SMA_45']
df['pre_position'] = df['position'].shift(1)
df.dropna(inplace=True) # dropping the NaN values
df['crossover'] = np.where(df['position'] == df['pre_position'], False, True)

Time taken for this approach: 2.7 ms ± 310 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

Time taken for previous approach: 3.46 ms ± 307 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

answered Oct 12 '22 11:10

Jeril

Related questions
                            
                                Can you add a light source in blender using python
                            
                                Python getting a string (key + value) from Python Dictionary
                            
                                Python: Formatting a string using variable names placeholders
                            
                                Python - Add ID3 tags to mp3 file that has NO tags
                            
                                Python: modules and packaging - why isn't __init__.py file executed before __main__.py?
                            
                                Difference between if <obj> and if <obj> is not None
                            
                                Python-ldap not able to bind successfully
                            
                                Get list of column names from a Firebird database table
                            
                                calculating percentage error by comparing two arrays
                            
                                How to use QFileDialog options and retrieve saveFileName?
                            
                                Python code works, but eclipse shows error - Syntax error while detecting tuple
                            
                                Remove attribute from all MongoDB documents using Python and PyMongo
                            
                                How to compare two classes/types in python?
                            
                                PyQt: app.exec_() stops all following code from running
                            
                                Django how to make form fields optional
                            
                                request.args.get('key') gives NULL - Flask
                            
                                Possible to add newline to .format() method?
                            
                                Django File Upload and Rename
                            
                                Python idiom to get same result as calling os.path.dirname multiple times?
                            
                                Python / Remove special character from string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With