Why is the Mean Average Percentage Error(mape) extremely high?

Tags:

I have obtained code from machinelearningmastery

I modified the model.compile() function to add mape metrics to find out the Mean Absolute Percentage Error. After running the code, the mape at every epoch comes so huge, considering it as a percentage metric. Am I missing something obvious or is the output right? The output looks like:

Epoch 91/100
0s - loss: 0.0103 - mean_absolute_percentage_error: 1764997.4502
Epoch 92/100
0s - loss: 0.0103 - mean_absolute_percentage_error: 1765653.4924
Epoch 93/100
0s - loss: 0.0102 - mean_absolute_percentage_error: 1766505.5107
Epoch 94/100
0s - loss: 0.0102 - mean_absolute_percentage_error: 1766814.5450
Epoch 95/100
0s - loss: 0.0102 - mean_absolute_percentage_error: 1767510.8146
Epoch 96/100
0s - loss: 0.0101 - mean_absolute_percentage_error: 1767686.9054
Epoch 97/100
0s - loss: 0.0101 - mean_absolute_percentage_error: 1767076.2169
Epoch 98/100
0s - loss: 0.0100 - mean_absolute_percentage_error: 1767014.8481
Epoch 99/100
0s - loss: 0.0100 - mean_absolute_percentage_error: 1766592.8125
Epoch 100/100
0s - loss: 0.0100 - mean_absolute_percentage_error: 1766348.6332

My code that I ran (which omits the prediction part) goes as follows:

import numpy
from numpy import array
import matplotlib.pyplot as plt
from pandas import read_csv
import math
from keras.models import Sequential
from keras.layers import Dense
from keras.layers import LSTM
from sklearn.preprocessing import MinMaxScaler
from sklearn.metrics import mean_squared_error

# convert an array of values into a dataset matrix
def create_dataset(dataset, look_back=1):
        dataX, dataY = [], []
        for i in range(len(dataset)-look_back-1):
                a = dataset[i:(i+look_back), 0]
                dataX.append(a)
                dataY.append(dataset[i + look_back, 0])
        return numpy.array(dataX), numpy.array(dataY)
# fix random seed for reproducibility
numpy.random.seed(7)
# load the dataset
dataframe = read_csv('airlinepassdata.csv', usecols=[1], engine='python', skipfooter=3)
dataset = dataframe.values

#dataset = array([0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0])
dataset = dataset.astype('float32')
# normalize the dataset
scaler = MinMaxScaler(feature_range=(0, 1))
dataset = scaler.fit_transform(dataset)
# split into train and test sets
train_size = int(len(dataset) * 0.67)
test_size = len(dataset) - train_size
train, test = dataset[0:train_size,:], dataset[train_size:len(dataset),:]
# reshape into X=t and Y=t+1
look_back = 1
trainX, trainY = create_dataset(train, look_back)
testX, testY = create_dataset(test, look_back)
# reshape input to be [samples, time steps, features]
trainX = numpy.reshape(trainX, (trainX.shape[0], 1, trainX.shape[1]))
testX = numpy.reshape(testX, (testX.shape[0], 1, testX.shape[1]))
# create and fit the LSTM network
model = Sequential()
model.add(LSTM(4, input_shape=(1, look_back)))
model.add(Dense(1))
model.compile(loss='mse', optimizer='adam', metrics=['mape'])
model.fit(trainX, trainY, nb_epoch=100, batch_size=50, verbose=2)

622

asked Apr 09 '18 09:04

Srivatsa Sharma G

1 Answers

I solved this by setting the fuzz factor epsilon to one with keras.backend.set_epsilon(1) before calling the compile.

The hint was in the source code

def mean_absolute_percentage_error(y_true, y_pred):
diff = K.abs((y_true - y_pred) / K.clip(K.abs(y_true),
                                        K.epsilon(),
                                        None))
return 100. * K.mean(diff, axis=-1)

Meaning that, for some unknown reason, the K.abs(y_true) term in the MAPE calculation on the training set is lower than the fuzz default (1e-7), so it uses that default value instead, thus the huge numbers.

194

answered Oct 24 '22 06:10

Guile

Related questions
                            
                                TypeError: '<' not supported between instances of 'tuple' and 'str'
                            
                                Perfect forwarding - in Python
                            
                                How should I interpret the output of numpy.fft.rfft2?
                            
                                How to register a custom gradient for a operation composed of tf operations
                            
                                Pandas Vectorized Date Offset Operations with Vector of Differing Offsets
                            
                                Assign a Series to several Rows of a Pandas DataFrame
                            
                                Python test fixture to run a single test?
                            
                                Getting 405 error while trying to download nltk data
                            
                                Why doesn't super() work with static methods other than __new__?
                            
                                Accepting integers as keys of **kwargs
                            
                                manage.py doesn't log to stdout/stderr in Docker on Raspberry Pi
                            
                                Mean Euclidean distance in Tensorflow
                            
                                Keras - how to get unnormalized logits instead of probabilities
                            
                                Read Parquet file stored in S3 with AWS Lambda (Python 3)
                            
                                How to properly close mysql connections in sqlalchemy?
                            
                                How to make cerberus required rule depends on condition
                            
                                How to solve binary mode doesn't take an encoding argument
                            
                                Using python3 in shell script in crontab
                            
                                Inserting comments into jupyter notebook
                            
                                Spotipy Refreshing a token with authorization code flow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is the Mean Average Percentage Error(mape) extremely high?

Tags:

python

machine-learning

neural-network

tensorflow

keras

Srivatsa Sharma G

People also ask

1 Answers

Guile

Recent Activity

Donate For Us