I'm trying out the Keras package in R by doing this tutorial about forecasting the temperature. However, the tutorial has no explanation on how to predict with the trained RNN model and I wonder how to do this. To train a model I used the following code copied from the tutorial: <pre class="prettyprint"><code>dir.create("~/Downloads/jena_climate", recursive = TRUE) download.file( "https://s3.amazonaws.com/keras-datasets/jena_climate_2009_2016.csv.zip", "~/Downloads/jena_climate/jena_climate_2009_2016.csv.zip" ) unzip( "~/Downloads/jena_climate/jena_climate_2009_2016.csv.zip", exdir = "~/Downloads/jena_climate" ) library(readr) data_dir <- "~/Downloads/jena_climate" fname <- file.path(data_dir, "jena_climate_2009_2016.csv") data <- read_csv(fname) data <- data.matrix(data[,-1]) train_data <- data[1:200000,] mean <- apply(train_data, 2, mean) std <- apply(train_data, 2, sd) data <- scale(data, center = mean, scale = std) generator <- function(data, lookback, delay, min_index, max_index, shuffle = FALSE, batch_size = 128, step = 6) { if (is.null(max_index)) max_index <- nrow(data) - delay - 1 i <- min_index + lookback function() { if (shuffle) { rows <- sample(c((min_index+lookback):max_index), size = batch_size) } else { if (i + batch_size >= max_index) i <<- min_index + lookback rows <- c(i:min(i+batch_size, max_index)) i <<- i + length(rows) } samples <- array(0, dim = c(length(rows), lookback / step, dim(data)[[-1]])) targets <- array(0, dim = c(length(rows))) for (j in 1:length(rows)) { indices <- seq(rows[[j]] - lookback, rows[[j]], length.out = dim(samples)[[2]]) samples[j,,] <- data[indices,] targets[[j]] <- data[rows[[j]] + delay,2] } list(samples, targets) } } lookback <- 1440 step <- 6 delay <- 144 batch_size <- 128 train_gen <- generator( data, lookback = lookback, delay = delay, min_index = 1, max_index = 200000, shuffle = TRUE, step = step, batch_size = batch_size ) val_gen = generator( data, lookback = lookback, delay = delay, min_index = 200001, max_index = 300000, step = step, batch_size = batch_size ) test_gen <- generator( data, lookback = lookback, delay = delay, min_index = 300001, max_index = NULL, step = step, batch_size = batch_size ) # How many steps to draw from val_gen in order to see the entire validation set val_steps <- (300000 - 200001 - lookback) / batch_size # How many steps to draw from test_gen in order to see the entire test set test_steps <- (nrow(data) - 300001 - lookback) / batch_size library(keras) model <- keras_model_sequential() %>% layer_flatten(input_shape = c(lookback / step, dim(data)[-1])) %>% layer_dense(units = 32, activation = "relu") %>% layer_dense(units = 1) model %>% compile( optimizer = optimizer_rmsprop(), loss = "mae" ) history <- model %>% fit_generator( train_gen, steps_per_epoch = 500, epochs = 20, validation_data = val_gen, validation_steps = val_steps ) </code></pre> I tried to predict the temperature with the code below. If I am correct, this should give me the normalized predicted temperature for every batch. So when I denormalize the values and average them, I get the predicted temperature. Is this correct and if so for which time is then predicted (latest observation time + <code>delay</code>?) ? <pre class="prettyprint"><code>prediction.set <- test_gen()[[1]] prediction <- predict(model, prediction.set) </code></pre> Also, what is the correct way to use <code>keras::predict_generator()</code> and the <code>test_gen()</code> function? If I use the following code: <pre class="prettyprint"><code>model %>% predict_generator(generator = test_gen, steps = test_steps) </code></pre> it gives this error: <pre class="prettyprint"><code>error in py_call_impl(callable, dots$args, dots$keywords) : ValueError: Error when checking model input: the list of Numpy arrays that you are passing to your model is not the size the model expected. Expected to see 1 array(s), but instead got the following list of 2 arrays: [array([[[ 0.50394005, 0.6441838 , 0.5990761 , ..., 0.22060473, 0.2018686 , -1.7336458 ], [ 0.5475698 , 0.63853574, 0.5890239 , ..., -0.45618412, -0.45030192, -1.724062... </code></pre>

Note: my familiarity with syntax of R is very little, so unfortunately I can't give you an answer using R. Instead, I am using Python in my answer. I hope you could easily translate back, my words at least, to R. <hr> <blockquote> ... If I am correct, this should give me the normalized predicted temperature for every batch. </blockquote> Yes, that's right. The predictions would be normalized since you have trained it with normalized labels: <pre class="prettyprint lang-r prettyprint-override"><code>data <- scale(data, center = mean, scale = std) </code></pre> Therefore, you would need to denormalize the values using the computed mean and std to find the real predictions: <pre class="prettyprint lang-py prettyprint-override"><code>pred = model.predict(test_data) denorm_pred = pred * std + mean </code></pre> <blockquote> ... for which time is then predicted (latest observation time + delay?) </blockquote> That's right. Concretely, since in this particular dataset every ten minutes a new obeservation is recorded and you have set <code>delay=144</code>, it would mean that the predicted value is the temperature 24 hours ahead (i.e. 144 * 10 = 1440 minutes = 24 hours) from the last given observation. <blockquote> Also, what is the correct way to use <code>keras::predict_generator()</code> and the <code>test_gen()</code> function? </blockquote> <code>predict_generator</code> takes a generator that gives as output only test samples and not the labels (since we don't need labels when we are performing prediction; the labels are needed when training, i.e. <code>fit_generator()</code>, and when evaluating the model, i.e. <code>evaluate_generator()</code>). That's why the error mentions that you need to pass one array instead of two arrays. So you need to define a generator that only gives test samples or one alternative way, in Python, is to wrap your existing generator inside another function that gives only the input samples (I don't know whether you can do this in R or not): <pre class="prettyprint lang-py prettyprint-override"><code>def pred_generator(gen): for data, labels in gen: yield data # discards labels preds = model.predict_generator(pred_generator(test_generator), number_of_steps) </code></pre> You need to provide one other argument which is the number of steps of generator to cover all the samples in test data. Actually we have <code>num_steps = total_number_of_samples / batch_size</code>. For example, if you have 1000 samples and each time the generator generate 10 samples, you need to use generator for <code>1000 / 10 = 100</code> steps. Bonus: To see how good your model performs you can use <code>evaluate_generator</code> using the existing test generator (i.e. <code>test_gen</code>): <pre class="prettyprint lang-r prettyprint-override"><code>loss = model.evaluate_generator(test_gen, number_of_steps) </code></pre> The given <code>loss</code> is also normalized and to denormalize it (to get a better sense of prediction error) you just need to multiply it by <code>std</code> (you don't need to add <code>mean</code> since you are using <code>mae</code>, i.e. mean absolute error, as the loss function): <pre class="prettyprint lang-r prettyprint-override"><code>denorm_loss = loss * std </code></pre> This would tell you how much your predictions are off on average. For example, if you are predicting the temperature, a <code>denorm_loss</code> of 5 means that the predictions are on average 5 degrees off (i.e. are either less or more than the actual value). <hr> Update: For prediction, you can define a new generator using an existing generator in R like this: <pre class="prettyprint lang-r prettyprint-override"><code>pred_generator <- function(gen) { function() { # wrap it in a function to make it callable gen()[1] # call the given generator and get the first element (i.e. samples) } } preds <- model %>% predict_generator( generator = pred_generator(test_gen), # pass test_gen directly to pred_generator without calling it steps = test_steps ) evaluate_generator(model, test_gen, test_steps) </code></pre>

Understanding Keras prediction output of a rnn model in R

Tags:

r

machine-learning

keras

lstm

recurrent-neural-network

I'm trying out the Keras package in R by doing this tutorial about forecasting the temperature. However, the tutorial has no explanation on how to predict with the trained RNN model and I wonder how to do this. To train a model I used the following code copied from the tutorial:

dir.create("~/Downloads/jena_climate", recursive = TRUE)
download.file(
    "https://s3.amazonaws.com/keras-datasets/jena_climate_2009_2016.csv.zip",
      "~/Downloads/jena_climate/jena_climate_2009_2016.csv.zip"
    )
unzip(
  "~/Downloads/jena_climate/jena_climate_2009_2016.csv.zip",
  exdir = "~/Downloads/jena_climate"
)

library(readr)
data_dir <- "~/Downloads/jena_climate"
fname <- file.path(data_dir, "jena_climate_2009_2016.csv")
data <- read_csv(fname)

data <- data.matrix(data[,-1])

train_data <- data[1:200000,]
mean <- apply(train_data, 2, mean)
std <- apply(train_data, 2, sd)
data <- scale(data, center = mean, scale = std)

generator <- function(data, lookback, delay, min_index, max_index,
                      shuffle = FALSE, batch_size = 128, step = 6) {
  if (is.null(max_index))
    max_index <- nrow(data) - delay - 1
  i <- min_index + lookback
  function() {
    if (shuffle) {
      rows <- sample(c((min_index+lookback):max_index), size = batch_size)
    } else {
      if (i + batch_size >= max_index)
        i <<- min_index + lookback
      rows <- c(i:min(i+batch_size, max_index))
      i <<- i + length(rows)
    }

    samples <- array(0, dim = c(length(rows), 
                                lookback / step,
                                dim(data)[[-1]]))
    targets <- array(0, dim = c(length(rows)))

    for (j in 1:length(rows)) {
      indices <- seq(rows[[j]] - lookback, rows[[j]], 
                     length.out = dim(samples)[[2]])
      samples[j,,] <- data[indices,]
      targets[[j]] <- data[rows[[j]] + delay,2]
    }            

    list(samples, targets)
  }
}

lookback <- 1440
step <- 6
delay <- 144
batch_size <- 128

train_gen <- generator(
  data,
  lookback = lookback,
  delay = delay,
  min_index = 1,
  max_index = 200000,
  shuffle = TRUE,
  step = step, 
  batch_size = batch_size
)

val_gen = generator(
  data,
  lookback = lookback,
  delay = delay,
  min_index = 200001,
  max_index = 300000,
  step = step,
  batch_size = batch_size
)

test_gen <- generator(
  data,
  lookback = lookback,
  delay = delay,
  min_index = 300001,
  max_index = NULL,
  step = step,
  batch_size = batch_size
)

# How many steps to draw from val_gen in order to see the entire validation set
val_steps <- (300000 - 200001 - lookback) / batch_size

# How many steps to draw from test_gen in order to see the entire test set
test_steps <- (nrow(data) - 300001 - lookback) / batch_size

library(keras)

model <- keras_model_sequential() %>% 
  layer_flatten(input_shape = c(lookback / step, dim(data)[-1])) %>% 
  layer_dense(units = 32, activation = "relu") %>% 
  layer_dense(units = 1)

model %>% compile(
  optimizer = optimizer_rmsprop(),
  loss = "mae"
)

history <- model %>% fit_generator(
  train_gen,
  steps_per_epoch = 500,
  epochs = 20,
  validation_data = val_gen,
  validation_steps = val_steps
)

I tried to predict the temperature with the code below. If I am correct, this should give me the normalized predicted temperature for every batch. So when I denormalize the values and average them, I get the predicted temperature. Is this correct and if so for which time is then predicted (latest observation time + delay?) ?

prediction.set <- test_gen()[[1]]
prediction <- predict(model, prediction.set)

Also, what is the correct way to use keras::predict_generator() and the test_gen() function? If I use the following code:

model %>% predict_generator(generator = test_gen,
                            steps = test_steps)

it gives this error:

error in py_call_impl(callable, dots$args, dots$keywords) : 
 ValueError: Error when checking model input: the list of Numpy
 arrays that you are passing to your model is not the size the model expected. 
 Expected to see 1 array(s), but instead got the following list of 2 arrays: 
 [array([[[ 0.50394005,  0.6441838 ,  0.5990761 , ...,  0.22060473,
          0.2018686 , -1.7336458 ],
        [ 0.5475698 ,  0.63853574,  0.5890239 , ..., -0.45618412,
         -0.45030192, -1.724062...

467

asked Feb 28 '18 14:02

Sven

1 Answers

Note: my familiarity with syntax of R is very little, so unfortunately I can't give you an answer using R. Instead, I am using Python in my answer. I hope you could easily translate back, my words at least, to R.

... If I am correct, this should give me the normalized predicted temperature for every batch.

Yes, that's right. The predictions would be normalized since you have trained it with normalized labels:

data <- scale(data, center = mean, scale = std)

Therefore, you would need to denormalize the values using the computed mean and std to find the real predictions:

pred = model.predict(test_data)
denorm_pred = pred * std + mean

... for which time is then predicted (latest observation time + delay?)

That's right. Concretely, since in this particular dataset every ten minutes a new obeservation is recorded and you have set delay=144, it would mean that the predicted value is the temperature 24 hours ahead (i.e. 144 * 10 = 1440 minutes = 24 hours) from the last given observation.

Also, what is the correct way to use keras::predict_generator() and the test_gen() function?

predict_generator takes a generator that gives as output only test samples and not the labels (since we don't need labels when we are performing prediction; the labels are needed when training, i.e. fit_generator(), and when evaluating the model, i.e. evaluate_generator()). That's why the error mentions that you need to pass one array instead of two arrays. So you need to define a generator that only gives test samples or one alternative way, in Python, is to wrap your existing generator inside another function that gives only the input samples (I don't know whether you can do this in R or not):

def pred_generator(gen):
    for data, labels in gen:
        yield data  # discards labels

preds = model.predict_generator(pred_generator(test_generator), number_of_steps)

You need to provide one other argument which is the number of steps of generator to cover all the samples in test data. Actually we have num_steps = total_number_of_samples / batch_size. For example, if you have 1000 samples and each time the generator generate 10 samples, you need to use generator for 1000 / 10 = 100 steps.

Bonus: To see how good your model performs you can use evaluate_generator using the existing test generator (i.e. test_gen):

loss = model.evaluate_generator(test_gen, number_of_steps)

The given loss is also normalized and to denormalize it (to get a better sense of prediction error) you just need to multiply it by std (you don't need to add mean since you are using mae, i.e. mean absolute error, as the loss function):

denorm_loss = loss * std

This would tell you how much your predictions are off on average. For example, if you are predicting the temperature, a denorm_loss of 5 means that the predictions are on average 5 degrees off (i.e. are either less or more than the actual value).

Update: For prediction, you can define a new generator using an existing generator in R like this:

pred_generator <- function(gen) {
  function() { # wrap it in a function to make it callable
    gen()[1]  # call the given generator and get the first element (i.e. samples)
  }
}

preds <- model %>% 
  predict_generator(
    generator = pred_generator(test_gen), # pass test_gen directly to pred_generator without calling it
    steps = test_steps
  )

evaluate_generator(model, test_gen, test_steps)

148

answered Sep 24 '22 15:09

today

Related questions
                            
                                How to terminate a plotting job in Rstudio
                            
                                How to add an image as a header/footer in Markdown for a PDF document
                            
                                Combining multiple lists of variable names in data.table?
                            
                                Multi-steps forecasting with dplyr and do
                            
                                How to change and set Rcpp compile arguments
                            
                                Find matching intervals in data frame by range of two column values
                            
                                Reproduce Fisher linear discriminant figure
                            
                                rvest, html_nodes() error: cannot coerce type 'environment' to vector of type 'list'. Fails RScript, works in Session
                            
                                Debugging package::function() although lazy evaluation is used
                            
                                Split a matrix in blocks of size n with offset i (vectorized method)
                            
                                Start multiple h2o cluster from within R
                            
                                Tail recursion in R
                            
                                Is there a way to deal with nested data with sparklyr?
                            
                                Programmatically scraping a response header within R
                            
                                How to identify the function used by geom_smooth()
                            
                                sum non NA elements only, but if all NA then return NA
                            
                                Finding specific strings in an array using R
                            
                                R Shiny authentication using AWS Cognito
                            
                                fuzzy matching in R
                            
                                Stored Input values in shiny widgets?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With