Inside an autoregressive continuous problem, when the zeros take too much place, it is possible to treat the situation as a zero-inflated problem (i.e. ZIB). In other words, instead of working to fit <code>f(x)</code>, we want to fit <code>g(x)*f(x)</code> where <code>f(x)</code> is the function we want to approximate, i.e. <code>y</code>, and <code>g(x)</code> is a function which output a value between 0 and 1 depending if a value is zero or non-zero. Currently, I have two models. One model which gives me <code>g(x)</code> and another model which fits <code>g(x)*f(x)</code>. The first model gives me a set of weights. This is where I need your help. I can use the <code>sample_weights</code> arguments with <code>model.fit()</code>. As I work with tremendous amount of data, then I need to work with <code>model.fit_generator()</code>. However, <code>fit_generator()</code> does not have the argument <code>sample_weights</code>. Is there a work around to work with <code>sample_weights</code> inside <code>fit_generator()</code>? Otherwise, how can I fit <code>g(x)*f(x)</code> knowing that I have already a trained model for <code>g(x)</code>?

You can provide sample weights as the third element of the tuple returned by the generator. From Keras documentation on <code>fit_generator</code>: <blockquote> generator: A generator or an instance of <code>Sequence</code> (<code>keras.utils.Sequence</code>) object in order to avoid duplicate data when using multiprocessing. The output of the generator must be either <ul> <li>a tuple <code>(inputs, targets)</code> </li> <li>a tuple <code>(inputs, targets, sample_weights)</code>.</li> </ul> </blockquote> Update: Here is a rough sketch of a generator that returns the input samples and targets as well as the sample weights obtained from model <code>g(x)</code>: <pre class="prettyprint lang-py prettyprint-override"><code>def gen(args): while True: for i in range(num_batches): # get the i-th batch data inputs = ... targets = ... # get the sample weights weights = g.predict(inputs) yield inputs, targets, weights model.fit_generator(gen(args), steps_per_epoch=num_batches, ...) </code></pre>

Using sample_weights with fit_generator()

Tags:

generator

machine-learning

keras

time-series

autoregressive-models

Inside an autoregressive continuous problem, when the zeros take too much place, it is possible to treat the situation as a zero-inflated problem (i.e. ZIB). In other words, instead of working to fit f(x), we want to fit g(x)*f(x) where f(x) is the function we want to approximate, i.e. y, and g(x) is a function which output a value between 0 and 1 depending if a value is zero or non-zero.

Currently, I have two models. One model which gives me g(x) and another model which fits g(x)*f(x).

The first model gives me a set of weights. This is where I need your help. I can use the sample_weights arguments with model.fit(). As I work with tremendous amount of data, then I need to work with model.fit_generator(). However, fit_generator() does not have the argument sample_weights.

Is there a work around to work with sample_weights inside fit_generator()? Otherwise, how can I fit g(x)*f(x) knowing that I have already a trained model for g(x)?

886

asked Nov 17 '18 19:11

user1050421

1 Answers

You can provide sample weights as the third element of the tuple returned by the generator. From Keras documentation on fit_generator:

generator: A generator or an instance of Sequence (keras.utils.Sequence) object in order to avoid duplicate data when using multiprocessing. The output of the generator must be either

a tuple (inputs, targets)

a tuple (inputs, targets, sample_weights).

Update: Here is a rough sketch of a generator that returns the input samples and targets as well as the sample weights obtained from model g(x):

Click to copy

def gen(args):
    while True:
        for i in range(num_batches):
            # get the i-th batch data
            inputs = ...
            targets = ...
            
            # get the sample weights
            weights = g.predict(inputs)
            
            yield inputs, targets, weights
            
            
model.fit_generator(gen(args), steps_per_epoch=num_batches, ...)

answered Nov 24 '22 11:11

today

Related questions
                            
                                Multi-label feature selection using sklearn
                            
                                Keras Extremely High Loss
                            
                                Test data predictions yield random results when making predictions from a saved model
                            
                                L1 norm instead of L2 norm for cost function in regression model
                            
                                【CVAT】How to create multiple jobs in one task?
                            
                                Flutter TFLite Error: "metal_delegate.h" File Not Found
                            
                                What is evaluation of a cluster in WEKA?
                            
                                Using LIBSVM grid.py for unbalanced data?
                            
                                Vowpal Wabbit Logistic Regression
                            
                                Ground Truth and training data set
                            
                                Scikit Learn - Calculating TF-IDF from a corpus of arrays of features instead of from a corpus of raw documents
                            
                                Trouble understanding Convolutional Neural Network
                            
                                How to update an SVM model with new data
                            
                                Why xgboost.cv and sklearn.cross_val_score give different results?
                            
                                What is row slicing vs What is column slicing?
                            
                                How to list all classification/regression/clustering algorithms in scikit-learn?
                            
                                Keras Realtime Augmentation adding Noise and Contrast
                            
                                How to calculate the actual size of a .fit()-trained model in sklearn?
                            
                                How to visualize TensorFlow Estimator weights?
                            
                                How to do multi-class image classification in keras?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With