Keras - class_weight vs sample_weights in the fit_generator

Tags:

In Keras (using TensorFlow as a backend) I am building a model which is working with a huge dataset that is having highly imbalanced classes (labels). To be able to run the training process, I created a generator which feeds chunks of data to the fit_generator.

According to the documentation for the fit_generator, the output of the generator can either be the tuple (inputs, targets) or the tuple (inputs, targets, sample_weights). Having that in mind, here are a few questions:

My understanding is that the class_weight regards the weights of all classes for the entire dataset whereas the sample_weights regards the weights of all classes for each individual chunk created by the generator. Is that correct? If not, can someone elaborate on the matter?
Is it necessary to give both the class_weight to the fit_generator and then the sample_weights as an output for each chunk? If yes, then why? If not then which one is better to give?
If I should give the sample_weights for each chunk, how do I map the weights if some of the classes are missing from a specific chunk? Let me give an example. In my overall dataset, I have 7 possible classes (labels). Because these classes are highly imbalanced, when I create smaller chunks of data as an output from the fit_generator, some of the classes are missing from the specific chunk. How should I create the sample_weights for these chunks?

936

asked Apr 17 '17 20:04

RaduS

1 Answers

My understanding is that the class_weight regards the weights of all classes for the entire dataset whereas the sample_weights regards the weights of all classes for each individual chunk created by the generator. Is that correct? If not, can someone elaborate on the matter?

class_weight affects the relative weight of each class in the calculation of the objective function. sample_weights, as the name suggests, allows further control of the relative weight of samples that belong to the same class.

Is it necessary to give both the class_weight to the fit_generator and then the sample_weights as an output for each chunk? If yes, then why? If not then which one is better to give?

It depends on your application. Class weights are useful when training on highly skewed data sets; for example, a classifier to detect fraudulent transactions. Sample weights are useful when you don't have equal confidence in the samples in your batch. A common example is performing regression on measurements with variable uncertainty.

If I should give the sample_weights for each chunk, how do I map the weights if some of the classes are missing from a specific chunk? Let me give an example. In my overall dataset, I have 7 possible classes (labels). Because these classes are highly imbalanced, when I create smaller chunks of data as an output from the fit_generator, some of the classes are missing from the specific chunk. How should I create the sample_weights for these chunks?

This is not an issue. sample_weights is defined on a per-sample basis and is independent from the class. For this reason, the documentation states that (inputs, targets, sample_weights) should be the same length.

The function _weighted_masked_objective in engine/training.py has an example of sample_weights are being applied.

148

answered Nov 09 '22 02:11

dhinckley

Related questions
                            
                                How to use Batch Normalization correctly in tensorflow?
                            
                                How to deal with UserWarning: Converting sparse IndexedSlices to a dense Tensor of unknown shape
                            
                                Python TensorFlow: How to restart training with optimizer and import_meta_graph?
                            
                                When to use tensorflow datasets api versus pandas or numpy
                            
                                Keras inconsistent prediction time
                            
                                Restoring TensorFlow model
                            
                                How can I use Tensorflow with react-native? [closed]
                            
                                How to select batch size automatically to fit GPU?
                            
                                what is Device interconnect StreamExecutor with strength 1 edge matrix
                            
                                TensorFlow Object Detection API print objects found on image to console
                            
                                Does bias in the convolutional layer really make a difference to the test accuracy?
                            
                                Understanding COCO evaluation "maximum detections"
                            
                                Loss suddenly increases with Adam Optimizer in Tensorflow
                            
                                How to set the input of a Keras layer with a Tensorflow tensor?
                            
                                What's the best way to refresh TensorBoard after new events/logs were added?
                            
                                How to use tensorflow to implement deconvolution?
                            
                                TensorFlow strings: what they are and how to work with them
                            
                                Why was Eigen chosen for TensorFlow? [closed]
                            
                                ValueError when performing matmul with Tensorflow
                            
                                Add Tensorflow pre-processing to existing Keras model (for use in Tensorflow Serving)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keras - class_weight vs sample_weights in the fit_generator

Tags:

tensorflow

keras

RaduS

People also ask

1 Answers

dhinckley

Recent Activity

Donate For Us