How should I handle input data with nan values in TensorFlow?

Tags:

I have a multilayer perceptron with a sigmoid loss (tf.nn.sigmoid_cross_entropy_with_logits) and an Adam optimizer (tf.train.AdamOptimizer). My input data has several features and some nan feature-values. When I replace the nan values with 0, I get a result, however, when I do not replace the nan values, I get loss=nan.

What is the best way to handle nan values in TensorFlow, and how can I use my input data with nan values without replacing them with 0?

530

asked Sep 19 '17 05:09

TomDriftwood

1 Answers

Question

How can I somehow tell my network to ignore some input data. For example when the input data is nan

Answer

This is very similar to adding a mask to your input data. You want your input data to pass through, nans turned to zeros, but you want somehow to also signal to the neural network to ignore where the nans were and pay attention to everything else.

In this question about adding a mask I review how a mask can successfully be added to an image but also give a code demonstration for a non-image problem.

First create a mask, 1's where data exists in the input and 0's where nan exist.
Second, clean up the input converting nans to 0's, or 0.5's, or anything really.
Third, stack the mask onto the input. If the input is an image, then the mask becomes another colour channel.

The code in the masking question shows that when the mask is added the neural net is able to learn well and when the mask is not added it is not able to learn well.

168

answered Oct 13 '22 21:10

Anton Codes

Related questions
                            
                                How to use tf.cond for batch processing
                            
                                Interpreting tensorboard plots
                            
                                How to visualize a TFRecord?
                            
                                Is there a tensorflow equivalent to np.empty?
                            
                                Using deep learning models from TensorFlow in other language environments [closed]
                            
                                tf.round() to a specified precision
                            
                                Can Tensorflow be used for global minimization of multivariate functions?
                            
                                Tensorflow Object Detection Slow when using rtsp stream
                            
                                Miminum requirements for Google tensorflow image classifier
                            
                                Image recognition using TensorFlow [closed]
                            
                                About tensorflow Metadata and RunOptions
                            
                                TensorFlow Dataset Shuffle Each Epoch
                            
                                How faster is tensorflow-gpu with AVX and AVX2 compared with it without AVX and AVX2?
                            
                                Weighted mse custom loss function in keras
                            
                                Training broke with ResourceExausted error
                            
                                InvalidArgumentError indices[i,0] = x is not in [0, x) in keras
                            
                                What is tensorflow.python.data.ops.dataset_ops._OptionsDataset?
                            
                                How does Tensorflow support Cuda streams?
                            
                                Error while using a newer version of glibc
                            
                                How to display the average of multiple runs on tensorboard

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With