TensorBoard doesn't show all data points

Tags:

I was running a very long training (reinforcement learning with 20M steps) and writing summary every 10k steps. In between step 4M and 6M, I saw 2 peaks in my TensorBoard scalar chart for game score, then I let it run and went to sleep. In the morning, it was running at about step 12M, but the peaks between step 4M and 6M that I saw earlier disappeared from the chart. I tried to zoom in and found out that TensorBoard (randomly?) skipped some of the data points. I also tried to export the data but some data point including the peaks are also missing in the exported .csv.

I looked for answers and found this in TensorFlow github page:

TensorBoard uses reservoir sampling to downsample your data so that it can be loaded into RAM. You can modify the number of elements it will keep per tag in tensorboard/backend/server.py.

Has anyone ever modified this server.py file? Where can I find the file and if I installed TensorFlow from source, do I have to recompile it after I modified the file?

861

asked Apr 30 '17 03:04

Kerawit Somchaipeng

1 Answers

You don't have to change the source code for this, there is a flag called --samples_per_plugin.

Quoting from the help command

--samples_per_plugin: An optional comma separated list of plugin_name=num_samples pairs to explicitly specify how many samples to keep per tag for that plugin. For unspecified plugins, TensorBoard randomly downsamples logged summaries to reasonable values to prevent out-of-memory errors for long running jobs. This flag allows fine control over that downsampling. Note that 0 means keep all samples of that type. For instance, "scalars=500,images=0" keeps 500 scalars and all images. Most users should not need to set this flag. (default: '')

So if you want to have a slider of 100 images, use:

tensorboard --samples_per_plugin images=100

189

answered Oct 16 '22 07:10

Phúc Lê

Related questions
                            
                                Replace nan values in tensorflow tensor
                            
                                Save and load model optimizer state
                            
                                How training and test data is split - Keras on Tensorflow
                            
                                Save Tensorflow graph for viewing in Tensorboard without summary operations
                            
                                What does this tensorflow message mean? Any side effect? Was the installation successful?
                            
                                ValueError: Duplicate plugins for name projector
                            
                                Converting from Pandas dataframe to TensorFlow tensor object
                            
                                Should I use @tf.function for all functions?
                            
                                When global_variables_initializer() is actually required
                            
                                What is the TensorFlow checkpoint meta file?
                            
                                Machine Learning (tensorflow / sklearn) in Django?
                            
                                ValueError: Output tensors to a Model must be the output of a TensorFlow `Layer`
                            
                                TensorFlow Variables and Constants
                            
                                In TensorFlow, what is the argument 'axis' in the function 'tf.one_hot'
                            
                                tensorflow: what's the difference between tf.nn.dropout and tf.layers.dropout
                            
                                What does the function control_dependencies do?
                            
                                How to perform k-fold cross validation with tensorflow?
                            
                                Output from TensorFlow `py_func` has unknown rank/shape
                            
                                Tensorflow: How does tf.get_variable work?
                            
                                Issue feeding a list into feed_dict in TensorFlow

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TensorBoard doesn't show all data points

Tags:

tensorflow

tensorboard

Kerawit Somchaipeng

People also ask

1 Answers

Phúc Lê

Recent Activity

Donate For Us