What is tf.bfloat16 "truncated 16-bit floating point"?

1 Answers

bfloat16 is a tensorflow-specific format that is different from IEEE's own float16, hence the new name. The b stands for (Google) Brain.

Basically, bfloat16 is a float32 truncated to its first 16 bits. So it has the same 8 bits for exponent, and only 7 bits for mantissa. It is therefore easy to convert from and to float32, and because it has basically the same range as float32, it minimizes the risks of having NaNs or exploding/vanishing gradients when switching from float32.

From the sources:

// Compact 16-bit encoding of floating point numbers. This representation uses
// 1 bit for the sign, 8 bits for the exponent and 7 bits for the mantissa.  It
// is assumed that floats are in IEEE 754 format so the representation is just
// bits 16-31 of a single precision float.
//
// NOTE: The IEEE floating point standard defines a float16 format that
// is different than this format (it has fewer bits of exponent and more
// bits of mantissa).  We don't use that format here because conversion
// to/from 32-bit floats is more complex for that format, and the
// conversion for this format is very simple.

As for quantized integers, they are designed to replace floating points in trained networks to speed up processing. Basically, they are a sort of fixed point encoding of real numbers, albeit with an operating range that is chosen to represent the observed distribution at any given point of the net.

P-Gn

Related questions
                            
                                Get the diagonal of a matrix in TensorFlow
                            
                                How to expand a Tensorflow Variable
                            
                                tensorflow code optimization strategy
                            
                                What is the negative index in shape arrays used for? (Tensorflow)
                            
                                Tensorflow installation error - (directory not empty)
                            
                                Tensorflow GPU utilization only 60% (GTX 1070)
                            
                                Neural Network to predict nth square
                            
                                Hyperparameter tune for Tensorflow
                            
                                tensorflow stop_gradient equivalent in pytorch
                            
                                Error: ValueError: The last dimension of the inputs to `Dense` should be defined. Found `None`
                            
                                Tensorflow - Keras: Consider either turning off auto-sharding or switching the auto_shard_policy to DATA to shard this dataset
                            
                                TF save/restore graph fails at tf.GraphDef.ParseFromString()
                            
                                Tensorflow: How to convert NaNs to a number?
                            
                                How to generate random number in a given range as a Tensorflow variable
                            
                                nvidia-smi does not display memory usage [closed]
                            
                                How to Split the Input into different channels in Keras
                            
                                Understanding tf.contrib.lite.TFLiteConverter quantization parameters
                            
                                Difference between .pb and .h5
                            
                                Is it possible to split a network across multiple GPUs in tensorflow?
                            
                                How to print progress when training a DNNClassifier in tensorflow r0.9 (skflow)?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is tf.bfloat16 "truncated 16-bit floating point"?

Tags:

tensorflow

JMC

People also ask

1 Answers

P-Gn

Recent Activity

Donate For Us