Does tensorflow use automatic or symbolic gradients?

1 Answers

TF uses automatic differentiation and more specifically reverse-mode auto differentiation.

There are 3 popular methods to calculate the derivative:

Numerical differentiation
Symbolic differentiation
Automatic differentiation

Numerical differentiation relies on the definition of the derivative: enter image description here , where you put a very small h and evaluate function in two places. This is the most basic formula and on practice people use other formulas which give smaller estimation error. This way of calculating a derivative is suitable mostly if you do not know your function and can only sample it. Also it requires a lot of computation for a high-dim function.

Symbolic differentiation manipulates mathematical expressions. If you ever used matlab or mathematica, then you saw something like this enter image description here

Here for every math expression they know the derivative and use various rules (product rule, chain rule) to calculate the resulting derivative. Then they simplify the end expression to obtain the resulting expression.

Automatic differentiation manipulates blocks of computer programs. A differentiator has the rules for taking the derivative of each element of a program (when you define any op in core TF, you need to register a gradient for this op). It also uses chain rule to break complex expressions into simpler ones. Here is a good example how it works in real TF programs with some explanation.

You might think that Automatic differentiation is the same as Symbolic differentiation (in one place they operate on math expression, in another on computer programs). And yes, they are sometimes very similar. But for control flow statements (`if, while, loops) the results can be very different:

symbolic differentiation leads to inefficient code (unless carefully done) and faces the difficulty of converting a computer program into a single expression

150

answered Sep 24 '22 13:09

Salvador Dali

Related questions
                            
                                What's a nice method to factor gaussian integers?
                            
                                Python atan or atan2, what should I use?
                            
                                How can I generate truly (not pseudo) random numbers with C#?
                            
                                Extracting Yaw from a Quaternion
                            
                                Efficient 4x4 matrix inverse (affine transform)
                            
                                All factors of a given number
                            
                                Why do programming languages round down until .6?
                            
                                Calculating the angle between two lines without having to calculate the slope? (Java)
                            
                                Create matrix transformation system in React Native?
                            
                                How can natural numbers be represented to offer constant time addition?
                            
                                Prerequisites Needed to Read Books on Neural Networks (and understand them)
                            
                                Fast implementation of trigonometric functions for c++
                            
                                What is the total number of nodes in a full k-ary tree, in terms of the number of leaves?
                            
                                Dividing a plane of points into two equal halves [closed]
                            
                                Detecting whether a GPS coordinate falls within a polygon on a map
                            
                                Implementing Ray Picking
                            
                                Fastest algorithm for primality test [closed]
                            
                                How to know the angle between two vectors?
                            
                                Is it possible to pass arithmetic operators to a method in java?
                            
                                3d Accelerometer calculate the orientation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Does tensorflow use automatic or symbolic gradients?

Tags:

math

tensorflow

Alexander Telfar

People also ask

1 Answers

Salvador Dali

Recent Activity

Donate For Us