how tensorflow handles complex gradient?

Tags:

Let z is a complex variable, C(z) is its conjugation. In complex analysis theory, the derivative of C(z) w.r.t z don't exist. But in tesnsorflow, we can calculate dC(z)/dz and the result is just 1. Here is an example:

x = tf.placeholder('complex64',(2,2))
y = tf.reduce_sum(tf.conj(x))
z = tf.gradients(y,x)
sess = tf.Session()
X = np.random.rand(2,2)+1.j*np.random.rand(2,2)
X = X.astype('complex64')
Z = sess.run(z,{x:X})[0]

The input X is

[[0.17014372+0.71475762j  0.57455420+0.00144318j]
 [0.57871044+0.61303568j  0.48074263+0.7623235j ]]

and the result Z is

[[1.-0.j  1.-0.j]
 [1.-0.j  1.-0.j]]

I don't understand why the gradient is set to be 1? And I want to know how tensorflow handles the complex gradients in general.

248

asked Feb 27 '17 06:02

zhd.zhang

1 Answers

How?

The equation used by Tensorflow for the gradient is:

$\nabla_z f = \left( \frac{\partial f}{\partial z} + \frac{\partial f*}{\partial z} \right)*=2\frac{\partial Real(f)}{\partial z*}$

Where the '*' means conjugate.

When using the definition of the partial derivatives wrt z and z* it uses Wirtinger Calculus. Wirtinger calculus enables to calculate the derivative wrt a complex variable for non-holomorphic functions. The Wirtinger definition is:

$\frac{\partial f}{\partial z} = \frac{1}{2}\left( \frac{\partial f}{\partial x} - j \frac{\partial f}{\partial y} \right)$

Why this definition?

When using for example Complex-Valued Neural Networks (CVNN) the gradients will be used over non-holomorphic, real-valued scalar function of one or several complex variables, tensorflow definition of a gradient can then be written as:

$2\frac{\partial f}{\partial z*} = \left( \frac{\partial f}{\partial x} + j \frac{\partial f}{\partial y} \right)$

This definition corresponds with the literature of CVNN like for example chapter 4 section 4.3 of this book or Amin et al. (between countless examples).

answered Nov 01 '22 08:11

Agustin Barrachina

Related questions
                            
                                TensorFlow placement algorithm
                            
                                Write Custom Python-Based Gradient Function for an Operation? (without C++ Implementation)
                            
                                Does the Inception Model have two softmax outputs?
                            
                                Tensorflow with gpu support installation error - the specified --crosstool_top is not a valid cc_toolchain_suite rule
                            
                                Oscillating accuracy of CNN training with Tensor Flow for MNIST handwritten digits
                            
                                open tensorflow graph from file
                            
                                tensorflow ValueError: Shape must be rank 1 but is rank 2
                            
                                Is there a way to efficiently vectorize Tensorflow ops on images?
                            
                                How to use Tensorflow's PTB model example?
                            
                                ConcatOp : Dimensions of inputs should match
                            
                                dynamic_partition with dynamic num_partitions
                            
                                Where does tensorflow log error?
                            
                                Tensorflow graph editor reroute complex network
                            
                                Slice multiple slices at once with tensorflow
                            
                                The order of pooling and normalization layer in convnet
                            
                                Adding matrices with different dimensions
                            
                                How can I use TensorFlow without CUDA on Linux?
                            
                                Tensorflow Type Error: Value passed to parameter 'shape' has DataType float32 not in list of allowed values: int32, int64
                            
                                TensorFlow 1.0 does not see GPU on Windows (but Theano does)
                            
                                C++ Eigen: dynamic tensor

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how tensorflow handles complex gradient?

Tags:

tensorflow

autodiff