TF-TRT vs UFF-TensorRT

Tags:

I found that we can optimize the Tensorflow model in several ways. If I am mistaken, please tell me.

1- Using TF-TRT, This API developer by tensorflow and integreted TensoRT to Tensorflow and this API called as :

from tensorflow.python.compiler.tensorrt import trt_convert as trt

This API can be applied to any tensorflow models (new and old version models) without any converting error, because If this API don't support any new layers, don't consider these layers for TensorRT engines and these layers remain for Tensorflow engine and run on Tensorflow. right?

2- Using TensorRT, This API developed by NVIDA and is independent of Tenorflow library (Not integrated to Tensorflow), and this API called as:

import tensorrt as trt

If we want to use this api, first, we must converting the tensorflow graph to UFF using uff-convertor and then parse the UFF graph to this API. In this case, If the Tensorflow graph have unsupported layers we must use plugin or custom code for these layers, right?

3- I don't know, when we work with Tensorflow models, Why we use UFF converter then TensorRT, we can use directly TF-TRT API, right? If so, Are you tested the Tensorflow optimization model from these two method to get same performance? what's advantage of this UFF converter method?

I have some question about the two cases above:

4- I convert the ssd_mobilenet_v2 using two cases, In the case 1, I achieve slight improvement in speed but in the case 2, I achieve more improvement, why? My opinion is that, In the case 1, The API only consider converting the precision (FP32 to FP16) and merging the possible layers together, But in the case 2, the graph is clean by UFF such as remove any redundant nodes like Asserts and Identity and then converted to tensorrt graph, right?

5- when we convert the trained model files like .ckpt and .meta, ... to frozen inference graph(.pb file), These layers don't remove from graph? only loss states and optimizer states , ... are removed?

737

asked Jan 17 '20 10:01

DeeeepNet

1 Answers

Duplicate post with answers here: https://github.com/NVIDIA/TensorRT/issues/341

179

answered Nov 15 '22 08:11

Ryan McCormick

Related questions
                            
                                How to limit tensorflow memory usage?
                            
                                Keras model (tensorflow backend) trains perfectly well in python 3.5 but very bad in python 2.7
                            
                                how can I save a string data to TFRecord?
                            
                                keras CNN : train and validation set are identical but with different accuracy
                            
                                How to continouosly evaluate a tensorflow object detection model in parallel to training with model_main
                            
                                Tensorflow: FailedPreconditionError: Table not initialized (using tf.data.Dataset API)
                            
                                Is there a decent workaround to saving checkpoints in local drive when using TPU in Tensorflow?
                            
                                Tensorboard projector visualisation - PCA keeps loading
                            
                                What is difference between a regular model checkpoint and a saved model in tensorflow?
                            
                                Any way to extract the exhaustive vocabulary of the google universal sentence encoder large?
                            
                                Convolving Across Channels in Keras CNN: Conv1D, Depthwise Separable Conv, CCCP?
                            
                                Session keyword arguments are not support during eager execution. You passed: {'learning_rate': 1e-05}
                            
                                When should I stop the object detection model training while mAP are not stable?
                            
                                What is the 'index' in TFLite interpreter.get_input_details referring to?
                            
                                How to import tensorflow in javascript? Import in file, served by local http-server
                            
                                Reading TF2 summary file with tf.data.TFRecordDataset
                            
                                Keras: TPU models must have constant shapes for all operations
                            
                                LSTM RNN to predict multiple time-steps and multiple features simultaneously
                            
                                How to reset initialization in TensorFlow 2
                            
                                Keras custom metric sum is wrong

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

TF-TRT vs UFF-TensorRT

Tags:

tensorflow

nvidia

nvidia-jetson

tensorrt

nvidia-jetson-nano

DeeeepNet

People also ask

1 Answers

Ryan McCormick

Recent Activity

Donate For Us