The NVIDIA TAO Toolkit simplification tool that allows for usage of NVIDIA pre-trained models or user defined ONNX to be retrained with custom data (transfer learning) to create Computer Vision (CV) or/ and Conversational AI (Conv AI) Models. With minimal training, TAO toolkit allows you to: (https://docs.nvidia.com/tao/tao-toolkit/text/overview.html)
=========Direct Quote from NVIDIA toolkit manual=========
NVIDIA TAO Toolkit allows for 3 types of models to be trained.
General purpose model architecture
NVIDIA optimized pre-trained models for (CV and Conv AI)
User Defined ONNX model
The general purpose model architecture allows for pre-trained weights with user specified backbone (feature extraction) models to be trained with specific custom data. This method allows for far faster model training as it does not involve randomized weight initialization, but trained weights, which can be used as the starting point for the models.
Key words:
Available model architectures in relation to the available backbones.

The NVIDIA Optimized Pre-trained models are trained models that can be deployed from the box, or retrained with transfer learning for specific custom dataset.https://docs.nvidia.com/tao/tao-toolkit/index.html
These models are not are general and can be task specific, but since they are optimized both in terms of hyper-parameters and model efficiency, it is used a lot in custom tasks.
When a model is made using Machine Learning frameworks such as PyTorch or TensorFlow, user may save their model in ONNX format.
The TAO toolkit allows the ONNX format models to be compatible with TAO functionalities.

Term Explanations
ONNX
ONNX stands for ⇒ Open Neural Network Exchange
It is an open format for trained Machine Learning (ML) Models that allows the model to be used in various ML frameworks and tools.
Transfer Learning
Develop Model Approach
Select Source Task
Develop Source Model
Reuse Model
For using the model as the staring point for a model on the second task, this process may involve using all or parts of the model, depending on the modelling technique used.
Tune Model
Optionally, the model may need to be adapted or refined on the input output pair data available for the task of interest
Pre-trained Model Approach
Select Source Model
Pre-trained models from many research institutions such as NVIDIA
Reuse Model.
Tune Model
When to use Transfer Learning
The transfer learning provides 3 possible benefits
