Getting Started With NVIDIA TAO

NVIDIA TAO gives you a low-code, open-source AI framework for accelerating your vision AI model development thatâ€™s suitable for all skill levelsâ€”from beginners to expert data scientists. Now, you can use the power and efficiency of transfer learning to achieve state-of-the-art accuracy and production-class throughput in record time with adaptation and optimization.

Download TAO

NVIDIA TAO Version 5.5: Whatâ€™s New

This latest release of NVIDIA TAO delivers several foundation and multi-modal models to accelerate your AI development. These models and features unlock new potential and unleash tremendous productivity gains in vision AI.

Explore new foundation and multi-modal models:

Grounding-DINOâ€”Open vocabulary object detection with fine-tuning
Mask-Grounding-DINOâ€”Open vocabulary instance segmentation with fine-tuning
NV-CLIPâ€”Foundation model for image and text embedding
BEVFusionâ€”Sensor fusion model combining image and lidar data for 3D understanding with fine-tuning
SEGICâ€”In-context segmentation on any object based on visual prompting
Foundation Poseâ€”Six DoF object pose estimation for any novel objects
Mask2Formerâ€”State-of-the-art instance and panoptic segmentation model with fine-tuning

Automatically create label datasets for object detection and segmentation using text prompts.
Knowledge distillationâ€”Create smaller efficient and accurate networks from distilling knowledge of larger networks.

Production-Ready Vision AI

NVIDIA TAO is also available as a part of NVIDIA AI Enterprise, an end-to-end, secure, cloud-native AI software platform optimized to accelerate enterprises to the leading edge of AI.

With enterprise-grade security, stability, manageability, and support, NVIDIA AI Enterprise speeds time to value while mitigating the potential risks of open-source software. This ensures business continuity and a reliable platform for running mission-critical AI applications.

Benefits of using TAO with NVIDIA AI Enterprise include:

Access to exclusive foundation models for vision AI that can be fine-tuned for custom vision AI tasks
Validation and integration for NVIDIA AI open-source software
Access to AI solution workflows to speed time to production
Certifications to deploy AI everywhere
Enterprise-grade support, security, manageability, and API stability to mitigate potential risks of open source software

Learn More

Helpful Resources

New Foundational Models and Training Capabilities

Learn the groundbreaking features in TAO 5.5., including multimodal sensor fusion models, auto-labeling with text prompts, open-vocabulary detection, and more.

Read the Blog

Prompt-Based Auto Labeling

Learn how to use the prompt-based auto-labeling tool for object detection and segmentation to significantly reduce your time creating labeled dataset.

Watch the Video

AI Training With Multi-Modal Foundation Models

Learn how to create purpose-built AI using vision and language with multi-modal foundation models.

Watch the GTC Talk

Developer Starter Resources

Developer Blogs

Train Like an AI Pro Using the New AutoML Feature in TAO
Create Custom AI Models With TAO in Azure ML
Introductory TAO Whitepaper
Developing and Deploying AI-Powered Robots With NVIDIA Isaac Sim and NVIDIA TAO
Customize Action Recognition with TAO and Deploy with DeepStream
Training and Optimizing a 2D Pose Estimation Model with the NVIDIA TAO Toolkit, Part 1 | Part 2
Preparing State-of-the-Art Models for Classification and Object Detection with the NVIDIA TAO Toolkit

Training Notebooks & Containers

For vision AI training, access the collection of Jupyter Notebooks and training specs
For the computer vision models and container collection, Download From NGC
To try TAO on Google Colab:

Object Detection
Image Classification
Action Recognition

Sample Deployment Applications

To deploy TAO models using NVIDIA Triton?, go to TAO Triton repo
To deploy TAO models using NVIDIA DeepStream, check out following apps:

Deploy YoloV3/V4, EfficientDet, UNET, etc.
Deploy License Plate Detection and Recognition
Deploy 2D Body Pose Estimation, Gaze Estimation, Facial Landmark, HeartRateNet, Gesture, and Emotion

To convert TAO model (etlt) to an NVIDIA TensorRT? engine for deployment with DeepStream, select the appropriate TAO converter for your hardware and software stack.

Featured Video Tutorial

Additional Resources

Blogs & Tutorials

Transforming Industrial Defect Detection With NVIDIA TAO and Vision AI Models
Customizing AI Models: Train and Deploy Character Detection and Recognition Models With NVIDIA TAO and Triton Part 1 | Part 2 (New)
Improve Accuracy and Robustness of Vision AI Apps With Vision Transformers and NVIDIA TAO
Access the Latest in Vision AI Model Development Workflows With NVIDIA TAO Toolkit 5.0
Train Like an AI Pro Using the New AutoML Feature in TAO
Create Custom AI Models With TAO in Azure ML

Read More Developer Blogs

Featured Webinars

GTC 2024 Create Purpose-Built AI Using Vision and Language With Multi-Modal Foundation Models
Accelerate AI Model Creation Using AutoML in NVIDIA TAO 4.0
GTC 2023: AI Model Made Simple Using TAO
GTC 2023: Running TAO Toolkit API in NetsPresso for Effortless Vision AI Model Development and Optimization
GTC 2023: Solving Computer Vision Grand Challenges in One-Click
Top 5 Reasons to Use TAO Toolkit

Partner Resources

Data generation and labeling partners:

MLOps partner:

Success Stories

Rocketboots optimizes workforce management with vision AI
SenSen fastracks AI model development for truck management
Nota uses vision AI to make roadways safer
Onecup AI brings AI to automated precision ranching

Read More Solutions Using TAO

Product Support

TAO Forum

NVIDIA AI Enterprise

Ethical AI

NVIDIA platforms and application frameworks enable developers to build a wide array of AI applications. Consider potential algorithmic bias when choosing or creating the models being deployed. Also, work with the modelâ€™s developer to ensure that it meets the requirements for the relevant industry and use case; that the necessary instruction and documentation are provided to understand error rates, confidence intervals, and results; and that the model is being used under the conditions and in the manner intended.