INT8 – NVIDIA Technical Blog

INT8 – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-27T16:00:00Z http://www.open-lab.net/blog/feed/ Neta Zmora <![CDATA[Achieving FP32 Accuracy for INT8 Inference Using Quantization Aware Training with NVIDIA TensorRT]]> http://www.open-lab.net/blog/?p=34216 2023-06-12T21:09:34Z 2021-07-20T13:00:00Z

Deep learning is revolutionizing the way that industries are delivering products and services. These services include object detection, classification, and...]]>

Deep learning is revolutionizing the way that industries are delivering products and services. These services include object detection, classification, and...

qat-training-precision

Join the NVIDIA Triton and NVIDIA TensorRT community to stay current on the latest product updates, bug fixes, content, best practices, and more. Deep learning is revolutionizing the way that industries are delivering products and services. These services include object detection, classification, and segmentation for computer vision, and text extraction, classification��

]]> 1 Dave Salvator <![CDATA[Int4 Precision for AI Inference]]> http://www.open-lab.net/blog/?p=15821 2023-02-13T17:33:48Z 2019-11-06T18:00:57Z

INT4 Precision Can Bring an Additional 59% Speedup Compared to INT8 If there��s one constant in AI and deep learning, it��s never-ending optimization to wring...]]>

INT4 Precision Can Bring an Additional 59% Speedup Compared to INT8 If there��s one constant in AI and deep learning, it��s never-ending optimization to wring...

MLPerf

If there��s one constant in AI and deep learning, it��s never-ending optimization to wring every possible bit of performance out of a given platform. Many inference applications benefit from reduced precision, whether it��s mixed precision for recurrent neural networks (RNNs) or INT8 for convolutional neural networks (CNNs), where applications can get 3x+ speedups. NVIDIA��s Turing architecture��

]]> 2 Dave Salvator <![CDATA[MLPerf Inference: NVIDIA Innovations Bring Leading Performance]]> http://www.open-lab.net/blog/?p=15851 2023-07-05T19:38:49Z 2019-11-06T18:00:22Z

New TensorRT 6 Features Combine with Open-Source Plugins to Further Accelerate Inference? Inference is where AI goes to work. Identifying diseases. Answering...]]>

New TensorRT 6 Features Combine with Open-Source Plugins to Further Accelerate Inference? Inference is where AI goes to work. Identifying diseases. Answering...

MLPerf

Inference is where AI goes to work. Identifying diseases. Answering questions. Recommending products and services. The inference market is also diffuse, and will happen everywhere from the data center to edge to IoT devices across multiple use-cases including image, speech and recommender systems to name a few. As a result, creating a benchmark to measure the performance of these diverse platforms��

]]> 0 Gary Burnett <![CDATA[Object Detection on GPUs in 10 Minutes]]> http://www.open-lab.net/blog/?p=15047 2022-08-21T23:39:32Z 2019-06-26T19:00:39Z

Object detection remains the primary driver for applications such as autonomous driving and intelligent video analytics. Object detection applications require...]]>

Object detection remains the primary driver for applications such as autonomous driving and intelligent video analytics. Object detection applications require...

Object detection remains the primary driver for applications such as autonomous driving and intelligent video analytics. Object detection applications require substantial training using vast datasets to achieve high levels of accuracy. NVIDIA GPUs excel at the parallel compute performance required to train large networks in order to generate datasets for object detection inference.

]]> 8 Valerie Sarge <![CDATA[Tips for Optimizing GPU Performance Using Tensor Cores]]> http://www.open-lab.net/blog/?p=14687 2023-07-27T20:01:41Z 2019-06-10T13:00:06Z

Our most popular question is "What can I do to get great GPU performance for deep learning?"?We��ve recently published a detailed Deep Learning Performance...]]>

Our most popular question is "What can I do to get great GPU performance for deep learning?"?We��ve recently published a detailed Deep Learning Performance...

tensor_cube_white-1280-362x265

Our most popular question is ��What can I do to get great GPU performance for deep learning?�� We��ve recently published a detailed Deep Learning Performance Guide to help answer this question. The guide explains how GPUs process data and gives tips on how to design networks for better performance. We also take a close look at Tensor Core optimization to help improve performance. This post takes a��

]]> 15 ��˳��97caoporen��