deep learning – NVIDIA Technical Blog

deep learning – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-21T20:30:26Z http://www.open-lab.net/blog/feed/ Szymon Karpi��ski <![CDATA[Fusing Epilog Operations with Matrix Multiplication Using nvmath-python]]> http://www.open-lab.net/blog/?p=92098 2024-11-21T21:07:24Z 2024-11-18T18:30:00Z

nvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X...]]>

nvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X... Code showing how to use epilogs with matrix multiplication in nvmath-python.

Code showing how to use epilogs with matrix multiplication in nvmath-python.

nvmath-python (Beta) is an open-source Python library, providing Python programmers with access to high-performance mathematical operations from NVIDIA CUDA-X math libraries. nvmath-python provides both low-level bindings to the underlying libraries and higher-level Pythonic abstractions. It is interoperable with existing Python packages, such as PyTorch and CuPy. In this post, I show how to��

]]> 1 Babak Hejazi <![CDATA[Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates]]> http://www.open-lab.net/blog/?p=83888 2024-07-16T17:19:07Z 2024-06-12T20:30:00Z

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...]]>

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...

cublas-compilation

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance computing (HPC) workloads. This post provides an overview of the following updates on cuBLAS matrix multiplications (matmuls) since version 12.0, and a walkthrough: Grouped GEMM APIs can be viewed as a generalization of the batched��

]]> 0 Matthew Nicely <![CDATA[Accelerating Transformers with NVIDIA cuDNN 9]]> http://www.open-lab.net/blog/?p=82592 2024-05-30T19:55:46Z 2024-05-24T16:00:00Z

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance....]]>

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance.... Decorative image of cuDNN attention.

Decorative image of cuDNN attention.

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance. cuDNN is integrated with popular deep learning frameworks like PyTorch, TensorFlow, and XLA (Accelerated Linear Algebra). These frameworks abstract the complexities of direct GPU programming, enabling you to focus on designing and��

]]> 1 Tiffany Yeung <![CDATA[Explainer: What Is a Convolutional Neural Network?]]> http://www.open-lab.net/blog/?p=75991 2024-06-05T22:20:53Z 2024-04-12T19:00:00Z

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.]]>

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images. Example of CNN for brain scan results.

Example of CNN for brain scan results.

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.

]]> 0 Rishi Puri <![CDATA[Release: PyTorch Geometric Container for GNNs on NGC]]> http://www.open-lab.net/blog/?p=76597 2024-06-06T16:17:50Z 2024-01-17T23:05:40Z

The NVIDIA PyG container, now generally available, packages PyTorch Geometric with accelerations for GNN models, dataloading, and pre-processing using...]]>

The NVIDIA PyG container, now generally available, packages PyTorch Geometric with accelerations for GNN models, dataloading, and pre-processing using... PyG and Accelerated with NVIDIA logos on a white background.

PyG and Accelerated with NVIDIA logos on a white background.

The NVIDIA PyG container, now generally available, packages PyTorch Geometric with accelerations for GNN models, dataloading, and pre-processing using cuGraph-Ops, cuGraph, and cuDF from NVIDIA RAPIDS, all with an effortless out-of-the-box experience.

]]> 0 Michelle Horton <![CDATA[New Release: NVIDIA TAO 5.2]]> http://www.open-lab.net/blog/?p=75832 2023-12-20T21:39:11Z 2023-12-20T19:03:54Z

With the latest NVIDIA TAO 5.2, you can now run zero-shot inference for panoptic segmentation with ODISE, create custom 3D object pose models, and boost...]]>

With the latest NVIDIA TAO 5.2, you can now run zero-shot inference for panoptic segmentation with ODISE, create custom 3D object pose models, and boost... Decorative image.

Decorative image.

With the latest NVIDIA TAO 5.2, you can now run zero-shot inference for panoptic segmentation with ODISE, create custom 3D object pose models, and boost inference throughput for vision transformers using FasterViT. Download now.

]]> 0 Michelle Horton <![CDATA[New Course: Introduction to Transformer-Based Natural Language Processing]]> http://www.open-lab.net/blog/?p=74272 2023-12-14T19:27:35Z 2023-11-29T18:53:10Z

Learn how transformers are used as the building blocks of modern large language models in this new self-paced course.]]>

Learn how transformers are used as the building blocks of modern large language models in this new self-paced course.

Stylized image of a person interacting with an input screen that provides data to a glowing cube, which in turn provides an output screen.

Learn how transformers are used as the building blocks of modern large language models in this new self-paced course.

]]> 0 Tanya Lenz <![CDATA[New Workshop: Rapid Application Development Using Large Language Models]]> http://www.open-lab.net/blog/?p=72570 2023-11-16T19:36:02Z 2023-11-08T21:30:00Z

Interested in developing LLM-based applications? Get started with this exploration of the open-source ecosystem.]]>

Interested in developing LLM-based applications? Get started with this exploration of the open-source ecosystem.

rapid-application-development-graphic

Interested in developing LLM-based applications? Get started with this exploration of the open-source ecosystem.

]]> 0 Tanya Lenz <![CDATA[New Workshop: Generative AI with Diffusion Models]]> http://www.open-lab.net/blog/?p=72317 2023-11-02T19:19:18Z 2023-10-31T20:00:00Z

Take a deep dive into denoising diffusion models, from building a U-Net to training a text-to-image model.]]>

Take a deep dive into denoising diffusion models, from building a U-Net to training a text-to-image model. Toy Jensen in a lab.

Toy Jensen in a lab.

Take a deep dive into denoising diffusion models, from building a U-Net to training a text-to-image model.

]]> 0 Tanya Lenz <![CDATA[Webinar: Fast Track AI to the Edge with NVIDIA TAO and Edge Impulse]]> http://www.open-lab.net/blog/?p=71788 2023-11-02T18:14:37Z 2023-10-18T19:01:00Z

Discover the power of integrating NVIDIA TAO and Edge Impulse to accelerate AI deployment at the edge.]]>

Discover the power of integrating NVIDIA TAO and Edge Impulse to accelerate AI deployment at the edge.

tao-graphic

Discover the power of integrating NVIDIA TAO and Edge Impulse to accelerate AI deployment at the edge.

]]> 0 Tianna Nguy <![CDATA[New Self-Paced Course: RAPIDS Accelerator for Apache Spark]]> http://www.open-lab.net/blog/?p=71620 2023-11-02T18:14:38Z 2023-10-18T17:52:59Z

Dive into the RAPIDS Accelerator for Apache Spark toolset, including the workload qualification tool for estimating speedup on GPU and the profiling tool for...]]>

Dive into the RAPIDS Accelerator for Apache Spark toolset, including the workload qualification tool for estimating speedup on GPU and the profiling tool for... Photo at a skewed angle of person looking at a monitor that has graphics on it, against a grey background.

Photo at a skewed angle of person looking at a monitor that has graphics on it, against a grey background.

Dive into the RAPIDS Accelerator for Apache Spark toolset, including the workload qualification tool for estimating speedup on GPU and the profiling tool for tuning jobs.

]]> 4 Tanya Lenz <![CDATA[Workshop: Model Parallelism: Building and Deploying Large Neural Networks]]> http://www.open-lab.net/blog/?p=71572 2024-08-28T17:34:21Z 2023-10-12T17:00:00Z

Learn how to train the largest neural networks and deploy them to production.]]>

Learn how to train the largest neural networks and deploy them to production.

nvt-model-parallelism-graphic

Learn how to train the largest neural networks and deploy them to production.

]]> 0 Tanya Lenz <![CDATA[Event: NVIDIA Computer Vision Speaker Series]]> http://www.open-lab.net/blog/?p=71382 2023-11-02T18:14:46Z 2023-10-03T16:00:00Z

Discover how PepsiCo, Runway, SoftServe, and AWS used GPU-accelerated SDKs for their CV applications.]]>

Discover how PepsiCo, Runway, SoftServe, and AWS used GPU-accelerated SDKs for their CV applications.

computer-vision-speaker-series-graphic

Discover how PepsiCo, Runway, SoftServe, and AWS used GPU-accelerated SDKs for their CV applications.

]]> 0 Haggai Maron <![CDATA[Designing Deep Networks to Process Other Deep Networks]]> http://www.open-lab.net/blog/?p=68489 2023-08-24T18:03:38Z 2023-08-17T17:28:37Z

Deep neural networks (DNNs) are the go-to model for learning functions from data, such as image classifiers or language models. In recent years, deep models...]]>

Deep neural networks (DNNs) are the go-to model for learning functions from data, such as image classifiers or language models. In recent years, deep models...

network-graphic

Deep neural networks (DNNs) are the go-to model for learning functions from data, such as image classifiers or language models. In recent years, deep models have become popular for representing the data samples themselves. For example, a deep model can be trained to represent an image, a 3D object, or a scene, an approach called Implicit Neural Representations. (See also Neural Radiance Fields and��

]]> 1 Chintan Shah <![CDATA[Customizing AI Models: Train Character Detection and Recognition Models with NVIDIA TAO]]> http://www.open-lab.net/blog/?p=68713 2023-08-24T18:03:40Z 2023-08-15T15:00:00Z

Optical Character Detection (OCD) and Optical Character Recognition (OCR) are computer vision techniques used to extract text from images. Use cases vary across...]]>

Optical Character Detection (OCD) and Optical Character Recognition (OCR) are computer vision techniques used to extract text from images. Use cases vary across...

ocd-ocr-detail

Optical Character Detection (OCD) and Optical Character Recognition (OCR) are computer vision techniques used to extract text from images. Use cases vary across industries and include extracting data from scanned documents or forms with handwritten texts, automatically recognizing license plates, sorting boxes or objects in a fulfillment center based on serial numbers��

]]> 0 Chintan Shah <![CDATA[Customizing AI Models: Deploy a Character Detection and Recognition Model with NVIDIA Triton]]> http://www.open-lab.net/blog/?p=69017 2023-08-24T18:03:39Z 2023-08-15T15:00:00Z

NVIDIA Triton Inference Server streamlines and standardizes AI inference by enabling teams to deploy, run, and scale trained ML or DL models from any framework...]]>

NVIDIA Triton Inference Server streamlines and standardizes AI inference by enabling teams to deploy, run, and scale trained ML or DL models from any framework...

ocd-ocr-detail

NVIDIA Triton Inference Server streamlines and standardizes AI inference by enabling teams to deploy, run, and scale trained ML or DL models from any framework on any GPU- or CPU-based infrastructure. It helps developers deliver high-performance inference across cloud, on-premises, edge, and embedded devices. The nvOCDR library is integrated into Triton for inference.

]]> 0 Lee Yang <![CDATA[Distributed Deep Learning Made Easy with Spark 3.4]]> http://www.open-lab.net/blog/?p=66415 2024-06-06T16:23:05Z 2023-06-12T20:30:00Z

Apache Spark is an industry-leading platform for distributed extract, transform, and load (ETL) workloads on large-scale data. However, with the advent of deep...]]>

Apache Spark is an industry-leading platform for distributed extract, transform, and load (ETL) workloads on large-scale data. However, with the advent of deep... Deep learning abstract.

Deep learning abstract.

Apache Spark is an industry-leading platform for distributed extract, transform, and load (ETL) workloads on large-scale data. However, with the advent of deep learning (DL), many Spark practitioners have sought to add DL models to their data processing pipelines across a variety of use cases like sales predictions, content recommendations, sentiment analysis, and fraud detection. Yet��

]]> 0 Michelle Horton <![CDATA[Webinar: Accelerate AI Model Inference at Scale for Financial Services]]> http://www.open-lab.net/blog/?p=66045 2023-12-05T18:57:33Z 2023-06-01T17:04:32Z

Learn how AI is transforming financial services across use cases such as fraud detection, risk prediction models, contact centers, and more. ]]>

Learn how AI is transforming financial services across use cases such as fraud detection, risk prediction models, contact centers, and more. Illustration of a character sitting at a computer with a warning popup.

Illustration of a character sitting at a computer with a warning popup.

Learn how AI is transforming financial services across use cases such as fraud detection, risk prediction models, contact centers, and more.

]]> 0 Michelle Horton <![CDATA[Model Parallelism Virtual Workshop]]> http://www.open-lab.net/blog/?p=63251 2023-04-20T19:04:15Z 2023-04-19T21:01:02Z

Learn to build and deploy large neural networks to production with this virtual workshop on May 3 from the NVIDIA Deep Learning Institute.]]>

Learn to build and deploy large neural networks to production with this virtual workshop on May 3 from the NVIDIA Deep Learning Institute.

Workshop promo card with an abstract illustration of a neural network.

Learn to build and deploy large neural networks to production with this virtual workshop on May 3 from the NVIDIA Deep Learning Institute.

]]> 0 Shashank Gaur <![CDATA[Topic Modeling and Image Classification with Dataiku and NVIDIA Data Science]]> http://www.open-lab.net/blog/?p=62857 2023-11-03T07:15:04Z 2023-04-04T18:30:00Z

The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language...]]>

The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language...

Twitter topic model Dataiku diagram

The Dataiku platform for everyday AI simplifies deep learning. Use cases are far-reaching, from image classification to object detection and natural language processing (NLP). Dataiku helps you with labeling, model training, explainability, model deployment, and centralized management of code and code environments. This post dives into high-level Dataiku and NVIDIA integrations for image��

]]> 0 ��˳��97caoporen��