Computer Vision / Video Analytics – NVIDIA Technical Blog

Computer Vision / Video Analytics – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-29T19:05:40Z http://www.open-lab.net/blog/feed/ Davide Paglieri <![CDATA[Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=99202 2025-04-29T19:05:40Z 2025-04-24T17:00:00Z

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...

]]>

Elias Wolfberg <![CDATA[AI-Generated Heat Maps Keep Seniors and their Privacy Safe]]> http://www.open-lab.net/blog/?p=98891 2025-04-17T19:35:21Z 2025-04-16T20:00:10Z

By 2030, more than one in five Americans will be 65 or older, becoming the United States�� largest group of seniors ever. Silicon Valley-based startup Butlr...

]]>

Michelle Horton <![CDATA[AI Advances Parkinson��s Detection Using Standard MRI Scans]]> http://www.open-lab.net/blog/?p=98636 2025-04-17T19:35:29Z 2025-04-11T16:58:59Z

A simple brain scan may soon be all that's needed to accurately diagnose Parkinson��s disease, thanks to a new AI-powered tool. The advancement could help...

]]>

Anu Srivastava <![CDATA[NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick]]> http://www.open-lab.net/blog/?p=98468 2025-04-22T23:57:03Z 2025-04-06T02:18:34Z

The newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...

]]>

1 Ashley Goldstein <![CDATA[Simulating Robots in Industrial Facility Digital Twins]]> http://www.open-lab.net/blog/?p=98201 2025-04-23T00:00:10Z 2025-03-31T16:00:00Z

Industrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that...

]]>

Shubham Agrawal <![CDATA[Build Real-Time Multimodal XR Apps with NVIDIA AI Blueprint for Video Search and Summarization]]> http://www.open-lab.net/blog/?p=96842 2025-03-12T22:08:59Z 2025-03-11T17:30:00Z

With the recent advancements in generative AI and vision foundational models, VLMs present a new wave of visual computing wherein the models are capable of...

]]>

Elias Wolfberg <![CDATA[AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale]]> http://www.open-lab.net/blog/?p=96671 2025-03-06T19:26:37Z 2025-03-03T17:48:01Z

In an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world��s...

]]>

Anu Srivastava <![CDATA[Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs]]> http://www.open-lab.net/blog/?p=96519 2025-04-23T02:39:30Z 2025-02-26T22:05:00Z

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...

]]>

Shubham Agrawal <![CDATA[Vision Language Model Prompt Engineering Guide for Image and Video Understanding]]> http://www.open-lab.net/blog/?p=96229 2025-04-23T02:38:32Z 2025-02-26T16:25:34Z

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...

]]>

Vishesh Lokras <![CDATA[NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell]]> http://www.open-lab.net/blog/?p=96377 2025-04-23T02:35:08Z 2025-02-24T22:55:30Z

The release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...

]]>

Ravi Chaudhary <![CDATA[Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0]]> http://www.open-lab.net/blog/?p=96366 2025-04-23T02:42:31Z 2025-02-24T22:32:34Z

NVIDIA announces the implementation of Multi-View High Efficiency Video Coding (MV-HEVC) encoder in the latest NVIDIA Video Codec SDK release, version 13.0....

]]>

Michelle Horton <![CDATA[AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=95520 2025-04-23T02:43:07Z 2025-02-20T17:44:00Z

From mitigating climate change to improving disaster response and environmental monitoring, AI is reshaping how we tackle critical global challenges....

]]>

Joanne Chang <![CDATA[Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=96193 2025-02-20T15:50:53Z 2025-02-20T17:00:00Z

Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.

]]>

Joanne Chang <![CDATA[Upcoming Webinar: Unlocking Video Analytics With AI Agents]]> http://www.open-lab.net/blog/?p=96135 2025-02-20T15:52:55Z 2025-02-13T22:05:57Z

Master prompt engineering, fine-tuning, and customization to build video analytics AI agents.

]]>

Pranav Marathe <![CDATA[Just Released: Tripy, a Python Programming Model For TensorRT]]> http://www.open-lab.net/blog/?p=95947 2025-02-10T17:08:43Z 2025-02-10T17:08:40Z

Experience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.

]]>

Brad Nemire <![CDATA[Featured Researcher and Educator Sessions at NVIDIA GTC 2025]]> http://www.open-lab.net/blog/?p=95817 2025-02-06T19:33:45Z 2025-02-05T23:03:06Z

Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.

]]>

Elias Wolfberg <![CDATA[New AI Model Offers Cellular-Level View of Cancerous Tumors]]> http://www.open-lab.net/blog/?p=95758 2025-04-23T02:48:10Z 2025-02-04T22:33:00Z

Researchers studying cancer unveiled a new AI model that provides cellular-level mapping and visualizations of cancer cells, which scientists hope can shed...

]]>

Michelle Horton <![CDATA[AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment]]> http://www.open-lab.net/blog/?p=95722 2025-04-23T02:48:13Z 2025-02-04T17:16:54Z

A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...

]]>

1 Michelle Horton <![CDATA[Advancing Rare Disease Detection with AI-Powered Cellular Profiling]]> http://www.open-lab.net/blog/?p=95498 2025-04-23T15:01:14Z 2025-01-29T20:45:46Z

Rare diseases are difficult to diagnose due to limitations in traditional genomic sequencing. Wolfgang Pernice, assistant professor at Columbia University, is...

]]>

Michelle Horton <![CDATA[Spinal Health Diagnostics Gets Deep Learning Automation]]> http://www.open-lab.net/blog/?p=95243 2025-04-23T15:02:48Z 2025-01-22T17:09:42Z

An advanced deep-learning model that automates X-ray analysis for faster and more accurate assessments could transform spinal health diagnostics. Capable of...

]]>

Elias Wolfberg <![CDATA[AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells]]> http://www.open-lab.net/blog/?p=95106 2025-04-23T15:03:07Z 2025-01-16T19:09:15Z

With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...

]]>

Samuel Ochoa <![CDATA[Build a Video Search and Summarization Agent with NVIDIA AI Blueprint]]> http://www.open-lab.net/blog/?p=86011 2025-02-13T20:44:57Z 2025-01-07T04:20:00Z

This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...

]]>

2 Elias Wolfberg <![CDATA[AI Vision Helps Green Recycling Plants]]> http://www.open-lab.net/blog/?p=94421 2025-01-07T20:18:07Z 2024-12-19T20:20:23Z

Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...

]]>

Michelle Horton <![CDATA[Time-Lapse AI Model Enhances IVF Embryo Selection]]> http://www.open-lab.net/blog/?p=93767 2024-12-18T16:38:55Z 2024-12-12T17:29:22Z

Researchers from Weill Cornell Medicine have developed an AI-powered model that could help couples undergoing in vitro fertilization (IVF) and guide...

]]>

Joanne Chang <![CDATA[Just Released: NVIDIA VILA VLM]]> http://www.open-lab.net/blog/?p=93512 2024-12-12T19:35:17Z 2024-12-09T17:09:10Z

Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.

]]>

Michael Zephyr <![CDATA[Celebrating Open Science and Enterprise AI Innovation on MONAI��s 5th Anniversary]]> http://www.open-lab.net/blog/?p=92886 2024-12-20T18:35:40Z 2024-12-05T22:13:17Z

As MONAI celebrates its fifth anniversary, we're witnessing the convergence of our vision for open medical AI with production-ready enterprise solutions. ...

]]>

Monika Jhuria <![CDATA[Scaling Action Recognition Models with Synthetic Data]]> http://www.open-lab.net/blog/?p=91593 2024-12-12T19:35:22Z 2024-12-03T18:36:55Z

Action recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...

]]>

Shubham Agrawal <![CDATA[Build an Agentic Video Workflow with Video Search and Summarization]]> http://www.open-lab.net/blog/?p=92834 2025-01-07T05:45:50Z 2024-12-03T18:30:00Z

Building a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...

]]>

Joanne Chang <![CDATA[Just Released: NVIDIA DeepStream 7.1]]> http://www.open-lab.net/blog/?p=92695 2024-12-12T19:46:55Z 2024-11-25T16:40:22Z

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...

]]>

Shashank Maheshwari <![CDATA[NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM]]> http://www.open-lab.net/blog/?p=91283 2024-12-12T19:47:55Z 2024-11-21T22:01:16Z

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...

]]>

Michelle Horton <![CDATA[AI Unlocks Early Clues to Alzheimer��s Through Retinal Scans]]> http://www.open-lab.net/blog/?p=92565 2024-12-12T19:38:44Z 2024-11-21T16:40:39Z

Your eyes could hold the key to unlocking early detection of Alzheimer��s and dementia, with a groundbreaking AI study. Called Eye-AD, the deep learning...

]]>

1 Michelle Horton <![CDATA[Deep Learning AI Model Identifies Breast Cancer Spread without Surgery]]> http://www.open-lab.net/blog/?p=91133 2024-12-20T18:48:46Z 2024-10-31T16:06:07Z

A new deep learning model could reduce the need for surgery when diagnosing whether cancer cells are spreading, including to nearby lymph nodes��also known as...

]]>

Elias Wolfberg <![CDATA[AI-Powered Devices Track Howls to Save Wolves]]> http://www.open-lab.net/blog/?p=91077 2024-10-31T16:21:07Z 2024-10-29T17:56:55Z

A new cell-phone-sized device��which can be deployed in vast, remote areas��is using AI to identify and geolocate wildlife to help conservationists track...

]]>

Hanson Xu <![CDATA[Federated Learning in Autonomous Vehicles Using Cross-Border Training]]> http://www.open-lab.net/blog/?p=90443 2025-02-05T20:08:58Z 2024-10-24T16:00:00Z

Federated learning is revolutionizing the development of autonomous vehicles (AVs), particularly in cross-country scenarios where diverse data sources and...

]]>

Bret Li <![CDATA[Optimizing the CV Pipeline in Automotive Vehicle Development Using the PVA Engine]]> http://www.open-lab.net/blog/?p=90646 2024-10-31T16:21:21Z 2024-10-23T13:00:00Z

In the field of automotive vehicle software development, more large-scale AI models are being integrated into autonomous vehicles. The models range from vision...

]]>

Paul Logan <![CDATA[Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs]]> http://www.open-lab.net/blog/?p=89719 2024-10-17T18:19:11Z 2024-10-07T23:03:48Z

Reality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...

]]>

William Raveane <![CDATA[Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries]]> http://www.open-lab.net/blog/?p=89831 2024-11-14T16:23:01Z 2024-10-07T21:11:06Z

Microsoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...

]]>

Tanya Lenz <![CDATA[Generate Image and Text Embeddings with NV-CLIP]]> http://www.open-lab.net/blog/?p=89773 2024-10-17T18:19:13Z 2024-10-07T20:00:00Z

NV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.

]]>

Alexander Ladikos <![CDATA[Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan]]> http://www.open-lab.net/blog/?p=89703 2024-10-17T19:06:57Z 2024-10-07T12:00:00Z

Developers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...

]]>

Elias Wolfberg <![CDATA[AI Chatbot Delivers Multilingual Support to African Farmers]]> http://www.open-lab.net/blog/?p=89513 2024-10-17T19:07:10Z 2024-09-27T18:10:11Z

Some of Africa��s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...

]]>

Michelle Horton <![CDATA[How AI and Robotics are Driving Agricultural Productivity and Sustainability]]> http://www.open-lab.net/blog/?p=89454 2024-10-17T19:07:15Z 2024-09-25T15:53:36Z

By 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...

]]>

Micha? Szo?ucha <![CDATA[Improved Data Loading with Threads]]> http://www.open-lab.net/blog/?p=88657 2024-09-19T19:30:59Z 2024-09-13T16:00:00Z

Data loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...

]]>

Ricardo Monteiro <![CDATA[Enabling Customizable GPU-Accelerated Video Transcoding Pipelines]]> http://www.open-lab.net/blog/?p=88870 2024-09-19T19:31:10Z 2024-09-11T23:01:24Z

Today, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...

]]>

Elias Wolfberg <![CDATA[AI Tool Helps Farmers Combat Crop Loss and Climate Change]]> http://www.open-lab.net/blog/?p=88957 2025-01-07T20:27:37Z 2024-09-11T16:28:27Z

Machine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...

]]>

Michelle Horton <![CDATA[High-Tech AI Framework Transforms Global Marine Pollution Tracking]]> http://www.open-lab.net/blog/?p=88586 2024-10-21T16:26:32Z 2024-09-09T15:08:15Z

An AI-powered remote sensing study offers a dynamic new tool for global ocean cleanup efforts. Detailed in the ISPRS Journal of Photogrammetry and Remote...

]]>

Michelle Horton <![CDATA[AI-Powered Platform Advances Personalized Cancer Diagnostics and Treatments]]> http://www.open-lab.net/blog/?p=88574 2024-10-21T16:26:32Z 2024-09-05T17:27:27Z

A recent study introduced a cutting-edge AI-powered pathology platform that can help doctors diagnose and evaluate lung cancer in patients quickly and...

]]>

Dvir Samuel <![CDATA[Fast Inversion for Real-Time Image Editing with Text]]> http://www.open-lab.net/blog/?p=85619 2024-09-05T17:57:10Z 2024-08-30T16:00:04Z

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...

]]>

Monika Jhuria <![CDATA[New Foundational Models and Training Capabilities with NVIDIA TAO 5.5]]> http://www.open-lab.net/blog/?p=87263 2024-09-09T19:37:08Z 2024-08-28T16:00:00Z

NVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...

]]>

Shuo Wang <![CDATA[Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking]]> http://www.open-lab.net/blog/?p=87901 2024-09-05T17:57:21Z 2024-08-27T18:30:00Z

This post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...

]]>

Joanne Chang <![CDATA[Webinar: Build Visual AI Agents With Generative AI and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=87551 2024-08-22T18:24:51Z 2024-08-19T15:00:00Z

Learn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.

]]>

Michelle Horton <![CDATA[Interactive AI Tool Delivers Immersive Video Content to Blind and Low-Vision Viewers]]> http://www.open-lab.net/blog/?p=86936 2025-02-04T19:44:34Z 2024-08-12T15:54:26Z

New research aims to revolutionize video accessibility for blind or low-vision (BLV) viewers with an AI-powered system that gives users the ability to explore...

]]>

Michelle Horton <![CDATA[??Real-Time AI Shark Detection is Boosting Beach Safety]]> http://www.open-lab.net/blog/?p=86892 2024-09-05T18:59:24Z 2024-08-06T19:01:54Z

California beaches are becoming safer with a new AI-powered shark detection system. Known as SharkEye, the technology identifies sharks near shorelines in real...

]]>

Ahmed Harouni <![CDATA[Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice]]> http://www.open-lab.net/blog/?p=85863 2024-08-08T18:48:29Z 2024-07-26T18:41:23Z

Over 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...

]]>

Samuel Ochoa <![CDATA[Develop Generative AI-Powered Visual AI Agents for the Edge]]> http://www.open-lab.net/blog/?p=85444 2024-11-07T05:08:55Z 2024-07-17T15:00:00Z

An exciting breakthrough in AI technology��Vision Language Models (VLMs)��offers a more dynamic and flexible method for video analysis. VLMs enable users to...

]]>

1 Sameer Satish Pusegaonkar <![CDATA[Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data]]> http://www.open-lab.net/blog/?p=84692 2024-07-25T18:19:09Z 2024-07-10T16:00:00Z

Large-scale, use�Ccase-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That��s because digital twins...

]]>

2 Min-Hung Chen https://minhungchen.netlify.app/ <![CDATA[Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning]]> http://www.open-lab.net/blog/?p=84454 2024-11-07T05:09:12Z 2024-06-28T15:00:00Z

Full fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...

]]>

Abhijit Patait <![CDATA[Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC]]> http://www.open-lab.net/blog/?p=82930 2024-09-22T15:09:03Z 2024-06-26T19:30:00Z

NVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...

]]>

Nate Bradford <![CDATA[Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD]]> http://www.open-lab.net/blog/?p=84422 2024-07-10T15:28:34Z 2024-06-26T16:00:00Z

SyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...

]]>

Elias Wolfberg <![CDATA[AI-Enhanced Navigation Charts Safer Waters for Massive Ships]]> http://www.open-lab.net/blog/?p=84076 2025-02-04T19:49:56Z 2024-06-25T16:00:00Z

Maritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...

]]>

1 Pengfei Guo <![CDATA[Addressing Medical Imaging Limitations with Synthetic Data Generation]]> http://www.open-lab.net/blog/?p=83468 2025-02-04T19:51:06Z 2024-06-24T17:50:59Z

Synthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...

]]>

Monika Jhuria <![CDATA[Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim]]> http://www.open-lab.net/blog/?p=83470 2024-07-30T22:15:36Z 2024-06-24T17:00:00Z

As vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...

]]>

Alvin Clark <![CDATA[Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0]]> http://www.open-lab.net/blog/?p=84266 2025-03-20T16:21:00Z 2024-06-18T19:53:22Z

Intelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...

]]>

Akhil Docca <![CDATA[Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab]]> http://www.open-lab.net/blog/?p=84120 2024-06-27T18:32:20Z 2024-06-17T13:00:00Z

The era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...

]]>

Joanne Chang <![CDATA[MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development]]> http://www.open-lab.net/blog/?p=83754 2024-06-13T19:08:55Z 2024-06-06T18:16:31Z

MediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...

]]>

Meiran Peng <![CDATA[Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK]]> http://www.open-lab.net/blog/?p=83116 2024-06-13T19:06:01Z 2024-06-05T14:00:00Z

NVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...

]]>

1 Chintan Shah <![CDATA[Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA]]> http://www.open-lab.net/blog/?p=83182 2024-11-07T05:09:27Z 2024-06-04T20:24:32Z

NVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...

]]>

Monika Jhuria <![CDATA[Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow]]> http://www.open-lab.net/blog/?p=83108 2025-03-21T20:30:26Z 2024-06-02T12:30:00Z

This post is the first in a series on building multi-camera tracking vision AI applications. In this part, we introduce the overall end-to-end workflow,...

]]>

Jenny Plunkett <![CDATA[How to Train an Object Detection Model for Visual Inspection with Synthetic Data]]> http://www.open-lab.net/blog/?p=70820 2024-06-17T16:44:02Z 2024-05-31T22:30:00Z

AI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or...

]]>

0 Amr Elmeleegy <![CDATA[Enhancing the Apparel Shopping Experience with AI, Emoji-Aware OCR, and Snapchat��s Screenshop]]> http://www.open-lab.net/blog/?p=82250 2025-03-18T18:30:13Z 2024-05-17T17:33:20Z

Ever spotted someone in a photo wearing a cool shirt or some unique apparel and wondered where they got it? How much did it cost? Maybe you've even thought...

]]>

Carlos Garcia-Sierra <![CDATA[NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development]]> http://www.open-lab.net/blog/?p=82050 2024-09-04T22:00:17Z 2024-05-14T22:50:34Z

NVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available...

]]>

Paul Shin <![CDATA[Mitigating Occlusions in Visual Perception Using Single-View 3D Tracking in NVIDIA DeepStream]]> http://www.open-lab.net/blog/?p=81786 2024-05-15T17:15:51Z 2024-05-08T16:00:00Z

When it comes to perception for Intelligent Video Analytics (IVA) applications such as traffic monitoring, warehouse safety, and retail shopper analytics, one...

]]>

1 Yao (Jason) Lu <![CDATA[Visual Language Intelligence and Edge AI 2.0 with NVIDIA Cosmos Nemotron]]> http://www.open-lab.net/blog/?p=81534 2025-01-09T03:29:25Z 2024-05-03T15:00:00Z

Note: As of January 6, 2025, VILA is now part of the Cosmos Nemotron VLM family. NVIDIA is proud to announce the release of NVIDIA Cosmos Nemotron, a family of...

]]>

1 Yao (Jason) Lu <![CDATA[Visual Language Models on NVIDIA Hardware with VILA]]> http://www.open-lab.net/blog/?p=81571 2025-01-07T04:01:29Z 2024-05-03T15:00:00Z

Note: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently....

]]>

1 Tian Cao <![CDATA[Perception Model Training for Autonomous Vehicles with Tensor Parallelism]]> http://www.open-lab.net/blog/?p=81464 2024-05-02T19:01:07Z 2024-04-27T05:00:00Z

Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...

]]>

Vishwesh Nath <![CDATA[Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D]]> http://www.open-lab.net/blog/?p=81250 2024-05-07T16:54:01Z 2024-04-22T18:30:00Z

Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...

]]>

Mahesh Khadatare <![CDATA[Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec]]> http://www.open-lab.net/blog/?p=81155 2024-04-18T20:14:59Z 2024-04-17T20:30:00Z

This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...

]]>

Tiffany Yeung <![CDATA[Explainer: What Is a Convolutional Neural Network?]]> http://www.open-lab.net/blog/?p=75991 2024-06-05T22:20:53Z 2024-04-12T19:00:00Z

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.

]]>

0 Michelle Horton <![CDATA[Explainer: What Is Computer Vision?]]> http://www.open-lab.net/blog/?p=75988 2024-06-05T22:19:45Z 2024-03-22T19:00:00Z

Computer vision defines the field that enables devices to acquire, process, understand, and analyze digital images and videos and extract useful...

]]>

Mostafa Toloui <![CDATA[Developing Production-Ready AI Sensor Processing Applications with NVIDIA Holoscan 1.0]]> http://www.open-lab.net/blog/?p=79788 2024-04-09T23:45:13Z 2024-03-20T17:00:00Z

Edge AI developers are building AI applications and products for safety-critical and regulated use cases. With NVIDIA Holoscan 1.0, these applications can...

]]>

Michael Zephyr <![CDATA[Breaking Barriers in Healthcare with New Models for Generative AI and Cellular Imaging]]> http://www.open-lab.net/blog/?p=79523 2024-04-09T23:45:17Z 2024-03-19T15:00:00Z

Driving the future of healthcare imaging, NVIDIA MONAI microservices are creating unique state-of-the-art models and expanded modalities to meet the demands of...

]]>

Cem Moluluo <![CDATA[Calculating Video Quality Using NVIDIA GPUs and VMAF-CUDA]]> http://www.open-lab.net/blog/?p=77541 2024-04-09T23:45:26Z 2024-03-12T16:57:38Z

Video quality metrics are used to evaluate the fidelity of video content. They provide a consistent quantitative measurement to assess the performance of the...

]]>

Paul Springer <![CDATA[cuTENSOR 2.0: Applications and Performance]]> http://www.open-lab.net/blog/?p=77915 2024-04-09T23:45:28Z 2024-03-09T03:20:47Z

While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...

]]>

Paul Springer <![CDATA[cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations]]> http://www.open-lab.net/blog/?p=77913 2024-04-09T23:45:29Z 2024-03-09T03:20:45Z

NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...

]]>

Amr Elmeleegy <![CDATA[Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform]]> http://www.open-lab.net/blog/?p=78388 2025-03-18T18:31:44Z 2024-03-07T19:05:46Z

Diffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...

]]>

1 Tanya Lenz <![CDATA[Featured Smart Spaces Sessions at NVIDIA GTC 2024]]> http://www.open-lab.net/blog/?p=78162 2024-04-09T23:45:34Z 2024-03-07T00:19:10Z

From cities and airports to Olympic Stadiums, AI is transforming public spaces into safer, smarter, and more sustainable environments.

]]>

Jeffrey Renfro <![CDATA[Spotlight: Honeywell Accelerates Industrial Process Simulation with NVIDIA cuDSS]]> http://www.open-lab.net/blog/?p=78496 2024-04-09T23:45:36Z 2024-03-05T19:00:00Z

For over a decade, traditional industrial process modeling and simulation approaches have struggled to fully leverage multicore CPUs or acceleration devices to...

]]>

Nate Bradford <![CDATA[Top Synthetic Data Generation Sessions at NVIDIA GTC 2024]]> http://www.open-lab.net/blog/?p=78671 2024-03-07T19:18:48Z 2024-02-29T23:31:18Z

Learn how synthetic data is supercharging 3D simulation and computer vision workflows, from visual inspection to autonomous machines.

]]>

0 Umair Iqbal <![CDATA[Detecting Real-Time Waste Contamination Using Edge Computing and Video Analytics]]> http://www.open-lab.net/blog/?p=76482 2024-03-07T19:33:06Z 2024-02-26T21:00:00Z

The past few decades have witnessed a surge in rates of waste generation, closely linked to economic development and urbanization. This escalation in waste...

]]>

0 Michelle Horton <![CDATA[Top Computer Vision/Video Analytics Sessions at NVIDIA GTC 2024]]> http://www.open-lab.net/blog/?p=78104 2024-02-22T19:58:48Z 2024-02-21T22:00:00Z

Discover the transformative power of computer vision and video analytics at GTC. Dive into cutting-edge techniques such as vision transformers, AI agents,...

]]>

0 Michelle Horton <![CDATA[Webinar: Accelerate Edge AI Development With NVIDIA Metropolis Microservices For Jetson]]> http://www.open-lab.net/blog/?p=78100 2024-02-22T19:58:51Z 2024-02-21T17:30:00Z

On March 5, 8am PT, learn how NVIDIA Metropolis microservices for Jetson Orin helps you modernize your app stack, streamline development and deployment, and...

]]>

0 Gal Chechik <![CDATA[Generative AI Research Spotlight: Personalizing Text-to-Image Models]]> http://www.open-lab.net/blog/?p=77308 2024-02-22T19:59:02Z 2024-02-06T23:41:01Z

Visual generative AI is the process of creating images from text prompts. The technology is based on vision-language foundation models that are pretrained on...

]]>

0 John Yang <![CDATA[Emulating the Attention Mechanism in Transformer Models with a Fully Convolutional Network]]> http://www.open-lab.net/blog/?p=75844 2024-02-08T18:51:54Z 2024-01-29T17:00:00Z

The past decade has seen a remarkable surge in the adoption of deep learning techniques for computer vision (CV) tasks. Convolutional neural networks (CNNs)...

]]>

0 Chintan Shah <![CDATA[Announcing NVIDIA Metropolis Microservices for Jetson for Rapid Edge AI Development]]> http://www.open-lab.net/blog/?p=76670 2024-06-17T16:38:04Z 2024-01-25T18:30:00Z

NVIDIA Metropolis Microservices for Jetson has been renamed to Jetson Platform Services, and is now part of NVIDIA JetPack SDK 6.0. Building vision AI...

]]>

1 Riccardo Mariani <![CDATA[Using the Power of AI to Make Factories Safer]]> http://www.open-lab.net/blog/?p=77101 2024-02-08T19:52:28Z 2024-01-24T17:00:00Z

As industrial automation increases, safety becomes a greater challenge and top priority for enterprises. Safety encompasses multiple aspects: System...

]]>

0 Samuel Ochoa <![CDATA[Bringing Generative AI to the Edge with NVIDIA Metropolis Microservices for Jetson]]> http://www.open-lab.net/blog/?p=76663 2024-06-17T16:36:14Z 2024-01-23T17:00:00Z

NVIDIA Metropolis Microservices for Jetson has been renamed to Jetson Platform Services, and is now part of NVIDIA JetPack SDK 6.0. NVIDIA Metropolis...

]]>

0 Bhanu Pisupati <![CDATA[Build Vision AI Applications at the Edge with NVIDIA Metropolis Microservices and APIs]]> http://www.open-lab.net/blog/?p=76684 2024-06-17T16:37:03Z 2024-01-23T17:00:00Z

NVIDIA Metropolis Microservices for Jetson has been renamed to Jetson Platform Services, and is now part of NVIDIA JetPack SDK 6.0. NVIDIA Metropolis...

]]>

0 Raffaello Bonghi <![CDATA[Benchmarking Camera Performance on Your Workstation with NVIDIA Isaac Sim]]> http://www.open-lab.net/blog/?p=76929 2024-10-23T21:12:50Z 2024-01-22T15:00:00Z

Robots are typically equipped with cameras. When designing a digital twin simulation, it��s important to replicate its performance in a simulated environment...

]]>

1 Asawaree Bhide <![CDATA[Generate Synthetic Data for Deep Object Pose Estimation Training with NVIDIA Isaac ROS]]> http://www.open-lab.net/blog/?p=75640 2024-08-06T20:51:29Z 2024-01-18T21:45:18Z

For robotic agents to interact with objects in their environment, they must know the position and orientation of objects around them. This information describes...

]]>

0 Rishi Puri <![CDATA[Release: PyTorch Geometric Container for GNNs on NGC]]> http://www.open-lab.net/blog/?p=76597 2024-06-06T16:17:50Z 2024-01-17T23:05:40Z

The NVIDIA PyG container, now generally available, packages PyTorch Geometric with accelerations for GNN models, dataloading, and pre-processing using...

]]>

0 Marc-Michael Horstmann <![CDATA[Simulating Railroads with OpenUSD]]> http://www.open-lab.net/blog/?p=76567 2024-02-08T18:52:03Z 2024-01-17T21:00:00Z

Railroad simulation is important in modern transportation and logistics, providing a virtual testing ground for the intricate interplay of tracks, switches, and...

]]>

0 ��˳��97caoporen��