Computer Vision / Video Analytics – NVIDIA Technical BlogNews and tutorials for developers, data scientists, and IT admins2025-04-29T19:05:40Zhttp://www.open-lab.net/blog/feed/Davide Paglieri<![CDATA[Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM]]>http://www.open-lab.net/blog/?p=992022025-04-29T19:05:40Z2025-04-24T17:00:00ZThis is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...
]]>Elias Wolfberg<![CDATA[AI-Generated Heat Maps Keep Seniors and their Privacy Safe]]>http://www.open-lab.net/blog/?p=988912025-04-17T19:35:21Z2025-04-16T20:00:10ZBy 2030, more than one in five Americans will be 65 or older, becoming the United States�� largest group of seniors ever. Silicon Valley-based startup Butlr...
]]>Michelle Horton<![CDATA[AI Advances Parkinson��s Detection Using Standard MRI Scans]]>http://www.open-lab.net/blog/?p=986362025-04-17T19:35:29Z2025-04-11T16:58:59ZA simple brain scan may soon be all that's needed to accurately diagnose Parkinson��s disease, thanks to a new AI-powered tool. The advancement could help...
]]>Anu Srivastava<![CDATA[NVIDIA Accelerates Inference on Meta Llama 4 Scout and Maverick]]>http://www.open-lab.net/blog/?p=984682025-04-22T23:57:03Z2025-04-06T02:18:34ZThe newest generation of the popular Llama AI models is here with Llama 4 Scout and Llama 4 Maverick. Accelerated by NVIDIA open-source software, they can...
]]>1Ashley Goldstein<![CDATA[Simulating Robots in Industrial Facility Digital Twins]]>http://www.open-lab.net/blog/?p=982012025-04-23T00:00:10Z2025-03-31T16:00:00ZIndustrial enterprises are embracing physical AI and autonomous systems to transform their operations. This involves deploying heterogeneous robot fleets that...
]]>Shubham Agrawal<![CDATA[Build Real-Time Multimodal XR Apps with NVIDIA AI Blueprint for Video Search and Summarization]]>http://www.open-lab.net/blog/?p=968422025-03-12T22:08:59Z2025-03-11T17:30:00ZWith the recent advancements in generative AI and vision foundational models, VLMs present a new wave of visual computing wherein the models are capable of...
]]>Elias Wolfberg<![CDATA[AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale]]>http://www.open-lab.net/blog/?p=966712025-03-06T19:26:37Z2025-03-03T17:48:01ZIn an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world��s...
]]>Anu Srivastava<![CDATA[Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs]]>http://www.open-lab.net/blog/?p=965192025-04-23T02:39:30Z2025-02-26T22:05:00ZLarge language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
]]>Shubham Agrawal<![CDATA[Vision Language Model Prompt Engineering Guide for Image and Video Understanding]]>http://www.open-lab.net/blog/?p=962292025-04-23T02:38:32Z2025-02-26T16:25:34ZVision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
]]>Vishesh Lokras<![CDATA[NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell]]>http://www.open-lab.net/blog/?p=963772025-04-23T02:35:08Z2025-02-24T22:55:30ZThe release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...
]]>Ravi Chaudhary<![CDATA[Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0]]>http://www.open-lab.net/blog/?p=963662025-04-23T02:42:31Z2025-02-24T22:32:34ZNVIDIA announces the implementation of Multi-View High Efficiency Video Coding (MV-HEVC) encoder in the latest NVIDIA Video Codec SDK release, version 13.0....
]]>Michelle Horton<![CDATA[AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=955202025-04-23T02:43:07Z2025-02-20T17:44:00ZFrom mitigating climate change to improving disaster response and environmental monitoring, AI is reshaping how we tackle critical global challenges....
]]>Joanne Chang<![CDATA[Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=961932025-02-20T15:50:53Z2025-02-20T17:00:00ZExplore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
]]>Joanne Chang<![CDATA[Upcoming Webinar: Unlocking Video Analytics With AI Agents]]>http://www.open-lab.net/blog/?p=961352025-02-20T15:52:55Z2025-02-13T22:05:57ZMaster prompt engineering, fine-tuning, and customization to build video analytics AI agents.
]]>Pranav Marathe<![CDATA[Just Released: Tripy, a Python Programming Model For TensorRT]]>http://www.open-lab.net/blog/?p=959472025-02-10T17:08:43Z2025-02-10T17:08:40ZExperience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.
]]>Brad Nemire<![CDATA[Featured Researcher and Educator Sessions at NVIDIA GTC 2025]]>http://www.open-lab.net/blog/?p=958172025-02-06T19:33:45Z2025-02-05T23:03:06ZExplore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
]]>Elias Wolfberg<![CDATA[New AI Model Offers Cellular-Level View of Cancerous Tumors]]>http://www.open-lab.net/blog/?p=957582025-04-23T02:48:10Z2025-02-04T22:33:00ZResearchers studying cancer unveiled a new AI model that provides cellular-level mapping and visualizations of cancer cells, which scientists hope can shed...
]]>Michelle Horton<![CDATA[AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment]]>http://www.open-lab.net/blog/?p=957222025-04-23T02:48:13Z2025-02-04T17:16:54ZA new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...
]]>1Michelle Horton<![CDATA[Advancing Rare Disease Detection with AI-Powered Cellular Profiling]]>http://www.open-lab.net/blog/?p=954982025-04-23T15:01:14Z2025-01-29T20:45:46ZRare diseases are difficult to diagnose due to limitations in traditional genomic sequencing. Wolfgang Pernice, assistant professor at Columbia University, is...
]]>Michelle Horton<![CDATA[Spinal Health Diagnostics Gets Deep Learning Automation]]>http://www.open-lab.net/blog/?p=952432025-04-23T15:02:48Z2025-01-22T17:09:42ZAn advanced deep-learning model that automates X-ray analysis for faster and more accurate assessments could transform spinal health diagnostics. Capable of...
]]>Elias Wolfberg<![CDATA[AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells]]>http://www.open-lab.net/blog/?p=951062025-04-23T15:03:07Z2025-01-16T19:09:15ZWith as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
]]>Samuel Ochoa<![CDATA[Build a Video Search and Summarization Agent with NVIDIA AI Blueprint]]>http://www.open-lab.net/blog/?p=860112025-02-13T20:44:57Z2025-01-07T04:20:00ZThis post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
]]>2Elias Wolfberg<![CDATA[AI Vision Helps Green Recycling Plants]]>http://www.open-lab.net/blog/?p=944212025-01-07T20:18:07Z2024-12-19T20:20:23ZEach year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
]]>Michelle Horton<![CDATA[Time-Lapse AI Model Enhances IVF Embryo Selection]]>http://www.open-lab.net/blog/?p=937672024-12-18T16:38:55Z2024-12-12T17:29:22ZResearchers from Weill Cornell Medicine have developed an AI-powered model that could help couples undergoing in vitro fertilization (IVF) and guide...
]]>Joanne Chang<![CDATA[Just Released: NVIDIA VILA VLM]]>http://www.open-lab.net/blog/?p=935122024-12-12T19:35:17Z2024-12-09T17:09:10ZNow available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
]]>Michael Zephyr<![CDATA[Celebrating Open Science and Enterprise AI Innovation on MONAI��s 5th Anniversary]]>http://www.open-lab.net/blog/?p=928862024-12-20T18:35:40Z2024-12-05T22:13:17ZAs MONAI celebrates its fifth anniversary, we're witnessing the convergence of our vision for open medical AI with production-ready enterprise solutions. ...
]]>Monika Jhuria<![CDATA[Scaling Action Recognition Models with Synthetic Data]]>http://www.open-lab.net/blog/?p=915932024-12-12T19:35:22Z2024-12-03T18:36:55ZAction recognition models such as PoseClassificationNet have been around for some time, helping systems identify and classify human actions like walking,...
]]>Shubham Agrawal<![CDATA[Build an Agentic Video Workflow with Video Search and Summarization]]>http://www.open-lab.net/blog/?p=928342025-01-07T05:45:50Z2024-12-03T18:30:00ZBuilding a question-answering chatbot with large language models (LLMs) is now a common workflow for text-based interactions. What about creating an AI system...
]]>Joanne Chang<![CDATA[Just Released: NVIDIA DeepStream 7.1]]>http://www.open-lab.net/blog/?p=926952024-12-12T19:46:55Z2024-11-25T16:40:22ZThe new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
]]>Shashank Maheshwari<![CDATA[NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM]]>http://www.open-lab.net/blog/?p=912832024-12-12T19:47:55Z2024-11-21T22:01:16ZNVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
]]>Michelle Horton<![CDATA[AI Unlocks Early Clues to Alzheimer��s Through Retinal Scans]]>http://www.open-lab.net/blog/?p=925652024-12-12T19:38:44Z2024-11-21T16:40:39ZYour eyes could hold the key to unlocking early detection of Alzheimer��s and dementia, with a groundbreaking AI study. Called Eye-AD, the deep learning...
]]>1Michelle Horton<![CDATA[Deep Learning AI Model Identifies Breast Cancer Spread without Surgery]]>http://www.open-lab.net/blog/?p=911332024-12-20T18:48:46Z2024-10-31T16:06:07ZA new deep learning model could reduce the need for surgery when diagnosing whether cancer cells are spreading, including to nearby lymph nodes��also known as...
]]>Elias Wolfberg<![CDATA[AI-Powered Devices Track Howls to Save Wolves]]>http://www.open-lab.net/blog/?p=910772024-10-31T16:21:07Z2024-10-29T17:56:55ZA new cell-phone-sized device��which can be deployed in vast, remote areas��is using AI to identify and geolocate wildlife to help conservationists track...
]]>Hanson Xu<![CDATA[Federated Learning in Autonomous Vehicles Using Cross-Border Training]]>http://www.open-lab.net/blog/?p=904432025-02-05T20:08:58Z2024-10-24T16:00:00ZFederated learning is revolutionizing the development of autonomous vehicles (AVs), particularly in cross-country scenarios where diverse data sources and...
]]>Bret Li<![CDATA[Optimizing the CV Pipeline in Automotive Vehicle Development Using the PVA Engine]]>http://www.open-lab.net/blog/?p=906462024-10-31T16:21:21Z2024-10-23T13:00:00ZIn the field of automotive vehicle software development, more large-scale AI models are being integrated into autonomous vehicles. The models range from vision...
]]>Paul Logan<![CDATA[Accelerating Reality Capture Workflows with AI and NVIDIA RTX GPUs]]>http://www.open-lab.net/blog/?p=897192024-10-17T18:19:11Z2024-10-07T23:03:48ZReality capture creates highly accurate, detailed, and immersive digital representations of environments. Innovations in site scanning and accelerated data...
]]>William Raveane<![CDATA[Optimizing Microsoft Bing Visual Search with NVIDIA Accelerated Libraries]]>http://www.open-lab.net/blog/?p=898312024-11-14T16:23:01Z2024-10-07T21:11:06ZMicrosoft Bing Visual Search enables people around the world to find content using photographs as queries. The heart of this capability is Microsoft's TuringMM...
]]>Tanya Lenz<![CDATA[Generate Image and Text Embeddings with NV-CLIP]]>http://www.open-lab.net/blog/?p=897732024-10-17T18:19:13Z2024-10-07T20:00:00ZNV-CLIP, a cutting-edge multimodal embeddings model for image and text, is now generally available.
]]>Alexander Ladikos<![CDATA[Real-Time Surgical Guidance by Fusing Multi-Modal Imaging with NVIDIA Holoscan]]>http://www.open-lab.net/blog/?p=897032024-10-17T19:06:57Z2024-10-07T12:00:00ZDevelopers in the fields of image-guided surgery and surgical vision face unique challenges in creating systems and applications that can significantly improve...
]]>Elias Wolfberg<![CDATA[AI Chatbot Delivers Multilingual Support to African Farmers]]>http://www.open-lab.net/blog/?p=895132024-10-17T19:07:10Z2024-09-27T18:10:11ZSome of Africa��s most resource-constrained farmers are gaining access to on-demand, AI-powered advice through a multimodal chatbot?that gives detailed...
]]>Michelle Horton<![CDATA[How AI and Robotics are Driving Agricultural Productivity and Sustainability]]>http://www.open-lab.net/blog/?p=894542024-10-17T19:07:15Z2024-09-25T15:53:36ZBy 2030, John Deere aims for fully autonomous farming, addressing global challenges like labor shortages, sustainability, and food security. Their AI and...
]]>Micha? Szo?ucha<![CDATA[Improved Data Loading with Threads]]>http://www.open-lab.net/blog/?p=886572024-09-19T19:30:59Z2024-09-13T16:00:00ZData loading is a critical aspect of deep learning workflows, whether you're focused on training or inference. However, it often presents a paradox: the need...
]]>Ricardo Monteiro<![CDATA[Enabling Customizable GPU-Accelerated Video Transcoding Pipelines]]>http://www.open-lab.net/blog/?p=888702024-09-19T19:31:10Z2024-09-11T23:01:24ZToday, over 80% of internet traffic is video. This content is generated by and consumed across various devices, including IoT gadgets, smartphones, computers,...
]]>Elias Wolfberg<![CDATA[AI Tool Helps Farmers Combat Crop Loss and Climate Change]]>http://www.open-lab.net/blog/?p=889572025-01-07T20:27:37Z2024-09-11T16:28:27ZMachine Learning algorithms are beginning to revolutionize modern agriculture. Enabling farmers to combat pests and diseases in real time, the technology is...
]]>Michelle Horton<![CDATA[High-Tech AI Framework Transforms Global Marine Pollution Tracking]]>http://www.open-lab.net/blog/?p=885862024-10-21T16:26:32Z2024-09-09T15:08:15ZAn AI-powered remote sensing study offers a dynamic new tool for global ocean cleanup efforts. Detailed in the ISPRS Journal of Photogrammetry and Remote...
]]>Michelle Horton<![CDATA[AI-Powered Platform Advances Personalized Cancer Diagnostics and Treatments]]>http://www.open-lab.net/blog/?p=885742024-10-21T16:26:32Z2024-09-05T17:27:27ZA recent study introduced a cutting-edge AI-powered pathology platform that can help doctors diagnose and evaluate lung cancer in patients quickly and...
]]>Dvir Samuel<![CDATA[Fast Inversion for Real-Time Image Editing with Text]]>http://www.open-lab.net/blog/?p=856192024-09-05T17:57:10Z2024-08-30T16:00:04ZText-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. They operate by mapping a random sample from a...
]]>Monika Jhuria<![CDATA[New Foundational Models and Training Capabilities with NVIDIA TAO 5.5]]>http://www.open-lab.net/blog/?p=872632024-09-09T19:37:08Z2024-08-28T16:00:00ZNVIDIA TAO is a framework designed to simplify and accelerate the development and deployment of AI models. It enables you to use pretrained models, fine-tune...
]]>Shuo Wang<![CDATA[Simplifying Camera Calibration to Enhance AI-Powered Multi-Camera Tracking]]>http://www.open-lab.net/blog/?p=879012024-09-05T17:57:21Z2024-08-27T18:30:00ZThis post is the third in a series on building multi-camera tracking vision AI applications. We introduce the overall end-to-end workflow and fine-tuning...
]]>Joanne Chang<![CDATA[Webinar: Build Visual AI Agents With Generative AI and NVIDIA NIM]]>http://www.open-lab.net/blog/?p=875512024-08-22T18:24:51Z2024-08-19T15:00:00ZLearn how to build high-performance solutions with NVIDIA visual AI agents that help streamline operations across a range of industries.
]]>Michelle Horton<![CDATA[Interactive AI Tool Delivers Immersive Video Content to Blind and Low-Vision Viewers]]>http://www.open-lab.net/blog/?p=869362025-02-04T19:44:34Z2024-08-12T15:54:26ZNew research aims to revolutionize video accessibility for blind or low-vision (BLV) viewers with an AI-powered system that gives users the ability to explore...
]]>Michelle Horton<![CDATA[??Real-Time AI Shark Detection is Boosting Beach Safety]]>http://www.open-lab.net/blog/?p=868922024-09-05T18:59:24Z2024-08-06T19:01:54ZCalifornia beaches are becoming safer with a new AI-powered shark detection system. Known as SharkEye, the technology identifies sharks near shorelines in real...
]]>Ahmed Harouni<![CDATA[Computed Tomography Organ and Disease Segmentation Using the NVIDIA VISTA-3D NIM Microservice]]>http://www.open-lab.net/blog/?p=858632024-08-08T18:48:29Z2024-07-26T18:41:23ZOver 300M computed tomography (CT) scans are performed globally, 85M in the US alone. Radiologists are looking for ways to speed up their workflow and generate...
]]>Samuel Ochoa<![CDATA[Develop Generative AI-Powered Visual AI Agents for the Edge]]>http://www.open-lab.net/blog/?p=854442024-11-07T05:08:55Z2024-07-17T15:00:00ZAn exciting breakthrough in AI technology��Vision Language Models (VLMs)��offers a more dynamic and flexible method for video analysis. VLMs enable users to...
]]>1Sameer Satish Pusegaonkar<![CDATA[Enhance Multi-Camera Tracking Accuracy by Fine-Tuning AI Models with Synthetic Data]]>http://www.open-lab.net/blog/?p=846922024-07-25T18:19:09Z2024-07-10T16:00:00ZLarge-scale, use�Ccase-specific synthetic data has become increasingly important in real-world computer vision and AI workflows. That��s because digital twins...
]]>2Min-Hung Chenhttps://minhungchen.netlify.app/<![CDATA[Introducing DoRA, a High-Performing Alternative to LoRA for Fine-Tuning]]>http://www.open-lab.net/blog/?p=844542024-11-07T05:09:12Z2024-06-28T15:00:00ZFull fine-tuning (FT) is commonly employed to tailor general pretrained models for specific downstream tasks. To reduce the training cost, parameter-efficient...
]]>Abhijit Patait<![CDATA[Improving Video Quality with the NVIDIA Video Codec SDK 12.2 for HEVC]]>http://www.open-lab.net/blog/?p=829302024-09-22T15:09:03Z2024-06-26T19:30:00ZNVIDIA Video Codec SDK provides a comprehensive set of APIs for hardware-accelerated video encode and decode on Windows and Linux. The 12.2 release improves...
]]>Nate Bradford<![CDATA[Transforming Microsoft XLS and PPT Files into a Factory Digital Twin with OpenUSD]]>http://www.open-lab.net/blog/?p=844222024-07-10T15:28:34Z2024-06-26T16:00:00ZSyncTwin GmbH, a company that builds software to optimize production, intralogistics, and assembly, is on a mission to unlock industrial digital twins for small...
]]>Elias Wolfberg<![CDATA[AI-Enhanced Navigation Charts Safer Waters for Massive Ships]]>http://www.open-lab.net/blog/?p=840762025-02-04T19:49:56Z2024-06-25T16:00:00ZMaritime startup Orca AI is pioneering safety at sea with its AI-powered navigation system, which provides real-time video processing to help crews make...
]]>1Pengfei Guo<![CDATA[Addressing Medical Imaging Limitations with Synthetic Data Generation]]>http://www.open-lab.net/blog/?p=834682025-02-04T19:51:06Z2024-06-24T17:50:59ZSynthetic data in medical imaging offers numerous benefits, including the ability to augment datasets with diverse and realistic images where real data is...
]]>Monika Jhuria<![CDATA[Real-Time Vision AI From Digital Twins to Cloud-Native Deployment with NVIDIA Metropolis Microservices and NVIDIA Isaac Sim]]>http://www.open-lab.net/blog/?p=834702024-07-30T22:15:36Z2024-06-24T17:00:00ZAs vision AI complexity increases, streamlined deployment solutions are crucial to optimizing spaces and processes. NVIDIA accelerates development, turning...
]]>Alvin Clark<![CDATA[Generate Traffic Insights Using YOLOv8 and NVIDIA JetPack 6.0]]>http://www.open-lab.net/blog/?p=842662025-03-20T16:21:00Z2024-06-18T19:53:22ZIntelligent Transportation Systems (ITS) applications are becoming increasingly valuable and prevalent in modern urban environments. The benefits of using ITS...
]]>Akhil Docca<![CDATA[Supercharge Robotics Workflows with AI and Simulation Using NVIDIA Isaac Sim 4.0 and NVIDIA Isaac Lab]]>http://www.open-lab.net/blog/?p=841202024-06-27T18:32:20Z2024-06-17T13:00:00ZThe era of AI robots powered by physical AI has arrived. Physical AI models understand their environments and autonomously complete complex tasks in the...
]]>Joanne Chang<![CDATA[MediaTek Integrates NVIDIA TAO Toolkit for IoT Edge AI Development]]>http://www.open-lab.net/blog/?p=837542024-06-13T19:08:55Z2024-06-06T18:16:31ZMediaTek is teaming with NVIDIA to integrate NVIDIA TAO training and pretrained models into its development workflow, bringing advanced AI and visual perception...
]]>Meiran Peng<![CDATA[Build a Zero-Copy AI Sensor Processing Pipeline with OpenCV in NVIDIA Holoscan SDK]]>http://www.open-lab.net/blog/?p=831162024-06-13T19:06:01Z2024-06-05T14:00:00ZNVIDIA Holoscan is the NVIDIA domain-agnostic multimodal real-time AI sensor processing platform that delivers the foundation for developers to build their...
]]>1Chintan Shah<![CDATA[Power Cloud-Native Microservices at the Edge with NVIDIA JetPack 6.0, Now GA]]>http://www.open-lab.net/blog/?p=831822024-11-07T05:09:27Z2024-06-04T20:24:32ZNVIDIA JetPack SDK powers NVIDIA Jetson modules, offering a comprehensive solution for building end-to-end accelerated AI applications. JetPack 6 expands the...
]]>Monika Jhuria<![CDATA[Optimize Processes for Large Spaces with the Multi-Camera Tracking Workflow]]>http://www.open-lab.net/blog/?p=831082025-03-21T20:30:26Z2024-06-02T12:30:00ZThis post is the first in a series on building multi-camera tracking vision AI applications. In this part, we introduce the overall end-to-end workflow,...
]]>Jenny Plunkett<![CDATA[How to Train an Object Detection Model for Visual Inspection with Synthetic Data]]>http://www.open-lab.net/blog/?p=708202024-06-17T16:44:02Z2024-05-31T22:30:00ZAI is rapidly changing industrial visual inspection. In a factory setting, visual inspection is used for many issues, including detecting defects and missing or...
]]>0Amr Elmeleegy<![CDATA[Enhancing the Apparel Shopping Experience with AI, Emoji-Aware OCR, and Snapchat��s Screenshop]]>http://www.open-lab.net/blog/?p=822502025-03-18T18:30:13Z2024-05-17T17:33:20ZEver spotted someone in a photo wearing a cool shirt or some unique apparel and wondered where they got it? How much did it cost? Maybe you've even thought...
]]>Carlos Garcia-Sierra<![CDATA[NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development]]>http://www.open-lab.net/blog/?p=820502024-09-04T22:00:17Z2024-05-14T22:50:34ZNVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available...
]]>Paul Shin<![CDATA[Mitigating Occlusions in Visual Perception Using Single-View 3D Tracking in NVIDIA DeepStream]]>http://www.open-lab.net/blog/?p=817862024-05-15T17:15:51Z2024-05-08T16:00:00ZWhen it comes to perception for Intelligent Video Analytics (IVA) applications such as traffic monitoring, warehouse safety, and retail shopper analytics, one...
]]>1Yao (Jason) Lu<![CDATA[Visual Language Intelligence and Edge AI 2.0 with NVIDIA Cosmos Nemotron]]>http://www.open-lab.net/blog/?p=815342025-01-09T03:29:25Z2024-05-03T15:00:00ZNote: As of January 6, 2025, VILA is now part of the Cosmos Nemotron VLM family. NVIDIA is proud to announce the release of NVIDIA Cosmos Nemotron, a family of...
]]>1Yao (Jason) Lu<![CDATA[Visual Language Models on NVIDIA Hardware with VILA]]>http://www.open-lab.net/blog/?p=815712025-01-07T04:01:29Z2024-05-03T15:00:00ZNote: As of January 6, 2025 VILA is now part of the new Cosmos Nemotron vision language models. Visual language models have evolved significantly recently....
]]>1Tian Cao<![CDATA[Perception Model Training for Autonomous Vehicles with Tensor Parallelism]]>http://www.open-lab.net/blog/?p=814642024-05-02T19:01:07Z2024-04-27T05:00:00ZDue to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...
]]>Vishwesh Nath<![CDATA[Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D]]>http://www.open-lab.net/blog/?p=812502024-05-07T16:54:01Z2024-04-22T18:30:00ZGenomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...
]]>Mahesh Khadatare<![CDATA[Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec]]>http://www.open-lab.net/blog/?p=811552024-04-18T20:14:59Z2024-04-17T20:30:00ZThis post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...
]]>Tiffany Yeung<![CDATA[Explainer: What Is a Convolutional Neural Network?]]>http://www.open-lab.net/blog/?p=759912024-06-05T22:20:53Z2024-04-12T19:00:00ZA convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.
]]>0Michelle Horton<![CDATA[Explainer: What Is Computer Vision?]]>http://www.open-lab.net/blog/?p=759882024-06-05T22:19:45Z2024-03-22T19:00:00ZComputer vision defines the field that enables devices to acquire, process, understand, and analyze digital images and videos and extract useful...
]]>Mostafa Toloui<![CDATA[Developing Production-Ready AI Sensor Processing Applications with NVIDIA Holoscan 1.0]]>http://www.open-lab.net/blog/?p=797882024-04-09T23:45:13Z2024-03-20T17:00:00ZEdge AI developers are building AI applications and products for safety-critical and regulated use cases. With NVIDIA Holoscan 1.0, these applications can...
]]>Michael Zephyr<![CDATA[Breaking Barriers in Healthcare with New Models for Generative AI and Cellular Imaging]]>http://www.open-lab.net/blog/?p=795232024-04-09T23:45:17Z2024-03-19T15:00:00ZDriving the future of healthcare imaging, NVIDIA MONAI microservices are creating unique state-of-the-art models and expanded modalities to meet the demands of...
]]>Cem Moluluo<![CDATA[Calculating Video Quality Using NVIDIA GPUs and VMAF-CUDA]]>http://www.open-lab.net/blog/?p=775412024-04-09T23:45:26Z2024-03-12T16:57:38ZVideo quality metrics are used to evaluate the fidelity of video content. They provide a consistent quantitative measurement to assess the performance of the...
]]>Paul Springer<![CDATA[cuTENSOR 2.0: Applications and Performance]]>http://www.open-lab.net/blog/?p=779152024-04-09T23:45:28Z2024-03-09T03:20:47ZWhile part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...
]]>Paul Springer<![CDATA[cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations]]>http://www.open-lab.net/blog/?p=779132024-04-09T23:45:29Z2024-03-09T03:20:45ZNVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...
]]>Amr Elmeleegy<![CDATA[Generate Stunning Images with Stable Diffusion XL on the NVIDIA AI Inference Platform]]>http://www.open-lab.net/blog/?p=783882025-03-18T18:31:44Z2024-03-07T19:05:46ZDiffusion models are transforming creative workflows across industries. These models generate stunning images based on simple text or image inputs by...
]]>1Tanya Lenz<![CDATA[Featured Smart Spaces Sessions at NVIDIA GTC 2024]]>http://www.open-lab.net/blog/?p=781622024-04-09T23:45:34Z2024-03-07T00:19:10ZFrom cities and airports to Olympic Stadiums, AI is transforming public spaces into safer, smarter, and more sustainable environments.
]]>Jeffrey Renfro<![CDATA[Spotlight: Honeywell Accelerates Industrial Process Simulation with NVIDIA cuDSS]]>http://www.open-lab.net/blog/?p=784962024-04-09T23:45:36Z2024-03-05T19:00:00ZFor over a decade, traditional industrial process modeling and simulation approaches have struggled to fully leverage multicore CPUs or acceleration devices to...
]]>Nate Bradford<![CDATA[Top Synthetic Data Generation Sessions at NVIDIA GTC 2024]]>http://www.open-lab.net/blog/?p=786712024-03-07T19:18:48Z2024-02-29T23:31:18ZLearn how synthetic data is supercharging 3D simulation and computer vision workflows, from visual inspection to autonomous machines.
]]>0Umair Iqbal<![CDATA[Detecting Real-Time Waste Contamination Using Edge Computing and Video Analytics]]>http://www.open-lab.net/blog/?p=764822024-03-07T19:33:06Z2024-02-26T21:00:00ZThe past few decades have witnessed a surge in rates of waste generation, closely linked to economic development and urbanization. This escalation in waste...
]]>0Michelle Horton<![CDATA[Top Computer Vision/Video Analytics Sessions at NVIDIA GTC 2024]]>http://www.open-lab.net/blog/?p=781042024-02-22T19:58:48Z2024-02-21T22:00:00ZDiscover the transformative power of computer vision and video analytics at GTC. Dive into cutting-edge techniques such as vision transformers, AI agents,...
]]>0Michelle Horton<![CDATA[Webinar: Accelerate Edge AI Development With NVIDIA Metropolis Microservices For Jetson]]>http://www.open-lab.net/blog/?p=781002024-02-22T19:58:51Z2024-02-21T17:30:00ZOn March 5, 8am PT, learn how NVIDIA Metropolis microservices for Jetson Orin helps you modernize your app stack, streamline development and deployment, and...
]]>0Gal Chechik<![CDATA[Generative AI Research Spotlight: Personalizing Text-to-Image Models]]>http://www.open-lab.net/blog/?p=773082024-02-22T19:59:02Z2024-02-06T23:41:01ZVisual generative AI is the process of creating images from text prompts. The technology is based on vision-language foundation models that are pretrained on...
]]>0John Yang<![CDATA[Emulating the Attention Mechanism in Transformer Models with a Fully Convolutional Network]]>http://www.open-lab.net/blog/?p=758442024-02-08T18:51:54Z2024-01-29T17:00:00ZThe past decade has seen a remarkable surge in the adoption of deep learning techniques for computer vision (CV) tasks. Convolutional neural networks (CNNs)...
]]>0Chintan Shah<![CDATA[Announcing NVIDIA Metropolis Microservices for Jetson for Rapid Edge AI Development]]>http://www.open-lab.net/blog/?p=766702024-06-17T16:38:04Z2024-01-25T18:30:00ZNVIDIA Metropolis Microservices for Jetson has been renamed to Jetson Platform Services, and is now part of NVIDIA JetPack SDK 6.0. Building vision AI...
]]>1Riccardo Mariani<![CDATA[Using the Power of AI to Make Factories Safer]]>http://www.open-lab.net/blog/?p=771012024-02-08T19:52:28Z2024-01-24T17:00:00ZAs industrial automation increases, safety becomes a greater challenge and top priority for enterprises. Safety encompasses multiple aspects: System...
]]>0Samuel Ochoa<![CDATA[Bringing Generative AI to the Edge with NVIDIA Metropolis Microservices for Jetson]]>http://www.open-lab.net/blog/?p=766632024-06-17T16:36:14Z2024-01-23T17:00:00ZNVIDIA Metropolis Microservices for Jetson has been renamed to Jetson Platform Services, and is now part of NVIDIA JetPack SDK 6.0. NVIDIA Metropolis...
]]>0Bhanu Pisupati<![CDATA[Build Vision AI Applications at the Edge with NVIDIA Metropolis Microservices and APIs]]>http://www.open-lab.net/blog/?p=766842024-06-17T16:37:03Z2024-01-23T17:00:00ZNVIDIA Metropolis Microservices for Jetson has been renamed to Jetson Platform Services, and is now part of NVIDIA JetPack SDK 6.0. NVIDIA Metropolis...
]]>0Raffaello Bonghi<![CDATA[Benchmarking Camera Performance on Your Workstation with NVIDIA Isaac Sim]]>http://www.open-lab.net/blog/?p=769292024-10-23T21:12:50Z2024-01-22T15:00:00ZRobots are typically equipped with cameras. When designing a digital twin simulation, it��s important to replicate its performance in a simulated environment...
]]>1Asawaree Bhide<![CDATA[Generate Synthetic Data for Deep Object Pose Estimation Training with NVIDIA Isaac ROS]]>http://www.open-lab.net/blog/?p=756402024-08-06T20:51:29Z2024-01-18T21:45:18ZFor robotic agents to interact with objects in their environment, they must know the position and orientation of objects around them. This information describes...
]]>0Rishi Puri<![CDATA[Release: PyTorch Geometric Container for GNNs on NGC]]>http://www.open-lab.net/blog/?p=765972024-06-06T16:17:50Z2024-01-17T23:05:40ZThe NVIDIA PyG container, now generally available, packages PyTorch Geometric with accelerations for GNN models, dataloading, and pre-processing using...
]]>0Marc-Michael Horstmann<![CDATA[Simulating Railroads with OpenUSD]]>http://www.open-lab.net/blog/?p=765672024-02-08T18:52:03Z2024-01-17T21:00:00ZRailroad simulation is important in modern transportation and logistics, providing a virtual testing ground for the intricate interplay of tracks, switches, and...