Best practice – NVIDIA Technical Blog

Best practice – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-04-29T22:44:15Z http://www.open-lab.net/blog/feed/ Davide Paglieri <![CDATA[Benchmarking Agentic LLM and VLM Reasoning for Gaming with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=99202 2025-04-29T19:05:40Z 2025-04-24T17:00:00Z

This is the first post in the LLM Benchmarking series, which shows how to use GenAI-Perf to benchmark the Meta Llama 3 model when deployed with NVIDIA NIM.?...

]]>

0 Ziyue Xu <![CDATA[Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming]]> http://www.open-lab.net/blog/?p=98553 2025-04-17T19:35:24Z 2025-04-16T16:00:00Z

Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy....

]]>

1 Prem Sagar Gali <![CDATA[Efficiently Scaling Polars GPU Parquet Reader]]> http://www.open-lab.net/blog/?p=98435 2025-04-22T23:52:25Z 2025-04-10T16:30:00Z

When working with large datasets, the performance of your data processing tools becomes critical. Polars, an open-source library for data manipulation known for...

]]>

0 Ashish Sardana <![CDATA[Prevent LLM Hallucinations with the Cleanlab Trustworthy Language Model in NVIDIA NeMo Guardrails]]> http://www.open-lab.net/blog/?p=98456 2025-04-22T23:39:03Z 2025-04-09T20:00:00Z

As more enterprises integrate LLMs into their applications, they face a critical challenge: LLMs can generate plausible but incorrect responses, known as...

]]>

0 Chris Alexiuk <![CDATA[Build Enterprise AI Agents with Advanced Open NVIDIA Llama Nemotron Reasoning Models]]> http://www.open-lab.net/blog/?p=97155 2025-04-22T23:53:49Z 2025-04-08T22:05:00Z

This updated post was originally published on March 18, 2025. Organizations are embracing AI agents to enhance productivity and streamline operations. To...

]]>

0 Vishal Ganeriwala <![CDATA[Seamlessly Scale AI Across Cloud Environments with NVIDIA DGX Cloud Serverless Inference]]> http://www.open-lab.net/blog/?p=97192 2025-03-20T17:07:54Z 2025-03-18T21:22:51Z

NVIDIA DGX Cloud Serverless Inference is an auto-scaling AI inference solution that enables application deployment with speed and reliability. Powered by NVIDIA...

]]>

0 Emily Potyraj <![CDATA[Measure and Improve AI Workload Performance with NVIDIA DGX Cloud Benchmarking]]> http://www.open-lab.net/blog/?p=97548 2025-03-20T17:07:42Z 2025-03-18T21:21:17Z

As AI capabilities advance, understanding the impact of hardware and software infrastructure choices on workload performance is crucial for both technical...

]]>

0 Chen Fu <![CDATA[Streamline LLM Deployment for Autonomous Vehicle Applications with NVIDIA DriveOS LLM SDK]]> http://www.open-lab.net/blog/?p=96776 2025-03-07T20:13:46Z 2025-03-10T19:30:00Z

Large language models (LLMs) have shown remarkable generalization capabilities in natural language processing (NLP). They are used in a wide range of...

]]>

2 Shelby Thomas <![CDATA[Ensuring Reliable Model Training on NVIDIA DGX Cloud]]> http://www.open-lab.net/blog/?p=96789 2025-03-24T18:36:43Z 2025-03-10T16:26:44Z

Training AI models on massive GPU clusters presents significant challenges for model builders. Because manual intervention becomes impractical as job scale...

]]>

0 Douglas Moore <![CDATA[Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI]]> http://www.open-lab.net/blog/?p=96530 2025-04-23T02:39:52Z 2025-02-28T18:11:50Z

According to the World Health Organization (WHO), 3.6 billion medical imaging tests are performed every year globally to diagnose, monitor, and treat various...

]]>

0 Leigh Engel <![CDATA[Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA]]> http://www.open-lab.net/blog/?p=96079 2025-04-23T02:45:13Z 2025-02-13T21:26:30Z

NVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined...

]]>

2 Allison Ding <![CDATA[Get Started with GPU Acceleration for Data Science]]> http://www.open-lab.net/blog/?p=95894 2025-04-23T02:52:30Z 2025-02-06T23:07:48Z

In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...

]]>

0 David Hart <![CDATA[Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs]]> http://www.open-lab.net/blog/?p=95790 2025-04-23T02:52:20Z 2025-02-06T20:30:00Z

Hardware support for ray tracing triangle meshes was introduced as part of NVIDIA RTX in 2018. But ray tracing for hair and fur has remained a compute-intensive...

]]>

0 Christoph Kubisch <![CDATA[NVIDIA RTX Mega Geometry Now Available with New Vulkan Samples]]> http://www.open-lab.net/blog/?p=95842 2025-04-23T02:50:59Z 2025-02-06T18:29:20Z

Geometric detail in computer graphics has increased exponentially in the past 30 years. To render high quality assets with higher instance counts and greater...

]]>

0 Shruthii Sathyanarayanan <![CDATA[Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench]]> http://www.open-lab.net/blog/?p=95720 2025-04-23T02:48:08Z 2025-02-05T18:00:00Z

NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...

]]>

0 Jonathan Litt <![CDATA[Build Apps with Neural Rendering Using NVIDIA Nsight Developer Tools on GeForce RTX 50 Series GPUs]]> http://www.open-lab.net/blog/?p=95580 2025-04-23T15:00:02Z 2025-01-30T21:11:00Z

The next generation of NVIDIA graphics hardware has arrived. Powered by NVIDIA Blackwell, GeForce RTX 50 Series GPUs deliver groundbreaking new RTX features...

]]>

0 Amit Bleiweiss <![CDATA[Mastering LLM Techniques: Evaluation]]> http://www.open-lab.net/blog/?p=95447 2025-04-23T15:01:33Z 2025-01-29T20:44:06Z

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...

]]>

0 Nick Comly <![CDATA[Optimize AI Inference Performance with NVIDIA Full-Stack Solutions]]> http://www.open-lab.net/blog/?p=95310 2025-04-23T15:02:06Z 2025-01-24T16:00:00Z

The explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing...

]]>

0 Chris Krapu <![CDATA[Lessons Learned from Building an AI Sales Assistant]]> http://www.open-lab.net/blog/?p=95231 2025-04-23T15:02:53Z 2025-01-21T20:34:41Z

At NVIDIA, the Sales Operations team equips the Sales team with the tools and resources needed to bring cutting-edge hardware and software to market. Managing...

]]>

1 John Thomson <![CDATA[Introducing New KV Cache Reuse Optimizations in NVIDIA TensorRT-LLM]]> http://www.open-lab.net/blog/?p=95040 2025-04-23T15:02:57Z 2025-01-16T22:57:30Z

Language models generate text by predicting the next token, given all the previous tokens including the input text tokens. Key and value elements of the...

]]>

0 Sama Bali <![CDATA[GPU Memory Essentials for AI Performance]]> http://www.open-lab.net/blog/?p=94979 2025-01-23T19:54:24Z 2025-01-15T16:00:00Z

Generative AI has revolutionized how people bring ideas to life, and agentic AI represents the next leap forward in this technological evolution. By leveraging...

]]>

1 Peter Entschev <![CDATA[Accelerating GPU Analytics Using RAPIDS and Ray]]> http://www.open-lab.net/blog/?p=94495 2024-12-20T21:13:45Z 2024-12-20T21:13:42Z

RAPIDS is a suite of open-source GPU-accelerated data science and AI libraries that are well supported for scale-out with distributed engines like Spark and...

]]>

0 Japinder Singh <![CDATA[Fine-Tuning Small Language Models to Optimize Code Review Accuracy]]> http://www.open-lab.net/blog/?p=94078 2025-02-17T05:13:45Z 2024-12-17T17:58:31Z

Generative AI is transforming enterprises by driving innovation and boosting efficiency across numerous applications. However, adopting large foundational...

]]>

0 Joseph Lucas <![CDATA[Sandboxing Agentic AI Workflows with WebAssembly]]> http://www.open-lab.net/blog/?p=93975 2024-12-16T21:06:56Z 2024-12-16T20:33:46Z

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...

]]>

0 Tim Lustig <![CDATA[Integration of NVIDIA BlueField DPUs with WEKA Client Boosts AI Workload Efficiency]]> http://www.open-lab.net/blog/?p=93578 2024-12-12T19:35:12Z 2024-12-12T17:45:46Z

WEKA, a pioneer in scalable software-defined data platforms, and NVIDIA are collaborating to unite WEKA's state-of-the-art data platform solutions with powerful...

]]>

0 Jonathan Litt <![CDATA[Optimize GPU Workloads for Graphics Applications with NVIDIA Nsight Graphics]]> http://www.open-lab.net/blog/?p=93418 2025-04-17T18:35:27Z 2024-12-05T18:06:35Z

One of the great pastimes of graphics developers and enthusiasts is comparing specifications of GPUs and marveling at the ever-increasing counts of shader...

]]>

0 Ben Zaitlen https://www.linkedin.com/in/benjamin-zaitlen-62ab7b4/ <![CDATA[Best Practices for Multi-GPU Data Analysis Using RAPIDS with Dask]]> http://www.open-lab.net/blog/?p=92480 2024-12-12T19:38:40Z 2024-11-21T19:02:03Z

As we move towards a more dense computing infrastructure, with more compute, more GPUs, accelerated networking, and so forth��multi-gpu training and analysis...

]]>

0 Mario Geiger <![CDATA[Accelerate Drug and Material Discovery with New Math Library NVIDIA cuEquivariance]]> http://www.open-lab.net/blog/?p=91896 2024-11-18T22:58:58Z 2024-11-18T18:30:00Z

AI models for science are often trained to make predictions about the workings of nature, such as predicting the structure of a biomolecule or the properties of...

]]>

1 Tyler Whitehouse <![CDATA[Frictionless Collaboration and Rapid Prototyping in Hybrid Environments with NVIDIA AI Workbench]]> http://www.open-lab.net/blog/?p=91234 2024-11-14T17:10:49Z 2024-11-04T17:30:00Z

NVIDIA AI Workbench is a free development environment manager that streamlines data science, AI, and machine learning (ML) projects on systems of choice. The...

]]>

0 Sophia Schuur <![CDATA[Protect Your Network with Secure Boot in SONiC]]> http://www.open-lab.net/blog/?p=91056 2024-10-31T19:07:37Z 2024-10-29T22:01:56Z

NVIDIA technology helps organizations build and maintain secure, scalable, and high-performance network infrastructure. Advances in AI, with NVIDIA at the...

]]>

1 Nathan Patterson <![CDATA[Learning Fluid Flow with AI-Enabled Virtual Wind Tunnels]]> http://www.open-lab.net/blog/?p=87861 2024-11-25T17:28:18Z 2024-10-14T18:39:40Z

There��s never enough time to do everything, even in engineering education. Employers want engineers capable of wielding simulation tools to expedite iterative...

]]>

0 Annamalai Chockalingam <![CDATA[Accelerating LLMs with llama.cpp on NVIDIA RTX Systems]]> http://www.open-lab.net/blog/?p=89663 2024-11-22T23:11:17Z 2024-10-02T13:00:00Z

The NVIDIA RTX AI for Windows PCs platform offers a thriving ecosystem of thousands of open-source models for application developers to leverage and integrate...

]]>

0 Rajvir Singh <![CDATA[Optimizing Inference Efficiency for LLMs at Scale with NVIDIA NIM Microservices]]> http://www.open-lab.net/blog/?p=87091 2024-08-22T18:24:55Z 2024-08-14T19:30:00Z

As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize...

]]>

0 Robert Jensen <![CDATA[Shader Debugging Made Easy with NVIDIA Nsight Graphics]]> http://www.open-lab.net/blog/?p=86432 2024-08-28T18:09:18Z 2024-07-31T16:00:00Z

Shaders are specialized programs that run on the GPU that manipulate rays, pixels, vertices, and textures to achieve unique visual effects. With shaders, you...

]]>

0 James Mills <![CDATA[Developing Product Configurators with OpenUSD]]> http://www.open-lab.net/blog/?p=85709 2024-08-08T18:48:33Z 2024-07-24T16:00:00Z

Developers from advertising agencies to software vendors are empowering global brands to deliver hyperpersonalization for digital experiences and visual...

]]>

0 Gorkem Batmaz https://twitter.com/gorkembatmaz <![CDATA[Building Cyber Language Models to Unlock New Cybersecurity Capabilities]]> http://www.open-lab.net/blog/?p=84556 2025-02-04T19:45:51Z 2024-07-09T16:00:00Z

General-purpose large language models (LLMs) have proven their usefulness across various fields, offering substantial benefits in applications ranging from text...

]]>

0 Joseph Lucas <![CDATA[Secure LLM Tokenizers to Maintain Application Integrity]]> http://www.open-lab.net/blog/?p=84504 2024-07-10T15:28:33Z 2024-06-27T18:00:00Z

This post is part of the NVIDIA AI Red Team��s continuing vulnerability and technique research. Use the concepts presented to responsibly assess and increase...

]]>

0 Babak Hejazi <![CDATA[Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates]]> http://www.open-lab.net/blog/?p=83888 2024-07-16T17:19:07Z 2024-06-12T20:30:00Z

The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...

]]>

0 Jess Nguyen <![CDATA[New Webinar: Deploying Generative AI in Production]]> http://www.open-lab.net/blog/?p=83086 2024-05-30T19:55:44Z 2024-05-29T20:00:53Z

Ready to move your pilot to production? Get an expert overview on how to deploy generative AI applications.

]]>

0 Amit Bleiweiss <![CDATA[Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints]]> http://www.open-lab.net/blog/?p=81895 2025-03-11T16:19:32Z 2024-05-08T16:00:00Z

Retrieval-augmented generation (RAG) is a technique that combines information retrieval with a set of carefully designed system prompts to provide more...

]]>

7 Belen Tegegn <![CDATA[Top Data Science Sessions from NVIDIA GTC 2024 Now Available On Demand]]> http://www.open-lab.net/blog/?p=81594 2024-05-02T21:34:01Z 2024-04-29T22:40:06Z

At GTC 2024, experts from NVIDIA and our partners shared insights about GPU-accelerated tools, optimizations, and best practices for data scientists. From the...

]]>

0 Jon Kennedy <![CDATA[Limiting CPU Threads for Better Game Performance]]> http://www.open-lab.net/blog/?p=77628 2024-02-22T19:58:51Z 2024-02-21T17:38:17Z

Many PC games are designed around an eight-core console with an assumption that their software threading system ��just works�� on all PCs, especially...

]]>

1 Taylor Allison <![CDATA[Simplifying Network Operations for AI with NVIDIA Quantum InfiniBand]]> http://www.open-lab.net/blog/?p=76977 2024-02-08T18:51:59Z 2024-01-23T18:00:00Z

A common technological misconception is that performance and complexity are directly linked. That is, the highest-performance implementation is also the most...

]]>

0 Rahul Ramasubramanian <![CDATA[Improving CUDA Initialization Times Using cgroups in Certain Scenarios]]> http://www.open-lab.net/blog/?p=75534 2024-01-11T19:49:33Z 2024-01-05T22:14:41Z

Many CUDA applications running on multi-GPU platforms usually use a single GPU for their compute needs. In such scenarios, a performance penalty is paid by...

]]>

0 Lars Nordskog <![CDATA[Advanced API Performance: Swap Chains]]> http://www.open-lab.net/blog/?p=74280 2023-12-11T20:20:45Z 2023-12-15T17:00:00Z

Swap chains are an integral part of how you get rendering data output to a screen. They usually consist of some group of output-ready buffers, each of which can...

]]>

0 Oleg Kuznetsov <![CDATA[Advanced API Performance: Intrinsics]]> http://www.open-lab.net/blog/?p=71300 2023-12-30T00:44:05Z 2023-11-21T18:37:48Z

Intrinsics can be thought of as higher-level abstractions of specific hardware instructions. They offer direct access to low-level operations or...

]]>

0 Rich Harang <![CDATA[Best Practices for Securing LLM-Enabled Applications]]> http://www.open-lab.net/blog/?p=73609 2024-07-08T20:07:28Z 2023-11-15T18:00:00Z

Large language models (LLMs) provide a wide range of powerful enhancements to nearly any application that processes text. And yet they also introduce new risks,...

]]>

0 Harry Petty <![CDATA[Accelerating Ptychography Workflows with NVIDIA Holoscan at Diamond Light Source]]> http://www.open-lab.net/blog/?p=72819 2023-11-16T19:16:36Z 2023-11-14T17:00:00Z

Diamond Light Source is a world-renowned synchrotron facility in the UK that provides scientists with access to intense beams of x-rays, infrared, and other...

]]>

0 Leroy Sikkes <![CDATA[Advanced API Performance: Descriptors]]> http://www.open-lab.net/blog/?p=71317 2023-11-02T20:23:13Z 2023-10-27T16:00:00Z

By using descriptor types, you can bind resources to shaders and specify how those resources are accessed. This creates efficient communication between the CPU...

]]>

0 Bhumin Pathak <![CDATA[Boost Synthetic Data Generation with Low-Code Workflows in NVIDIA Omniverse Replicator 1.10]]> http://www.open-lab.net/blog/?p=71526 2023-11-02T18:14:39Z 2023-10-18T14:00:00Z

Data is the lifeblood of AI systems, which rely on robust datasets to learn and make predictions or decisions. For perception AI models specifically, it is...

]]>

0 Brian Sparks <![CDATA[Networking for Data Centers and the Era of AI]]> http://www.open-lab.net/blog/?p=71474 2023-11-02T18:14:42Z 2023-10-12T16:30:00Z

Traditional cloud data centers have served as the bedrock of computing infrastructure for over a decade, catering to a diverse range of users and applications....

]]>

0 Joseph Lucas <![CDATA[Analyzing the Security of Machine Learning Research Code]]> http://www.open-lab.net/blog/?p=71113 2024-07-08T21:33:52Z 2023-10-04T18:00:00Z

The NVIDIA AI Red Team is focused on scaling secure development practices across the data, science, and AI ecosystems. We participate in open-source security...

]]>

2 Berkin Kartal <![CDATA[Comparing Solutions for Boosting Data Center Redundancy]]> http://www.open-lab.net/blog/?p=70873 2023-10-19T19:05:58Z 2023-09-29T19:46:58Z

In today��s data center, there are many ways to achieve system redundancy from a server connected to a fabric. Customers usually seek redundancy to increase...

]]>

0 Zachary Bourque <![CDATA[NVIDIA CUDA Toolkit Symbol Server]]> http://www.open-lab.net/blog/?p=70493 2023-09-21T17:56:27Z 2023-09-07T19:10:21Z

NVIDIA has already made available a GPU driver binary symbols server for Windows. Now, NVIDIA is making available a repository of CUDA Toolkit symbols for...

]]>

2 Johannes Deligiannis <![CDATA[Advanced API Performance: Shaders]]> http://www.open-lab.net/blog/?p=70243 2023-10-25T23:52:32Z 2023-09-01T15:36:30Z

This post covers best practices when working with shaders on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced...

]]>

0 Chris Deotte https://www.kaggle.com/cdeotte <![CDATA[Pro Tips for Building Multilingual Recommender Systems]]> http://www.open-lab.net/blog/?p=69059 2023-08-24T18:03:44Z 2023-08-10T16:00:00Z

Picture this: You're browsing through an online store, looking for the perfect pair of running shoes. But with thousands of options available, where do you even...

]]>

0 Tim Cheblokov <![CDATA[Advanced API Performance: Pipeline State Objects]]> http://www.open-lab.net/blog/?p=67779 2023-10-02T05:00:51Z 2023-07-18T19:00:00Z

This post covers best practices when working with pipeline state objects on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see...

]]>

0 Joel Lashmore <![CDATA[GPUs for ETL? Run Faster, Less Costly Workloads with NVIDIA RAPIDS Accelerator for Apache Spark and Databricks]]> http://www.open-lab.net/blog/?p=67503 2023-11-10T01:27:07Z 2023-07-17T18:08:30Z

We were stuck. Really stuck. With a hard delivery deadline looming, our team needed to figure out how to process a complex extract-transform-load (ETL) job on...

]]>

0 Jay Rodge <![CDATA[Accelerated Data Analytics: Machine Learning with GPU-Accelerated Pandas and Scikit-learn]]> http://www.open-lab.net/blog/?p=67937 2024-05-15T16:11:39Z 2023-07-11T20:00:00Z

If you are looking to take your machine learning (ML) projects to new levels of speed and scalability, GPU-accelerated data analytics can help you deliver...

]]>

0 Louis Bavoil <![CDATA[In-Game GPU Profiling for DirectX 12 Using SetBackgroundProcessingMode]]> http://www.open-lab.net/blog/?p=67605 2023-10-25T23:52:36Z 2023-07-10T17:00:00Z

If you are a DirectX 12 (DX12) game developer, you may have noticed that GPU times displayed in real time in your game HUD may change over time for a given...

]]>

0 Joseph Cavanaugh <![CDATA[Advanced API Performance: CPUs]]> http://www.open-lab.net/blog/?p=64153 2023-10-02T05:00:51Z 2023-05-17T18:00:00Z

This post covers CPU best practices when working with NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

0 Yury Uralsky <![CDATA[Advanced API Performance: Sampler Feedback]]> http://www.open-lab.net/blog/?p=62908 2023-10-02T05:02:21Z 2023-05-04T17:11:42Z

This post covers best practices for using sampler feedback on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

0 Andr�� Franklin <![CDATA[Tips on Scaling Storage for AI Training and Inferencing]]> http://www.open-lab.net/blog/?p=60056 2023-07-27T19:52:33Z 2023-01-25T21:32:08Z

There are many benefits of GPUs in scaling AI, ranging from faster model training to GPU-accelerated fraud detection. While planning AI models and deployed...

]]>

1 Fatos Morina <![CDATA[Benefits of Using Pull Requests for Collaboration and Code Review]]> http://www.open-lab.net/blog/?p=57808 2023-07-27T19:54:05Z 2022-12-01T19:00:00Z

Software teams comprise a broad range of professionals, from software engineers and data scientists to project managers and technical writers. Sharing code with...

]]>

0 Richmond Alake <![CDATA[Data Storytelling Best Practices for Data Scientists and AI Practitioners]]> http://www.open-lab.net/blog/?p=56909 2023-07-27T19:54:47Z 2022-11-07T19:30:00Z

Storytelling with data is a crucial soft skill for AI and data professionals. To ensure that stakeholders understand the technical requirements, value, and...

]]>

1 Juha Sjoholm <![CDATA[Best Practices for Using NVIDIA RTX Ray Tracing (Updated)]]> http://www.open-lab.net/blog/?p=50632 2023-07-27T19:50:00Z 2022-07-25T20:00:00Z

[stextbox id="info"]This post is an update of Best Practices: Using NVIDIA RTX Ray Tracing.[/stextbox] This post gathers best practices based on our experiences...

]]>

0 Ana Mihut <![CDATA[Advanced API Performance: Vulkan Clearing and Presenting]]> http://www.open-lab.net/blog/?p=48112 2023-10-02T05:00:52Z 2022-07-01T15:09:39Z

This post covers best practices for Vulkan clearing and presenting on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all...

]]>

1 Ryan Prescott <![CDATA[Advanced API Performance: SetStablePowerState]]> http://www.open-lab.net/blog/?p=48106 2024-08-28T17:45:35Z 2022-06-28T15:00:00Z

This post covers best practices for using SetStablePowerState on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

14 Justin Kim <![CDATA[Advanced API Performance: Variable Rate Shading]]> http://www.open-lab.net/blog/?p=36325 2023-10-02T05:00:53Z 2022-05-16T21:42:00Z

This post covers best practices for variable rate shading on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

1 Ivan Belyavtsev <![CDATA[Advanced API Performance: Clears]]> http://www.open-lab.net/blog/?p=34146 2023-10-02T05:00:53Z 2022-05-11T22:51:00Z

This post covers best practices for clears on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API Performance tips....

]]>

1 Ana Mihut <![CDATA[Advanced API Performance: Mesh Shaders]]> http://www.open-lab.net/blog/?p=35887 2023-10-02T05:00:54Z 2021-10-25T16:10:00Z

This post covers best practices for mesh shaders on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API Performance...

]]>

0 Andrew Allan <![CDATA[Advanced API Performance: Memory and Resources]]> http://www.open-lab.net/blog/?p=35933 2023-10-02T05:00:55Z 2021-10-25T16:05:00Z

This post covers best practices for memory and resources on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

1 Wessam Bahnassi <![CDATA[Advanced API Performance: Command Buffers]]> http://www.open-lab.net/blog/?p=34148 2023-10-02T05:00:55Z 2021-10-25T16:00:00Z

This post covers best practices for command buffers on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

1 Jiho Choi <![CDATA[Advanced API Performance: Barriers]]> http://www.open-lab.net/blog/?p=33064 2023-10-02T05:00:56Z 2021-10-22T23:49:00Z

This post covers best practices for barriers on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API Performance...

]]>

1 Katherine Sun <![CDATA[Advanced API Performance: Async Copy]]> http://www.open-lab.net/blog/?p=33041 2023-10-02T05:00:56Z 2021-10-22T23:47:00Z

This post covers best practices for async copy on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API Performance...

]]>

2 Vladimir Bondarev <![CDATA[Advanced API Performance: Async Compute and Overlap]]> http://www.open-lab.net/blog/?p=33048 2023-10-02T05:00:57Z 2021-10-22T23:45:00Z

This post covers best practices for async compute and overlap on NVIDIA GPUs. To get a high and consistent frame rate in your applications, see all Advanced API...

]]>

1 Amanda Saunders <![CDATA[Considerations for Deploying AI at the Edge]]> http://www.open-lab.net/blog/?p=37124 2023-07-27T19:55:35Z 2021-09-07T19:15:58Z

The growth of edge computing has been a hot topic in many industries. The value of smart infrastructure can mean improvements to overall operational efficiency,...

]]>

0 Jiho Choi <![CDATA[Tips: Acceleration Structure Compaction]]> http://www.open-lab.net/blog/?p=31830 2023-07-27T19:56:16Z 2021-05-20T18:56:34Z

In ray tracing, more geometries can reside in the GPU memory than with the rasterization approach because rays may hit the geometries out of the view frustum....

]]>

1 Kazuki Onodera <![CDATA[Best Practices for Using AI to Develop the Most Accurate Retail Forecasting Solution]]> http://www.open-lab.net/blog/?p=25108 2024-10-28T18:31:55Z 2021-03-26T14:00:00Z

A leading global retailer has invested heavily in becoming one of the most competitive technology companies around. Accurate and...

]]>

0 Richard Cowgill <![CDATA[Tips: Getting the Most out of the DLSS Unreal Engine 4 Plugin]]> http://www.open-lab.net/blog/?p=24048 2023-10-25T23:53:06Z 2021-02-17T19:00:29Z

DLSS is a deep learning, super-resolution network that boosts frame rates by rendering fewer pixels and then using AI to construct sharp, higher-resolution...

]]>

2 Juha Sjoholm <![CDATA[Best Practices: Using NVIDIA RTX Ray Tracing (Updated)]]> http://www.open-lab.net/blog/?p=19410 2023-07-27T19:50:29Z 2020-08-10T20:40:45Z

[stextbox id="info"]This post has been updated: Best Practices for Using NVIDIA RTX Ray Tracing (Updated).[/stextbox] This post gathers best practices based on...

]]>

0 Evan Hart <![CDATA[Tips and Tricks: Getting the Best Ray Tracing Performance Out of Unreal Engine 4.23]]> http://www.open-lab.net/blog/?p=15732 2023-10-25T23:54:12Z 2019-10-15T21:37:51Z

Roughly five months ago, we introduced you to the new ray tracing support (via DirectX Raytracing) in the 4.22 release of Unreal Engine. Recently, Epic Games...

]]>

0 Valerie Sarge <![CDATA[Tips for Optimizing GPU Performance Using Tensor Cores]]> http://www.open-lab.net/blog/?p=14687 2023-07-27T20:01:41Z 2019-06-10T13:00:06Z

Our most popular question is "What can I do to get great GPU performance for deep learning?"?We��ve recently published a detailed Deep Learning Performance...

]]>

15 Nuno Subtil <![CDATA[Tips and Tricks: Vulkan Dos and Don��ts]]> http://www.open-lab.net/blog/?p=14696 2025-01-14T20:07:39Z 2019-06-06T17:14:24Z

Note: This post was updated on 1/14/2025 to reflect updates. The increased performance potential of modern graphics APIs is coupled with a dramatically...

]]>

6 Alex Dunn <![CDATA[Tips and Tricks: Ray Tracing Best Practices]]> http://www.open-lab.net/blog/?p=14120 2023-07-27T20:03:06Z 2019-03-20T18:01:07Z

This post presents best practices for implementing ray tracing in games and other real-time graphics applications. We present these as briefly as possible to...

]]>

3 Cliff Woolley <![CDATA[CUDA Pro Tip: Improve NVIDIA Visual Profiler Loading of Large Profiles]]> http://www.open-lab.net/blog/parallelforall/?p=3213 2024-12-10T17:13:44Z 2014-05-06T21:03:51Z

Post updated on December 10, 2024. NVIDIA has deprecated nvprof and NVIDIA Visual Profiler and these tools are not supported on current GPU architectures. The...

]]>

4 Jiri Kraus <![CDATA[CUDA Pro Tip: Generate Custom Application Profile Timelines with NVTX]]> http://www.open-lab.net/blog/parallelforall/?p=2003 2024-08-12T15:49:35Z 2013-09-04T01:49:42Z

The last time you used the timeline feature in the NVIDIA Visual Profiler, Nsight VSE or the new Nsight Systems to analyze a complex application, you might have...

]]>

6 ��˳��97caoporen��