Hardware / Semiconductor

Oct 25, 2024
Advancing Performance with NVIDIA SHARP In-Network Computing
AI and scientific computing applications are great examples of distributed computing problems. The problems are too large and the computations too intensive to...
7 MIN READ

Oct 24, 2024
Building AI Agents to Automate Software Test Case Creation
In software development, testing is crucial for ensuring the quality and reliability of the final product. However, creating test plans and specifications can...
15 MIN READ

Oct 09, 2024
NVIDIA Grace CPU Delivers World-Class Data Center Performance and Breakthrough Energy Efficiency
NVIDIA designed the NVIDIA Grace CPU to be a new kind of high-performance, data center CPU—one built to deliver breakthrough energy efficiency and optimized...
8 MIN READ

Sep 06, 2024
Using Generative AI Models in Circuit Design
Generative models have been making big waves in the past few years, from intelligent text-generating large language models (LLMs) to creative image and...
7 MIN READ

Aug 28, 2024
NVIDIA Blackwell Platform Sets New LLM Inference Records in MLPerf Inference v4.1
Large language model (LLM) inference is a full-stack challenge. Powerful GPUs, high-bandwidth GPU-to-GPU interconnects, efficient acceleration libraries, and a...
13 MIN READ

Aug 27, 2024
Optimize Large-Scale AI Workloads with NVIDIA Spectrum-X
In today’s rapidly evolving technological landscape, staying ahead of the curve is not just a goal—it's a necessity. The surge of innovations, particularly...
5 MIN READ

Aug 12, 2024
NVIDIA NVLink and NVIDIA NVSwitch Supercharge Large Language Model Inference
Large language models (LLM) are getting larger, increasing the amount of compute required to process inference requests. To meet real-time latency requirements...
8 MIN READ

Jul 11, 2024
Next Generation of FlashAttention
NVIDIA is excited to collaborate with Colfax, Together.ai, Meta, and Princeton University on their recent achievement to exploit the Hopper GPU architecture and...
1 MIN READ

Jun 24, 2024
Exploring SONiC on NVIDIA Air
Testing out networking infrastructure and building working PoCs for a new environment can be tricky at best and downright dreadful at worst. You may run into...
6 MIN READ

Jun 17, 2024
Video: Talk to Your Supply Chain Data Using NVIDIA NIM
NVIDIA operates one of the largest and most complex supply chains in the world. The supercomputers we build connect tens of thousands of NVIDIA GPUs with...
2 MIN READ

Jun 12, 2024
Introducing Grouped GEMM APIs in cuBLAS and More Performance Updates
The latest release of NVIDIA cuBLAS library, version 12.5, continues to deliver functionality and performance to deep learning (DL) and high-performance...
7 MIN READ

Jun 10, 2024
Spotlight: Cisco Enhances Workload Security and Operational Efficiency with NVIDIA BlueField-3 DPUs
As cyberattacks become more sophisticated, organizations must constantly adapt with cutting-edge solutions to protect their critical assets. One such solution...
7 MIN READ

Jun 07, 2024
Seamlessly Deploying a Swarm of LoRA Adapters with NVIDIA NIM
The latest state-of-the-art foundation large language models (LLMs) have billions of parameters and are pretrained on trillions of tokens of input text. They...
11 MIN READ

May 14, 2024
RAPIDS on Databricks: A Guide to GPU-Accelerated Data Processing
In today's data-driven landscape, maximizing performance and efficiency in data processing and analytics is critical. While many Databricks users are familiar...
10 MIN READ

May 14, 2024
NVIDIA TensorRT 10.0 Upgrades Usability, Performance, and AI Model Support
NVIDIA today announced the latest release of NVIDIA TensorRT, an ecosystem of APIs for high-performance deep learning inference. TensorRT includes inference...
7 MIN READ

Apr 30, 2024
Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks
This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...
3 MIN READ