Itay Ozery – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-05-29T17:31:03Z http://www.open-lab.net/blog/feed/ Itay Ozery <![CDATA[NVIDIA ConnectX-8 SuperNICs Advance AI Platform Architecture with PCIe Gen6 Connectivity]]> http://www.open-lab.net/blog/?p=99991 2025-05-29T17:31:03Z 2025-05-19T04:07:33Z As AI workloads grow in complexity and scale��from large language models (LLMs) to agentic AI reasoning and physical AI��the demand for faster, more scalable...]]>

As AI workloads grow in complexity and scale—from large language models (LLMs) to agentic AI reasoning and physical AI—the demand for faster, more scalable compute infrastructure has never been greater. Meeting these demands requires rethinking system architecture from the ground up. NVIDIA is advancing platform architecture with NVIDIA ConnectX-8 SuperNICs, the industry’s first SuperNIC to…

Source

]]>
Itay Ozery <![CDATA[Powering Next-Generation AI Networking with NVIDIA SuperNICs]]> http://www.open-lab.net/blog/?p=90176 2024-11-01T14:27:00Z 2024-10-15T16:30:00Z In the era of generative AI, accelerated networking is essential to build high-performance computing fabrics for massively distributed AI workloads. NVIDIA...]]>

In the era of generative AI, accelerated networking is essential to build high-performance computing fabrics for massively distributed AI workloads. NVIDIA continues to lead in this space, offering state-of-the-art Ethernet and InfiniBand solutions that maximize the performance and efficiency of AI factories and cloud data centers. At the core of these solutions are NVIDIA SuperNICs—a new…

Source

]]>
Itay Ozery <![CDATA[Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage]]> http://www.open-lab.net/blog/?p=79614 2024-10-28T21:58:50Z 2024-03-18T22:00:00Z In the era of generative AI, where machines are not just learning from data but generating human-like text, images, video, and more, retrieval-augmented...]]>

In the era of generative AI, where machines are not just learning from data but generating human-like text, images, video, and more, retrieval-augmented generation (RAG) stands out as a groundbreaking approach. A RAG workflow builds on large language models (LLMs), which can understand queries and generate responses. However, LLMs have limitations, including training complexity and a lack of…

Source

]]>
1
Itay Ozery <![CDATA[Explainer: What Is a SuperNIC?]]> http://www.open-lab.net/blog/?p=74303 2024-10-11T20:02:13Z 2023-12-01T17:00:00Z A SuperNIC is a type of network accelerator for AI cloud data centers that delivers robust and seamless connectivity between GPU servers.]]>

A SuperNIC is a type of network accelerator for AI cloud data centers that delivers robust and seamless connectivity between GPU servers.

Source

]]>
0
Itay Ozery <![CDATA[Power the Next Wave of Applications with NVIDIA BlueField-3 DPUs]]> http://www.open-lab.net/blog/?p=64597 2023-07-11T23:12:17Z 2023-05-11T20:00:00Z ChatGPT, Stable Diffusion, DALL-E, and similar applications have awakened the world to generative AI. ChatGPT is the fastest-growing application in history. The...]]>

ChatGPT, Stable Diffusion, DALL-E, and similar applications have awakened the world to generative AI. ChatGPT is the fastest-growing application in history. The ease of use and impressive capabilities have attracted over a hundred million users in just a few months. Generative AI has created a sense of urgency for companies to reimagine their products and business models. As NVIDIA CEO Jensen…

Source

]]>
0
Itay Ozery <![CDATA[Transform the Data Center for the AI Era with NVIDIA DPUs and NVIDIA DOCA]]> http://www.open-lab.net/blog/?p=62095 2023-10-23T17:20:53Z 2023-03-21T17:00:00Z NVIDIA BlueField-3 data processing units (DPUs) are now in full production, and have been selected by Oracle Cloud Infrastructure (OCI) to achieve higher...]]>

NVIDIA BlueField-3 data processing units (DPUs) are now in full production, and have been selected by Oracle Cloud Infrastructure (OCI) to achieve higher performance, better efficiency, and stronger security, as announced at NVIDIA GTC 2023. As a 400 Gb/s infrastructure compute platform, BlueField-3 enables organizations to deploy and operate data centers at massive scale.

Source

]]>
0
Itay Ozery <![CDATA[Accelerate Enterprise Apps with Microsoft Azure Stack HCI and NVIDIA BlueField DPUs]]> http://www.open-lab.net/blog/?p=57175 2022-11-17T19:43:10Z 2022-11-10T14:00:00Z As enterprises continue to shift workloads to the cloud, some applications need to remain on-premises to maximize latency performance and meet security, data...]]>

As enterprises continue to shift workloads to the cloud, some applications need to remain on-premises to maximize latency performance and meet security, data sovereignty, and compliance policies. Microsoft Azure Stack HCI is a hyperconverged infrastructure (HCI) stack delivered as an Azure service. Providing built-in security and manageability, Azure Stack HCI is ideally positioned to run…

Source

]]>
0
Itay Ozery <![CDATA[Scaling Zero Touch RoCE Technology with Round Trip Time Congestion Control]]> http://www.open-lab.net/blog/?p=41691 2022-08-21T23:53:09Z 2021-12-14T22:10:52Z NVIDIA Zero Touch RoCE (ZTR) enables data centers to seamlessly deploy RDMA over Converged Ethernet (RoCE) without requiring any special switch configuration....]]>

NVIDIA Zero Touch RoCE (ZTR) enables data centers to seamlessly deploy RDMA over Converged Ethernet (RoCE) without requiring any special switch configuration. Until recently, ZTR was optimal for only small to medium-sized data centers. Meanwhile, large-scale deployments have traditionally relied on Explicit Congestion Notification (ECN) to enable RoCE network transport…

Source

]]>
15
Itay Ozery <![CDATA[Streamlining Kubernetes Networking in Scale-out GPU Clusters with the new NVIDIA Network Operator 1.0]]> http://www.open-lab.net/blog/?p=34099 2022-08-21T23:52:06Z 2021-07-12T16:00:00Z The growing prevalence of GPU-accelerated computing in the cloud, enterprise, and at the edge increasingly relies on robust and powerful network...]]>

The growing prevalence of GPU-accelerated computing in the cloud, enterprise, and at the edge increasingly relies on robust and powerful network infrastructures. NVIDIA ConnectX SmartNICs and NVIDIA BlueField DPUs provide high-throughput, low-latency connectivity that enables the scaling of GPU resources across a fleet of nodes. To address the demand for cloud-native AI workloads…

Source

]]>
0
Itay Ozery <![CDATA[Securing and Accelerating Modern Data Center Workloads with NVIDIA ASAP2 Technology]]> http://www.open-lab.net/blog/?p=31562 2022-08-21T23:51:38Z 2021-05-17T23:04:47Z NVIDIA accelerated switching and packet processing (ASAP2) technology is becoming ubiquitous to supercharging networking and security for the most demanding...]]>

NVIDIA accelerated switching and packet processing (ASAP2) technology is becoming ubiquitous to supercharging networking and security for the most demanding applications. Modern data center networks are increasingly becoming virtualized and provisioned as a service. These software-defined networks (SDN) deliver great flexibility and control, enabling you to easily scale from the premises of…

Source

]]>
0
Itay Ozery <![CDATA[Securing and Accelerating Cloud Computing Platforms with NVIDIA BlueField-2 DPUs]]> http://www.open-lab.net/blog/?p=21399 2023-03-22T01:09:06Z 2020-10-05T13:00:00Z Cloud technologies are increasingly taking over the worldwide IT infrastructure market. With offerings that include elastic compute, storage, and networking,...]]>

Cloud technologies are increasingly taking over the worldwide IT infrastructure market. With offerings that include elastic compute, storage, and networking, cloud service providers (CSPs) allow customers to rapidly scale their IT infrastructure up and down without having to build and manage it on their own. The increasing demand for differentiated and cost-effective cloud products and services is…

Source

]]>
1
Itay Ozery <![CDATA[Accelerating Bare Metal Kubernetes Workloads, the Right Way]]> http://www.open-lab.net/blog/?p=18182 2022-08-21T23:40:14Z 2020-06-18T19:53:00Z [stextbox id="info"]This post was originally published on the Mellanox blog.[/stextbox] In my previous Kubernetes post, Provision Bare-Metal Kubernetes Like a...]]>

This post was originally published on the Mellanox blog. In my previous Kubernetes post, Provision Bare-Metal Kubernetes Like a Cloud Giant!, I discussed the benefits of using BlueField DPU-programmable SmartNICs to simplify provisioning of Kubernetes clusters in bare-metal infrastructures. A key takeaway from this post was the current rapid shift toward bare metal Kubernetes…

Source

]]>
0
���˳���97caoporen����