NVIDIA JetPack provides a full development environment for hardware-accelerated AI-at-the-edge on Jetson platforms. Previously, a standalone version of NVIDIA JetPack supports a single release of CUDA, and you did not have the ability to upgrade CUDA on a given NVIDIA JetPack version. NVIDIA JetPack is released on a rolling cadence with a single version of CUDA…
]]>NVIDIA announces the newest CUDA Toolkit software release, 11.8. This release is focused on enhancing the programming model and CUDA application speedup through new hardware capabilities. New architecture-specific features in NVIDIA Hopper and Ada Lovelace are initially being exposed through libraries and framework enhancements. The full programming model enhancements for the NVIDIA Hopper…
]]>OpenCL is evolving the way it can interoperate with other APIs, such as Vulkan. This post gives you a tour of the new style of OpenCL interop, which is already supported in the latest NVIDIA drivers. We include downloadable sample code so you can try this new functionality today. Developers often use OpenCL for compute together with other APIs, such as OpenGL…
]]>NVIDIA PhysicsNeMo was previously known as NVIDIA SimNet. Simulations have been widely used to model a variety of real-world problems in the science and engineering domains. Recent developments in AI and machine learning have led to the use of data to build surrogates for simulations, but the latest efforts have focused on the infusion of scientific laws in neural networks.
]]>NVIDIA PhysicsNeMo was previously known as NVIDIA SimNet. Simulations are pervasive in every domain of science and engineering, but they often have constraints such as large computational times, limited compute resources, tedious manual setup efforts, and the need for technical expertise. Neural networks not only accelerate simulations done by traditional solvers, but also simplify simulation…
]]>NVIDIA PhysicsNeMo was previously known as NVIDIA SimNet. Simulations are pervasive in every domain of science and engineering, but they are often constrained by large computational times, limited compute resources, tedious manual setup efforts, and the need for technical expertise. NVIDIA PhysicsNeMo is a simulation toolkit that addresses these challenges with a combination of AI and physics.
]]>NVIDIA PhysicsNeMo was previously known as NVIDIA SimNet. Today, NVIDIA announces the release of PhysicsNeMo v21.06 for general availability, enabling physics simulations across a variety of use cases. NVIDIA PhysicsNeMo is a Physics-Informed Neural Networks (PINNs) toolkit for engineers, scientists, students, and researchers who either want to get started with AI-driven physics…
]]>NVIDIA welcomes OpenCL 3.0’s focus on defining a baseline to enable developer-critical functionality to be widely adopted in future versions of the specification. With the recently released R465 display driver, NVIDIA is now officially OpenCL 3.0 conformant on both Windows and Linux. In September 2020, the Khronos Group released the OpenCL 3.0 final specification.
]]>There is a growing need among embedded and HPC applications to share resources and control execution for pipelined workflows spanning multiple hardware engines and software applications. The following diagram gives an insight into the number of engines that can be supported on NVIDIA-embedded platforms. In addition to GPUs, this includes image processing units like image signal processors…
]]>Designing new custom hardware accelerators for deep learning is clearly popular, but achieving state-of-the-art performance and efficiency with a new design is a complex and challenging problem. Two years ago, NVIDIA opened the source for the hardware design of the NVIDIA Deep Learning Accelerator (NVDLA) to help advance the adoption of efficient AI inferencing in custom hardware designs.
]]>Remote Direct Memory Access (RDMA) allows computers to exchange data in memory without the involvement of a CPU. The benefits include low latency and high bandwidth data exchange. GPUDirect RDMA extends the same philosophy to the GPU and the connected peripherals in Jetson AGX Xavier. GPUDirect RDMA enables a direct path for data exchange between the GPU-accessible memory (the CUDA memory) and a…
]]>