Abhishek Sawarkar – NVIDIA Technical Blog

Abhishek Sawarkar – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2024-11-20T23:03:22Z http://www.open-lab.net/blog/feed/ Abhishek Sawarkar <![CDATA[Scale High-Performance AI Inference with Google Kubernetes Engine and NVIDIA NIM]]> http://www.open-lab.net/blog/?p=90198 2024-10-30T18:57:03Z 2024-10-16T16:30:00Z

The rapid evolution of AI models has driven the need for more efficient and scalable inferencing solutions. As organizations strive to harness the power of AI,...]]>

The rapid evolution of AI models has driven the need for more efficient and scalable inferencing solutions. As organizations strive to harness the power of AI, they face challenges in deploying, managing, and scaling AI inference workloads. NVIDIA NIM and Google Kubernetes Engine (GKE) together offer a powerful solution to address these challenges. NVIDIA has collaborated with Google Cloud to…

]]> Abhishek Sawarkar <![CDATA[Google Cloud Run Adds Support for NVIDIA L4 GPUs, NVIDIA NIM, and Serverless AI Inference Deployments at Scale]]> http://www.open-lab.net/blog/?p=87666 2024-09-05T17:57:27Z 2024-08-21T18:00:00Z

Deploying AI-enabled applications and services presents enterprises with significant challenges: Performance is critical as it directly shapes user...]]>

Deploying AI-enabled applications and services presents enterprises with significant challenges: Addressing these challenges requires a full-stack approach that can optimize performance, manage scalability effectively, and navigate the complexities of deployment, enabling organizations to maximize AI’s full potential while maintaining operational efficiency and cost-effectiveness.

]]> Abhishek Sawarkar <![CDATA[NVIDIA AI Foundation Models: Build Custom Enterprise Chatbots and Co-Pilots with Production-Ready LLMs]]> http://www.open-lab.net/blog/?p=73296 2024-11-20T23:03:22Z 2023-11-15T16:00:00Z

Large language models (LLMs) are revolutionizing data science, enabling advanced capabilities in natural language understanding, AI, and machine learning....]]>

Large language models (LLMs) are revolutionizing data science, enabling advanced capabilities in natural language understanding, AI, and machine learning. Custom LLMs, tailored for domain-specific insights, are finding increased traction in enterprise applications. The NVIDIA Nemotron-3 8B family of foundation models is a powerful new tool for building production-ready generative AI…

]]> 4 Abhishek Sawarkar <![CDATA[Elevate Enterprise Generative AI App Development with NVIDIA AI on Azure Machine Learning]]> http://www.open-lab.net/blog/?p=73312 2023-12-30T00:41:50Z 2023-11-15T16:00:00Z

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement,...]]>

Generative AI is revolutionizing how organizations across all industries are leveraging data to increase productivity, advance personalized customer engagement, and foster innovation. Given its tremendous value, enterprises are looking for tools and expertise that help them integrate this new technology into their business operations and strategies effectively and reliably.

]]> 0 Abhishek Sawarkar <![CDATA[Train Your AI Model Once and Deploy on Any Cloud with NVIDIA and Run:ai]]> http://www.open-lab.net/blog/?p=67035 2023-09-11T21:36:55Z 2023-07-07T16:38:25Z

Organizations are increasingly adopting hybrid and multi-cloud strategies to access the latest compute resources, consistently support worldwide customers, and...]]>

Organizations are increasingly adopting hybrid and multi-cloud strategies to access the latest compute resources, consistently support worldwide customers, and optimize cost. However, a major challenge that engineering teams face is operationalizing AI applications across different platforms as the stack changes. This requires MLOps teams to familiarize themselves with different environments and…

]]> 2 Abhishek Sawarkar <![CDATA[Building a Computer Vision Application to Recognize Human Activities]]> http://www.open-lab.net/blog/?p=49322 2022-07-25T19:18:05Z 2022-06-21T21:43:32Z

[stextbox id="info"]Watch this On-Demand webinar, Build A Computer Vision Application with NVIDIA AI on Google Cloud Vertex AI, where we walk you step-by-step...]]>

Watch this On-Demand webinar, Build A Computer Vision Application with NVIDIA AI on Google Cloud Vertex AI, where we walk you step-by-step through using these resources to build your own action recognition application. Advances in computer vision models are providing deeper insights to make our lives increasingly productive, our communities safer, and our planet cleaner. We’ve come a…

]]> 1 Abhishek Sawarkar <![CDATA[Building Transcription and Entity Recognition Apps Using NVIDIA Riva]]> http://www.open-lab.net/blog/?p=24076 2023-03-22T01:16:50Z 2021-11-09T16:15:08Z

In the past several months, many of us have grown accustomed to seeing our doctors over a video call. It��s certainly convenient, but after the call ends,...]]>

In the past several months, many of us have grown accustomed to seeing our doctors over a video call. It’s certainly convenient, but after the call ends, those important pieces of advice from your doctor start to slip away. What was that new medication I needed to take? Were there any side effects to watch out for? Conversational AI can help in building an application to transcribe speech as…

]]> 10 Abhishek Sawarkar <![CDATA[Achieving Noise-Free Audio for Virtual Collaboration and Content Creation Applications]]> http://www.open-lab.net/blog/?p=37611 2023-11-03T07:15:12Z 2021-09-21T19:41:05Z

With audio and video streaming, conferencing, and telecommunication on the rise, it has become essential for developers to build applications with outstanding...]]>

With audio and video streaming, conferencing, and telecommunication on the rise, it has become essential for developers to build applications with outstanding audio quality and enable end users to communicate and collaborate effectively. Various background noises can disrupt communication, ranging from traffic and construction to dogs barking and babies crying. Moreover, a user could talk in a…

]]> 1 Abhishek Sawarkar <![CDATA[Accelerating Inference with Sparsity Using the NVIDIA Ampere Architecture and NVIDIA TensorRT]]> http://www.open-lab.net/blog/?p=34218 2023-06-12T21:09:10Z 2021-07-20T13:00:00Z

This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. When deploying a neural network, it's useful to think about how the network could be...]]>

This post was updated July 20, 2021 to reflect NVIDIA TensorRT 8.0 updates. Join the NVIDIA Triton and NVIDIA TensorRT community to stay current on the latest product updates, bug fixes, content, best practices, and more. When deploying a neural network, it’s useful to think about how the network could be made to run faster or take less space. A more efficient network can make better…

]]> 13 Abhishek Sawarkar <![CDATA[Continuously Improving Recommender Systems for Competitive Advantage Using NVIDIA Merlin and MLOps]]> http://www.open-lab.net/blog/?p=33639 2024-10-28T19:22:30Z 2021-07-01T00:23:02Z

Recommender systems are a critical resource for enterprises that are relentlessly striving to improve customer engagement. They work by suggesting potentially...]]>

Recommender systems are a critical resource for enterprises that are relentlessly striving to improve customer engagement. They work by suggesting potentially relevant products and services amongst an overwhelmingly large and ever-increasing number of offerings. NVIDIA Merlin is an application framework that accelerates all phases of recommender system development on NVIDIA GPUs…

]]> 2 Abhishek Sawarkar <![CDATA[Accelerating Conversational AI Research with New Cutting-Edge Neural Networks and Features from NeMo 1.0]]> http://www.open-lab.net/blog/?p=32233 2023-02-10T22:26:14Z 2021-06-08T16:00:00Z

NVIDIA NeMo is a conversational AI toolkit built for researchers working on automatic speech recognition (ASR), natural language processing (NLP), and...]]>

NVIDIA NeMo is a conversational AI toolkit built for researchers working on automatic speech recognition (ASR), natural language processing (NLP), and text-to-speech synthesis (TTS). The primary objective of NeMo is to help researchers from industry and academia to reuse prior work (code and pretrained models and make it easier to create new conversational AI models. NeMo is an open-source project…

]]> 6 Abhishek Sawarkar <![CDATA[Deploying a Natural Language Processing Service on a Kubernetes Cluster with Helm Charts from NVIDIA NGC]]> http://www.open-lab.net/blog/?p=22018 2022-08-21T23:40:46Z 2020-11-11T22:39:07Z

Conversational AI solutions such as chatbots are now deployed in the data center, on the cloud, and at the edge to deliver lower latency and high quality of...]]>

Conversational AI solutions such as chatbots are now deployed in the data center, on the cloud, and at the edge to deliver lower latency and high quality of service while meeting an ever-increasing demand. The strategic decision to run AI inference on any or all these compute platforms varies not only by the use case but also evolves over time with the business. Hence…

]]> 4 Abhishek Sawarkar <![CDATA[Simplifying AI Inference with NVIDIA Triton Inference Server from NVIDIA NGC]]> http://www.open-lab.net/blog/?p=19889 2022-10-10T18:57:20Z 2020-08-25T00:12:17Z

Seamlessly deploying AI services at scale in production is as critical as creating the most accurate AI model. Conversational AI services, for example, need...]]>

Seamlessly deploying AI services at scale in production is as critical as creating the most accurate AI model. Conversational AI services, for example, need multiple models handling functions of automatic speech recognition (ASR), natural language understanding (NLU), and text-to-speech (TTS) to complete the application pipeline. To provide real-time conversation to users…

]]> 3 Abhishek Sawarkar <![CDATA[Optimizing and Accelerating AI Inference with the TensorRT Container from NVIDIA NGC]]> http://www.open-lab.net/blog/?p=19032 2022-10-10T18:57:20Z 2020-07-23T17:24:26Z

Natural language processing (NLP) is one of the most challenging tasks for AI because it needs to understand context, phonics, and accent to convert human...]]>

Natural language processing (NLP) is one of the most challenging tasks for AI because it needs to understand context, phonics, and accent to convert human speech into text. Building this AI workflow starts with training a model that can understand and process spoken language to text. BERT is one of the best models for this task. Instead of starting from scratch to build state-of-the-art…

]]> 0 ��˳��97caoporen��