Nirmal Kumar Juluru – NVIDIA Technical Blog

Nirmal Kumar Juluru – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-02-20T15:52:43Z http://www.open-lab.net/blog/feed/ Nirmal Kumar Juluru <![CDATA[Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding]]> http://www.open-lab.net/blog/?p=96010 2025-02-20T15:52:43Z 2025-02-14T18:19:37Z

Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...]]>

Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents, these models assist developers with various tasks, including enhancing code, fixing bugs, generating tests, and writing documentation. To promote the development of open-source LLMs, the Qwen team recently released Qwen2.5-Coder…

]]> Nirmal Kumar Juluru <![CDATA[Enhancing Generative AI Model Accuracy with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=94263 2025-01-23T19:54:27Z 2025-01-13T17:00:00Z

In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that...]]>

In the rapidly evolving landscape of artificial intelligence, the quality of the data used for training models is paramount. High-quality data ensures that models are accurate, reliable, and capable of generalizing well across various applications. The recent NVIDIA webinar, Enhance Generative AI Model Accuracy with High-Quality Multimodal Data Processing, dove into the intricacies of data…

]]> Nirmal Kumar Juluru <![CDATA[Enhance Your Training Data with New NVIDIA NeMo Curator Classifier Models]]> http://www.open-lab.net/blog/?p=94447 2024-12-19T23:08:12Z 2024-12-19T23:08:08Z

Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for...]]>

Classifier models are specialized in categorizing data into predefined groups or classes, playing a crucial role in optimizing data processing pipelines for fine-tuning and pretraining generative AI models. Their value lies in enhancing data quality by filtering out low-quality or toxic data, ensuring only clean and relevant information feeds downstream processes. Beyond filtering…

]]> Nirmal Kumar Juluru <![CDATA[State-of-the-Art Multimodal Generative AI Model Development with NVIDIA NeMo]]> http://www.open-lab.net/blog/?p=91184 2025-01-13T17:19:42Z 2024-11-06T16:00:00Z

Generative AI has rapidly evolved from text-based models to multimodal capabilities. These models perform tasks like image captioning and visual question...]]>

Generative AI has rapidly evolved from text-based models to multimodal capabilities. These models perform tasks like image captioning and visual question answering, reflecting a shift toward more human-like AI. The community is now expanding from text and images to video, opening new possibilities across industries. Video AI models are poised to revolutionize industries such as robotics…

]]> Nirmal Kumar Juluru <![CDATA[Upcoming Webinar: Enhance Generative AI Model Accuracy Through High-Quality Data Processing]]> http://www.open-lab.net/blog/?p=91036 2024-10-31T18:32:58Z 2024-10-28T19:23:49Z

Learn how to build scalable data processing pipelines to create high-quality datasets.]]>

Learn how to build scalable data processing pipelines to create high-quality datasets.

]]> Nirmal Kumar Juluru <![CDATA[Train Highly Accurate LLMs with the Zyda-2 Open 5T-Token Dataset Processed with NVIDIA NeMo Curator]]> http://www.open-lab.net/blog/?p=89677 2024-10-18T20:10:29Z 2024-10-15T18:00:00Z

Open-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train...]]>

Open-source datasets have significantly democratized access to high-quality data, lowering the barriers of entry for developers and researchers to train cutting-edge generative AI models. By providing free access to diverse, high-quality, and well-curated datasets, open-source datasets enable the open-source community to train models at or close to the frontier, facilitating the rapid advancement…

]]> Nirmal Kumar Juluru <![CDATA[Just Released: NVIDIA NeMo Curator Improvements for Accelerating Data Curation]]> http://www.open-lab.net/blog/?p=89756 2024-10-18T20:10:53Z 2024-10-04T16:00:00Z

NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.]]>

NeMo Curator now supports images, enabling you to process data for training accurate generative AI models.

]]> Nirmal Kumar Juluru <![CDATA[Optimizing Inference Efficiency for LLMs at Scale with NVIDIA NIM Microservices]]> http://www.open-lab.net/blog/?p=87091 2024-08-22T18:24:55Z 2024-08-14T19:30:00Z

As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize...]]>

As large language models (LLMs) continue to evolve at an unprecedented pace, enterprises are looking to build generative AI-powered applications that maximize throughput to lower operational costs and minimize latency to deliver superior user experiences. This post discusses the critical performance metrics of throughput and latency for LLMs, exploring their importance and trade-offs between…

]]> Nirmal Kumar Juluru <![CDATA[Customize Generative AI Models for Enterprise Applications with Llama 3.1]]> http://www.open-lab.net/blog/?p=85948 2025-02-17T05:27:15Z 2024-07-23T15:15:00Z

The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their...]]>

The newly unveiled Llama 3.1 collection of 8B, 70B, and 405B large language models (LLMs) is narrowing the gap between proprietary and open-source models. Their open nature is attracting more developers and enterprises to integrate these models into their AI applications. These models excel at various tasks including content generation, coding, and deep reasoning, and can be used to power…

]]> 1 Nirmal Kumar Juluru <![CDATA[Customizing NVIDIA NIM for Domain-Specific Needs with NVIDIA NeMo]]> http://www.open-lab.net/blog/?p=84587 2025-02-17T05:27:27Z 2024-07-10T18:16:21Z

Large language models (LLMs) adopted for specific enterprise applications most often benefit from model customization. Enterprises need to tailor ?LLMs for...]]>

Large language models (LLMs) adopted for specific enterprise applications most often benefit from model customization. Enterprises need to tailor ‌LLMs for their specific needs and quickly deploy them for low-latency and high-throughput inferencing. This post will help you get started with this process. Specifically, we’ll show how to customize the Llama 3 8B NIM for answering questions in…

]]> Nirmal Kumar Juluru <![CDATA[Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog]]> http://www.open-lab.net/blog/?p=84373 2024-07-25T18:19:21Z 2024-07-01T16:00:00Z

Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use...]]>

Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use Brev.dev to easily deploy software on an NVIDIA GPU by pairing a cloud orchestration tool with a simple UI. Get an on-demand GPU reliably from any cloud, access the notebook in-browser, or SSH directly into the machine with the Brev…

]]> Nirmal Kumar Juluru <![CDATA[Develop Custom Enterprise Generative AI with NVIDIA NeMo]]> http://www.open-lab.net/blog/?p=80360 2025-02-17T05:27:49Z 2024-03-27T20:00:00Z

Generative AI is transforming computing, paving new avenues for humans to interact with computers in natural, intuitive ways. For enterprises, the prospect of...]]>

Generative AI is transforming computing, paving new avenues for humans to interact with computers in natural, intuitive ways. For enterprises, the prospect of generative AI is vast. Businesses can tap into their rich datasets to streamline time-consuming tasks—from text summarization and translation to insight prediction and content generation. But they must also navigate adoption challenges.

]]> Nirmal Kumar Juluru <![CDATA[Fine-Tune and Align LLMs Easily with NVIDIA NeMo Customizer]]> http://www.open-lab.net/blog/?p=80290 2025-02-17T05:26:51Z 2024-03-27T18:00:00Z

As large language models (LLMs) continue to gain traction in enterprise AI applications, the demand for custom models that can understand and integrate specific...]]>

As large language models (LLMs) continue to gain traction in enterprise AI applications, the demand for custom models that can understand and integrate specific industry terminology, domain expertise, and unique organizational requirements becomes increasingly important. To address this growing need for customizing LLMs, the NVIDIA NeMo team has announced an early access program for NeMo…

]]> Nirmal Kumar Juluru <![CDATA[Streamline Evaluation of LLMs for Accuracy with NVIDIA NeMo Evaluator]]> http://www.open-lab.net/blog/?p=80412 2025-02-17T05:28:02Z 2024-03-27T18:00:00Z

Large language models (LLMs) have demonstrated remarkable capabilities, from tackling complex coding tasks to crafting compelling stories to translating natural...]]>

Large language models (LLMs) have demonstrated remarkable capabilities, from tackling complex coding tasks to crafting compelling stories to translating natural language. Enterprises are customizing these models for even greater application-specific effectiveness to deliver higher accuracy and improved responses to end users. However, customizing LLMs for specific tasks can cause the model…

]]> Nirmal Kumar Juluru <![CDATA[Simplify Custom Generative AI Development with NVIDIA NeMo Microservices]]> http://www.open-lab.net/blog/?p=79415 2025-02-17T05:28:27Z 2024-03-18T22:04:00Z

Across the globe, enterprises are realizing the benefits of generative AI models. They are racing to adopt these models in various applications, such as...]]>

Across the globe, enterprises are realizing the benefits of generative AI models. They are racing to adopt these models in various applications, such as chatbots, virtual assistants, coding copilots, and more. While general-purpose models work well for simple tasks, they underperform when it comes to catering to the unique needs of various industries. Custom generative AI models outperform…

]]> Nirmal Kumar Juluru <![CDATA[Build Enterprise-Grade AI with NVIDIA AI Software]]> http://www.open-lab.net/blog/?p=76978 2024-02-08T18:51:56Z 2024-01-24T20:30:00Z

Following the introduction of ChatGPT, enterprises around the globe are realizing the benefits and capabilities of AI, and are racing to adopt it into their...]]>

Following the introduction of ChatGPT, enterprises around the globe are realizing the benefits and capabilities of AI, and are racing to adopt it into their workflows. As this adoption accelerates, it becomes imperative for enterprises not only to keep pace with the rapid advancements in AI, but also address related challenges such as optimization, scalability, and security.

]]> 0 Nirmal Kumar Juluru <![CDATA[Available Now: NVIDIA AI Accelerated DGL and PyG Containers for GNNs]]> http://www.open-lab.net/blog/?p=74698 2023-12-14T19:27:28Z 2023-12-08T22:07:12Z

From credit card transactions, social networks, and recommendation systems to transportation networks and protein-protein interactions in biology, graphs are...]]>

From credit card transactions, social networks, and recommendation systems to transportation networks and protein-protein interactions in biology, graphs are the go-to data structure for modeling and analyzing intricate connections. Graph neural networks (GNNs), with their ability to learn and reason over graph-structured data, have emerged as a game-changer across various domains. However…

]]> 0 Nirmal Kumar Juluru <![CDATA[Announcing HelpSteer: An Open-Source Dataset for Building Helpful LLMs]]> http://www.open-lab.net/blog/?p=73937 2024-01-03T23:48:02Z 2023-11-27T17:00:00Z

NVIDIA recently announced the NVIDIA NeMo SteerLM technique as part of the NVIDIA NeMo framework. This technique enables users to control large language model...]]>

NVIDIA recently announced the NVIDIA NeMo SteerLM technique as part of the NVIDIA NeMo framework. This technique enables users to control large language model (LLM) responses during inference. The developer community has shown great interest in using the approach for building custom LLMs. The NVIDIA NeMo team is now open-sourcing a multi-attribute dataset called Helpfulness SteerLM dataset…

]]> 0 Nirmal Kumar Juluru <![CDATA[Transforming Industrial Defect Detection with NVIDIA TAO and Vision AI Models]]> http://www.open-lab.net/blog/?p=73760 2023-12-07T16:59:55Z 2023-11-20T17:00:00Z

Efficiency is paramount in industrial manufacturing, where even minor gains can have significant financial implications. According to the American Society of...]]>

Efficiency is paramount in industrial manufacturing, where even minor gains can have significant financial implications. According to the American Society of Quality, “Many organizations will have true quality-related costs as high as 15-20% of sales revenue, some going as high as 40% of total operations.” These staggering statistics reveal a stark reality: defects in industrial applications not…

]]> 1 Nirmal Kumar Juluru <![CDATA[Build Custom Enterprise-Grade Generative AI with NVIDIA AI Foundation Models?]]> http://www.open-lab.net/blog/?p=73688 2024-01-02T18:37:01Z 2023-11-15T16:00:00Z

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the...]]>

In the realm of generative AI, building enterprise-grade large language models (LLMs) requires expertise collecting high-quality data, setting up the accelerated infrastructure, and optimizing the models. Developers can begin with pretrained models and fine-tune them for their use case, saving time and getting their solutions faster to market. Developers need an easy way to try out models…

]]> 0 ��˳��97caoporen��