Chris Alexiuk – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-06T19:52:48Z http://www.open-lab.net/blog/feed/ Chris Alexiuk <![CDATA[Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM]]> http://www.open-lab.net/blog/?p=96030 2025-03-06T19:52:48Z 2025-02-28T20:23:51Z AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on...]]>

AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on expert reasoning, enabling smarter planning and efficient execution. Agentic AI applications could benefit from the capabilities of models such as DeepSeek-R1. Built for solving problems that require advanced AI reasoning…

Source

]]>
Chris Alexiuk <![CDATA[Mastering LLM Techniques: Evaluation]]> http://www.open-lab.net/blog/?p=95447 2025-02-17T05:21:53Z 2025-01-29T20:44:06Z Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...]]>

Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and multifaceted nature of these systems. Unlike traditional machine learning (ML) models, LLMs generate a wide range of diverse and often unpredictable outputs, making standard evaluation metrics insufficient. Key challenges include the…

Source

]]>
Chris Alexiuk <![CDATA[Accelerate Custom Video Foundation Model Pipelines with New NVIDIA NeMo Framework Capabilities]]> http://www.open-lab.net/blog/?p=94541 2025-02-04T19:34:45Z 2025-01-07T16:00:00Z Generative AI has evolved from text-based models to multimodal models, with a recent expansion into video, opening up new potential uses across various...]]>

Generative AI has evolved from text-based models to multimodal models, with a recent expansion into video, opening up new potential uses across various industries. Video models can create new experiences for users or simulate scenarios for training autonomous agents at scale. They are helping revolutionize various industries including robotics, autonomous vehicles, and entertainment.

Source

]]>
Chris Alexiuk <![CDATA[Deploying Fine-Tuned AI Models with NVIDIA NIM]]> http://www.open-lab.net/blog/?p=91696 2024-12-17T00:07:21Z 2024-11-21T22:04:57Z For organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently...]]>

For organizations adapting AI foundation models with domain-specific data, the ability to rapidly create and deploy fine-tuned models is key to efficiently delivering value with enterprise generative AI applications. NVIDIA NIM offers prebuilt, performance-optimized inference microservices for the latest AI foundation models, including seamless deployment of models customized using parameter…

Source

]]>
Chris Alexiuk <![CDATA[An Introduction to Model Merging for LLMs]]> http://www.open-lab.net/blog/?p=90842 2024-10-31T18:33:13Z 2024-10-28T18:30:00Z One challenge organizations face when customizing large language models (LLMs) is the need to run multiple experiments, which produces only one useful model....]]>

One challenge organizations face when customizing large language models (LLMs) is the need to run multiple experiments, which produces only one useful model. While the cost of experimentation is typically low, and the results well worth the effort, this experimentation process does involve “wasted” resources, such as compute assets spent without their product being utilized…

Source

]]>
2
Chris Alexiuk <![CDATA[Leverage the Latest Open Models for Synthetic Data Generation with NVIDIA Nemotron-4-340B]]> http://www.open-lab.net/blog/?p=84322 2024-10-04T21:38:35Z 2024-08-16T16:15:56Z [stextbox id="info"]The Llama-3.1-Nemotron 70B-Reward model helps generate high-quality training data that aligns with human preferences for finance, retail,...]]>

The Llama-3.1-Nemotron 70B-Reward model helps generate high-quality training data that aligns with human preferences for finance, retail, healthcare, scientific research, telecommunications, and sovereign AI. This post was updated on August 16, 2024 to reflect the most recent Reward Bench results. Since the introduction and subsequent wide adoption of large language models (LLMs)…

Source

]]>
1
Chris Alexiuk <![CDATA[New LLM: Snowflake Arctic Model for SQL and Code Generation]]> http://www.open-lab.net/blog/?p=81484 2024-05-07T16:53:04Z 2024-04-27T00:42:50Z Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...]]>

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text summarization, question answering, and natural language generation. Arctic, developed by Snowflake, is a new open LLM designed to achieve high inference performance while maintaining low cost on various NLP tasks. Arctic Arctic is…

Source

]]>
���˳���97caoporen����