Nave Algarici – NVIDIA Technical Blog News and tutorials for developers, data scientists, and IT admins 2025-03-06T20:05:47Z http://www.open-lab.net/blog/feed/ Nave Algarici <![CDATA[How Using a Reranking Microservice Can Improve Accuracy and Costs of Information Retrieval]]> http://www.open-lab.net/blog/?p=96363 2025-03-06T20:05:47Z 2025-03-06T18:33:38Z Applications requiring high-performance information retrieval span a wide range of domains, including search engines, knowledge management systems, AI agents,...]]>

Applications requiring high-performance information retrieval span a wide range of domains, including search engines, knowledge management systems, AI agents, and AI assistants. These systems demand retrieval processes that are accurate and computationally efficient to deliver precise insights, enhance user experiences, and maintain scalability. Retrieval-augmented generation (RAG) is used to…

Source

]]>
Nave Algarici <![CDATA[Develop Multilingual and Cross-Lingual Information Retrieval Systems with Efficient Data Storage]]> http://www.open-lab.net/blog/?p=93638 2024-12-17T20:42:28Z 2024-12-17T16:00:00Z Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity,...]]>

Efficient text retrieval is critical for a broad range of information retrieval applications such as search, question answering, semantic textual similarity, summarization, and item recommendation. It also plays a pivotal role in retrieval-augmented generation (RAG), a technique that enables large language models (LLMs) to access external context without modifying underlying parameters.

Source

]]>
Nave Algarici <![CDATA[Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever?]]> http://www.open-lab.net/blog/?p=85762 2024-10-28T21:50:54Z 2024-07-23T15:15:00Z Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative...]]>

Enterprises are sitting on a goldmine of data waiting to be used to improve efficiency, save money, and ultimately enable higher productivity. With generative AI, developers can build and deploy an agentic flow or a retrieval-augmented generation (RAG) chatbot, while ensuring the insights provided are based on the most accurate and up-to-date information. Building these solutions requires not…

Source

]]>
Nave Algarici <![CDATA[Evaluating Retriever for Enterprise-Grade RAG]]> http://www.open-lab.net/blog/?p=78222 2024-10-28T21:59:05Z 2024-02-23T19:02:26Z The conversation about designing and evaluating Retrieval-Augmented Generation (RAG) systems is a long, multi-faceted discussion. Even when we look at retrieval...]]>

The conversation about designing and evaluating Retrieval-Augmented Generation (RAG) systems is a long, multi-faceted discussion. Even when we look at retrieval on its own, developers selectively employ many techniques, such as query decomposition, re-writing, building soft filters, and more, to increase the accuracy of their RAG pipelines. While the techniques vary from system to system…

Source

]]>
0
Nave Algarici <![CDATA[Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model]]> http://www.open-lab.net/blog/?p=74346 2024-10-28T22:00:06Z 2023-11-28T18:10:50Z Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation...]]>

Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation enterprise productivity applications, they enhance user efficiency across tasks like programming, copy editing, brainstorming, and answering questions on a wide range of topics. However, these models often struggle with real-time events and…

Source

]]>
0
���˳���97caoporen����