Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

jwitsoe · July 23, 2024, 3:15pm

Originally published at: https://developer.nvidia.com/blog/build-an-agentic-rag-pipeline-with-llama-3-1-and-nvidia-nemo-retriever-nims/

Employing retrieval-augmented generation (RAG) is an effective strategy for ensuring large language model (LLM) responses are up-to-date and not hallucinated. While various retrieval strategies can improve the recall of documents for generation, there is no one-size-fits-all approach. The retrieval pipeline depends on your data, from hyperparameters like the chunk size, and number of documents returned,…

Topic		Replies	Views
Develop Production-Grade Text Retrieval Pipelines for RAG with NVIDIA NeMo Retriever Technical Blog	1	22	July 23, 2024
Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model Technical Blog	0	502	November 28, 2023
RAG 101: Demystifying Retrieval-Augmented Generation Pipelines Technical Blog	1	537	December 19, 2023
Explainer: What Is Retrieval-Augmented Generation aka RAG? Technical Blog	0	448	November 24, 2023
Enhancing RAG Pipelines with Re-Ranking Technical Blog	1	40	July 30, 2024
Building a Simple VLM-Based Multimodal Information Retrieval System with NVIDIA NIM Technical Blog nim	2	22	February 26, 2025
A Guide to Retrieval-Augmented Generation for AEC Technical Blog	2	28	January 2, 2025
Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints Technical Blog	10	495	August 28, 2024
Building Your First LLM Agent Application Technical Blog	0	664	November 30, 2023
Advanced RAG Techniques for Telco O-RAN Specifications Using NVIDIA NIM Microservices Technical Blog nim	1	18	October 10, 2024

Build an Agentic RAG Pipeline with Llama 3.1 and NVIDIA NeMo Retriever NIMs

Related topics