Advanced AI and Retrieval-Augmented Generation for Code Development in High-Performance Computing

jwitsoe · May 13, 2024, 4:35am

Originally published at: https://developer.nvidia.com/blog/advanced-ai-and-retrieval-augmented-generation-for-code-development-in-high-performance-computing/

In the rapidly evolving field of software development, AI tools such as chatbots and GitHub Copilot have significantly transformed how developers write and manage code. These tools, built on large language models (LLMs), enhance productivity by automating routine coding tasks. Parallel computing challenges However, the use of LLMs in generating parallel computing code—essential for high-performance…

sandeepwins10 · May 24, 2024, 3:22am

If I understand this Advanced RAG solution, it recommends code for HPC development that is more syntactically accurate and operationally efficient than previous AI models AND it does so without requiring HPC-specific fine-tuning. Previous LLMs generate serial code effectively but struggle with parallel operations such as deadlocks and race-conditions. They also do not account for user code running efficiently on diverse HPC architectures with unique hardware complexities.
Sandia’s contribution to the solution seems to be some level of automation and integration of Kokkos which is a leading tool for abstracting performance-portable applications from the underlying hardware.
Also Sandia’s benchmarked improvements in query relevancy, accuracy and balance of breadth vs. depth are very promising. Please confirm.

hpetty · July 9, 2024, 4:06pm

You captured the overall messages from the RAG blog and Kokkos portability. If you wish to discuss further, I recommend to reach out to Sarah Tsai, co-author at Sandia, and implementer of the RAG model described. You may find her on LinkedIn too.

Topic		Replies	Views
Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model Technical Blog	0	502	November 28, 2023
A Guide to Retrieval-Augmented Generation for AEC Technical Blog	2	28	January 2, 2025
Optimize AI Model Performance and Maintain Data Privacy with Hybrid RAG Technical Blog	1	57	July 11, 2024
Evaluating Medical RAG with NVIDIA AI Endpoints and Ragas Technical Blog	1	42	October 1, 2024
Evolving AI-Powered Game Development with Retrieval-Augmented Generation Technical Blog	1	10	October 1, 2024
Accelerating Inference on End-to-End Workflows with H2O.ai and NVIDIA Technical Blog	2	468	January 4, 2024
Tips for Building a RAG Pipeline with NVIDIA AI LangChain AI Endpoints Technical Blog	10	491	August 28, 2024
How to Take a RAG Application from Pilot to Production in Four Steps Technical Blog	1	227	March 18, 2024
Scaling Enterprise RAG with Accelerated Ethernet Networking and Networked Storage Technical Blog	2	317	April 2, 2024
NVIDIA AI Platform Delivers Big Gains for Large Language Models Technical Blog	0	415	July 28, 2022

Advanced AI and Retrieval-Augmented Generation for Code Development in High-Performance Computing

Related topics