We built traceAI, an open-source tool for tracing LLM calls in production

param.s · April 18, 2026, 10:21am

Hey everyone 👋

We have been building traceAI, an open-source observability tool
for LLM applications in production.

It traces every LLM call capturing inputs, outputs, latency, costs,
and errors with minimal setup. Useful for teams running inference
on NVIDIA hardware who want full visibility into what their models
are actually doing in prod.

Repo: GitHub - future-agi/traceAI: Open Source AI Tracing Framework built on Opentelemetry for AI Applications and Frameworks · GitHub

Would love feedback from folks running LLMs at scale. What does
your current monitoring setup look like?

Topic		Replies	Views
How can i integrate NVIDIA LLM's to my system? NVIDIA Nemotron llm , llama-31-8b-instruct , llama , nemotron	0	69	February 19, 2026
Optimizing Inference on Large Language Models with NVIDIA TensorRT-LLM, Now Publicly Available Technical Blog	8	2029	January 25, 2024
Deploy an AI Coding Assistant with NVIDIA TensorRT-LLM and NVIDIA Triton Technical Blog	1	423	February 2, 2024
LLM Performance Benchmarking: Measuring NVIDIA NIM Performance with GenAI-Perf Technical Blog nim , llama	1	140	May 6, 2025
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes Technical Blog llama	1	114	October 22, 2024
TensorRT-LLM vs. Standard NIM for Production Alert Triage with Agentic AI Agents Models python , ai , developer , nim , llm , developer-support , agentic-ai	0	78	January 26, 2026
Building LLM-Powered Production Systems with NVIDIA NIM and Outerbounds Technical Blog nim	1	75	October 2, 2024
LLM Inference Benchmarking: Performance Tuning with TensorRT-LLM Technical Blog nim	1	175	July 7, 2025
Open AI Endpoint NVIDIA Nemotron	0	324	April 28, 2024
Supercharging Llama 3.1 across NVIDIA Platforms Technical Blog	14	397	September 17, 2024

We built traceAI, an open-source tool for tracing LLM calls in production

Related topics