GPU Usage Stuck at Placeholder in C++ Llama 3.2 App - Need NVML Help!

bniladridas · March 1, 2025, 3:36am

Description

Hey NVIDIA crew, I’m working on this C++ terminal app for Llama 3.2 (shoutout to your GPU tech in the README!), and I’ve hit a snag. The GPU usage is hardcoded to int gpu_usage = 0;—no real measurement, just a placeholder. I’m on the “optimize-algorithm” branch trying to juice up performance, but without actual GPU stats, I’m stuck. The README teases “GPU usage: 5%” in an example, but it’s fake. How do I hook up something like NVML to get the real deal? Appreciate any pointers!

Environment

Here’s my setup (fill in your own if different):

TensorRT Version: N/A (not using it here)
GPU Type: NVIDIA GTX 1660 (mid-tier, might upgrade—yours?)
Nvidia Driver Version: 535.104.05
CUDA Version: 11.8
CUDNN Version: 8.9.0
Operating System + Version: Ubuntu 22.04
Python Version: N/A (pure C++)
TensorFlow Version: N/A
PyTorch Version: N/A
Baremetal or Container: Baremetal

Relevant Files

Check my repo:

main.cpp—where the GPU usage sits at zero.
README.md—admits it’s a placeholder and gives NVIDIA props.
Link: https://github.com/bniladridas/cpp_terminal_app/tree/optimize-algorithm

Steps To Reproduce

Clone it: git clone -b optimize-algorithm https://github.com/bniladridas/cpp_terminal_app.git
Build: mkdir build && cd build && cmake .. && make
Run: ./LlamaTerminalApp
Output shows “GPU usage: 0%” (or 5% in README example)—all fake, no crash, just no real data.

No traceback, just a quiet fail on the GPU front.

Question

I’m thinking NVML could fix this since you guys rock CUDA. How do I plug it in to measure actual GPU usage for this Llama 3.2 beast? Code snippets or tips would be clutch—thanks!

AakankshaS · March 28, 2025, 5:52pm

Hi @bniladridas
Apologies for the delay,
This forum talks about issues related to TRT, hence i suggest you to pls raise it top CUDA forum.

thanks

Topic		Replies	Views
Cuda and tensorflow CUDA Developer Tools	0	1129	September 18, 2020
Tensorflow is not recognising the gpu TensorRT	7	1940	July 15, 2024
GPU support for tflite Jetson Nano cuda , tensorflow	8	5239	October 18, 2021
Configuring multiple versions of TensorRT and Tensorflow on HPC share cluster; TF-TRT Warning: Cannot dlopen some TensorRT libraries TensorRT	8	12966	June 28, 2023
Tensorflow coredump no supported devices found for CUDA (Docker nvcr.io container), after reboot nvidia-smi can't find driver Linux cuda , tensorflow	2	2573	October 8, 2020
getPluginCreator could not find plugin BatchedNMS_TRT version 1 TensorRT	5	4006	December 23, 2020
Unable to use GPU with Tensorflow 2.1 + CUDA 10.1 on Ubuntu 18.04 Linux	3	9984	October 12, 2021
Trying to implement GPU for Tensorflow using CUDA & cuDNN TensorRT cudnn	1	680	May 20, 2024
Cannot get MATLAB or Python Libraries (tensorflow, tensorrt, or pytorch) to recognize GPU Linux tensorrt , tensorflow , ubuntu , matlab , docker , pytorch , python	3	701	May 4, 2023
Nvidia driver conflict CUDA_ERROR_NO_DEVICE Linux	10	9750	June 28, 2018