11 GB of GPU RAM used, and no process listed by nvidia-smi

  1. log out of the username that issued the interrupted work to that gpu

  2. as root, find all running processes associated with the username that issued the interrupted work on that gpu:

ps -ef|grep username

  1. as root, kill all of those

  2. as root, retry the nvidia-smi gpu reset

If that doesn’t work, I’m out of ideas.

3 Likes