Managing Rogue Memory from OpenWebUI + Ollama - Problem and Solution:

ai.denatured515 · October 20, 2025, 6:22am

Problem Overview

In configuring a DGX system with OpenWebUI and Ollama to run large-scale models such as GPT-OSS 120B, a critical problem was encountered:

Upon stopping the application via a Docker stop command, the system memory was not being fully released.
This resulted in “rogue” memory consumption, where the memory used by the model (~64GB or more) remained allocated, even though the application appeared to be shut down.

This behavior caused confusion and inefficiencies—users expected the model to unload, but the memory footprint persisted, degrading system performance and making it unclear whether OpenWebUI or Ollama had truly stopped.

Investigation Findings

Stopping the Docker container (docker stop open-webui) was insufficient.
Ollama (which loads the large model into memory) spawns background processes that can remain active even after Docker stops.
These processes consume significant RAM and are not visible in simple docker ps or nvidia-smi checks.
Attempts to clear shared memory segments using ipcrm targeting IPC shared memory (ipcs -m) were not reliably releasing the RAM.
The actual RAM release was achieved only through:

Topic		Replies	Views
DGX Memory Not Released After Stopping Ollama/OpenWebUI – FIXED DGX Spark / GB10	5	555	October 21, 2025
System crashes when memory is full DGX Spark / GB10	28	1418	December 22, 2025
Issue installing open-webui:cuda DGX Spark / GB10	3	142	February 28, 2026
The DGX system itself takes up 20GB memory? DGX Spark / GB10 cuda	20	1027	November 23, 2025
How to control amount of shared memory available to LLM on Jetson Thor? Jetson Thor generative_ai	21	917	November 10, 2025
Is transient freezing expected behavior? DGX Spark / GB10	8	430	November 19, 2025
Open WebUI not opening in the web browser DGX Spark / GB10	2	124	December 10, 2025
DGX Spark becomes unresponsive (“zombie”) instead of throwing CUDA OOM DGX Spark / GB10	13	746	December 31, 2025
Open WebUI with Ollama DGX Spark / GB10 Projects	3	143	February 24, 2026
Vllm docker-compose - on DGX Spark from first time user looking for suggestions and question about RAM utilization DGX Spark / GB10 docker	6	1157	December 10, 2025

Managing Rogue Memory from OpenWebUI + Ollama - Problem and Solution:

Problem Overview

Investigation Findings

Related topics