"CUDA out of memory" error when running Helmholtz example in Modulus

nga77 · February 8, 2023, 8:49pm

Hi, I’m new to Modulus and I get the following error whenever I run their Helmholtz python scripts example. How do I solve this issue?

ngeneva · February 9, 2023, 3:56am

Hi @nga77 ,

Thanks for trying out Modulus. This is a general deep learning problem not just Modulus. Unfortunately, PINNs training can take up quite a bit of memory compared to data-driven problems since additional gradients need to be stored. We develop on V100 and A100 gpus so the GPU memory we work with is larger than 4Gb. Fortunately there’s some simple solutions you can try:

Lower your batch-size. This can typically be done in your config file. (Will impact convergence)
Change the size of your neural network (make it smaller). This can be done in your config file or in the code itself. Have a look at the API docs for what parameters you can change. (Will impact convergence)
Train on hardware with more memory.

nga77 · February 10, 2023, 1:44am

Can this problem also be solved if I used Modulus on public cloud instances like AWS?

ngeneva · February 10, 2023, 1:49am

Yes, given that your remote instance has a GPU available with sufficient memory. In development we may test things by running on smaller sizes on test machines then scale to larger systems for bigger problems. It greatly depends on what the problem you’re working on.

Alternatively you could try running in CPU mode, but this will be much slower.

nga77 · February 10, 2023, 1:56am

Alright, thank you so much for the help :)

system · March 29, 2023, 4:50pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
"out of memory" problem.. CUDA Programming and Performance	1	6505	May 9, 2007
cuMemAlloc-always out of memory but dont know why... Memory Problem CUDA Programming and Performance	4	3952	April 27, 2011
cuda memory issues Legacy PGI Compilers	13	9563	March 19, 2012
cuda malloc use CPU memory Legacy PGI Compilers	3	4391	August 22, 2013
memory exhausted on GPU CUDA Programming and Performance	3	1096	September 9, 2014
CUDA 4.0 cudaHostAlloc CUDA Programming and Performance	9	1841	June 12, 2011
Many tests in examples are failing after a sucessful installation. CUDA Programming and Performance	1	1908	November 6, 2008
GPU Compatibility with NVIDIA Modulus Technical Support (PhysicsNeMo Only)	3	875	February 9, 2023
cuda memory CUDA Programming and Performance	4	3865	May 14, 2008
Object segmentation with CUDA-Memory requirements CUDA Programming and Performance	8	1113	March 27, 2017

"CUDA out of memory" error when running Helmholtz example in Modulus

Related topics