Can't allocate gpu memory to multiple gpus while training

linmc.86 · January 2, 2019, 8:34pm

Hello,

I have 2 titan Xp on my pc and I use python2.7 and tensorflow1.12.0 to do the training on Ubuntu 16.04. Sometimes I need 2 gpus’ memory(24G) to do my work and it used to work fine. However, for some reason, I have to completely wipe off the pc and reinstall everything. This time it only allocates gpu memory to one of the gpus and never uses both. It fully uses one gpu memory and give me OOM error while the other gpu memory allocation is 0. Cuda version is 9.0. Cudnn7.4. “./bin/ppc64le/linux/release/deviceQuery” showed pass.

My code was same as before.
when I set os.environ[“CUDA_VISIBLE_DEVICES”] = “0” or “0,1” or not write this line of code, it runs the gpu:0 without any problem.
When I set os.environ[“CUDA_VISIBLE_DEVICES”] = “1” or “1,0”, it runs gpu:1 perfectly.

Is this cuda problem? how can I allocate gpus memory to multiple gpus?

Thanks!

Topic		Replies	Views
Multiple GPUs not working CUDA Programming and Performance	1	840	July 9, 2009
Multi-GPU Memory Allocation behaves differently with different order of allocation CUDA Programming and Performance	1	797	June 15, 2013
Using multiple GPUs to scale an existing Cuda application - failing to allocate memory CUDA Programming and Performance	5	1232	September 4, 2018
cuda_driver failed_to_allocate problem CUDA_ERROR_OUT_OF_MEMORY CUDA Programming and Performance	0	1770	April 18, 2019
Memory allocation problem with multi-gpu (Tesla k80), possible cuda driver bug CUDA Programming and Performance	5	4110	February 20, 2016
MultiGPU Pinned and pageable memory CUDA Programming and Performance	0	812	June 9, 2010
CudaMalloc fails when more of 2 linux process acces to the GPU 0 CUDA Programming and Performance	2	1170	February 24, 2009
Problem with Memory allocation in GTX 295 multiThread out of memory when allocation of more than 768 CUDA Programming and Performance	1	1484	October 5, 2009
Unable to utilize all GPU memory when using tensorflow, failed to alloate memory CUDA Programming and Performance	1	1127	October 8, 2018
CUDa programming for multi-GPU questions about multi-GPU memory allocation CUDA Programming and Performance	1	7350	September 13, 2009

Can't allocate gpu memory to multiple gpus while training

Related topics