Errors when loading/unloading a module repeatedly I get CUDA_UNKNOWN_ERROR

Veronica · January 6, 2008, 8:11pm

I am loading and unloading a module repeatedly, and after a certain number of iterations, cuModuleLoad() results in CUDA_UNKNOWN_ERROR. It seems like the number of texture references in the device code determines the number of successful iterations; for example, if I declare four textures (all on the form texture<float, 2, cudaReadModeElementType> tex), I get an error after 32 iterations. If I reduce the number of textures to three, I get an error after 42 iterations. The corresponding number of successful iterations is 64 for two textures and 128 for one texture. If I don’t declare any textures at all in my .cu-file, no error is produced. Changing the number of global functions in the .cu-file doesn’t seem to have any effect on the execution.

This is my code:

CUdevice cuDevice;
CUcontext cuContext;
CUmodule cuModule;
char* module_path;

CUT_DEVICE_INIT_DRV(cuDevice);

CUresult status = cuCtxCreate( &cuContext, 0, cuDevice );        
module_path = cutFindFilePath("simpleTexture_kernel.cubin", argv[0]);    

bool firstError = true;

for(int i = 0; i < 1000; i++)
{
    char* memfname = (char*) malloc(200*sizeof(char));
    unsigned int freemem;
    unsigned int totmem;
    CUresult getmeminfo = cuMemGetInfo(&freemem, &totmem);
    sprintf(memfname, "%d free mem %.1f MB out of total mem %.1f MB.txt",i,freemem/1000000.0f, totmem/1000000.0f);                     
    int getmeminfoi = (int) getmeminfo;
    cutWriteFilei(memfname, &getmeminfoi, 1);
    free(memfname); 

    status = cuModuleLoad(&cuModule, module_path);

    if(!status)
    {
        printf("%d OK when loaded module: %d \n", i, status);
    }
    else if(firstError)
    {
        printf("%d first error while loading module: %d \n", i, status);
        firstError = false;          
    }
    
    status = cuModuleUnload(cuModule);
    if(!status)
    {
        printf("%d ok when unloaded module: %d \n", i, status);
    }
    else if(firstError)
    {
        printf("%d first error while unloading module: %d \n", i, status);
        firstError = false;          
    }

    cuModule = 0;
}
cutFree(module_path);        
cuCtxDetach(cuContext);
CUT_EXIT(argc, argv);

According to the Cuda Programming guide, “if the memory for functions and data (constant and global) needed by the module cannot be allocated, cuModuleLoad() fails”, but I thought I was freeing the memory used by the module when unloading it?
As you can see I have checked the amount of memory available for allocation by the Cuda context (using cuMemGetInfo), and I seem to have plenty of memory left even when the loading of the module fails. Am I mixing up memory spaces?

I would be very grateful if somebody could explain why this error arises. I am new to Cuda and there are probably plenty of things about the driver API that I haven’t understood yet.

chrisse27 · June 24, 2008, 11:12am

I have a similar problem. After a number of repeated load/unload operations I get a CUDA_OUT_OF_MEMORY error. Don’t know if it is the totally related to the problem described above but it seems likely. In my case the number of iterations lies around 5000.

netllama · June 24, 2008, 11:39am

Which OS are you using?
Which driver version?
Which GPU?

Please provide a test app which reproduces the problem.

chrisse27 · June 24, 2008, 7:50pm

Windows XP 64 (will test on our Win XP 32)
CUDA 1.1
GeForce 9800 GX2

Test app will follow.

netllama · June 24, 2008, 7:51pm

Does this reproduce with the 2.0-beta release?

Topic		Replies	Views
Unknown Error CUDA Programming and Performance	4	5891	October 17, 2018
Cuda Out of Memory with tons of memory left? CUDA Programming and Performance	5	38953	December 23, 2009
Bug report: 8400M GS + Win7 errors errors and more errors CUDA Programming and Performance	0	4518	January 19, 2010
Cuda uncomprensible error CUDA Programming and Performance	5	7087	August 17, 2010
CUDA Texture Memory Example for Beginners CUDA Programming and Performance	6	4112	July 10, 2023
Device memory size CUDA Programming and Performance	11	46792	June 6, 2008
Clearing Cuda Errors CUDA Programming and Performance	6	11337	December 1, 2009
Getting around apparent CUDA bugs CUDA Programming and Performance	5	963	September 20, 2011
Using texture memory over iterations causes incorrect read/write of some lines CUDA Programming and Performance cuda	2	511	September 2, 2020
cudaMallocHost not returning errors in emulation mode CUDA Programming and Performance	1	1471	June 3, 2009

Errors when loading/unloading a module repeatedly I get CUDA_UNKNOWN_ERROR

Related topics