Non-consistent memory error

ChepeNolon · September 13, 2011, 8:56am

Hi guys,

I’m quite new to cuda and gpu computing. I’m goint through the book cuda by example and trying to run the julia example in chapter four. I’ve done some changes from the code in the book because the struct constructor was not marked with device. I compile with the -lglut flag and it compiles just fine. However, some strange things happen when I try to run the program. At my first try I got

out of memory in line 62: HANDLE_ERROR(cudaMalloc((void**)&dev_bitmap, bitmap.image_size()));

I then tried to set down the DIM definition 50 and still got the same error. I waited a little while, set DIM to 20, and suddenly it worked. I was then able to increase DIM all the way up to the original 10000. To experiment a bit I continued to increase DIM, and gave it the value 1500, I then got the same memory error. However, when I set it back down to 1000 (which worked just a minute ago) I continue to get the memory error, and get it even if I decrease DIM to 2!

Can someone help me understand this behavior? Isn’t the gpu’s memory freed, or is something entirely different causing this error?

#include "../../cuda-by-example-src/common/book.h"

#include "../../cuda-by-example-src/common/cpu_bitmap.h"

#define DIM 1000

struct cuComplex {

    float r;

    float i;

    __device__ cuComplex(float a, float b) : r(a), i(b) {}

    __device__ float magnitude2(void) { 

        return r*r + i*i; 

    }

__device__ cuComplex operator*(const cuComplex& a) {

        return cuComplex(r*a.r - i*a.i, i*a.r + r*a.i);

    }

    __device__ cuComplex operator+(const cuComplex& a) {

        return cuComplex(r+a.r, i+a.i);

    }

};

__device__ int julia(int x, int y)

{

    const float scale = 1.5;

    float jx = scale * (float) (DIM/2 - x)/(DIM/2);

    float jy = scale * (float) (DIM/2 - y)/(DIM/2);

cuComplex c(-0.8, 0.156);

    cuComplex a(jx, jy);

int i = 0;

    for (i = 0; i < 200; i++) {

        a = a*a + c;

        if (a.magnitude2() > 1000)

            return 0;

    }

    return 1;

}

__global__ void kernel(unsigned char *ptr) {

    // map from blockIdx to pixel position

    int x = blockIdx.x;

    int y = blockIdx.y;

    int offset = x + y *gridDim.x;

int juliaValue = julia(x, y);

    ptr[offset*4 + 0] = 255*juliaValue;

    ptr[offset*4 + 1] = 0;

    ptr[offset*4 + 2] = 0;

    ptr[offset*4 + 3] = 255;

}   

int main(void)

{

    CPUBitmap bitmap(DIM, DIM);

    unsigned char *dev_bitmap;

    printf("image_size = %ld\n", bitmap.image_size());

    HANDLE_ERROR(cudaMalloc((void**)&dev_bitmap,

                bitmap.image_size()));

dim3 grid(DIM, DIM);

    kernel<<<grid,1>>>(dev_bitmap);

HANDLE_ERROR(cudaMemcpy(bitmap.get_ptr(),

                dev_bitmap,

                bitmap.image_size(),

                cudaMemcpyDeviceToHost));

    bitmap.display_and_exit();

cudaFree(dev_bitmap);

}

Paracelsus · September 13, 2011, 12:46pm

Hi there,

That is a bit strange. The only things I can think of are to check dmesg and see if there are any kernel messages from the Nvida driver (they will start with NVRM:), if there is something odd going on there may be some diagnostic info you can glean that way. Another option is to unload and reload the nvidia kernel module - just to see if it makes a difference.

Also, for what it’s worth, are you using the latest driver and did you compile the book examples using CUDA 4.0 or 3.2? When the book was written 3.2 was the current version, and while I believe I compiled those examples on 4.0 okay, there may be some issues? (Just a thought.)

I know the above is not enormously helpful, but it might help a bit.

Pete

Topic		Replies	Views
Cuda by example, Julia set performance improved failed CUDA Programming and Performance	1	836	November 1, 2013
Glut: Unresolved external CUDA by Example CUDA Programming and Performance	3	1928	June 21, 2011
A problem when a new hand in cuda programing CUDA Programming and Performance	6	695	July 30, 2015
Cuda Out of Memory with tons of memory left? CUDA Programming and Performance	5	38952	December 23, 2009
Strange change in behaviour between float and double CUDA Programming and Performance	6	1309	April 1, 2009
Weird memory behavior CUDA Programming and Performance	0	575	January 13, 2014
cuda error out of memory how to increase the size of matrix in multiplication CUDA Programming and Performance	10	11730	October 1, 2009
Errors when loading/unloading a module repeatedly I get CUDA_UNKNOWN_ERROR CUDA Programming and Performance	4	5457	June 24, 2008
How to properly debug this memory issue CUDA Programming and Performance	2	427	August 27, 2020
This is driving me nuts! memory access problem.. CUDA Programming and Performance	5	2662	December 7, 2007

Non-consistent memory error

Related topics