copy device memory to constant memory

QD4_33 · October 22, 2008, 12:37pm

Hi,

I tried to copy some kernel calculated values to constant memory, because I want to reuse them in other kernels.

Unfortunately I get segmentation faults.

Some little code snippet:

#include <stdio.h>

__constant__ float testVar[1];

__global__ void test_kernel( float *testVar_d )

{

	testVar_d[ 0 ] = 7.0f;

}

__global__ void output_kernel( float *out_d )

{

	out_d[ 0 ] = testVar[0];

}

int main()

{

	float *testVar_d;

	cudaMalloc( (void**)&testVar_d, sizeof( float ) );

	float *out_d;

	cudaMalloc( (void**)&out_d, sizeof( float ) );

	float h = 1;

	cudaMemcpyToSymbol( testVar, &h, sizeof( float ) );

	dim3 dimGrid( 1 );

	dim3 dimBlock( 1 );

	test_kernel<<< dimGrid, dimBlock >>>( testVar_d );

	cudaThreadSynchronize();

	cudaMemcpyToSymbol( testVar, testVar_d, sizeof( float ), cudaMemcpyDeviceToDevice );

	output_kernel<<< dimGrid, dimBlock >>>( out_d );

	float result = 2;

	cudaMemcpy( &result, out_d, sizeof( float ), cudaMemcpyDeviceToHost );

	printf( "%f\n", result );

	return 0;

}

This programm should print 7, if everything works fine…

Any ideas how to copy from global memory to constant without making a copy to host memroy?

Thx ;o)

DarkAr · October 23, 2008, 7:39am

you cannot copy from device memory to const memory directly
you need first copy value from device memory to system memory, and then from system memory to symbol (const memory) :D

QD4_33 · October 27, 2008, 10:20am

I think you are right.

But the reference manual sais…

I hope somebody writing this reference manual is reading that.

When I can use a device to device copy and I am not allowed to copy to constant device memory, it should be noticed.

Quoc_Vinh · October 30, 2008, 3:23am

Yes, i got the same problem.

I was trying to copy data from device[Global memory] to constant memory .

But it was never work.

so I think that we can only copy data from host memory to constant memory.

I think that in the “reference manual” has this mistake.

QD4_33 · November 11, 2008, 12:05pm

I got a hint!

Definition after includes…

__contant__ float constant_device_variable

Host code…

float *device_pointer

cudaMalloc( (void**)&device_pointer, device_pointer_size );

[...]

cudaMemcpyToSymbol( "constant_device_variable", device_pointer, sizeof( float ), 0, cudaMemcpyDeviceToDevice );

This version works.

Important is to set a 0-offset. Perhaps cudaMemcpyDeviceToDevice is misinterpreted by nvcc as an offset, when there are three arguments in the function call.

edit: perhaps cudaMemcpyHostToDevice is equal to zero. If this assumption is true, the reference manual is ok but the sample code in the programming guide is misleading.

Topic		Replies	Views
directly copy device memory to the constant memory how to directly copy CUDA Programming and Performance	3	4691	October 30, 2008
Can we move data from global to constant memory in host function? CUDA Programming and Performance	3	616	August 17, 2022
copying structure to constant memory? CUDA Programming and Performance	9	8701	May 1, 2009
device to constant memory Trying to copy device mem to constant me CUDA Programming and Performance	2	3712	March 26, 2007
Copying into constant memory Invalid Device Symbol CUDA Programming and Performance	1	3822	September 25, 2009
__constant__ and __device__ memory access CUDA Programming and Performance	4	5905	April 10, 2012
Quick swapping of constant memory possible? Is it possible to declare an area of global memory as co CUDA Programming and Performance	3	2939	December 15, 2008
setting constant memory CUDA Programming and Performance	1	2231	May 12, 2009
Device constant memory from shared object CUDA Programming and Performance	1	6416	June 9, 2009
cudaMemcpyToSymbol do not copy data CUDA Programming and Performance	3	5795	August 12, 2009

copy device memory to constant memory

Related topics