Warning: Unified Memory Profiling is not supported on this configuration

JFSebastian · August 5, 2014, 4:50pm

I’m trying to profile the following simple code:

#define BLOCKSIZE 32

/**********/
/* iDivUp */
/*********/
int iDivUp(int a, int b) { return ((a % b) != 0) ? (a / b + 1) : (a / b); }

/********************/
/* CUDA ERROR CHECK */
/********************/
#define gpuErrchk(ans) { gpuAssert((ans), __FILE__, __LINE__); }
inline void gpuAssert(cudaError_t code, char *file, int line, bool abort=true)
{
   if (code != cudaSuccess) 
   {
      fprintf(stderr,"GPUassert: %s %s %d\n", cudaGetErrorString(code), file, line);
      if (abort) exit(code);
   }
}

/*******************/
/* KERNEL FUNCTION */
/*******************/
__global__ void kernel(int *vec1, int *vec2, int *vec3, int N) {

	int tid = threadIdx.x + blockIdx.x * blockDim.x;
	
	if (tid < N) vec3[tid] = vec1[tid] + vec2[tid];
	
}
 
/********/
/* MAIN */
/********/
int main() {
	
	const int N = 10;
	
	int *vec1, *vec2, *vec3; 
	
	gpuErrchk(cudaMallocManaged(&vec1, N*sizeof(int)));
	gpuErrchk(cudaMallocManaged(&vec2, N*sizeof(int)));
	gpuErrchk(cudaMallocManaged(&vec3, N*sizeof(int)));

	for (int i=0; i<N; i++) {
		vec1[i] = i;
		vec2[i] = 2*i;
	}
	
	kernel<<<iDivUp(BLOCKSIZE,N), BLOCKSIZE>>>(vec1, vec2, vec3, N);	
	gpuErrchk(cudaPeekAtLastError());
	gpuErrchk(cudaDeviceSynchronize());	

	for (int i=0; i<N; i++) {
		printf("vec1 = %i; vec2 = %i; vec3 = %i \n", vec1[i], vec2[i], vec3[i]);
	}
	
	return 0;
}

However, the NVIDIA Visual Profiler gives me the following warning in the console panel

Warning: Unified Memory Profiling is not supported on this configuration

As a result, the timeline does not show any relevant information about the kernel launch.

My configuration: CUDA 6.5; Kepler K20c; Windows 7.

Robert_Crovella · August 5, 2014, 5:08pm

Do you have another nvidia gpu besides the K20c in that system?

Is the windows 32 bit or 64 bit?

JFSebastian · August 5, 2014, 5:14pm

The workstation has 4 Kepler K20c GPUs.

Windows is 64 bit and I’m compiling in Release mode for a 64 bit architecture.

Robert_Crovella · August 5, 2014, 5:23pm

Besides the 4 Kepler K20c GPUs, are there any other GPUs? What is driving the display?

JFSebastian · August 5, 2014, 8:47pm

Yes, the display is driven by a Matrox G200eR.

Robert_Crovella · August 5, 2014, 8:57pm

This description in the profiler user’s guide that accompanies cuda 6.5RC docs may be relevant:

“On multi-GPU configurations without P2P support between any pair of devices that
support Unified Memory, managed memory allocations are placed in zero-copy
memory. In this case Unified Memory profiling is not supported. In certain cases,
the environment variable CUDA_MANAGED_FORCE_DEVICE_ALLOC can be set to force
managed allocations to be in device memory and to enable migration on these hardware
configurations. In this case Unified Memory profiling is supported. Normally, using the
environment variable CUDA_VISIBLE_DEVICES is recommended to restrict CUDA to
only use those GPUs that have P2P support. Please refer to the environment variables
section in the CUDA C Programming Guide for further details.”

Although this is in section 3.2.6 which pertains to nvprof, I suspect nvvp may have a similar limitation. You might try to see if launching nvvp with the CUDA_VISIBLE_DEVICES environment variable set to a single GPU may help it to work.

smartvoice · May 28, 2015, 6:18am

I think txbob raised a very good point.
I will suggest trying to set cuda device to K20c, before running the computation.
I know that cuda cap 3.2 device doesn’t support um profiling.

JFSebastian:

I’m trying to profile the following simple code:

#define BLOCKSIZE 32

/**********/
/* iDivUp */
/*********/
int iDivUp(int a, int b) { return ((a % b) != 0) ? (a / b + 1) : (a / b); }

/********************/
/* CUDA ERROR CHECK */
/********************/
#define gpuErrchk(ans) { gpuAssert((ans), __FILE__, __LINE__); }
inline void gpuAssert(cudaError_t code, char *file, int line, bool abort=true)
{
   if (code != cudaSuccess) 
   {
      fprintf(stderr,"GPUassert: %s %s %d\n", cudaGetErrorString(code), file, line);
      if (abort) exit(code);
   }
}

/*******************/
/* KERNEL FUNCTION */
/*******************/
__global__ void kernel(int *vec1, int *vec2, int *vec3, int N) {

	int tid = threadIdx.x + blockIdx.x * blockDim.x;
	
	if (tid < N) vec3[tid] = vec1[tid] + vec2[tid];
	
}
 
/********/
/* MAIN */
/********/
int main() {
	
	const int N = 10;
	
	int *vec1, *vec2, *vec3; 
	
	gpuErrchk(cudaMallocManaged(&vec1, N*sizeof(int)));
	gpuErrchk(cudaMallocManaged(&vec2, N*sizeof(int)));
	gpuErrchk(cudaMallocManaged(&vec3, N*sizeof(int)));

	for (int i=0; i<N; i++) {
		vec1[i] = i;
		vec2[i] = 2*i;
	}
	
	kernel<<<iDivUp(BLOCKSIZE,N), BLOCKSIZE>>>(vec1, vec2, vec3, N);	
	gpuErrchk(cudaPeekAtLastError());
	gpuErrchk(cudaDeviceSynchronize());	

	for (int i=0; i<N; i++) {
		printf("vec1 = %i; vec2 = %i; vec3 = %i \n", vec1[i], vec2[i], vec3[i]);
	}
	
	return 0;
}

However, the NVIDIA Visual Profiler gives me the following warning in the console panel

Warning: Unified Memory Profiling is not supported on this configuration

As a result, the timeline does not show any relevant information about the kernel launch.

My configuration: CUDA 6.5; Kepler K20c; Windows 7.

Topic		Replies	Views
"Unified Memory Profiling is not supported ..." warning 3348 Visual Profiler and nvprof	15	5949	September 20, 2018
unified memory profiling failed Visual Profiler and nvprof	12	6256	June 17, 2018
nvprof and unified memory CUDA Setup and Installation	7	2185	June 17, 2018
NVPROF is causing system instability and requiring reboot CUDA Programming and Performance	8	1020	February 19, 2018
Nvprof - Unified Memory profiling failed [solved] Visual Profiler and nvprof	7	4855	June 2, 2019
Unified Memory Profiling is not supported on the underlying platform Jetson AGX Xavier	10	3373	October 18, 2021
Unified Memory Warning and how to set CUDA device for GTX 1060 CUDA Setup and Installation	1	1527	March 14, 2017
How to profile unified memory with Nsight System? Profiling Linux Targets cuda , nsight	2	1080	September 9, 2023
NVProf error on samples CUDA Programming and Performance	28	20689	December 29, 2020
CUDA might not be working properly and other warnings CUDA Programming and Performance	8	1788	July 1, 2018

Warning: Unified Memory Profiling is not supported on this configuration

Related topics