Hi, I was giving a try to the graph api and I have this weird behavior with the following code: #include <cuda_runtime.h> #include <stdio.h> #include <array> #include <iostream> #include <vector> #include "helper_cuda.h" __global__ void debugPrint(float* data, int size) { int index = threadIdx…

The programming guide states : 3.2.8.7.7.1.1. Device Graph Requirements … Memcpy nodes: Only copies involving device memory and/or pinned device-mapped host memory are permitted. Your usage of std::array does not constitute pinned device-mapped host memory. The following adaptation of your c…

Setting host memory via an hostnode before a memcopy node to device is not reflected in the device kernel execution

Accelerated Computing CUDA CUDA Programming and Performance

Robert_Crovella January 24, 2025, 4:11pm 6

~~my expectation is that is a doc oversight.~~

~~If its of concern you can can always request CUDA documentation updates by filing a bug.~~

Topic		Replies	Views
Device to host data copy may not reflect on host side using graphs CUDA Programming and Performance	5	253	September 6, 2023
Poor Memcpy Performance Copying To Pinned Memory On Host CUDA Programming and Performance	16	8038	April 2, 2014
CUDA graph: kernel execution and DtoH/HtoD memcpy not concurrent when destination of DtoH memcpy is not mapped CUDA Programming and Performance	4	34	January 29, 2025
How get in host the memory allocated from device CUDA Programming and Performance	10	3048	August 16, 2017
n00b error with cudaMemcpy CUDA Programming and Performance	4	993	June 30, 2010
Device Memeroy allocation and data transfer Data transfer between host and device CUDA Programming and Performance	5	2571	June 16, 2011
Can I create a pinned memory buffer to support overlapping compute/copy without cudaMallocHost overhead CUDA Programming and Performance cuda	13	837	November 3, 2020
malloc memory in kernel linked via in/out variable CUDA Programming and Performance	10	1955	October 17, 2015
Newbie: Error while device to host memcopy CUDA Programming and Performance	2	1941	July 18, 2008
cudaHostRegister crash or poor performance unknow error (30) in kernel for 64bit host operating syst CUDA Programming and Performance	23	5724	May 8, 2012

Setting host memory via an hostnode before a memcopy node to device is not reflected in the device kernel execution

Related topics