nmistCUDNN failed to compile cuDNNv7.6.1

Hello,
I recently installed CUDAv10.1 successfully on my Dell laptop (Driver version-418.67, GeForce GTX 960M, Ubuntu 18.04.1)

deviceQuery

thomas@thomas-Inspiron-7559:~/NVIDIA_CUDA-10.1_Samples/bin/x86_64/linux/release$ ./deviceQuery
./deviceQuery Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: "GeForce GTX 960M"
  CUDA Driver Version / Runtime Version          10.1 / 10.1
  CUDA Capability Major/Minor version number:    5.0
  Total amount of global memory:                 4046 MBytes (4242604032 bytes)
  ( 5) Multiprocessors, (128) CUDA Cores/MP:     640 CUDA Cores
  GPU Max Clock rate:                            1176 MHz (1.18 GHz)
  Memory Clock rate:                             2505 Mhz
  Memory Bus Width:                              128-bit
  L2 Cache Size:                                 2097152 bytes
  Maximum Texture Dimension Size (x,y,z)         1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)
  Maximum Layered 1D Texture Size, (num) layers  1D=(16384), 2048 layers
  Maximum Layered 2D Texture Size, (num) layers  2D=(16384, 16384), 2048 layers
  Total amount of constant memory:               65536 bytes
  Total amount of shared memory per block:       49152 bytes
  Total number of registers available per block: 65536
  Warp size:                                     32
  Maximum number of threads per multiprocessor:  2048
  Maximum number of threads per block:           1024
  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
  Maximum memory pitch:                          2147483647 bytes
  Texture alignment:                             512 bytes
  Concurrent copy and kernel execution:          Yes with 1 copy engine(s)
  Run time limit on kernels:                     Yes
  Integrated GPU sharing Host Memory:            No
  Support host page-locked memory mapping:       Yes
  Alignment requirement for Surfaces:            Yes
  Device has ECC support:                        Disabled
  Device supports Unified Addressing (UVA):      Yes
  Device supports Compute Preemption:            No
  Supports Cooperative Kernel Launch:            No
  Supports MultiDevice Co-op Kernel Launch:      No
  Device PCI Domain ID / Bus ID / location ID:   0 / 2 / 0
  Compute Mode:
     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.1, CUDA Runtime Version = 10.1, NumDevs = 1
Result = PASS

bandwidthTest

thomas@thomas-Inspiron-7559:~/NVIDIA_CUDA-10.1_Samples/bin/x86_64/linux/release$ ./bandwidthTest
[CUDA Bandwidth Test] - Starting...
Running on...

 Device 0: GeForce GTX 960M
 Quick Mode

 Host to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)	Bandwidth(GB/s)
   32000000			5.9

 Device to Host Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)	Bandwidth(GB/s)
   32000000			6.0

 Device to Device Bandwidth, 1 Device(s)
 PINNED Memory Transfers
   Transfer Size (Bytes)	Bandwidth(GB/s)
   32000000			62.5

Result = PASS

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

I followed the cuDNN installation guide to install from a debian file for all three files, and then copied the cudnn samples folder to a writable path as it suggested and went to it:

$cp -r /usr/src/cudnn_samples_v7/ $HOME
$ cd  $HOME/cudnn_samples_v7/mnistCUDNN

But when I tried to verify it by compiling mnist, this is what was returned:

thomas@thomas-Inspiron-7559:~/cudnn_samples_v7/mnistCUDNN$ make clean && make
rm -rf *o
rm -rf mnistCUDNN
Linking agains cublasLt = true
CUDA VERSION: 10010
TARGET ARCH: x86_64
HOST_ARCH: x86_64
TARGET OS: linux
SMS: 30 35 50 53 60 61 62 70 72 75
/usr/local/cuda/bin/nvcc -ccbin g++ -I/usr/local/cuda/include -IFreeImage/include  -m64    -gencode arch=compute_30,code=sm_30 -gencode arch=compute_35,code=sm_35 -gencode arch=compute_50,code=sm_50 -gencode arch=compute_53,code=sm_53 -gencode arch=compute_60,code=sm_60 -gencode arch=compute_61,code=sm_61 -gencode arch=compute_62,code=sm_62 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_72,code=sm_72 -gencode arch=compute_75,code=sm_75 -gencode arch=compute_75,code=compute_75 -o fp16_dev.o -c fp16_dev.cu
g++ -I/usr/local/cuda/include -IFreeImage/include   -o fp16_emu.o -c fp16_emu.cpp
g++ -I/usr/local/cuda/include -IFreeImage/include   -o mnistCUDNN.o -c mnistCUDNN.cpp
In file included from /usr/local/cuda/include/channel_descriptor.h:61:0,
                 from /usr/local/cuda/include/cuda_runtime.h:95,
                 from /usr/include/cudnn.h:64,
                 from mnistCUDNN.cpp:30:
/usr/local/cuda/include/cuda_runtime_api.h:2248:25: error: expected ‘)’ before ‘*’ token
 typedef void (CUDART_CB *cudaStreamCallback_t)(cudaStream_t stream, cudaError_t status, void *userData);
                         ^
/usr/local/cuda/include/cuda_runtime_api.h:2316:9: error: ‘cudaStreamCallback_t’ has not been declared
         cudaStreamCallback_t callback, void *userData, unsigned int flags);
         ^~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:2483:88: error: use of enum ‘cudaStreamCaptureMode’ without previous declaration
 extern __host__ cudaError_t CUDARTAPI cudaStreamBeginCapture(cudaStream_t stream, enum cudaStreamCaptureMode mode);
                                                                                        ^~~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:2534:80: error: use of enum ‘cudaStreamCaptureMode’ without previous declaration
 extern __host__ cudaError_t CUDARTAPI cudaThreadExchangeStreamCaptureMode(enum cudaStreamCaptureMode *mode);
                                                                                ^~~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:2562:81: error: ‘cudaGraph_t’ has not been declared
 extern __host__ cudaError_t CUDARTAPI cudaStreamEndCapture(cudaStream_t stream, cudaGraph_t *pGraph);
                                                                                 ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:2600:87: error: use of enum ‘cudaStreamCaptureStatus’ without previous declaration
 extern __host__ cudaError_t CUDARTAPI cudaStreamIsCapturing(cudaStream_t stream, enum cudaStreamCaptureStatus *pCaptureStatus);
                                                                                       ^~~~~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:2628:90: error: use of enum ‘cudaStreamCaptureStatus’ without previous declaration
 extern __host__ cudaError_t CUDARTAPI cudaStreamGetCaptureInfo(cudaStream_t stream, enum cudaStreamCaptureStatus *pCaptureStatus, unsigned long long *pId);
                                                                                          ^~~~~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3012:64: error: ‘cudaExternalMemory_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalMemory(cudaExternalMemory_t *extMem_out, const struct cudaExternalMemoryHandleDesc *memHandleDesc);
                                                                ^~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3012:64: note: suggested alternative: ‘CUexternalMemory’
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalMemory(cudaExternalMemory_t *extMem_out, const struct cudaExternalMemoryHandleDesc *memHandleDesc);
                                                                ^~~~~~~~~~~~~~~~~~~~
                                                                CUexternalMemory
/usr/local/cuda/include/cuda_runtime_api.h:3012:86: error: ‘extMem_out’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalMemory(cudaExternalMemory_t *extMem_out, const struct cudaExternalMemoryHandleDesc *memHandleDesc);
                                                                                      ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3012:98: error: expected primary-expression before ‘const’
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalMemory(cudaExternalMemory_t *extMem_out, const struct cudaExternalMemoryHandleDesc *memHandleDesc);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3012:154: error: expression list treated as compound expression in initializer [-fpermissive]
 tExternalMemory(cudaExternalMemory_t *extMem_out, const struct cudaExternalMemoryHandleDesc *memHandleDesc);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:3066:88: error: ‘cudaExternalMemory_t’ has not been declared
 extern __host__ cudaError_t CUDARTAPI cudaExternalMemoryGetMappedBuffer(void **devPtr, cudaExternalMemory_t extMem, const struct cudaExternalMemoryBufferDesc *bufferDesc);
                                                                                        ^~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3121:111: error: ‘cudaExternalMemory_t’ has not been declared
 __ cudaError_t CUDARTAPI cudaExternalMemoryGetMappedMipmappedArray(cudaMipmappedArray_t *mipmap, cudaExternalMemory_t extMem, const struct cudaExternalMemoryMipmappedArrayDesc *mipmapDesc);
                                                                                                  ^~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3144:65: error: ‘cudaExternalMemory_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaDestroyExternalMemory(cudaExternalMemory_t extMem);
                                                                 ^~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3144:65: note: suggested alternative: ‘CUexternalMemory’
 extern __host__ cudaError_t CUDARTAPI cudaDestroyExternalMemory(cudaExternalMemory_t extMem);
                                                                 ^~~~~~~~~~~~~~~~~~~~
                                                                 CUexternalMemory
/usr/local/cuda/include/cuda_runtime_api.h:3238:67: error: ‘cudaExternalSemaphore_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalSemaphore(cudaExternalSemaphore_t *extSem_out, const struct cudaExternalSemaphoreHandleDesc *semHandleDesc);
                                                                   ^~~~~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3238:67: note: suggested alternative: ‘CUexternalSemaphore’
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalSemaphore(cudaExternalSemaphore_t *extSem_out, const struct cudaExternalSemaphoreHandleDesc *semHandleDesc);
                                                                   ^~~~~~~~~~~~~~~~~~~~~~~
                                                                   CUexternalSemaphore
/usr/local/cuda/include/cuda_runtime_api.h:3238:92: error: ‘extSem_out’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaImportExternalSemaphore(cudaExternalSemaphore_t *extSem_out, const struct cudaExternalSemaphoreHandleDesc *semHandleDesc);
                                                                                            ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3238:104: error: expected primary-expression before ‘const’
  __host__ cudaError_t CUDARTAPI cudaImportExternalSemaphore(cudaExternalSemaphore_t *extSem_out, const struct cudaExternalSemaphoreHandleDesc *semHandleDesc);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3238:163: error: expression list treated as compound expression in initializer [-fpermissive]
 Semaphore(cudaExternalSemaphore_t *extSem_out, const struct cudaExternalSemaphoreHandleDesc *semHandleDesc);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:3277:79: error: ‘cudaExternalSemaphore_t’ does not name a type; did you mean ‘CUexternalSemaphore’?
 extern __host__ cudaError_t CUDARTAPI cudaSignalExternalSemaphoresAsync(const cudaExternalSemaphore_t *extSemArray, const struct cudaExternalSemaphoreSignalParams *paramsArray, unsigned int numExtSems, cudaStream_t stream __dv(0));
                                                                               ^~~~~~~~~~~~~~~~~~~~~~~
                                                                               CUexternalSemaphore
/usr/local/cuda/include/cuda_runtime_api.h:3320:77: error: ‘cudaExternalSemaphore_t’ does not name a type; did you mean ‘CUexternalSemaphore’?
 extern __host__ cudaError_t CUDARTAPI cudaWaitExternalSemaphoresAsync(const cudaExternalSemaphore_t *extSemArray, const struct cudaExternalSemaphoreWaitParams *paramsArray, unsigned int numExtSems, cudaStream_t stream __dv(0));
                                                                             ^~~~~~~~~~~~~~~~~~~~~~~
                                                                             CUexternalSemaphore
/usr/local/cuda/include/cuda_runtime_api.h:3342:68: error: ‘cudaExternalSemaphore_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaDestroyExternalSemaphore(cudaExternalSemaphore_t extSem);
                                                                    ^~~~~~~~~~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:3342:68: note: suggested alternative: ‘CUexternalSemaphore’
 extern __host__ cudaError_t CUDARTAPI cudaDestroyExternalSemaphore(cudaExternalSemaphore_t extSem);
                                                                    ^~~~~~~~~~~~~~~~~~~~~~~
                                                                    CUexternalSemaphore
/usr/local/cuda/include/cuda_runtime_api.h:3855:79: error: ‘cudaHostFn_t’ has not been declared
 extern __host__ cudaError_t CUDARTAPI cudaLaunchHostFunc(cudaStream_t stream, cudaHostFn_t fn, void *userData);
                                                                               ^~~~~~~~~~~~
In file included from /usr/local/cuda/include/channel_descriptor.h:61:0,
                 from /usr/local/cuda/include/cuda_runtime.h:95,
                 from /usr/include/cudnn.h:64,
                 from mnistCUDNN.cpp:30:
/usr/local/cuda/include/cuda_runtime_api.h:8238:55: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphCreate(cudaGraph_t *pGraph, unsigned int flags);
                                                       ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8238:55: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphCreate(cudaGraph_t *pGraph, unsigned int flags);
                                                       ^~~~~~~~~~~
                                                       cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8238:68: error: ‘pGraph’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphCreate(cudaGraph_t *pGraph, unsigned int flags);
                                                                    ^~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8238:68: note: suggested alternative: ‘CUgraph’
 extern __host__ cudaError_t CUDARTAPI cudaGraphCreate(cudaGraph_t *pGraph, unsigned int flags);
                                                                    ^~~~~~
                                                                    CUgraph
/usr/local/cuda/include/cuda_runtime_api.h:8238:76: error: expected primary-expression before ‘unsigned’
 extern __host__ cudaError_t CUDARTAPI cudaGraphCreate(cudaGraph_t *pGraph, unsigned int flags);
                                                                            ^~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8238:94: error: expression list treated as compound expression in initializer [-fpermissive]
 extern __host__ cudaError_t CUDARTAPI cudaGraphCreate(cudaGraph_t *pGraph, unsigned int flags);
                                                                                              ^
/usr/local/cuda/include/cuda_runtime_api.h:8335:62: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                              ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8335:62: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                              ^~~~~~~~~~~~~~~
                                                              cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8335:79: error: ‘pGraphNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                               ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8335:79: note: suggested alternative: ‘CUgraphNode’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                               ^~~~~~~~~~
                                                                               CUgraphNode
/usr/local/cuda/include/cuda_runtime_api.h:8335:91: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                                           ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8335:91: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                                           ^~~~~~~~~~~
                                                                                           cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8335:110: error: expected primary-expression before ‘const’
 t__ cudaError_t CUDARTAPI cudaGraphAddKernelNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8335:155: error: expected primary-expression before ‘numDependencies’
 ode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8335:172: error: expected primary-expression before ‘const’
 _t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8335:218: error: expression list treated as compound expression in initializer [-fpermissive]
 nst cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaKernelNodeParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8368:68: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphKernelNodeGetParams(cudaGraphNode_t node, struct cudaKernelNodeParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8368:68: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphKernelNodeGetParams(cudaGraphNode_t node, struct cudaKernelNodeParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
                                                                    cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8368:90: error: expected primary-expression before ‘struct’
 extern __host__ cudaError_t CUDARTAPI cudaGraphKernelNodeGetParams(cudaGraphNode_t node, struct cudaKernelNodeParams *pNodeParams);
                                                                                          ^~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8368:130: error: expression list treated as compound expression in initializer [-fpermissive]
 or_t CUDARTAPI cudaGraphKernelNodeGetParams(cudaGraphNode_t node, struct cudaKernelNodeParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8393:68: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphKernelNodeSetParams(cudaGraphNode_t node, const struct cudaKernelNodeParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8393:68: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphKernelNodeSetParams(cudaGraphNode_t node, const struct cudaKernelNodeParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
                                                                    cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8393:90: error: expected primary-expression before ‘const’
 extern __host__ cudaError_t CUDARTAPI cudaGraphKernelNodeSetParams(cudaGraphNode_t node, const struct cudaKernelNodeParams *pNodeParams);
                                                                                          ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8393:136: error: expression list treated as compound expression in initializer [-fpermissive]
 UDARTAPI cudaGraphKernelNodeSetParams(cudaGraphNode_t node, const struct cudaKernelNodeParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8437:62: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                              ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8437:62: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                              ^~~~~~~~~~~~~~~
                                                              cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8437:79: error: ‘pGraphNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                               ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8437:79: note: suggested alternative: ‘CUgraphNode’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                               ^~~~~~~~~~
                                                                               CUgraphNode
/usr/local/cuda/include/cuda_runtime_api.h:8437:91: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                                           ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8437:91: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                                           ^~~~~~~~~~~
                                                                                           cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8437:110: error: expected primary-expression before ‘const’
 t__ cudaError_t CUDARTAPI cudaGraphAddMemcpyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8437:155: error: expected primary-expression before ‘numDependencies’
 ode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8437:172: error: expected primary-expression before ‘const’
 _t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8437:215: error: expression list treated as compound expression in initializer [-fpermissive]
  const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemcpy3DParms *pCopyParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8460:68: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemcpyNodeGetParams(cudaGraphNode_t node, struct cudaMemcpy3DParms *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8460:68: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemcpyNodeGetParams(cudaGraphNode_t node, struct cudaMemcpy3DParms *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
                                                                    cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8460:90: error: expected primary-expression before ‘struct’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemcpyNodeGetParams(cudaGraphNode_t node, struct cudaMemcpy3DParms *pNodeParams);
                                                                                          ^~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8460:127: error: expression list treated as compound expression in initializer [-fpermissive]
 Error_t CUDARTAPI cudaGraphMemcpyNodeGetParams(cudaGraphNode_t node, struct cudaMemcpy3DParms *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8483:68: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemcpyNodeSetParams(cudaGraphNode_t node, const struct cudaMemcpy3DParms *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8483:68: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemcpyNodeSetParams(cudaGraphNode_t node, const struct cudaMemcpy3DParms *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
                                                                    cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8483:90: error: expected primary-expression before ‘const’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemcpyNodeSetParams(cudaGraphNode_t node, const struct cudaMemcpy3DParms *pNodeParams);
                                                                                          ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8483:133: error: expression list treated as compound expression in initializer [-fpermissive]
 t CUDARTAPI cudaGraphMemcpyNodeSetParams(cudaGraphNode_t node, const struct cudaMemcpy3DParms *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8525:62: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                              ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8525:62: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                              ^~~~~~~~~~~~~~~
                                                              cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8525:79: error: ‘pGraphNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                               ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8525:79: note: suggested alternative: ‘CUgraphNode’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                               ^~~~~~~~~~
                                                                               CUgraphNode
/usr/local/cuda/include/cuda_runtime_api.h:8525:91: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                                           ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8525:91: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                                           ^~~~~~~~~~~
                                                                                           cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8525:110: error: expected primary-expression before ‘const’
 t__ cudaError_t CUDARTAPI cudaGraphAddMemsetNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8525:155: error: expected primary-expression before ‘numDependencies’
 ode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8525:172: error: expected primary-expression before ‘const’
 _t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8525:216: error: expression list treated as compound expression in initializer [-fpermissive]
 const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaMemsetParams *pMemsetParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8548:68: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemsetNodeGetParams(cudaGraphNode_t node, struct cudaMemsetParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8548:68: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemsetNodeGetParams(cudaGraphNode_t node, struct cudaMemsetParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
                                                                    cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8548:90: error: expected primary-expression before ‘struct’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemsetNodeGetParams(cudaGraphNode_t node, struct cudaMemsetParams *pNodeParams);
                                                                                          ^~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8548:126: error: expression list treated as compound expression in initializer [-fpermissive]
 aError_t CUDARTAPI cudaGraphMemsetNodeGetParams(cudaGraphNode_t node, struct cudaMemsetParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8571:68: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemsetNodeSetParams(cudaGraphNode_t node, const struct cudaMemsetParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8571:68: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemsetNodeSetParams(cudaGraphNode_t node, const struct cudaMemsetParams *pNodeParams);
                                                                    ^~~~~~~~~~~~~~~
                                                                    cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8571:90: error: expected primary-expression before ‘const’
 extern __host__ cudaError_t CUDARTAPI cudaGraphMemsetNodeSetParams(cudaGraphNode_t node, const struct cudaMemsetParams *pNodeParams);
                                                                                          ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8571:132: error: expression list treated as compound expression in initializer [-fpermissive]
 _t CUDARTAPI cudaGraphMemsetNodeSetParams(cudaGraphNode_t node, const struct cudaMemsetParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8612:60: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                            ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8612:60: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                            ^~~~~~~~~~~~~~~
                                                            cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8612:77: error: ‘pGraphNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                             ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8612:77: note: suggested alternative: ‘CUgraphNode’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                             ^~~~~~~~~~
                                                                             CUgraphNode
/usr/local/cuda/include/cuda_runtime_api.h:8612:89: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                                         ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8612:89: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                                         ^~~~~~~~~~~
                                                                                         cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8612:108: error: expected primary-expression before ‘const’
 ost__ cudaError_t CUDARTAPI cudaGraphAddHostNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8612:153: error: expected primary-expression before ‘numDependencies’
 ode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8612:170: error: expected primary-expression before ‘const’
 _t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8612:214: error: expression list treated as compound expression in initializer [-fpermissive]
 const cudaGraphNode_t *pDependencies, size_t numDependencies, const struct cudaHostNodeParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8635:66: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphHostNodeGetParams(cudaGraphNode_t node, struct cudaHostNodeParams *pNodeParams);
                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8635:66: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphHostNodeGetParams(cudaGraphNode_t node, struct cudaHostNodeParams *pNodeParams);
                                                                  ^~~~~~~~~~~~~~~
                                                                  cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8635:88: error: expected primary-expression before ‘struct’
 extern __host__ cudaError_t CUDARTAPI cudaGraphHostNodeGetParams(cudaGraphNode_t node, struct cudaHostNodeParams *pNodeParams);
                                                                                        ^~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8635:126: error: expression list treated as compound expression in initializer [-fpermissive]
 aError_t CUDARTAPI cudaGraphHostNodeGetParams(cudaGraphNode_t node, struct cudaHostNodeParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8658:66: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphHostNodeSetParams(cudaGraphNode_t node, const struct cudaHostNodeParams *pNodeParams);
                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8658:66: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphHostNodeSetParams(cudaGraphNode_t node, const struct cudaHostNodeParams *pNodeParams);
                                                                  ^~~~~~~~~~~~~~~
                                                                  cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8658:88: error: expected primary-expression before ‘const’
 extern __host__ cudaError_t CUDARTAPI cudaGraphHostNodeSetParams(cudaGraphNode_t node, const struct cudaHostNodeParams *pNodeParams);
                                                                                        ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8658:132: error: expression list treated as compound expression in initializer [-fpermissive]
 _t CUDARTAPI cudaGraphHostNodeSetParams(cudaGraphNode_t node, const struct cudaHostNodeParams *pNodeParams);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8696:66: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8696:66: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                  ^~~~~~~~~~~~~~~
                                                                  cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8696:83: error: ‘pGraphNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                   ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8696:83: note: suggested alternative: ‘CUgraphNode’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                   ^~~~~~~~~~
                                                                                   CUgraphNode
/usr/local/cuda/include/cuda_runtime_api.h:8696:95: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                               ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8696:95: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                               ^~~~~~~~~~~
                                                                                               cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8696:114: error: expected primary-expression before ‘const’
 cudaError_t CUDARTAPI cudaGraphAddChildGraphNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8696:159: error: expected primary-expression before ‘numDependencies’
 ode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8696:176: error: ‘cudaGraph_t’ was not declared in this scope
 _t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                                  ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8696:176: note: suggested alternative: ‘cudaError_t’
 _t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                                  ^~~~~~~~~~~
                                                                                                                                                                                cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8696:198: error: expression list treated as compound expression in initializer [-fpermissive]
 e, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies, cudaGraph_t childGraph);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8720:71: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                       ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8720:71: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                       ^~~~~~~~~~~~~~~
                                                                       cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8720:93: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                                             ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8720:93: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                                             ^~~~~~~~~~~
                                                                                             cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8720:106: error: ‘pGraph’ was not declared in this scope
 n __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                                                     ^~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8720:106: note: suggested alternative: ‘CUgraph’
 n __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                                                     ^~~~~~
                                                                                                          CUgraph
/usr/local/cuda/include/cuda_runtime_api.h:8720:112: error: expression list treated as compound expression in initializer [-fpermissive]
 n __host__ cudaError_t CUDARTAPI cudaGraphChildGraphNodeGetGraph(cudaGraphNode_t node, cudaGraph_t *pGraph);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8757:61: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                             ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8757:61: note: suggested alternative: ‘cudaGraphCreate’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                             ^~~~~~~~~~~~~~~
                                                             cudaGraphCreate
/usr/local/cuda/include/cuda_runtime_api.h:8757:78: error: ‘pGraphNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                              ^~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8757:78: note: suggested alternative: ‘CUgraphNode’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                              ^~~~~~~~~~
                                                                              CUgraphNode
/usr/local/cuda/include/cuda_runtime_api.h:8757:90: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                                          ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8757:90: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                                          ^~~~~~~~~~~
                                                                                          cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8757:109: error: expected primary-expression before ‘const’
 st__ cudaError_t CUDARTAPI cudaGraphAddEmptyNode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                                                  ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8757:154: error: expected primary-expression before ‘numDependencies’
 ode(cudaGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                                                  ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8757:169: error: expression list treated as compound expression in initializer [-fpermissive]
 daGraphNode_t *pGraphNode, cudaGraph_t graph, const cudaGraphNode_t *pDependencies, size_t numDependencies);
                                                                                                           ^
/usr/local/cuda/include/cuda_runtime_api.h:8784:54: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                      ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8784:54: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                      ^~~~~~~~~~~
                                                      cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8784:67: error: ‘pGraphClone’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                                   ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8784:67: note: suggested alternative: ‘cuGraphClone’
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                                   ^~~~~~~~~~~
                                                                   cuGraphClone
/usr/local/cuda/include/cuda_runtime_api.h:8784:80: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                                                ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8784:80: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                                                ^~~~~~~~~~~
                                                                                cudaError_t
/usr/local/cuda/include/cuda_runtime_api.h:8784:105: error: expression list treated as compound expression in initializer [-fpermissive]
 extern __host__ cudaError_t CUDARTAPI cudaGraphClone(cudaGraph_t *pGraphClone, cudaGraph_t originalGraph);
                                                                                                         ^
/usr/local/cuda/include/cuda_runtime_api.h:8812:64: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphNodeFindInClone(cudaGraphNode_t *pNode, cudaGraphNode_t originalNode, cudaGraph_t clonedGraph);
                                                                ^~~~~~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8812:64: note: suggested alternative: ‘cudaGraphClone’
 extern __host__ cudaError_t CUDARTAPI cudaGraphNodeFindInClone(cudaGraphNode_t *pNode, cudaGraphNode_t originalNode, cudaGraph_t clonedGraph);
                                                                ^~~~~~~~~~~~~~~
                                                                cudaGraphClone
/usr/local/cuda/include/cuda_runtime_api.h:8812:81: error: ‘pNode’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphNodeFindInClone(cudaGraphNode_t *pNode, cudaGraphNode_t originalNode, cudaGraph_t clonedGraph);
                                                                                 ^~~~~
/usr/local/cuda/include/cuda_runtime_api.h:8812:81: note: suggested alternative: ‘pclose’
 extern __host__ cudaError_t CUDARTAPI cudaGraphNodeFindInClone(cudaGraphNode_t *pNode, cudaGraphNode_t originalNode, cudaGraph_t clonedGraph);
                                                                                 ^~~~~
                                                                                 pclose
/usr/local/cuda/include/cuda_runtime_api.h:8812:88: error: ‘cudaGraphNode_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphNodeFindInClone(cudaGraphNode_t *pNode, cudaGraphNode_t originalNode, cudaGraph_t clonedGraph);
                                                                                        ^~~~~~~~~~~~~~~
                                                            ^~~~~~~~~~~~~~~

(it won’t let me post the entire thing, but it is much the same until the end:)

/usr/local/cuda/include/cuda_runtime_api.h:9226:56: error: ‘cudaGraph_t’ was not declared in this scope
 extern __host__ cudaError_t CUDARTAPI cudaGraphDestroy(cudaGraph_t graph);
                                                        ^~~~~~~~~~~
/usr/local/cuda/include/cuda_runtime_api.h:9226:56: note: suggested alternative: ‘cudaError_t’
 extern __host__ cudaError_t CUDARTAPI cudaGraphDestroy(cudaGraph_t graph);
                                                        ^~~~~~~~~~~
                                                        cudaError_t
Makefile:226: recipe for target 'mnistCUDNN.o' failed
make: *** [mnistCUDNN.o] Error 1

I have followed the guide very thoroughly up to here and can’t think of what went wrong. Any help would be very appreciated

Found the fix in another post here.

Suggested fix:

Open the file:
/usr/include/cudnn.h

And change the line:
#include “driver_types.h”

to:
#include <driver_types.h>