Dear All,
Is it possible to pass a multi-dimentional array to the first argument in the function cudaMemPrefetchAsync() ? At that time, what numeric should I pass to the second argument ?
Here is the test code.
integer(acc_handle_kind) :: stream
real(kind=8),dimension(:,:),allocatable :: A
allocate(A(4096,4096))
A = 1.0d0
stream = acc_get_cuda_stream(acc_async_sync)
!$acc host_data use_device(A)
call cudaMemPrefetchAsync(A,4096*4096, 0, stream)
!$acc end host_data
deallocate(A)