Hi,
I have an example code here:
program testsum
implicit none
integer::i,j,k,lj,rj
integer::a(10,3),suma(5,3)
!$acc data region local(a,suma)
do k=1,500
!$acc region
do i=1,10
a(i,1)=i
a(i,2)=i*2
a(i,3)=i*3
end do
!$acc end region
!$acc region
do j=1,5
lj=(j-1)*2 + 1
rj=j*2
suma(j,1)=sum(a(lj:rj,1))
suma(j,2)=sum(a(lj:rj,2))
suma(j,3)=sum(a(lj:rj,3))
end do
!$acc end region
end do
!$acc updateout(suma)
!$acc end data region
write(*,*)suma(:,1)
end program
It fails to run:
call to EventSynchronize returned error 700: Launch failed
CUDA driver version: 2030
Accelerator Kernel Timing data
/mnt/home/../test.f
testsum
18: region entered 138 times
time(us): total=4499 init=14 region=4485
kernels=1499 data=0
w/o init: total=4485 max=136 min=31 avg=32
19: kernel launched 138 times
grid: [1] block: [5]
time(us): total=1499 max=12 min=10 avg=10
/mnt/home/../test.f
testsum
10: region entered 138 times
time(us): total=4142 init=19 region=4123
kernels=1109 data=0
w/o init: total=4123 max=134 min=28 avg=29
11: kernel launched 138 times
grid: [1] block: [10]
time(us): total=1109 max=13 min=7 avg=8
/mnt/home/../test.f
testsum
7: region entered 1 time
time(us): init=696724
Can someone explain this error?
Thanks.