Sorry for the delay I used to install cuda from lambda stack, I finally managed to use compute-sanitizer
error:
========= COMPUTE-SANITIZER
========= Program hit invalid argument (error 1) on CUDA API call to cudaMemcpy.
========= Saved host backtrace up to driver entry point at error
========= Host Frame: [0x355b43]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0x5b77d]
========= in /home/venus/./a.out
========= Host Frame: [0x7c77]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (0,0,0) in block (0,0,0)
========= Address 0x3e000000 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (1,0,0) in block (0,0,0)
========= Address 0x3e000010 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (2,0,0) in block (0,0,0)
========= Address 0x3e000020 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (3,0,0) in block (0,0,0)
========= Address 0x3e000030 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (4,0,0) in block (0,0,0)
========= Address 0x3e000040 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (5,0,0) in block (0,0,0)
========= Address 0x3e000050 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (6,0,0) in block (0,0,0)
========= Address 0x3e000060 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (7,0,0) in block (0,0,0)
========= Address 0x3e000070 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (8,0,0) in block (0,0,0)
========= Address 0x3e000080 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (9,0,0) in block (0,0,0)
========= Address 0x3e000090 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (10,0,0) in block (0,0,0)
========= Address 0x3e0000a0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (11,0,0) in block (0,0,0)
========= Address 0x3e0000b0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (12,0,0) in block (0,0,0)
========= Address 0x3e0000c0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (13,0,0) in block (0,0,0)
========= Address 0x3e0000d0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (14,0,0) in block (0,0,0)
========= Address 0x3e0000e0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (15,0,0) in block (0,0,0)
========= Address 0x3e0000f0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (16,0,0) in block (0,0,0)
========= Address 0x3e000100 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (17,0,0) in block (0,0,0)
========= Address 0x3e000110 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (18,0,0) in block (0,0,0)
========= Address 0x3e000120 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (19,0,0) in block (0,0,0)
========= Address 0x3e000130 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (20,0,0) in block (0,0,0)
========= Address 0x3e000140 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (21,0,0) in block (0,0,0)
========= Address 0x3e000150 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (22,0,0) in block (0,0,0)
========= Address 0x3e000160 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (23,0,0) in block (0,0,0)
========= Address 0x3e000170 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (24,0,0) in block (0,0,0)
========= Address 0x3e000180 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (25,0,0) in block (0,0,0)
========= Address 0x3e000190 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (26,0,0) in block (0,0,0)
========= Address 0x3e0001a0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (27,0,0) in block (0,0,0)
========= Address 0x3e0001b0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (28,0,0) in block (0,0,0)
========= Address 0x3e0001c0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (29,0,0) in block (0,0,0)
========= Address 0x3e0001d0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (30,0,0) in block (0,0,0)
========= Address 0x3e0001e0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Invalid __shared__ write of size 16 bytes
========= at 0x90 in Test(float*, unsigned int*, float*)
========= by thread (31,0,0) in block (0,0,0)
========= Address 0x3e0001f0 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame: [0x25428a]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0xc74b]
========= in /home/venus/./a.out
========= Host Frame: [0x5fe70]
========= in /home/venus/./a.out
========= Host Frame: [0x8090]
========= in /home/venus/./a.out
========= Host Frame: [0x7ee3]
========= in /home/venus/./a.out
========= Host Frame: [0x7f3a]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf0]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Program hit unspecified launch failure (error 719) on CUDA API call to cudaDeviceSynchronize.
========= Saved host backtrace up to driver entry point at error
========= Host Frame: [0x355b43]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0x3fa47]
========= in /home/venus/./a.out
========= Host Frame: [0x7cf5]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
========= Program hit unspecified launch failure (error 719) on CUDA API call to cudaMemcpy.
========= Saved host backtrace up to driver entry point at error
========= Host Frame: [0x355b43]
========= in /lib/x86_64-linux-gnu/libcuda.so.1
========= Host Frame: [0x5b77d]
========= in /home/venus/./a.out
========= Host Frame: [0x7d0f]
========= in /home/venus/./a.out
========= Host Frame:__libc_start_main [0x270b3]
========= in /lib/x86_64-linux-gnu/libc.so.6
========= Host Frame: [0x7ace]
========= in /home/venus/./a.out
=========
30.6386========= ERROR SUMMARY: 35 errors
thank