My application needs large 2D arrays created in Host as well as on Device? How to create them on Device using CUDA? I am creating arrays with dynamic allocation using malloc() and CudaMalloc(). But getting error during compilation … relocation truncated to fit: R_X86_64_PC32 against `.bss’
May I get help?