Hi there
When using the normal malloc() call on a linux system, newly allocated memory is always aligned at addresses that are a multiple of four. Does anyone know if this is also the case with cudamalloc()?
Im asking because I’m currently writing a program in which its important that the first two bits of a memory address are always zero, which is the case in newly allocated host memory on linux (at least when allocated with malloc())
I checked the CUDA programming guide, but couldnt find this specific information anywhere.
Thanks!