I am aware than the size of the printf buffer is 1MB.
cudaDeviceGetLimit(&printfBufferSz,cudaLimitPrintfFifoSize) also reports 1048576 on my system.
However, i have some code that generates quite a lot of printf data and when i run it i get roughly 120KB of output. What’s more weird is that if i set
cudaThreadSetLimit(cudaLimitPrintfFifoSize, printfBufferSz * 2) i indeed get roughly 240KB of output. Is there any reason for this incosistency? Am i missing something?
I am working on a gtx 950 and cuda 10.