I’m experiencing a ULF when executing the same kernel repeatedly on the same data, but this error seems to occur at different iterations each time I run the program. I’ve gone through the forums and read that a few people have experienced similar problems.
Have you guys got any suggestions on how to move forward? In particular, my kernel doesn’t exhibit the ULF at all if I remove the “volatile” keyword from shared memory - which is really strange.
I’ve kind of hit a brick wall on this :(
I can post my code if you think it will help, but it’s a bit long. The reason for using “volatile” is that I need threads to perform atomic operations on shared memory using the “write-combining” approach shown in the Histogram SDK example.