Xptxas default cache modifier on global/generic load and store not working

According to my testing on godbolt there was a change in compiler behavior sometime between CUDA 12.3.1 and CUDA 12.4.1.

If this is of concern to you, you may wish to file a bug

1 Like