Similar to this post, I’m trying to run Valgrind, but on an AGX Orin. I do get more details in the strack trace though, so I can tell you that this instruction occurs during a call to cudaStreamCreate. Could you please try one of the CUDA Samples that uses streams? I’m running CUDA 11.4.243.