Incorrect "double" calculations when profiling with Nsight Systems

Vectorizer · October 5, 2025, 9:58am

My CUDA kernels, which calculate double values, produce results consistent with the reference values when I run it under Visual Studio (debug and release), standalone or with Nsight Compute. However when I profile with Nsight Systems, I am getting a ton of errors:

...
at 3357224 -3.14949 should have been -2.67323
at 3357225 -3.14752 should have been -2.67126
at 3357226 -3.14776 should have been -2.6715
at 3357227 -3.14772 should have been -2.67146
at 3357228 -3.14823 should have been -2.67197
at 3357229 -3.14669 should have been -2.67042
at 3357230 -3.14863 should have been -2.67237
at 3357231 -3.14981 should have been -2.67354
at 3357232 -3.15097 should have been -2.6747
at 3357233 -3.15086 should have been -2.67459
at 3357234 -3.14987 should have been -2.6736
at 3357235 -3.14975 should have been -2.67349
at 3357236 -3.15079 should have been -2.67453
at 3357237 -3.15018 should have been -2.6739
...

I observe this when using shuffle instructions, the kernel that does not use shuffle is not experiencing this.

hwilper · October 6, 2025, 1:55pm

@mjain could this be a CUPTI issue?

Greg · October 8, 2025, 7:23am

Please provide a minimal reproducible, driver version, GPU model, and tools versions.

Vectorizer · October 10, 2025, 12:10pm

Turned out there was a subtle race condition and it manifested itself only when run under Nsight Systems and not under Nsight compute or standalone etc(why?)

Greg · October 11, 2025, 6:50am

Please provide sufficient information if you would like help. Please describe the race condition in sufficient detail and please provide the requested information and a minimal reproducible.

Topic		Replies	Views
Kernel output all correct but got NAN when profiling with nsight-compute Nsight Compute cuda	5	985	January 12, 2024
NSight incorrectly displays doubles expression result CUDA Programming and Performance	1	434	October 9, 2018
CUDA kernel launched from Nsight Compute gives inconsistent results Nsight Compute	1	508	October 20, 2022
Inconsistent results with nsight systems Profiling Embedded Targets	5	959	June 20, 2023
nsight-compute's profiling result is different from nvprof's Nsight Compute	5	716	October 12, 2021
Device computed value significantly different from precomputed value displayed in NSight/VS2010 Nsight Visual Studio Edition	5	1166	July 5, 2013
Executable under Nsight Systems profiler doesn't work Nsight Visual Studio Edition profiling	2	946	October 30, 2021
Cannot profile CUDA kernel using NC : Run Bottleneck returned an error Nsight Compute	4	607	October 12, 2021
The profiler returned an error code:1 Nsight Compute	1	2165	March 2, 2022
nsight compute ui and cli can't profiling any cuda application Nsight Compute	6	3981	August 21, 2019

Incorrect "double" calculations when profiling with Nsight Systems

Related topics