|
Is it expected on to see many NOPs in double precision code on Blackwell CC 12?
|
|
15
|
135
|
February 12, 2026
|
|
cudaMemcpyBatchAsync
|
|
3
|
18
|
February 11, 2026
|
|
CUDA device not found while using blackwell
|
|
1
|
24
|
February 11, 2026
|
|
Pytorch matmul vs cudaTensorCoreGemm on Jetson Orin NX
|
|
1
|
19
|
February 11, 2026
|
|
Unstable CUDA timing on Jetson AGX Orin compared to Windows GPU
|
|
3
|
26
|
February 11, 2026
|
|
Assessing the Impact of High Launch Latency in CUDA Applications
|
|
14
|
74
|
February 10, 2026
|
|
CUDA installation and driver/runtime incompatibility
|
|
2
|
777
|
February 9, 2026
|
|
Mismatch in CUDA driver and runtime versions
|
|
7
|
3471
|
February 9, 2026
|
|
cudaMemcpyAsync (P2P D2D) serializes with kernel execution
|
|
1
|
37
|
February 8, 2026
|
|
The flag -gencode is not recognized
|
|
4
|
34
|
February 7, 2026
|
|
Instruction 'tcgen05.alloc' not supported on .target 'sm_110'
|
|
1
|
26
|
February 7, 2026
|
|
Distributed Shared Memory
|
|
0
|
18
|
February 7, 2026
|
|
NVIDIA Driver Installed but nvidia-smi fails - Ubuntu kernel 6.17 DKMS installed module not loading
|
|
1
|
89
|
February 6, 2026
|
|
Single-Bit Corruption Detected by Device-Side Compare in Trivial Global Copy Kernel on RTX 3060 Ti (memcheck/racecheck clean)
|
|
5
|
33
|
February 6, 2026
|
|
Sequential SM Resource Splitting with CUDA Green Contexts
|
|
0
|
19
|
February 6, 2026
|
|
Clarification: bank_conflicts metric vs wavefronts for shared memory LDS.128
|
|
1
|
24
|
February 6, 2026
|
|
RTX 5070 not detected by CUDA / PyTorch (no kernel image available, GPU not usable for AI frameworks)
|
|
1
|
108
|
February 6, 2026
|
|
LSU Wavefront Scheduling and Shared Memory Bank Utilization on Blackwell
|
|
6
|
58
|
February 6, 2026
|
|
CUDA 13 Support for libc++ on x86_64
|
|
0
|
17
|
February 5, 2026
|
|
CMake, CMAKE_CUDA_USE_RESPONSE_FILE_FOR_INCLUDES and --options-file not working
|
|
0
|
9
|
February 4, 2026
|
|
CUDA - Make a specific memory access skip the cache
|
|
2
|
47
|
February 4, 2026
|
|
Understanding warp scheduling on a Streaming multiprocessor
|
|
3
|
56
|
February 4, 2026
|
|
Disable Logging of CUDA APIs
|
|
0
|
34
|
February 3, 2026
|
|
CUDA-Vulkan image interop broken on Windows
|
|
3
|
84
|
February 2, 2026
|
|
Clarification on legacy CUDA Toolkit EOL/EOS policy?
|
|
1
|
49
|
February 2, 2026
|
|
CUDA Programming Guide v13.1: Missing kernel argument in 2.1.5 “Explicit Memory Management” example
|
|
1
|
27
|
February 2, 2026
|
|
Getting started with parallel programming Suggested reading
|
|
9
|
29596
|
February 2, 2026
|
|
cudaMemPrefetchAsync does not migrate managed memory back to host (device -> host)
|
|
2
|
54
|
February 1, 2026
|
|
Configure VS 2022 / CUDA 13.1 Runtime project for OpenSSL
|
|
0
|
19
|
January 31, 2026
|
|
Rtx6000 not recognized by slurm-mig-discovery
|
|
0
|
37
|
January 30, 2026
|