this may be of interest
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Can I do async copy from global memory to register in hopper? | 7 | 179 | July 2, 2024 | |
too many registers issue with memory writes and registers | 7 | 1921 | July 13, 2011 | |
Can we directly use register value for tensor core calculation? | 4 | 574 | October 18, 2023 | |
Mixing ressource use to increase the SM occupancy | 5 | 3352 | August 6, 2008 | |
Avoiding a device write using textures and arrays. | 3 | 2795 | August 7, 2008 | |
How to use PTX prefetch.global with ASM? compiles but do not see prefetch instruction with cuobjdump | 7 | 5166 | May 7, 2012 | |
Issues about async on A100 | 22 | 41 | March 19, 2025 | |
cudaMemcpy() behavior question | 4 | 6658 | August 8, 2007 | |
Register Indexing | 4 | 1190 | March 6, 2011 | |
Global memory vs register storage How to force the compiler to use registers? | 6 | 4987 | July 3, 2009 |