I’m unable to find any updates on Compute Capabilities 10.0 and 12.0 for instruction throughput and latencies for Blackwell GPUs despite the official release of the hardware and CUDA 12.8. E.g. the tables in ‘CUDA C++ Programming Guide - 5.4. Maximize Instruction Throughput’ only go up to CC 9.0.
Is there somewhere to find this information or perhaps CUDA docs team have accidentally omitted an update?