My interpretation is that the cuda toolkit “binary blob” i.e. the CUDA toolkit download with associated drivers and libraries, is designed/intended to support both early members of the hopper family (i.e. primarily H100) and early members of the Ada Lovelace family (all currently announced members, presumably).
w.r.t. the documentation, the 9.0 SM architecture evidently does not represent Ada Lovelace GPUs that are based on the publicly announced chips such as AD102 (see the whitepaper). It seems evident (to me) that the 9.0 arch description mentions 64 FP64 cores, and no mention of those is made in the aforementioned ada lovelace whitepaper (&). I doubt that is a careless omission. So it seems evident to me that the 9.0 architecture description does not match various members of the Ada Lovelace family GPUs.
((&)In fact, in a note to figure 1, the whitepaper explicitly states that the AD102 SM contains 2 FP64 units per SM, so it is evident there is a mismatch against 9.0 SM description.)
Unfortunately without documentation I can’t go beyond that. Although supported, the Ada Lovelace SM is not yet documented to the same extent as the 9.0 SM.