Hi, I tested deviceQuery
sample in CUDASamples, part of the result output is shown below
CUDA Device Query (Runtime API) version (CUDART static linking)
Detected 1 CUDA Capable device(s)
Device 0: "NVIDIA A800 80GB PCIe"
CUDA Driver Version / Runtime Version 12.2 / 12.1
CUDA Capability Major/Minor version number: 8.0
Total amount of global memory: 81051 MBytes (84987740160 bytes)
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores
GPU Max Clock rate: 1410 MHz (1.41 GHz)
Memory Clock rate: 1512 Mhz
Memory Bus Width: 5120-bit
L2 Cache Size: 41943040 bytes
The results show that L2 Cache Size is 40MB, but as far as I could google, the L2 cache size of A800 80GB is 80MB, So what went wrong?
And someone say the A800 is made from two GA100 chips, each of which has 40MB of L2 cache, is that true?