CUDA 2.0Beta Device Query 8800GTS 320M returns 16 multiprocessor

The latest CUDA 2.0 Beta sample project deviceQuery reports:

There is 1 device supporting CUDA

Device 0: “GeForce 8800 GTS”
Major revision number: 1
Minor revision number: 0
Total amount of global memory: 335216640 bytes
Number of multiprocessors: 16
Number of cores: 128
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 16384 bytes
Total number of registers available per block: 8192
Warp size: 32
Maximum number of threads per block: 512
Maximum sizes of each dimension of a block: 512 x 512 x 64
Maximum sizes of each dimension of a grid: 65535 x 65535 x 1
Maximum memory pitch: 262144 bytes
Texture alignment: 256 bytes
Clock rate: 1.19 GHz
Concurrent copy and execution: Yes


Press ENTER to exit…

But, as you know, 8800GTS 320M (G80) only have 96 streaming processor(cores), or 12 multiprocessor. So it should be a bug here.
My drivers is CUDA 2.0 BETA driver on XP.