Determine Memory CUDA Context Memory Usage

Hi,

Here is some information from our internal team.

The pre-allocated memory amount is related to GPU SMs number.
The GPU with more SMs requires a larger memory.

Currently, there is no reliable mechanism to measure it across different GPUs.
You will need to test it directly on the target to get the informaiton.

Thanks.