Hi,
Do you use TensorRT for inference?
If yes, it is normal. Since TensorRT tends to use all the available GPU resources to achieve the best performance.
You can control this by adjusting the frequency of feeding input to TensorRT.
Thanks.
Hi,
Do you use TensorRT for inference?
If yes, it is normal. Since TensorRT tends to use all the available GPU resources to achieve the best performance.
You can control this by adjusting the frequency of feeding input to TensorRT.
Thanks.