I am doing model inference experiments using A30. I noticed when performing model inference using a full GPU card and MIG instances, there may be slight inconsistencies in the inference results. The inference model and code is not modified.
I am doing model inference experiments using A30. I noticed when performing model inference using a full GPU card and MIG instances, there may be slight inconsistencies in the inference results. The inference model and code is not modified.
Inconsistencies mean that the output tensor is not even close.