Hi,
I am developing a program which need capture the GPU hardware error use DCGM, such as PCI error, ECC error and so on. However I cannot get any error even warning when use dcgmi health -g 1 -c. Is there any way to simulate the GPU hardware error? I need do the test.
Thanks