The power can achieve 37W (super mode), and the temperature is closed to 80-90C. But I got 0.137 TOPS only, that is far less than the 157 TOPS which the official announced. Could you give me some advice?
And the method that the program calculates the GFlop :
GFlop = (2* $Matrix_size *10e-9) / (operation time / 10e-3)
Oh, it seem like Ops is matrix size 240924092*4092 Ops=137438953472 Ops. I misunderstanded the value …
But is the 375417490.63 GFlop/s also too larger right?
The jetson clock is running, and the power mode is already MAXN_SUPER.
But, the performance is still only 37 TOPS, which is significantly below the sparse INT8 performance (100 TOPS) written in the datasheet.
Could you give me some advise?
BTW, my jetson is ORIN NX, not AGX ORIN. Should the parameters be changed for my case?
Thanks !!!
I got the 46909 GFLOP/s = 46.9TOPS in m=512, n=512, k=16256, Identity2.
But, I can’t understand that the SOL is 46.9/60 or 46.9/100 ?
BTW, I don’t know how to test DLA’s TOPS, does NV provide the test tool recently?