[p2pBandwidthLatencyTest] failed in DGX-1

Hi All,

I am running p2pBandwidthLatencyTest on DGX-1, but it got failed as following:

/////////////////////////////
***NOTE: In case a device doesn’t have P2P access to other one, it falls back to normal memcopy procedure.
So you can see lesser Bandwidth (GB/s) in those cases.

P2P Connectivity Matrix
D\D 0 1 2 3 4 5 6 7
0 1 1 1 1 1 0 0 0
1 1 1 1 1 0 1 0 0
2 1 1 1 1 0 0 1 0
3 1 1 1 1 0 0 0 1
4 1 0 0 0 1 1 1 1
5 0 1 0 0 1 1 1 1
6 0 0 1 0 1 1 1 1
7 0 0 0 1 1 1 1 1
Unidirectional P2P=Disabled Bandwidth Matrix (GB/s)
Cuda failure p2pBandwidthLatencyTest.cu:105: ‘unspecified launch failure’
////////////////////////////

If I run it in another system with only two P100, p2pBandwidthLatencyTest can run correctly, what’s the problem on my DGX-1 setting ?