The p2pBandwidthLatencyTest only run on 1 GPU

phungdaovinhchung1 · September 13, 2019, 3:47am

Hi, I’m trying to check the GPUs on SYS-4029GP-TRT3 and I ran into this error.

My system:
cuda: 10.0.130
driver: 410.57
8x RTX 2080 ti

As the result below, I ran the p2pBandwidthLatencyTest build from the this repo: https://github.com/NVIDIA/cuda-samples/tree/master/Samples/p2pBandwidthLatencyTest. I got the result of only 1 GPU. While running the nvidia-smi command, I can get all the GPUs.

[P2P (Peer-to-Peer) GPU Bandwidth Latency Test]
Device: 0, GeForce RTX 2080 Ti, pciBusID: 24, pciDeviceID: 0, pciDomainID:0

***NOTE: In case a device doesn't have P2P access to other one, it falls back to normal memcopy procedure.
So you can see lesser Bandwidth (GB/s) and unstable Latency (us) in those cases.

P2P Connectivity Matrix
     D\D     0
     0	     1
Unidirectional P2P=Disabled Bandwidth Matrix (GB/s)
   D\D     0 
     0 528.25 
Unidirectional P2P=Enabled Bandwidth (P2P Writes) Matrix (GB/s)
   D\D     0 
     0 530.74 
Bidirectional P2P=Disabled Bandwidth Matrix (GB/s)
   D\D     0 
     0 532.91 
Bidirectional P2P=Enabled Bandwidth Matrix (GB/s)
   D\D     0 
     0 531.46 
P2P=Disabled Latency Matrix (us)
   GPU     0 
     0   2.07 

   CPU     0 
     0   8.67 
P2P=Enabled Latency (P2P Writes) Matrix (us)
   GPU     0 
     0   2.07 

   CPU     0 
     0   8.70 

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.

Edit: I checked on the nvlink status using

nvidia-smi nvlink -s

and found that all the links are inactive. Is this the reason the test cannot work on all the GPUs?

Please let me know if there is any problem with my system.
Thank you,
Chung

Robert_Crovella · September 13, 2019, 1:13pm

This might a result of having the CUDA_VISIBLE_DEVICES environment variable set.

Topic		Replies	Views
the bandwidth is low between my gpus. tested with p2pBandwidthLatencyTest CUDA Programming and Performance	0	649	March 28, 2018
How can I improve the 'p2p enabled' bandwidth when testing NCCL performance with two A5000 GPU using PCIe 4.0 x16? CUDA Programming and Performance cuda	2	1142	September 15, 2023
K80 peer-to-peer transfers: Slow bandwidth and high latency. CUDA Programming and Performance	7	3680	August 31, 2016
Question about P2P transfer bandwidth between two RTX2080s CUDA Programming and Performance	1	573	November 2, 2023
cuda p2p access not working for multiple k80s CUDA Programming and Performance	0	499	July 8, 2016
Failed to monitor nvlink throughput CUDA Developer Tools	0	915	December 29, 2020
Peer-to-Peer Memory Access can suppport a system-wide max of 8 peer connections CUDA Programming and Performance	4	1463	August 30, 2017
P2P: How do I know if cudaMemcpy falls back to non-P2P? CUDA Programming and Performance	8	2386	October 12, 2021
How to enable P2P access? CUDA Setup and Installation cuda	3	4511	February 6, 2023
Low P2P GPU bandwidth performance between GeForce GPUs CUDA Programming and Performance	20	845	October 9, 2024

The p2pBandwidthLatencyTest only run on 1 GPU

Related topics