Hello,
I have two exactly the same graphic cards on my computer. I found that using two GPUs make my program slower than using only one, and then I doubt whether there are any setting problem. Here are some information of my computer:
Model: Dell Precision Tower 5810
OS: Windows 10
Graphic Cards: 2 * NVidia Quadro M4000
Cuda: 8.0
cuDNN: 5.1
I run the simpleP2P test and the results are:
[simpleP2P.exe] - Starting…
Checking for multiple GPUs…
CUDA-capable device count: 2
GPU0 = " Quadro M4000" IS capable of Peer-to-Peer (P2P)
GPU1 = " Quadro M4000" NOT capable of Peer-to-Peer (P2P)
Two or more GPUs with SM 2.0 or higher capability are required for simpleP2P.exe.
Also, a TCC driver must be installed and enabled to run simpleP2P.exe.
What should I do to solve the above problem and make two GPUs working faster than using only one GPU? Thank you.
There is no topo option in my nvidia-smi command, therefore, I use -a to list as much as possible as below.
==============NVSMI LOG==============
Timestamp : Wed Apr 12 13:12:54 2017
Driver Version : 376.51
Attached GPUs : 2
GPU 0000:03:00.0
Product Name : Quadro M4000
Product Brand : Quadro
Display Mode : Enabled
Display Active : Enabled
Persistence Mode : N/A
Accounting Mode : Disabled
Accounting Mode Buffer Size : 1920
Driver Model
Current : WDDM
Pending : WDDM
Serial Number : 0325016106824
GPU UUID : GPU-035131cf-9127-b14a-eccb-bdc741376f02
Minor Number : N/A
VBIOS Version : 84.04.88.00.06
MultiGPU Board : No
Board ID : 0x300
GPU Part Number : 900-5G400-0100-000
Inforom Version
Image Version : G400.0501.01.03
OEM Object : 1.1
ECC Object : N/A
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GPU Virtualization Mode
Virtualization mode : None
PCI
Bus : 0x03
Device : 0x00
Domain : 0x0000
Device Id : 0x13F110DE
Bus Id : 0000:03:00.0
Sub System Id : 0x115310DE
GPU Link Info
PCIe Generation
Max : 3
Current : 1
Link Width
Max : 16x
Current : 16x
Bridge Chip
Type : N/A
Firmware : N/A
Replays since reset : 0
Tx Throughput : 8000 KB/s
Rx Throughput : 3000 KB/s
Fan Speed : 50 %
Performance State : P8
Clocks Throttle Reasons
Idle : Active
Applications Clocks Setting : Not Active
SW Power Cap : Not Active
HW Slowdown : Not Active
Sync Boost : Not Active
Unknown : Not Active
FB Memory Usage
Total : 8192 MiB
Used : 7019 MiB
Free : 1173 MiB
BAR1 Memory Usage
Total : 256 MiB
Used : 229 MiB
Free : 27 MiB
Compute Mode : Default
Utilization
Gpu : 0 %
Memory : 6 %
Encoder : 0 %
Decoder : 0 %
Ecc Mode
Current : N/A
Pending : N/A
ECC Errors
Volatile
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Aggregate
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Retired Pages
Single Bit ECC : N/A
Double Bit ECC : N/A
Pending : N/A
Temperature
GPU Current Temp : 50 C
GPU Shutdown Temp : 104 C
GPU Slowdown Temp : 99 C
Power Readings
Power Management : Supported
Power Draw : 22.25 W
Power Limit : 120.00 W
Default Power Limit : 120.00 W
Enforced Power Limit : 120.00 W
Min Power Limit : 10.00 W
Max Power Limit : 120.00 W
Clocks
Graphics : 135 MHz
SM : 135 MHz
Memory : 324 MHz
Video : 405 MHz
Applications Clocks
Graphics : 772 MHz
Memory : 3005 MHz
Default Applications Clocks
Graphics : 772 MHz
Memory : 3005 MHz
Max Clocks
Graphics : 772 MHz
SM : 772 MHz
Memory : 3005 MHz
Video : 710 MHz
Clock Policy
Auto Boost : On
Auto Boost Default : On
Processes
Process ID : 292
Type : C+G
Name : C:\Program Files (x86)\Internet Explorer\iexplore.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 1352
Type : Insufficient Permissions
Name : Insufficient Permissions
Used GPU Memory : Not available in WDDM driver model
Process ID : 3880
Type : C+G
Name : C:\Windows\explorer.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 7560
Type : C+G
Name : C:\Windows\SystemApps\ShellExperienceHost_cw5n1h2txyewy\ShellExperienceHost.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 8252
Type : C+G
Name : C:\Program Files (x86)\Microsoft Office\Office16\OUTLOOK.EXE
Used GPU Memory : Not available in WDDM driver model
Process ID : 8936
Type : C+G
Name : C:\Program Files (x86)\Google\Chrome\Application\chrome.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 9212
Type : C+G
Name : C:\Windows\SystemApps\Microsoft.Windows.Cortana_cw5n1h2txyewy\SearchUI.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 10644
Type : C+G
Name : C:\Program Files\WindowsApps\Microsoft.WindowsCalculator_10.1604.21020.0_x64__8wekyb3d8bbwe\Calculator.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 10700
Type : C+G
Name : C:\Program Files (x86)\Microsoft Visual Studio 10.0\Common7\IDE\devenv.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 10880
Type : C+G
Name : C:\Windows\System32\ApplicationFrameHost.exe
Used GPU Memory : Not available in WDDM driver model
Process ID : 11024
Type : C
Name : C:\Users\brandon5\AppData\Local\Programs\Python\Python35\python.exe
Used GPU Memory : Not available in WDDM driver model
GPU 0000:04:00.0
Product Name : Quadro M4000
Product Brand : Quadro
Display Mode : Disabled
Display Active : Disabled
Persistence Mode : N/A
Accounting Mode : Disabled
Accounting Mode Buffer Size : 1920
Driver Model
Current : TCC
Pending : TCC
Serial Number : 0323216038356
GPU UUID : GPU-c609aa7a-58ff-2ac3-06ba-7e563761c5f9
Minor Number : N/A
VBIOS Version : 84.04.88.00.06
MultiGPU Board : No
Board ID : 0x400
GPU Part Number : N/A
Inforom Version
Image Version : G400.0501.01.03
OEM Object : 1.1
ECC Object : N/A
Power Management Object : N/A
GPU Operation Mode
Current : N/A
Pending : N/A
GPU Virtualization Mode
Virtualization mode : None
PCI
Bus : 0x04
Device : 0x00
Domain : 0x0000
Device Id : 0x13F110DE
Bus Id : 0000:04:00.0
Sub System Id : 0x115310DE
GPU Link Info
PCIe Generation
Max : 3
Current : 3
Link Width
Max : 16x
Current : 16x
Bridge Chip
Type : N/A
Firmware : N/A
Replays since reset : 0
Tx Throughput : 25000 KB/s
Rx Throughput : 68000 KB/s
Fan Speed : 65 %
Performance State : P0
Clocks Throttle Reasons
Idle : Not Active
Applications Clocks Setting : Not Active
SW Power Cap : Not Active
HW Slowdown : Not Active
Sync Boost : Not Active
Unknown : Not Active
FB Memory Usage
Total : 8121 MiB
Used : 7826 MiB
Free : 295 MiB
BAR1 Memory Usage
Total : 256 MiB
Used : 2 MiB
Free : 254 MiB
Compute Mode : Default
Utilization
Gpu : 74 %
Memory : 42 %
Encoder : 0 %
Decoder : 0 %
Ecc Mode
Current : N/A
Pending : N/A
ECC Errors
Volatile
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Aggregate
Single Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Double Bit
Device Memory : N/A
Register File : N/A
L1 Cache : N/A
L2 Cache : N/A
Texture Memory : N/A
Texture Shared : N/A
Total : N/A
Retired Pages
Single Bit ECC : N/A
Double Bit ECC : N/A
Pending : N/A
Temperature
GPU Current Temp : 79 C
GPU Shutdown Temp : 104 C
GPU Slowdown Temp : 99 C
Power Readings
Power Management : Supported
Power Draw : 98.35 W
Power Limit : 120.00 W
Default Power Limit : 120.00 W
Enforced Power Limit : 120.00 W
Min Power Limit : 10.00 W
Max Power Limit : 120.00 W
Clocks
Graphics : 772 MHz
SM : 772 MHz
Memory : 3004 MHz
Video : 712 MHz
Applications Clocks
Graphics : 772 MHz
Memory : 3005 MHz
Default Applications Clocks
Graphics : 772 MHz
Memory : 3005 MHz
Max Clocks
Graphics : 772 MHz
SM : 772 MHz
Memory : 3005 MHz
Video : 710 MHz
Clock Policy
Auto Boost : On
Auto Boost Default : On
Processes
Process ID : 11024
Type : C
Name : C:\Users\brandon5\AppData\Local\Programs\Python\Python35\python.exe
Used GPU Memory : 7826 MiB