I am exploring NVIDIA’s drive software for exploring the driveworks package. I was successfully able to install drive sdk and all my binaries are created in /usr/local/driveworks/bin
. I was also able to run the ./sample_hello_word
example, and the following is the output of the same:
Welcome to Driveworks SDK
[13-05-2020 06:16:32] Platform: Detected Generic x86 Platform
[13-05-2020 06:16:32] Initialize DriveWorks SDK v2.0.2081
[13-05-2020 06:16:32] Release build with GNU 4.9.4 from heads/buildbrain-branch-0-g36b127f
[13-05-2020 06:16:32] TimeSource: monotonic epoch time offset is 1589299953526456
[13-05-2020 06:16:33] Platform: number of GPU devices detected 4
[13-05-2020 06:16:33] Platform: currently selected GPU device discrete ID 0
[13-05-2020 06:16:33] SDK: Resources mounted from /usr/local/driveworks-2.0/data/
Context of Driveworks SDK successfully initialized.
Version: 2.0.2081
GPU devices detected: 4
[13-05-2020 06:16:33] Platform: currently selected GPU device discrete ID 0
----------------------------------------------
Device: 0, GeForce GTX 1080
CUDA Driver Version / Runtime Version : 10.2 / 10.0
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory in MBytes:8116.44
Memory Clock rate Khz: 5005000
Memory Bus Width bits: 256
L2 Cache Size: 2097152
Maximum 1D Texture Dimension Size (x): 131072
Maximum 2D Texture Dimension Size (x,y): 131072, 65536
Maximum 3D Texture Dimension Size (x,y,z): 16384, 16384, 16384
Maximum Layered 1D Texture Size, (x): 32768 num: 2048
Maximum Layered 2D Texture Size, (x,y): 32768, 32768 num: 2048
Total amount of constant memory bytes: 65536
Total amount of shared memory per block bytes: 49152
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): 1024,1024,64
Max dimension size of a grid size (x,y,z): 2147483647,65535,65535
Maximum memory pitch bytes: 2147483647
Texture alignment bytes: 512
Concurrent copy and kernel execution: Yes, copy engines num: 2
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID: 0, Device PCI Bus ID: 5, Device PCI location ID: 0
Compute Mode: Default (multiple host threads can use ::cudaSetDevice() with device simultaneously)
Concurrent kernels: 1
Concurrent memory: 1
[13-05-2020 06:16:33] Platform: currently selected GPU device discrete ID 1
----------------------------------------------
Device: 1, GeForce GTX 1080
CUDA Driver Version / Runtime Version : 10.2 / 10.0
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory in MBytes:8119.56
Memory Clock rate Khz: 5005000
Memory Bus Width bits: 256
L2 Cache Size: 2097152
Maximum 1D Texture Dimension Size (x): 131072
Maximum 2D Texture Dimension Size (x,y): 131072, 65536
Maximum 3D Texture Dimension Size (x,y,z): 16384, 16384, 16384
Maximum Layered 1D Texture Size, (x): 32768 num: 2048
Maximum Layered 2D Texture Size, (x,y): 32768, 32768 num: 2048
Total amount of constant memory bytes: 65536
Total amount of shared memory per block bytes: 49152
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): 1024,1024,64
Max dimension size of a grid size (x,y,z): 2147483647,65535,65535
Maximum memory pitch bytes: 2147483647
Texture alignment bytes: 512
Concurrent copy and kernel execution: Yes, copy engines num: 2
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID: 0, Device PCI Bus ID: 6, Device PCI location ID: 0
Compute Mode: Default (multiple host threads can use ::cudaSetDevice() with device simultaneously)
Concurrent kernels: 1
Concurrent memory: 1
[13-05-2020 06:16:33] Platform: currently selected GPU device discrete ID 2
----------------------------------------------
Device: 2, GeForce GTX 1080
CUDA Driver Version / Runtime Version : 10.2 / 10.0
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory in MBytes:8119.56
Memory Clock rate Khz: 5005000
Memory Bus Width bits: 256
L2 Cache Size: 2097152
Maximum 1D Texture Dimension Size (x): 131072
Maximum 2D Texture Dimension Size (x,y): 131072, 65536
Maximum 3D Texture Dimension Size (x,y,z): 16384, 16384, 16384
Maximum Layered 1D Texture Size, (x): 32768 num: 2048
Maximum Layered 2D Texture Size, (x,y): 32768, 32768 num: 2048
Total amount of constant memory bytes: 65536
Total amount of shared memory per block bytes: 49152
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): 1024,1024,64
Max dimension size of a grid size (x,y,z): 2147483647,65535,65535
Maximum memory pitch bytes: 2147483647
Texture alignment bytes: 512
Concurrent copy and kernel execution: Yes, copy engines num: 2
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID: 0, Device PCI Bus ID: 9, Device PCI location ID: 0
Compute Mode: Default (multiple host threads can use ::cudaSetDevice() with device simultaneously)
Concurrent kernels: 1
Concurrent memory: 1
[13-05-2020 06:16:33] Platform: currently selected GPU device discrete ID 3
----------------------------------------------
Device: 3, GeForce GTX 1080
CUDA Driver Version / Runtime Version : 10.2 / 10.0
CUDA Capability Major/Minor version number: 6.1
Total amount of global memory in MBytes:8119.56
Memory Clock rate Khz: 5005000
Memory Bus Width bits: 256
L2 Cache Size: 2097152
Maximum 1D Texture Dimension Size (x): 131072
Maximum 2D Texture Dimension Size (x,y): 131072, 65536
Maximum 3D Texture Dimension Size (x,y,z): 16384, 16384, 16384
Maximum Layered 1D Texture Size, (x): 32768 num: 2048
Maximum Layered 2D Texture Size, (x,y): 32768, 32768 num: 2048
Total amount of constant memory bytes: 65536
Total amount of shared memory per block bytes: 49152
Total number of registers available per block: 65536
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): 1024,1024,64
Max dimension size of a grid size (x,y,z): 2147483647,65535,65535
Maximum memory pitch bytes: 2147483647
Texture alignment bytes: 512
Concurrent copy and kernel execution: Yes, copy engines num: 2
Run time limit on kernels: No
Integrated GPU sharing Host Memory: No
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device PCI Domain ID: 0, Device PCI Bus ID: 10, Device PCI location ID: 0
Compute Mode: Default (multiple host threads can use ::cudaSetDevice() with device simultaneously)
Concurrent kernels: 1
Concurrent memory: 1
[13-05-2020 06:16:33] Driveworks SDK released
Happy autonomous driving!
But when I try to run the sample dnn plugin ./sample_dnn_plugin
I get WindowGLFW: Failed create window
error, Initially I was running in cuda 9.2 and was still facing the problem after seeing @SivaRamaKrishnaNV post I tried updating my CUDA to 10.2 but still facing the same issue. Any leads will be really helpful. Thanks in advance.