Hi,
Lately we are testing different hardware-accelerate video-analytic solutions, and Deepstream seems promising.
We would like to try it out on our MEC device, that has 2 Quadro RTX 8000 and NVIDIA GRID installed in it.
The host OS is Centos7.
Nvidia version info:
[root@198 ~]# nvidia-smi
Fri May 29 13:38:56 2020
±----------------------------------------------------------------------------+
| NVIDIA-SMI 430.46 Driver Version: 430.46 CUDA Version: N/A |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 Quadro RTX 8000 On | 00000000:37:00.0 Off | Off |
| 33% 40C P8 32W / 260W | 48984MiB / 49151MiB | 0% Default |
±------------------------------±---------------------±---------------------+
| 1 Quadro RTX 8000 On | 00000000:86:00.0 Off | Off |
| 33% 39C P8 36W / 260W | 48984MiB / 49151MiB | 0% Default |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 42351 C+G vgpu 4056MiB |
| 0 42997 C+G vgpu 4056MiB |
| 0 43192 C+G vgpu 4056MiB |
| 0 43255 C+G vgpu 4056MiB |
| 0 43612 C+G vgpu 4056MiB |
| 0 43982 C+G vgpu 4056MiB |
| 0 44081 C+G vgpu 4056MiB |
| 0 44138 C+G vgpu 4056MiB |
| 0 44535 C+G vgpu 4056MiB |
| 0 44790 C+G vgpu 4056MiB |
| 0 44905 C+G vgpu 4056MiB |
| 0 45553 C+G vgpu 4056MiB |
| 1 45039 C+G vgpu 3042MiB |
| 1 45698 C+G vgpu 3042MiB |
| 1 45786 C+G vgpu 3042MiB |
| 1 45934 C+G vgpu 3042MiB |
| 1 46442 C+G vgpu 3042MiB |
| 1 46581 C+G vgpu 3042MiB |
| 1 46712 C+G vgpu 3042MiB |
| 1 46792 C+G vgpu 3042MiB |
| 1 47388 C+G vgpu 3042MiB |
| 1 47505 C+G vgpu 3042MiB |
| 1 47693 C+G vgpu 3042MiB |
| 1 47743 C+G vgpu 3042MiB |
| 1 48318 C+G vgpu 3042MiB |
| 1 48461 C+G vgpu 3042MiB |
| 1 48652 C+G vgpu 3042MiB |
| 1 48727 C+G vgpu 3042MiB |
±----------------------------------------------------------------------------+
[root@198 ~]# modinfo nvidia
filename: /lib/modules/3.10.0-1062.9.1.el7.x86_64/weak-updates/nvidia/nvidia.ko
alias: char-major-195-*
version: 430.46
supported: external
license: NVIDIA
retpoline: Y
rhelversion: 7.7
srcversion: 922226EAFE970320108DB9A
alias: pci:v000010DEd00000E00svsdbc04sc80i00*
alias: pci:v000010DEdsvsdbc03sc02i00
alias: pci:v000010DEdsvsdbc03sc00i00
depends: ipmi_msghandler
vermagic: 3.10.0-1057.el7.x86_64 SMP mod_unload modversions
parm: NvSwitchRegDwords:NvSwitch regkey (charp)
parm: NVreg_Mobile:int
parm: NVreg_ResmanDebugLevel:int
parm: NVreg_RmLogonRC:int
parm: NVreg_ModifyDeviceFiles:int
parm: NVreg_DeviceFileUID:int
parm: NVreg_DeviceFileGID:int
parm: NVreg_DeviceFileMode:int
parm: NVreg_InitializeSystemMemoryAllocations:int
parm: NVreg_UsePageAttributeTable:int
parm: NVreg_MapRegistersEarly:int
parm: NVreg_RegisterForACPIEvents:int
parm: NVreg_EnablePCIeGen3:int
parm: NVreg_EnableMSI:int
parm: NVreg_TCEBypassMode:int
parm: NVreg_EnableStreamMemOPs:int
parm: NVreg_EnableBacklightHandler:int
parm: NVreg_RestrictProfilingToAdminUsers:int
parm: NVreg_PreserveVideoMemoryAllocations:int
parm: NVreg_DynamicPowerManagement:int
parm: NVreg_EnableUserNUMAManagement:int
parm: NVreg_MemoryPoolSize:int
parm: NVreg_KMallocHeapMaxSize:int
parm: NVreg_VMallocHeapMaxSize:int
parm: NVreg_IgnoreMMIOCheck:int
parm: NVreg_NvLinkDisable:int
parm: NVreg_RegistryDwords:charp
parm: NVreg_RegistryDwordsPerDevice:charp
parm: NVreg_RmMsg:charp
parm: NVreg_GpuBlacklist:charp
parm: NVreg_TemporaryFilePath:charp
parm: NVreg_AssignGpus:charp
As we are not planning to update our nvidia driver for now, and Deepstream is not supported on Centos yet, we tried out dockerized version: nvcr.io/nvidia/deepstream:4.0.2-19.12-devel.
We start it with the following command:
[centos@hp-gpu-node1 ~]$ docker run --gpus all -it --rm -v /tmp/.X11-unix:/tmp/.X11-unix --env=“DISPLAY” --net=host -e DISPLAY=$DISPLAY -w /opt/nvidia/deepstream/deepstream-4.0 --volume=“$HOME/.Xauthority:/root/.Xauthority:rw” nvcr.io/nvidia/deepstream:4.0.2-19.12-devel
And try to run the deepstream-test1-app, but get following error:
root@hp-gpu-node1:/opt/nvidia/deepstream/deepstream-4.0# cd ~/deepstream_sdk_v4.0.2_x86_64/sources/apps/sample_apps/deepstream-test1/
root@hp-gpu-node1:~/deepstream_sdk_v4.0.2_x86_64/sources/apps/sample_apps/deepstream-test1# deepstream-test1-app …/…/…/…/samples/streams/sample_720p.h264
Now playing: …/…/…/…/samples/streams/sample_720p.h264
libEGL warning: DRI3: failed to query the version
libEGL warning: DRI2: failed to authenticate
Creating LL OSD context new
0:00:08.188922891 10 0x557358461430 INFO nvinfer gstnvinfer.cpp:519:gst_nvinfer_logger: NvDsInferContext[UID 1]:initialize(): Trying to create engine from model files
0:00:18.115738173 10 0x557358461430 INFO nvinfer gstnvinfer.cpp:519:gst_nvinfer_logger: NvDsInferContext[UID 1]:generateTRTModel(): Storing the serialized cuda engine to file at /root/deepstream_sdk_v4.0.2_x86_64/samples/models/Primary_Detector/resnet10.caffemodel_b1_int8.engine
Running…
Cuda failure: status=801
Could you please help what could be the problem?