System Reboot when running TensorRT on 536.40 driver

nathan.meyer · July 1, 2023, 1:31pm

Description

I am running object detection on my GPU inside a container. When the object detection runs, my system will hard reboot, no bluescreen, and no warnings in any system logs.

I rolled back to driver version 528.49 and the issue goes away and object detection runs without issue. I’m not yet sure where between 528 and 536 this starts happening.

Environment

TensorRT Version: 8.4.1.5 - 8.6.1
GPU Type: GTX 1080ti
Nvidia Driver Version: 536.40
CUDA Version: 11.7 - 12.1
CUDNN Version: 8.7 - 8.9
Operating System + Version: Windows 11 + WSL2 (Ubuntu 20.04)
Python Version (if applicable): 3.9
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): Package frigate · GitHub

Relevant Files

Issue observed using various yolo models generated from GitHub - yeahme49/tensorrt_demos: TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet.
This model is automatically downloaded and converted when the above container runs.

docker-compose.yml (640 Bytes)
config.yml (816 Bytes)

Steps To Reproduce

Create test folder and copy docker-compose.yml inside
Create folder named “config” in test folder
Copy config.yml into config folder
Edit config.yml with path to rstp stream (or follow Frigate documentation to pass in video file) and edit detect height and width to match video frame size.
From test folder, run “docker compose up”
System reboots, no traceback or errors found

spolisetty · July 13, 2023, 12:42pm

Hi,

We recommend you please install the TensorRT from https://developer.nvidia.com/nvidia-tensorrt-8x-download or TensorRT | NVIDIA NGC and try again.

Thank you.

Topic		Replies	Views
TensorRT installation version issue in docker container TensorRT docker	4	1575	March 30, 2023
TensorRT Docker:: NVIDIA GPU Quadra Series P2000:: PC reboot issues after installation of drivers CUDA Setup and Installation	5	670	May 30, 2022
We are getting error on L40s GPU model while running tensorrt image Docker and NVIDIA Docker tensorrt , cuda , ubuntu , nvidia-smi	0	565	February 14, 2024
Cuda failure: CUDA driver version is insufficient for CUDA runtime version TensorRT tensorrt , cuda	8	2639	October 12, 2021
Run CUDA 11.6 (R510) Container on 11.2 (R460) Host CUDA Setup and Installation containers	0	379	November 24, 2023
Nvidia container runtime and tensorrt TensorRT tensorrt	3	692	October 12, 2021
Bug : Tensorrt Model not loading on same GPU on a different device (slight driver version difference) TensorRT tensorrt , cudnn	1	227	April 30, 2024
TensorRT Python Runtime TensorRT	7	4876	September 11, 2021
TensorRT with nvcr.io/nvidia/l4t-base:r32.3.1 docker image Jetson Nano	7	2082	February 28, 2020
tensorrtserver: Detected NVIDIA GeForce 940MX GPU, which is not supported by this container TensorRT	2	2013	November 22, 2018

System Reboot when running TensorRT on 536.40 driver

Description

Environment

Relevant Files

Steps To Reproduce

Related topics