4 card RTX3090, Server restarts automatically during deep learning in ubuntu 20.04

I have a 4 card RTX 3090 machine.

My configuration is as follows
CPU : 2* intel 4216
memery:128G
GPU:4 * RTX 3090

When I run the deep learning training, the server will automatically restart,
Attached is my log file
nvidia-bug-report.log.gz (1.1 MB)

A reboot can only be triggered by the mainboard, e.g. by lack of power. Please check/replace your psu.

1 Like

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.