"VkDeviceLost-4" error message keeps throwing out

jack-jubo · December 19, 2017, 1:15am

OS: Windows 10
Vulkan Version: 1.0.65.0
Nvidia driver version: 388.43
GPU: 750ti

I’m working on a project which is to build a high-performance rendering server based on Vulkan. The rendering server is multi-threaded in order to process rendering requests sent by users. Initially, for each unique model, an exclusive logical device is created and taken care by one rendering thread. Basically, it works fine until we found that there seems to be a limitation on maximum number of devices that can be created. And this limitation varies from device to device (My tested results: 750ti [Windows10 driver] → Max = ~81, on Titan Xp [Linux driver] → Max = ~35). In order to overcome this pitfall, we tried to create only one single logical device which is shared by every 3D model rendering threads. Every model has their own command buffer which is built in multi-threaded fashion and submitted to one graphics queue whenever a render request is received. Locks are heavily used in drawCall, buildSecondBuffer and uploadModelData stages in order to prevent from race conditions. But it doesn’t work this time. Error msg “VkDeviceLost-4” keeps throwing out when multiple rendering requests received at the same time.

According to Vulkan Specification:

A logical device may become lost because of hardware errors, execution timeouts, power management events and/or platform-specific events.

I guess this problem may caused by execution timeout, but I have no idea how this could happen.

Due to patented issue, I cannot show you the source code(~5000 lines). But still, can any of you guys got any idea about this issue? What could potentially cause this problem? Does anyone else have the same issue? Really appreciate it if you can help me! Thanks in advance!

Topic		Replies	Views
vkCreateDevice still failed with VK_ERROR_INITIALIZATION_FAILED when creating large numbers of logic device . Vulkan	1	2962	October 19, 2022
Creating new Device after DeviceLostError not possible? Drivers - Linux, Windows, MacOS nvbugs	0	413	August 17, 2022
Logical device creation maximum limit Vulkan	3	1236	May 30, 2016
Device lost on multi-queue presentation Vulkan	3	2516	April 13, 2016
Vulkan logical device limit Vulkan	2	1077	October 19, 2022
VK_DEVICE_LOST only on RTX devices Vulkan	4	2213	August 17, 2021
Rare crash deep inside vkQueueSubmit Vulkan	2	2363	March 23, 2017
vkEnumeratePhysicalDevices returns 1 VkPhysicalDevice with 3 1080's installed Vulkan	4	2045	August 3, 2017
Vulkan multi-GPU Linux	11	3407	January 22, 2023
Dont found physical device count Vulkan	10	2067	May 15, 2017

"VkDeviceLost-4" error message keeps throwing out

Related topics