Linux Vulkan driver issue with timeline semaphors

zbendefy · April 17, 2024, 7:41pm

Hi!

I’m investigating a curious issue on linux, when using Vulkan timeline semaphors.

At one point of our pipeline, we have a large number of the following workflow:

for 1 to 40:
CommandRecord();
Submit(waitForTimeline: i, signal: i+1) //This submits the commands, and signals i+1 once it finishes
vkWaitSemaphors(waitForTimeline: i+1, timeout=UINT64_MAX); // This line waits on the CPU for the GPU commands to finish

(There are multiple threads doing the similiar things, but the timeline index is properly synchronizedacross threads)

Now the GPU workload usually runs for ~200 microseconds, however in some seemingly random cases it takes almost exactly 10 milliseconds.

In Nsight, I can see that the 10 milliseconds consists of a 200microseconds of GPU work, but after that there is absolutely no workload neither on the GPU, and the CPU.

The interesting thing is, that if I replace the vkWaitSemaphors() with the following, logically equivalent solution, then this 10ms bubble completely disappears, and performance is good again:

while(vkWaitSemaphors(waitForTimeline: i+1, timeout=5000) == VK_TIMEOUT) {}

This does a busy wait polling on the timeline semaphor, and for some reason it works fine.

The driver I use is: 535.171.04, but i have reports of this on the latest drivers as well
This issue is not present on windows.

Here is an image from NSight with the 10ms of no workload:

zbendefy · May 10, 2024, 7:54am

Bump,

For now this workaround seems to work, but it would be good to realy on the driver’s function instead of the busy wait :)

zoltan.bendefy · June 3, 2024, 12:58pm

Bump.

Topic		Replies	Views
vk_poll_commands busy waits causing excess CPU usage Linux	3	960	May 27, 2019
Vulkan External Semaphore Behavior in Simple Vulkan CUDA Example Vulkan	1	1193	July 20, 2022
vkCreateFence very slow on linux Vulkan	3	797	May 1, 2019
vkAcquireNextImageKHR ignoring timeout Vulkan	6	2389	July 19, 2017
Performance problems with Vulkan on Linux with Nvidia Quadro M1000M Driver Version: 510.73.05 Vulkan	0	665	August 18, 2022
[550.67] Nvidia Vulkan ICD wakes up dgpu on initialization and exit Linux vulkan	11	1199	August 7, 2024
vkWaitSemaphores pegs the thread while running Jetson Orin NX ubuntu , vulkan	3	14	February 20, 2025
GTX 650 - Vulkan rendering is slower than OpenGL Vulkan	5	2659	June 15, 2016
Severe user input lag in Vulkan on Windows Vulkan	5	616	July 26, 2024
Cuda vs Vulkan - performance issue (possibly __syncwarp related) Vulkan cuda , kernel , performance , vulkan	4	506	October 8, 2024

Linux Vulkan driver issue with timeline semaphors

Related topics