[Windows] Possible driver bug: Fragment shader interlock has almost no effect on RTX GPU driver

hgen264 · August 12, 2019, 8:21am

Hi, I’m encountering an issue where using fragment shader interlock is almost like it doesn’t work on RTX GPU driver, while the same code with ARB_fragment_shader_interlock enabled works perfectly on GTX GPU drivers and Intel drivers.

The problem is flickering black texels running around at the bottom right of the screen, that’s where I use shader interlock to do limited programmable blending, but it doesn’t seems to function properly on newest RTX GPU driver, while on the given driver on 1050ti below, and intel drivers, it seems to work properly.

Here is the gif of what happens (RTX 2070 SUPER driver 435.80)
[url]https://media.discordapp.net/attachments/442344583293304833/610265078788390912/clip3981.gif[/url]

And here is what I expected (running on a GTX 1050ti driver 430.86, GTX 1070 driver 435.27 also works):
[url]https://media.discordapp.net/attachments/442344583293304833/610093633030586379/nvidia-interlock-repor_2019-08-11_20-53-59-897.png[/url]

Reproduce VS solution and executable (in x64 folder):
[url]https://github.com/pent0/nv-rtx-fragment-shader-interlock-bug[/url]

External issue:
[url]https://github.com/Vita3K/Vita3K/issues/569[/url]

I’m not sure if this is a regress or not, or only on RTX driver, or it’s our fault. Anyway, I would appreciate help, thanks!

hgen264 · August 29, 2019, 3:16pm

Is there any updates on this? Do you guys track the issue internally? Thanks.
I think D3D12 ROV is broken too on Turing (maybe related), I will confirm it with other dev later.

hgen264 · August 31, 2019, 2:50am

Hi, 419.67 works for us (for RTX not super). For you guys to trace.

hgen264 · September 4, 2019, 4:26pm

Hi, to future ones. We did not put a memory barrier before each draw call, so the new draw call doesnt wait for last draw call to finish storing to image, causing race condition.

Before each draw call, we put it like this
glMemoryBarrier(GL_SHADER_IMAGE_ACCESS_BARRIER_BIT | GL_TEXTURE_FETCH_BARRIER_BIT);

apesch · September 6, 2019, 6:54pm

Thanks for the update, glad the added barrier sorted out the issue.

For what it’s worth, here’s the relevant sections from the 4.5 spec (7.12.2) outlining why this is necessary:

Explicit synchronization is required to ensure that the effects of buffer and texture data stores performed by shaders will be visible to subsequent operations using the same objects and will not overwrite data still to be read by previously requested operations. Without manual synchronization, shader stores for a “new” primitive may complete before processing of an “old” primitive completes. Additionally, stores for an “old” primitive might not be completed before processing of a “new”primitive starts.

And then a few pages down in the glMemoryBarrier guidelines:

Data written to image variables in one rendering pass and read by the shader in a later pass need not use coherent variables or memoryBarrier. Calling MemoryBarrier with the SHADER_IMAGE_ACCESS_BARRIER_BIT set in barriers between passes is necessary.

user25321 · March 8, 2022, 9:16am

Hi, it’s 2022 and I’m still having the same problem, but with a slightly different situation. I’m using instanced rendering, so how do I guarantee a barrier between instances within a single drawing command?
And, my code (without memory barriers) works fine on GTX1060 but can’t get correct results on my RTX2060.
Looking forward to your reply！

system · October 7, 2022, 12:19pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Internal linker error when linking a shader that uses GL_ARB_fragment_shader_interlock OpenGL nvbugs	0	401	February 6, 2024
InterlockedAdd()-Limitations since v 334.67beta / v 334.89 DirectX, DXR, DirectCompute	0	1672	March 12, 2014
Possible OpenGL or driver bug OpenGL	0	3162	August 12, 2015
[Opengl]Values of shader's uniforms change at each frame Video Processing & Optical Flow	3	128	December 12, 2024
glGenBuffers has corruption issues on several cards of RTX Ada family OpenGL opengl	3	663	June 6, 2025
Problem with driver 344 and vertex or fragment shader OpenGL	0	1011	November 12, 2014
Driver regression in 373.06 from 372.70 Vulkan	9	3197	December 19, 2016
Weird OpenCL bug OpenCL/OpenGL interop bug CUDA Programming and Performance	2	905	March 2, 2011
Vulkan Tessellation: Black terrain on nVidia GT 1030 Vulkan vulkan	12	1558	January 5, 2022
Driver issue with bindless textures pixels are flickering on the window OpenGL	3	951	October 12, 2021

[Windows] Possible driver bug: Fragment shader interlock has almost no effect on RTX GPU driver

Related topics