Hi All,
I’m working with cutlass and when I compile I see the BAR.SYNC.DEFER_BLOCKING
SASS instruction. I don’t even expect to find out what the real meaning of this is, if anyone knows it would be really nice :)
My main question is, when does nvcc generate this? I always see bar.sync
in PTX code. But sometimes it generates BAR.SYNC.DEFER_BLOCKING
, sometimes it generates BAR.SYNC
. Does anyone know how does that switch work?
I’m asking because according to my use of cutlass, this instruction is generated or not. If I understand what’s going on, it would be great.
Thanks in advance