can anybody explain warp vote functions

rocksportrocker · January 12, 2009, 5:07pm

Hi, the only information the CUDA Programming Guide gives about warp vote functions

is their signature, eg.

[codebox]int __all(int predicate);[/codebox]

But what is the semantics of this function ? what is predicate ? what happens if

a thread calls __all() ???

Greetings, Uwe

E.D_Riedijk · January 12, 2009, 7:13pm

I believe it can be called like this (for example)

int val = __all(threadIdx.x < 112)

val will be 1 if all threads of the warp this thread is in are < 112

if one of the threads in this warp will have threadIdx 112 or higher, it will return 0

So threads:

0 - 31 : 1

32 - 63 : 1

64 - 95 : 1

96 - 127 : 0

128 - 159 : 0

This can be used to minimize warp divergence as far as I understood.

rocksportrocker · January 13, 2009, 8:50am

Thanks, now I got it.

Uwe.

Sarnath · June 8, 2010, 5:00am

So, Does this mean that “_all” is an execution barrier like __syncthreads() ??

SPWorley · June 8, 2010, 5:27am

No. The voting is warp-wide, and all threads within a warp are (by definition) synced anyway so no barrier is needed.

Sarnath · June 8, 2010, 6:21am

Thank you Steve…!

jjtapiav · June 8, 2010, 6:38pm

While we are on the topic of warp voting functions, is there any information about the latency or the number of instructions that this instruction translates to? (e.g. is it an expensive function or anything like that?)

gpuguy · June 26, 2010, 6:48am

Is there any example of warp vote function in SDK. Basically I would like to know how it can be used to minimize warp divergence…

fji · February 11, 2011, 7:24pm

I guess a warp vote function will lead to a dead lock if it is used in a diverged warp, e.g.

if(threadIdx.x % warpSize == 0) {

if(__any(…)) …

} else {

…

}

Could anyone from Nvidia explain this topic?

fji · February 11, 2011, 7:28pm

Please ignore my previous post.
I found this thread much useful:
http://forums.nvidia.com/index.php?showtopic=162743

Topic		Replies	Views
Vote functions in a warp-divergent branch? Are they allowed? How idle threads are handled? CUDA Programming and Performance	5	19353	September 24, 2010
throughput of warp vote functions? CUDA Programming and Performance	6	7625	March 29, 2010
__ballot(..) and warpSize warpSize will never surpass 32 threads? CUDA Programming and Performance	3	4092	March 13, 2010
WARP Voting function CUDA Programming and Performance	6	6492	March 25, 2010
do warp vote functions cause branching? CUDA Programming and Performance	16	3626	August 11, 2010
Warp Vote Functions CUDA Programming and Performance	1	1225	December 3, 2009
Forced Convergence in Divergent Code Paths CUDA Programming and Performance	1	2699	July 7, 2009
Warp Vote Functions..When are they useful? CUDA Programming and Performance	1	1392	August 20, 2013
Most efficient blockmin function? CUDA Programming and Performance	12	4916	April 6, 2009
How to control warps? CUDA Programming and Performance	2	518	May 14, 2018

can anybody explain warp vote functions

Related topics