What happens for load instructions ?

Skybuck · July 22, 2011, 2:09pm

The guide doesn’t seem to be very clear what happens during a load instruction.

There are different solutions thinkable:

The thread stalls, the entire warp stalls, the schedular tries to find another warp to execute, the other warp stalls as well, until all warps are stalled and out of warp resources.
The thread tries to continue with executing other instructions which do not depend on the load, until it hits instructions which depend on the load, it stalls, and everything else stalls like in 1.
The thread stalls and is switched with another thread from the block but warp continues. (Doesn’t seem to be the case).

I am starting to suspect it’s case 1 this would mean it’s impossible to hide the latency inside a single thread by trying to execute other instructions in the same thread while the load happens ?!?

So the claim of “latency hiding” seems exagerated/inflated.

It seems only other warps could be run but those also stall real fast, and then everything is stalled ?!

The guide should be more clear on this.

tera · July 22, 2011, 2:51pm

Depending on the decision of the scheduler it’s either 1 or 2. 3 is’t possible since the minimal scheduling unit is a warp, not a thread.

Skybuck · July 22, 2011, 4:26pm

Are you unsure how the schedular works ?

Or do you mean the schedular can make different decisions ? If the latter then is there a way to influence the decision making ?

tera · July 22, 2011, 4:39pm

The scheduler decides which of the runnable warps it picks. In this decision it has to take into account several undocumented factors like banking of registers.

I’m not aware of any options to influence the scheduler’s decisions.

Topic		Replies	Views
Basic question about hiding latency CUDA Programming and Performance	6	2222	July 9, 2014
Latency Hiding Question CUDA Programming and Performance	2	1712	May 13, 2011
Trouble in understanding this concept CUDA Programming and Performance	2	550	September 17, 2018
Understanding Data Fetching Mechanics During a Load Instruction in CUDA Container: CUDA	0	302	April 19, 2024
Things related to stall reasons... or not so related CUDA Programming and Performance	6	2131	April 14, 2017
questions about warp scheduling CUDA Programming and Performance	5	1438	December 5, 2016
Questin regarding latency CUDA Programming and Performance	6	4352	August 26, 2010
How to understand the "hide latency" CUDA Programming and Performance	13	4501	August 8, 2024
Thread and Instruction Scheduling CUDA Programming and Performance	3	3399	August 17, 2007
Dual warp scheduler...quick question... CUDA Programming and Performance	0	1097	July 23, 2010

What happens for load instructions ?

Related topics