Any resource tells about instruction cache?

susangao · April 24, 2014, 12:39am

Hi,

Any one can help to recommend readings that tells about instruction cache?

Thanks,
Susan

tera · April 24, 2014, 12:57am

Old, but presumably not too much has changed since:

Demystifying GPU Microarchitecture through Microbenchmarking

susangao · April 24, 2014, 4:40pm

Cool! Thank you!

Susan

susangao · April 25, 2014, 4:46pm

One more to confirm, Are warps blocked at barrier are not considered for instruction fetching? I saw following statement in the paper: “Effect of instruction fetch and memory scheduling on GPU performance” HPArch

“Warps that are blocked at
a barrier, are waiting for loads/stores to complete, or are waiting
for a branch to be resolved are not considered for fetching.”

Thanks,
Susan

tera · April 28, 2014, 11:41pm

Note the statement you are citing from the paper is a description of the simulated architecture, not necessarily of any actual Nvidia card.

I would believe that, due to the pipelined nature of instruction execution, in actual Nvidia cards blocked warps can have a handful of follow-up instructions fetched. This is due to the fact that at the time of instruction fetch it is not even known whether a warp is blocked or not.

Topic		Replies	Views
"Instruction Fetch" in Nsight Performance Analysis CUDA Programming and Performance	8	2518	January 7, 2016
code instruction cache? CUDA Programming and Performance	12	4622	July 31, 2015
How does reducing unrolling or branching code actually reduce instruction fetch? CUDA Programming and Performance	16	2755	December 4, 2016
Instruction cache and instruction fetch stalls CUDA Programming and Performance	2	1899	June 26, 2019
Does the prefetch instruction delay the loading of the ld instruction? CUDA Programming and Performance	5	108	August 9, 2024
How can I tell whether my kernel will thrash the instruction cache? CUDA Programming and Performance	4	630	August 21, 2022
instruction fetch latency CUDA Programming and Performance	1	515	March 4, 2020
Some issues regarding the use of prefetch in the cuda kernel CUDA Programming and Performance cuda , kernel	19	151	June 11, 2025
Three questions about register shuffle and shared memory CUDA Programming and Performance	3	1330	October 12, 2021
threads in a warp still in lock-step? CUDA Programming and Performance	4	3194	January 31, 2019

Any resource tells about instruction cache?

Related topics