Optimal Loop Blocking (Tiling) Factor


What is the optimal loop blocking factor (Tile Size or Block Size) decided by pgcc when performing loop tiling or loop blocking. Does pgcc use any internal heuristics for deciding the optimal block or tile size or is it hardcoded to some default value (51 in the case of gcc version 4.6, defined in params.def)

Hi Ajith,

It’s based on the cachesize of the targeted processor: the cachesize will either be the size of the L3 cache or the size of the L2 cache if the L3 is not present.

  • Mat