As you know one can use texture for flating point memory index accessing. This is (at least in my experience) faster than interpolating the data oneself as it is “hardware accelerated” (according to all docs).
But what is going on in that hardware acceleration? Does the L2 cache have some sort of software that interpolates? Is it som sort of Application-specific integrated circuit (ASIC) that does the job? is that happening before or after the L2 cache loads from global texture memory?