It does have a chunk of code and i am trying to implement it but at one point, on the slides there are using:
comp16(b, &ashare[k][0],c)
and i have no idea what it does. I tried googling for it but with no luck. Can anyone shed any light on it and if anyone has implemented this method for matrix multiplication, is it actually faster?
I am struggling big time, so i hope you can help me out a bit.