SVD algorithm for complex matrix of size (32*32)

Hi,all!
I’m wondering if GPU can computing 120 complex matrixes of size (32*32) in less than 1 ms.I’ve tried,but it takes too much time!