Reducing time for processing videos

I am working on reducing the time required for processing of high definition medical videos. I was previously working on MATLAB but videos took around 2 hrs for execution. So I switched on to NVIDIA CUDA. Will it be possible using CUDA? Also how should I proceed further? Is there any information or are there any books that I can refer to regarding cutting down the time for executing videos?
Plz. help me…

Waiting desperately… :)