I had a quick question regarding the performance of cudamalloc opeartion on Win 7 machine and xp machine.
For about 3.5 Kbyte of memory allocation , it takes about 1.4msec on an win7 machine and < than 0.1ms on an xp machine.
Is it a known fact that the cudaMalloc operation is a lot slower on an windows 7 machine when compared to an xp machine? If so what is the reason? and is there any possible work around?
Any inputs is greatly appreciated