(Since unified memory usage is different between Pascal and earlier models,
and it has bug when running on Windows,
I am testing without unified memory.)
For just running program for image calculation (calculate disparity, etc),
GTX1060 runs slower than GTX960,
and it is even slower than notebook GPU (slower than 940M).
(do not include transmission time, just the kernel running time)
Is there something (etc: warp usage?) should be changed for running on Pascal?
I have checked some documentation pages, but cannot find useful information.