I’m seeing a constant 0.013 second overhead for calling clCreateKernel on the Linux NVidia implementation for my Tesla c1060 card. Why is this overhead so expensive?
My MacBook Pro (with an NVidia card) is only seeing a 0.00001 second overhead for doing the same operation. Why is the Linux NVidia driver so much slower in comparison? Is this something that will go away as the driver matures? or do I need to take measures to cache and avoid this performance hit?