Cuda Profiler Issues

I have a computer with 3 cuda devices (GTX 295 + 8800GTX), and I want to profile multiple GPU cuda code. The problem is that the profiler only shows usage of the 2nd and 3rd devices - the 1st device invariably shows no data, although it is clear that it is actually running kernals as well.

It also returns a empty column/header error at the completion of the profiling run, which I assume is correlated to the lack of data for the 1st device.

Any ideas what’s going on here?

I have 2 CUDA devices - TESLA C1060 and GeForce 9400. I can see both in the profiler but all profiler counters are 0 always. When you profile - do you obtain non-zero counter values?
I work under 64bit VISTA.