This is really just informational and I hope I am posting into the correct area.
cudaMalloc fails in a number of the samples if the hardware is an 8800GTS with 320. BlackScholes is an example of the cudaMalloc failure. Perhaps it is stipulated in the programming guide but I was unable to find this anywhere in the docs or on the Forums. Anyway it is pretty straightforward. OPT_SZ is specified as 20000000 and the malloc id for sizeof(float) * OPT_SZ. I believe BlackScholes has at least 5 buffers of this size and it does fail.
This was driving me crazy as I am new to cuda and GPGPU and I have a Dell XPS 600 with a 7800GTX and 8800GTS and I was assuming I was making a mistake with the drivers or the mixed mode cards were in someway not supported by cuda or PTX. What made it worse was that some samples ran fine! The good news is the configuration works great if OPT_SZ is set small enough on those applicable samples.
This is really just picky as the 8800 320 is certainly not the target for cuda but I hope this helps some other newbie from too much hairpulling (mine is thin enough).