I search a fast CUDA implementation of the N-Queens problem. I would compare it with with my own solution.
I had found this but I can’t finde the full source code.

The implementation must run on a 8800GTX or higher faster as a high optimized CPU version on 1 Core.

A (working) link to the zip file was (re) posted at the end of that same thread.