I have been wondering whether there are any thread synchronization constructs other than the sync threads function usable in CUDA compute capability 1.0 (since atomic operations are not there)? There don’t seem to be any kind of built in locks, for instance, or anything else for that matter.
Correct, all that is there, is written in the programming guide.