Inline PTX Assembly

Hi folks,

I recently discovered the possibility of inline PTX assembly in CUDA C code. Most of my knowledge I got from http://forums.nvidia.com/index.php?showtopic=151666. But are there more code examples or tutorials available?