Dynamic parallelism is a nice conceptual candidate for the nested loop
for each octree node o
for each sample s in the octree node o
So one Kernel launch for all nodes in octree and inside the kernel one kernel launch for all samples inside the node. Yes but my GTX-Titan is kneeling on this concept. Can an NVIDIA appropriate person come in contact with me or start a conversation inside here?