Overview of memory - driver, kernel, DMA, userspace, CUDA, zero-copy

For a partial answer to my original question re the overview of V4L2 memory architecture/concepts, the link below from the Unix Kernel folks may be a useful starting point. This is generic V4L2 without NVIDIA refinements.

https://docs.kernel.org/userspace-api/media/v4l/io.html