Trying to get an educated guess how long it should take to process a 4K (3840x2160) RGB image using a cuda kernel that uses a 3D texture that represents a 3D lookup table. Assume one thread per pixel for processing and the maximum allowed threads running in parallel. Reminder this is for the Jetson Nano.
Thanks.