I’m trying to understand ClearSightNet’s output format. From what I saw in the sample video, the network outputs an RGB image for each input frame. Can you let me know the meaning of each channel?
From what I see, it seems like the the network outputs a visibility level per pixel and for the purpose of visualisation, red, green, blue pixel corresponds to heavily, moderately and lightly occluded (black means fully-visible). Or each channel is representing different class of occlusion/visibility (red is occlusion, green is visibility, etc…)?
Thanks a lot