The objective is to Pan, Tilt and Zoom in a video stream with focus of where a specific object (ball) is.
How can I design a queue in the videostream where the machine learning operates on whatever goes i the queue and the video is processed (1-3) seconds in the queue, whith input from the machine learning.
This would give the machine learning more time (1-3) to get a proper object detection, as objects might be blocked by other objects for shorter periods (< 1s).
Any advice, demos, papers etc. would be helpful.