I have a AGX and want to process 4 video sources in parallel and display them in one windows (2x2)
I can run dusty object detection perfectly. But in python, I need to process each source in sequence which slow down the loop a lot I guess. Threading? I tried but it is slow (may be I did something wrong, it there any example?)
I saw there is some cuda synchronization I can do in python, is it true?
Plan B, I run 4 process and output the rtsp, and use another program to accept all 4 rtsp and display them in a single window. But I am worried about memory usage and speed also.