We were forced to limit our runs to 1000 output frames and just run it as a lot of separate containers because it would always eventually crash, no matter how much RAM we give it.
I strongly suspect the issue is related to the bug of USD instantiation described here: https://forums.developer.nvidia.com/t/incorrect-bboxes-in-basicwriter-output
You should talk to @jlafleche about it.