Hi, I have a brand new AGX Orin 64GB. I have followed the instructions here, but when I generate TTS within the docker container after getting Riva started successfully (no errors), I get the following intermittent error, seemingly randomly:
Error: Triton model failed during inference. Error message: Streaming timed out
The first Riva TTS call after riva_start.sh results in longer latency, and can throw a timeout error on some GPUs. Subsequent calls will exhibit normal latency.