Accelerating Leaderboard-Topping ASR Models 10x with NVIDIA NeMo

Originally published at: https://developer.nvidia.com/blog/accelerating-leaderboard-topping-asr-models-10x-with-nvidia-nemo/

NVIDIA NeMo has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry, particularly those topping the Hugging Face Open ASR Leaderboard.  These NVIDIA NeMo ASR models that transcribe speech into text offer a range of architectures designed to optimize both speed and accuracy: CTC model (nvidia/parakeet-ctc-1.1b): This model features a…