Parakeet CTC Vietnamese: Is the original ARPA LM available for n-gram LM interpolation / fine-tuning?

ktviruss25 · April 2, 2026, 10:30am

Hi NVIDIA team,

I am trying to follow the Riva / NeMo tutorial for n-gram LM training and fine-tuning:

“How To Train, Evaluate, and Fine-Tune an n-gram Language Model”
(official Riva tutorial)

My use case is with the model nvidia/parakeet-ctc-0.6b-Vietnamese.

From the tutorial, the LM adaptation workflow appears to require:

However, in the Hugging Face repo for nvidia/parakeet-ctc-0.6b-Vietnamese, I can only find:

I do not see the original .arpa LM.

So I would like to ask:

Is the original ARPA LM for this model available anywhere?
If not, what is the recommended way to adapt the provided LM for a specific domain?
Should we train a new n-gram LM from text and use that directly for decoding?
Or is there any supported way to recover / export ARPA from the provided KenLM .bin?

My understanding is that without the original ARPA, the official NeMo interpolation workflow cannot be applied directly to the released LM artifact.

Could you please advise on the recommended workflow for domain adaptation in this case?

Thanks a lot.

Topic		Replies	Views
Language model with citrinet model is not working Riva nemo , riva	2	726	September 6, 2022
Trainable version of Parakeet-CTC-XXL-1.1B ASR Multilingual with Universal Tokenizer Riva	0	92	April 9, 2025
[RIVA][Jasper][Citrinet] Build and deploy ASR models with custom KenLM language model TAO Toolkit riva	5	786	October 12, 2021
NVIDIA NeMo를 통한 현지화된 다국어 LLM 트레이닝, 2부 Technical Blog - South Korea	1	187	May 24, 2024
NVIDIA NeMo를 활용한 현지화된 다국어 LLM 트레이닝, 1 부 Technical Blog - South Korea	1	188	May 24, 2024
Adding a language model (LM) on top of the ASR - Conformer CTC Riva nemo	1	1604	October 26, 2021
NVIDIA TensorRT-LLM으로 LoRA LLM 조정 및 배포 Technical Blog - South Korea	1	288	April 18, 2024
N-gram LM and beams files related questions (audio transcription) Riva nemo , riva	1	673	October 27, 2021
Nvidia nim을 사용한 다국어 llm 배포 Technical Blog - South Korea nim	1	50	July 18, 2024
Riva TTS with Arabic Riva	1	677	July 31, 2023