Why CER is very high when serving NeMo model in Riva

mehadi.hasan · October 31, 2025, 6:10am

Hardware - GPU L4
Riva Version - 1.18.0
NeMo - 1.23.0
Nemo2Riva - 1.18.0

I finetune the parakeet-tdt_ctc-0.6b-ja model with my custom dataset. Got test/validation character error rate (CER) ~17% with CTC decoder

After getting this CER I,

Extract the CTC head from this NeMo model
Convert this to .riva
Build an offline Riva model with a greedy decoder
Serve the Riva model
Take a transcript from this offline Riva-deployed model on the same test/validation dataset.

This time I’m getting CER ~27%. I did not get any clue why the ~10% CER jump for the Riva model. Is it expected behavior?

For the streaming, low-latency, the CER jumped to 33%. For the Riva build, I use the default configuration from the Riva pipeline configs

I also tried to fine-tune a conformer-CTC model and get similar behavior. 10 to 15% CER jump in the Riva model

If you need any other information regarding this, please let me know.

Thanks for your help

mayjain · November 4, 2025, 4:59am

Can you share the build command you used?

mehadi.hasan · November 4, 2025, 9:13am

@mayjain Thanks for your reply.

I use this Riva build command for offline STT

riva-build speech_recognition -f \
    "/servicemaker-dev/$RMIR_MODEL:tlt_encode"\
    "/servicemaker-dev/$RIVA_MODEL:tlt_encode"\
    --offline \
    --name=parakeet-0.6b-unified-ml-cs-es-ja-JP-asr-offline \
    --return_separate_utterances=True \
    --featurizer.use_utterance_norm_params=False \
    --featurizer.precalc_norm_time_steps=0 \
    --featurizer.precalc_norm_params=False \
    --ms_per_timestep=80 \
    --endpointing.residue_blanks_at_start=-16 \
    --nn.fp16_needs_obey_precision_pass \
    --unified_acoustic_model \
    --chunk_size=4.8 \
    --left_padding_size=1.6 \
    --right_padding_size=1.6 \
    --featurizer.max_batch_size=256 \
    --featurizer.max_execution_batch_size=256 \
    --decoder_type=greedy \
    --greedy_decoder.asr_model_delay=-1 \
    --language_code=ja-JP \
    --force

mehadi.hasan · November 4, 2025, 9:18am

@mayjain I also tried streaming configs with

--name=parakeet-0.6b-unified-ml-cs-es-ja-JP-asr-streaming \
    --return_separate_utterances=False \
    --featurizer.use_utterance_norm_params=False \
    --featurizer.precalc_norm_time_steps=0 \
    --featurizer.precalc_norm_params=False \
    --ms_per_timestep=80 \
    --endpointing.residue_blanks_at_start=-16 \
    --nn.fp16_needs_obey_precision_pass \
    --unified_acoustic_model \
    --chunk_size=0.32 \
    --left_padding_size=3.92 \
    --right_padding_size=3.92 \
    --decoder_type=greedy \
    --greedy_decoder.asr_model_delay=-1 \
    --append_space_to_transcripts=False \
    --language_code=ja-JP \
    --force

For this I get CER: ~28%

mehadi.hasan · November 4, 2025, 9:21am

To extract CTC head I use this script NeMo/examples/asr/asr_hybrid_transducer_ctc/helpers/convert_nemo_asr_hybrid_to_ctc.py at main · NVIDIA-NeMo/NeMo · GitHub

mayjain · November 4, 2025, 10:27am

Can you try our latest Parakeet CTC NIM containers.
I am not getting such spike in CER in latest containers.

mehadi.hasan · November 4, 2025, 10:47am

Could you please share the Parakeet NIM model for the Japanese language

mehadi.hasan · November 4, 2025, 10:54am

For the Riva build, I use this image

nvcr.io/nvidia/riva/riva-speech:2.18.0

mayjain · November 5, 2025, 5:20am

You can checkout NIM docs on how to deploy a custom NIM.
CONTAINER_ID = parakeet-1-1b-ctc-en-us

mehadi.hasan · November 5, 2025, 7:06am

Thanks. I will try NIM deployment and let you know the update

mehadi.hasan · December 17, 2025, 5:36pm

Hi @mayjain I tried with NIM and getting the same issue

export NIM_EXPORT_PATH=~/nim_export
export CONTAINER_ID=parakeet-0-6b-ctc-en-us
docker run -it --rm --name=$CONTAINER_ID \
   --runtime=nvidia \
   --gpus '"device=0"' \
   --shm-size=8GB \
   -e NGC_API_KEY \
   -e NIM_TAGS_SELECTOR \
   -e NIM_DISABLE_MODEL_DOWNLOAD=true \
   -e NIM_HTTP_API_PORT=9000 \
   -e NIM_GRPC_API_PORT=50051 \
   -p 9000:9000 \
   -p 50051:50051 \
   -v $NIM_EXPORT_PATH:/opt/nim/export \
   -e NIM_EXPORT_PATH=/opt/nim/export \
   nvcr.io/nim/nvidia/$CONTAINER_ID:3.1.0

mayjain · December 19, 2025, 5:27am

Can you try base parakeet checkpoint to understand if this is model issue or something wrong with setup/ params.

mehadi.hasan · December 19, 2025, 9:51am

I finetune this model. Are you asking to deploy it using NIM and evaluate it on the same dataset?

mayjain · December 19, 2025, 10:24am

Yes, If the CER is bad even for this model, then there’s something is wrong with setup or params.

mehadi.hasan · December 21, 2025, 6:31am

@mayjain With the base model, I get this result

Dataset: Huggingfacejapanese-asr/ja_asr.jsut_basic5000 · Datasets at Hugging Face
NeMo CER (CTC decoder): 6.5
Riva CER (with NIM): 8.7

Topic		Replies	Views
RIVA v2.15.0 fails to build NeMo model Riva	0	453	March 30, 2024
Failed to convert Nemo model to Riva using nemo2riva for ASR Riva riva	1	130	January 24, 2025
Fine Tune the hind Nvidia Nemo Riva inception	25	2050	January 25, 2023
Language model with citrinet model is not working Riva nemo , riva	2	723	September 6, 2022
Getting no transcript from Parakeet-CTC 0.6b model in Riva Riva inception	2	246	June 24, 2025
Riva 1.8.0 deploy pretrained Tacotron2+Waveglow TTS Riva tensorrt , nemo , riva	2	877	February 9, 2022
RIVA Non-reproduicable ASR outputs compared to NeMo model Riva	2	436	June 18, 2024
RIVA error, when deploying official Conformer ASR network Riva riva	10	2085	January 27, 2023
Riva 1.8 riva_start.sh fail when build with language model Riva riva	3	1257	July 27, 2022
Riva support for Conformer CTC model train by Nemo Riva nemo , riva	2	1004	December 22, 2021

Why CER is very high when serving NeMo model in Riva

Related topics