Nemo Trained model not giving transcript when deployed on jarvis both offline and streaming

saurabh.sharma94 · August 9, 2021, 5:48am

Please provide the following information when requesting support.

Hardware - GPU (V100)
Hardware - CPU
Operating System :
Riva Version
TLT Version (if relevant)
How to reproduce the issue ? (This is for errors. Please share the command and the detailed log here)

Trained the asr jasper model using pre trained weights using Nemo 1.1
converted the nemo model to ejrvs model using nemo2jarvis
nemo2jarvis --out=./model.ejrvs ./model.nemo
deployed the converted model in jarvis 1.0.0-b.2

When build offline: jarvis-build speech_recognition ./custommodel/model/5821/modeloffline.jmir ./custommodel/model/5821/5821/model.ejrvs --offline

Steaming: jarvis-build speech_recognition ./custommodel/model/5821/model.jmir ./custommodel/model/5821/5821/model.ejrvs

Model Deploy: jarvis-deploy ./custommodel/model/5821/modeloffline.jmir ./custommodel/model/offline/

When audio file use to check the inferencing (using speech_to_text example) it is not giving transcript in results. Same working fine when tested using pre trained ejrvs model from ngc.

SunilJB · August 9, 2021, 10:34am

Hi @saurabh.sharma94
Could you please share the script and model file to reproduce the issue so we can help better?

Thanks

saurabh.sharma94 · August 9, 2021, 11:29am

Please find Models in following link:

https://drive.google.com/drive/folders/1bcNJRuOerYusrP-vMKMLMcQTA0DdxXPv?usp=sharing

Use script of nemo example provided in github to train the model with following parameters: ./examples/asr/speech_to_text.py model.train_ds.manifest_filepath=/workspace/saurabh/asr/code/valid_with_corrected_labels_manifest_part_0e0f11.json model.validation_ds.manifest_filepath=/workspace/saurabh/asr/code/valid_with_corrected_labels_manifest_part_0e0f11.json trainer.max_epochs=10 trainer.gpus=-1 +init_from_pretrained_model=stt_en_jasper10x5dr --config-path=/workspace/saurabh/asr/code/examples/asr/conf/jasper/ --config-name=jasper_10x5dr

saurabh.sharma94 · August 11, 2021, 5:21am

Please find Models in following link:

https://drive.google.com/drive/folders/1bcNJRuOerYusrP-vMKMLMcQTA0DdxXPv?usp=sharing

Use script of nemo example provided in github to train the model with following parameters: ./examples/asr/speech_to_text.py model.train_ds.manifest_filepath=/workspace/saurabh/asr/code/valid_with_corrected_labels_manifest_part_0e0f11.json model.validation_ds.manifest_filepath=/workspace/saurabh/asr/code/valid_with_corrected_labels_manifest_part_0e0f11.json trainer.max_epochs=10 trainer.gpus=-1 +init_from_pretrained_model=stt_en_jasper10x5dr --config-path=/workspace/saurabh/asr/code/examples/asr/conf/jasper/ --config-name=jasper_10x5dr

SunilJB · August 24, 2021, 1:43pm

Hi @saurabh.sharma94
Sorry for delayed response.
It seems you are using old jarvis 1.0.0-b.2 release, could you please try the latest Riva (renamed from Jarvis) release.
Below are updated steps for custom model deployment in Riva.

https://docs.nvidia.com/deeplearning/riva/user-guide/docs/custom-model-deployment.html#
https://docs.nvidia.com/deeplearning/riva/user-guide/docs/service-asr.html#streaming-offline-configuration

For use cases where being able to support additional concurrent audio streams is more important, run:

riva-build speech_recognition \
    /servicemaker-dev/<rmir_filename>:<encryption_key> \
    /servicemaker-dev/<riva_filename>:<encryption_key> \
    --name=<pipeline_name> \
    --decoder_type=greedy \
    --chunk_size=0.8 \
    --padding_size=0.8

Thanks

AakankshaS · August 26, 2021, 8:32am

Hi @saurabh.sharma94 ,
We tried reproducing the issue, and it seems to be with nemo model as we tried to load your model using nemo package & taken inference on 3-4 audio files but getting empty response.

saurabh.sharma94 · September 8, 2021, 4:20am

I have retrained the model by creating new pods using nemo 1.1 image and deploy using riva 1.5 . But still facing same issue. Please find updated model:
https://drive.google.com/drive/folders/1UNLhumBWASkfZ_t95kX62W__5n6l1hjB?usp=sharing

Please suggest solution as not able to move forward in asr without it. As per my understanding the riva model is similar to onnx model which might be platform independent so their their might be no version issue( as suggested earlier). Also if possible please train model your end and check if same issue is persist when you are training the asr model.

Topic		Replies	Views
JARVIS throwing errors for offline ASR when using own model Riva riva	12	2845	September 4, 2021
Failed to convert Nemo model to Riva using nemo2riva for ASR Riva riva	1	49	January 24, 2025
Init. Jarvis with german model Riva riva	9	1466	November 4, 2021
Riva providing empty transcriptions for a few audios, but nemo does not for those audios Riva python , nemo , riva	4	856	November 21, 2022
Jarvis-asr: jarvis_start.sh times out [TensorRT version error] Riva riva	5	863	October 12, 2021
Trying jarvis ont the local docker Riva riva	12	903	October 12, 2021
Issue Deploying Fine-Tuned Arabic Conformer Model in NVIDIA Riva: No Transcriptions Returned Riva	0	63	December 1, 2024
[TLT3.0][Jarvis] Fine tuning Quartznet produces garbled transcript Riva riva	7	966	October 12, 2021
Fine Tune the hind Nvidia Nemo Riva inception	25	1679	January 25, 2023
Riva model deployment issue Riva inception	8	1561	April 4, 2024

Nemo Trained model not giving transcript when deployed on jarvis both offline and streaming

Related topics