Riva model deployment issue

iamgarimanarang · January 11, 2023, 7:39am

We have fine-tuned an ASR model with “tao speech_to_text_conformer fine-tune”. Further, we want to deploy it using Riva, hence an export was done with “tao speech_to_text_conformer export”. To build this model further, we are trying to do riva-build and deploy.

When we are using riva-deploy, we are getting below error:

when importing initializer: onnx::MatMul_6791
[01/02/2023-14:32:31] [TRT] [E] parsers/onnx/ModelImporter.cpp:745: ERROR: parsers/onnx/ModelImporter.cpp:106 In function parseGraph:
[8] Assertion failed: convertOnnxWeights(initializer, &weights, ctx) && “Failed to import initializer.”
[01/02/2023-14:32:31] [TRT] [E] [network.cpp::addInput::1595] Error Code 3: API Usage Error (Parameter check failed at: optimizer/api/network.cpp::addInput::1595, condition: inName != knownInput->getName()
)
[01/02/2023-14:32:31] [TRT] [E] parsers/onnx/ModelImporter.cpp:745: ERROR: parsers/onnx/ModelImporter.cpp:106 In function parseGraph:
[8] Assertion failed: convertOnnxWeights(initializer, &weights, ctx) && “Failed to import initializer.”
[01/02/2023-14:32:31] [TRT] [E] parsers/onnx/ModelImporter.cpp:745: ERROR: audio_signal:269 In function importInput:
[8] Assertion failed: *tensor && “Failed to add input to the network.”
2023-01-02 14:32:31,252 [INFO] Mixed-precision net: 0 layers, 0 tensors, 0 outputs…
2023-01-02 14:32:31,252 [ERROR] Traceback (most recent call last):
File “/usr/local/lib/python3.8/dist-packages/servicemaker/cli/deploy.py”, line 100, in deploy_from_rmir
generator.serialize_to_disk(
File “/usr/local/lib/python3.8/dist-packages/servicemaker/triton/triton.py”, line 445, in serialize_to_disk
module.serialize_to_disk(repo_dir, rmir, config_only, verbose, overwrite)
File “/usr/local/lib/python3.8/dist-packages/servicemaker/triton/triton.py”, line 311, in serialize_to_disk
self.update_binary(version_dir, rmir, verbose)
File “/usr/local/lib/python3.8/dist-packages/servicemaker/triton/asr.py”, line 159, in update_binary
RivaTRTConfigGenerator.update_binary_from_copied(self, version_dir, rmir, copied, verbose)
File “/usr/local/lib/python3.8/dist-packages/servicemaker/triton/triton.py”, line 738, in update_binary_from_copied
bindings = self.build_trt_engine_from_onnx(model_weights, engine_path=trt_file, verbose=verbose)
File “/usr/local/lib/python3.8/dist-packages/servicemaker/triton/triton.py”, line 646, in build_trt_engine_from_onnx
network = fix_fp16_network(network)
File “/usr/local/lib/python3.8/dist-packages/servicemaker/triton/trt_bindings.py”, line 249, in fix_fp16_network
sys.setrecursionlimit(len(network_definition))
ValueError: recursion limit must be greater or equal than 1

The riva-build and deploy worked fine when we used the pre-trained Nemo model (stt_hi_conformer_ctc) but with our fine-tuned model, there is an error while Riva deploy.

Steps that we performed for model fine-tuning and deployment:

tao speech_to_text_conformer create_tokenizer
tao speech_to_text_conformer finetune
tao speech_to_text_conformer export (export_format=RIVA)
inside the Riva service maker
riva-build speech_recognition
riva-deploy

Kindly, provide some guidance for the issue.

rvinobha · January 11, 2023, 12:59pm

Hi @iamgarimanarang

Thanks for your interest in Riva,

Apologies you are facing issue

Will it be possible to share the model with us via GoogleDrive/OneDrive etc (If Possible)

Also Please share the

complete riva-build command used
complete riva-deploy command used

Thanks

iamgarimanarang · January 12, 2023, 4:31am

Hi @rvinobha ,

Thanks for the response. I’ve added the command in a text file. Please find below the link for the same:

https://drive.google.com/drive/folders/1jpjQX6PZM_4ScZPT-P6vrxZBOsJ_GeZ8?usp=sharing

Thanks and Regards,
Garima Narang

yoav.ellinson · January 16, 2023, 10:31am

Hi,
I have the same issue with a finetuned model. did you manage to find a way to make it work?

Yoav

iamgarimanarang · January 17, 2023, 9:07am

Hi @yoav.ellinson

We haven’t received any response to the above query. But, we were able to build the model with Nemo.
Please refer to this for the Nemo fine-tuning:
docker run --gpus=all -it --rm -v /project/path:/rift --shm-size=32g --net=host --ulimit memlock=-1 --ulimit stack=67108864 nvcr.io/nvidia/nemo:22.09 bash
python examples/asr//script_to_<script_name>.py
–config-path=
–config-name=<name of config without .yaml>)
model.train_ds.manifest_filepath=“”
model.validation_ds.manifest_filepath=“”
trainer.devices=-1
trainer.accelerator=‘gpu’
trainer.max_epochs=50
+init_from_nemo_model=“<path to .nemo model file>”

Reference links:

github.com

NVIDIA/NeMo/blob/main/examples/asr/asr_ctc/speech_to_text_ctc_bpe.py

# Copyright (c) 2020, NVIDIA CORPORATION.  All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

"""
# Preparing the Tokenizer for the dataset
Use the `process_asr_text_tokenizer.py` script under <NEMO_ROOT>/scripts/tokenizers/ in order to prepare the tokenizer.

```sh
python <NEMO_ROOT>/scripts/tokenizers/process_asr_text_tokenizer.py \

This file has been truncated. show original

https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/main/asr/configs.html#fine-tuning-configurations

Thanks
Garima

yoav.ellinson · January 18, 2023, 11:17am

thanks @iamgarimanarang , I’ll try it.

adjohnson · February 14, 2023, 7:41pm

Hello @rvinobha ,
I realize it has been over a month since your last response, but is there any update on this issue? We have the exact same problem on our end. Any update or suggestion is much appreciated.
Thank you,
A. Johnson

mel.adl · August 30, 2023, 1:30pm

Hello @rvinobha,
I encountered the same issue. I have search potential suggestions on more recent posts without success.
Any updates?

Thank you

RenardH · April 4, 2024, 9:05am

Hi @rvinobha,
Exact same problem here. I have seen that it’s quite common that people having this issue during the phrase of deployment.

my commands for the building and deployment are shown as below:

riva-build speech_recognition
/servicemaker-dev/conformer_ctc_de_finetuned.rmir:tlt_encode
/servicemaker-dev/conformer_ctc_de_finetuned.riva:tlt_encode
–name=conformer-de-DE-asr-streaming
–return_separate_utterances=False
–featurizer.use_utterance_norm_params=False
–featurizer.precalc_norm_time_steps=0
–featurizer.precalc_norm_params=False
–ms_per_timestep=40
–endpointing.start_history=200
–nn.fp16_needs_obey_precision_pass
–endpointing.residue_blanks_at_start=-2
–chunk_size=0.16
–left_padding_size=1.92
–right_padding_size=1.92
–decoder_type=flashlight
–decoding_language_model_binary=/data/conformer-de-DE-asr-streaming-ctc-decoder-cpu-streaming/1/de-DE_default_2.0.bin
–decoding_vocab=/data/conformer-de-DE-asr-streaming-ctc-decoder-cpu-streaming/1/de-DE_default_2.0_dict_vocab.txt
–flashlight_decoder.lm_weight=0.7
–flashlight_decoder.word_insertion_score=0.75
–flashlight_decoder.beam_threshold=20.
–language_code=de-DE \

riva-deploy /servicemaker-dev/conformer_ctc_de_finetuned.rmir:tlt_encode /data/

Could you please take a look ? Any helps will be appreciated.

Topic		Replies	Views
Error in riva deployment Riva deployment aborted Riva ubuntu , nemo , riva	3	1116	February 27, 2023
Wrong outputs from our fine-tuned version of speechtotext_english_citrinet_1024.tlt after deploying using riva_init.sh Riva inception	3	786	August 12, 2022
No ASR text output after building riva-build to use en-GB, and the running riva-start Riva	19	1124	October 21, 2022
RIVA error, when deploying official Conformer ASR network Riva riva	10	1966	January 27, 2023
Failed to deploy citrinet nemo to riva Riva riva	0	612	December 3, 2021
Encounter "Unsupported model IR version: 9, max supported IR version: 8" during deploy custom model in riva for TTS Riva onnx , riva	9	3400	January 22, 2024
Unable to load riva model build with --nn.use_trt_fp32 flag Riva riva	3	572	December 16, 2022
RIVA v2.15.0 fails to build NeMo model Riva	0	402	March 30, 2024
Riva waiting for Triton server to load all models...retrying in 1 second Riva riva	2	1014	March 22, 2023
Issue Deploying Fine-Tuned Arabic Conformer Model in NVIDIA Riva: No Transcriptions Returned Riva	0	69	December 1, 2024

Riva model deployment issue

Related topics