Unable to load riva model build with --nn.use_trt_fp32 flag

tim.xia · December 7, 2022, 3:03am

Following discussion here: Riva providing empty transcriptions for a few audios, but nemo does not for those audios - Deep Learning (Training & Inference) / Riva - NVIDIA Developer Forums

I tried to riva_build a model with fp32. But riva_start.sh will not work, citing
| riva-trt-asr-conformer_en_us-offline-flashlight-am-streaming-offline | 1 | UNAVAILABLE: Unavailable: unable to find ‘/data/models/riva-trt-asr-conformer_en_us-offline-flashlight-am-streaming-offline/1/model.plan’ for model instance 'riva-trt-asr-conformer_en_us-offline-flashlight- |

rvinobha · December 13, 2022, 7:46am

Hi @tim.xia

Thanks for your interest in Riva,

riva-build creates the .rmir
riva-deploy generates the model directories

After that we can deploy the model using riva_server

Was riva-deploy performed ?

Request to kindly share the

Complete riva-build command used
Complete riva-deploy command used
config.sh used for starting the riva-sever

Thanks

tim.xia · December 14, 2022, 4:47am

I usually follow the route of riva_quickstart, with riva_init.sh then riva_start.sh.
It works for most models except when I add the nn.use_trt_fp32 flag. Then there is just no model.plan in the models dir. Is there other settings that are required for this –nn.use_trt_fp32 flag?

rvinobha · December 16, 2022, 6:27am

Hi @tim.xia

My Apologies, if i made wrong understanding,
I will reproduce the issue from my end
Request to kindly share the

Complete riva-build command used
Complete riva-deploy command used
config.sh used for starting the riva-sever

If it is a custom trained conformer model, are we doing steps suggested in the below doc
https://docs.nvidia.com/deeplearning/riva/user-guide/docs/tutorials/asr-python-advanced-finetune-am-conformer-ctc-tao-deployment.html?highlight=custom%20model

If not please let me know

Thanks

Topic		Replies	Views
Docker - Riva fails to launch Model not specified Riva	3	629	January 11, 2023
Unable to deploy riva model trained in with tao 4.0.0 Riva tensorrt , riva , tao	1	538	February 2, 2023
Something wrong with riva quickstart Riva	6	1206	May 28, 2023
Finetuned ASR conformer returns only empty transcripts Riva	13	953	October 20, 2022
Problems when running ./riva_init.sh with custom Quartznet Model Riva	1	750	September 7, 2021
Unable to download RIVA models during riva_init.sh Riva	2	947	October 21, 2022
Riva model deployment issue Riva inception	8	1560	April 4, 2024
Found 0 live models and 0 in-flight non-inference requests Riva riva	4	998	August 4, 2024
No ASR text output after building riva-build to use en-GB, and the running riva-start Riva	19	1088	October 21, 2022
Riva-build does not work Riva	2	262	March 26, 2024

Unable to load riva model build with --nn.use_trt_fp32 flag

Related topics