Hi,
im trying to initialize Jarvis’ ASR with a pretrained german model and i cant find any documentation on how to do it properly.
Following the Quickstart-Guide, i ran jarvis_init.sh
and jarvis_start.sh
.
Before that i changed the config.sh
file according to my needs, and stumbled upon that:
# JMIR ($jarvis_model_loc/jmir)
# Jarvis uses an intermediate representation (JMIR) for models
# that are ready to deploy but not yet fully optimized for deployment. Pretrained
# versions can be obtained from NGC (by specifying NGC models below) and will be
# downloaded to $jarvis_model_loc/jmir by `jarvis_init.sh`
with emphasis on (by specifying NGC models below).
The ENGLISH Jasper Offline ASR Model can be obtained by callling it in the config.sh
file as follows:
### Jasper Offline w/ CPU decoder
"${jarvis_ngc_org}/${jarvis_ngc_team}/jmir_jarvis_asr_jasper_english_offline:${jarvis_ngc_model_version}"
How do i have to modify this to deploy the german Jarvis Model?
I know that u can get the according .nemo
file from here:
https://ngc.nvidia.com/catalog/models/nvidia:nemo:stt_de_quartznet15x5
but im clueless how to proceed from there.
They mention to run
import nemo.collections.asr as nemo_asr
asr_model = nemo_asr.models.EncDecCTCModel.from_pretrained(model_name="stt_de_quartznet15x5")
but where do i call this?
Do i have to call it when running inference, in the transcripe_file_offline.py
file?
To be clear, Offline ASR does work on my setup when im in the
jarvis_quickstart_v1.2.1-beta/examples
directory and run
python3 transcribe_file_offline.py --audio /path/to/audio
I just want it to use the German Jasper Model and not the English one.
Thanks alot. Any advice is appreciated. Hava a nice weekend