Any recognized file produces text like “individual case return sc case transform them transform return sc return sc return case return case sc case sc does sc individual return case return still scie still transformie transform return case w”. I tried AN4 dataset recognition, but it didn’t help either, the recognized text was about the same. The only thing is that I downloaded the dataset from another source and converted from sph format to wav 16khz using audacity.
I also tried the Russian model, the recognized text is always different from what is pronounced in the audio file.
using the checkpoint you sent me, the result was “university was university one university one” which is completely different from what is pronounced in the file.
I ran all the commands as per the notepad via console, all folders are mounted for tao docker.
I just completed all the steps to prepare AH4 already using the official nvidia notebook, Speech to Text Citrinet Notebook | NVIDIA NGC the result of recognizing files in the notepad using the “ASR Inference” cell is identical results of file recognition in the console - a set of random words. All I did was download the model and follow the instructions on the site. I think that there may be some mistake on your part, perhaps you updated something recently.
Indeed, using this checkpoint I was able to get great results on different audio files even outside the AH4 dataset. But why are the checkpoints for the Russian and English versions available at RIVA Citrinet ASR Russian | NVIDIA NGC and RIVA Citrinet ASR English | NVIDIA NGC not recognized correctly? I also want to check the quality of models for other languages.
Still checking. Not sure if there is something mismatching.
Could you try to run speech-to-text instead of speech-to-text-citrinet for these two models you mentioned?
For both models, when running speech_to_text instead of speech_to_text_citrinet I get the error:
FileNotFoundError: [Errno 2] No such file or directory: ‘/tmp/tmpjatcuk3h/model_weights.ckpt’
There is no update from you for a period, assuming this is not an issue anymore.
Hence we are closing this topic. If need further support, please open a new one.
Thanks