Building Transcription and Entity Recognition Apps Using NVIDIA Jarvis

jwitsoe · February 25, 2021, 4:00pm

Originally published at: https://developer.nvidia.com/blog/building-transcription-and-entity-recognition-apps-using-jarvis/

In the past several months, many of us have grown accustomed to seeing our doctors over a video call. It’s certainly convenient, but after the call ends, those important pieces of advice from your doctor start to slip away. What was that new medication I needed to take? Were there any side effects to watch…

mk.faraz · April 24, 2021, 5:18pm

Can sample rate for Jarvis ASR be 8000hz?

cparisien · April 27, 2021, 3:42pm

Thanks for your question! Yes, Jarvis supports 8 kHz input for ASR. The server will automatically upsample to 16 kHz. Just make sure you specify the sample rate as 8000 in the recognitionConfig being sent in the request.

Andrey1984 · January 28, 2022, 4:04pm

@jwitsoe
Hello
We were able to run the rive docker client service
However how exactly to replicate the demo in the article https://developer.nvidia.com/blog/building-transcription-and-entity-recognition-apps-using-riva/#entry-content-comments ?

cparisien · January 28, 2022, 4:09pm

Thanks for your interest in the demo. Glad to hear you’ve been able to run the docker client. We have included instructions on running that demo from the docker here: Riva Contact — NVIDIA Riva

Cheers,
Chris

Andrey1984 · February 9, 2022, 5:34am

@cparisien
Hi, thank you for your reply
Could you extend on how to start the callcenter with Medical NER German model, please?
AV

cparisien · February 9, 2022, 3:53pm

Hi @Andrey1984,

We don’t have a medical NER model in German, and unfortunately I’m not immediately aware of a good dataset you could use. A quick search turned up this one, which might be worth a try – I can’t comment on how well it might suit your application, and you’d likely have to do some data preprocessing.

If you do find a suitable dataset, you could use TAO to train or adapt an appropriate model:
https://docs.nvidia.com/tao/tao-toolkit/text/nlp/token_classification.html

Then once you have that model, you can deploy it in Riva:
https://docs.nvidia.com/deeplearning/riva/user-guide/docs/custom-model-deployment.html

I hope that helps!

Chris

jwitsoe · February 23, 2022, 3:54pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Andrey1984 · June 10, 2022, 9:26am

@cparisien
Hi, so what are the existing options for german ASR?
Is there more support by now for the Medical NER ?
How do we integrate GitHub - frankkramer-lab/GERNERMED: GERNERMED is the first open neural NER model for medical entities designed for German data. ?
Could you extend on steps to get it as a proof of concept implemented by using “TAO to train or adapt an appropriate model”, please?
Thanks
AV

cparisien · June 16, 2022, 3:07pm

We do have some updates here:

NVIDIA Riva now supports German ASR. You can try it in your browser. It probably won’t perform well on medical terminology. With the English ASR model that we used in the demo, we trained a custom language model using KenLM and a bunch of medical-themed text pulled from PubMed and medical subreddits. Riva now has a variety of mechanisms to customize the recognition.
Unfortunately we don’t have any German NER out of the box in Riva, let alone medical NER. You would need to train your own model and export it to Riva.
GERNERMED looks like a nice model, though it won’t be possible to integrate it directly into Riva. Riva is intended for transformer-based models trained using TAO or NeMo. When I put up the link to GERNERMED, I wanted to highlight the training data they have shared, which looks like a translation of i2b2 data. You should be able to use this data to train a relevant NER model, using the instructions for TAO above.

I hope that helps!
Chris

svaha · December 21, 2022, 7:59pm

I was looking to replicate this demo (https://developer.nvidia.com/blog/building-transcription-and-entity-recognition-apps-using-riva/) which led me to this forum entry. The solution link in this chat is trying to point to a “callcenter.html” demo. After some searching I realized the demo has been renamed to “Riva Contact” (Riva Contact — NVIDIA Riva). Hope this helps someone.

jwitsoe · December 21, 2022, 10:58pm

@svaha – Link fixed, thank you!

Topic		Replies	Views
Init. Jarvis with german model Riva riva	9	1466	November 4, 2021
Deploying NVIDIA Riva Multilingual ASR with Whisper and Canary Architectures While Selectively Deactivating NMT Technical Blog	1	23	February 20, 2025
Quickly Voice Your Apps with NVIDIA NIM Microservices for Speech and Translation Technical Blog nim	1	26	September 18, 2024
Error creating GRPC channel: Unable to establish connection to server Riva	9	1759	May 11, 2024
JARVIS throwing errors for offline ASR when using own model Riva riva	12	2845	September 4, 2021
Riva_start.sh will not start the server Riva riva	4	1114	August 31, 2023
Not able to run LM fine tuned qurtznet model Riva riva	13	1264	October 8, 2021
Help with custom deploy and perform inference using citrinet-mandarin NGC pre-trained model in Riva Riva riva	6	1122	October 12, 2021
Riva quickstart 2.11 fails on xavier nx Riva	3	913	June 29, 2023
Riva_start.sh will not load the models Riva riva	3	1157	April 23, 2024

Building Transcription and Entity Recognition Apps Using NVIDIA Jarvis

Related topics