Building and Deploying Conversational AI Models Using the NVIDIA Transfer Learning Toolkit

jwitsoe · February 25, 2021, 4:29pm

Originally published at: https://developer.nvidia.com/blog/building-and-deploying-conversational-ai-models-using-the-transfer-learning-toolkit/

Conversational AI is a set of technologies enabling human-like interactions between humans and devices based on the most natural interfaces for us: speech and natural language. Systems based on conversational AI can understand commands by recognizing speech and text, translating on-the-fly between different languages, understanding our intents, and responding in a way that mimics human…

dvanstee · June 10, 2021, 1:53pm

Hello, for the fine tuning part of this exercise, is the data available? When I stepped through the example I hit an error there. Also, for the KEY bash variable, i set this to my own user specified value, but I kept getting errors telling me I had an incorrect format. In a different TLT example, i found the authors set key to “tlt_encode” and then I was able to make progress.

tkornuta · June 14, 2021, 3:30pm

Hi @dvanstee ,

Generally we/NVIDIA don’t distribute data(sets) that we don’t own. Still, for the Text Classification task, TLT supports two public datasets out-of-the-box (SST-2 and IMBD) that you can download on your own. Then simply run dataset_convert script with a proper dataset name (please refer to TLT Text Classification user guide). Those datasets can be used for both training (from scratch) and/or fine-tuning.

When it comes to models used as a starting point for fine-tuning, I guess you downloaded them from NGC. If so, then please follow the instructions regarding usage of a given model (and key in particular) provided in the associated NGC Model Card.

Hope it helps,
Tom

Topic		Replies	Views
Pre-Trained Models and Transfer Learning Toolkit General Topics and Other SDKs	0	383	April 22, 2021
How to Deploy Real-Time Text-to-Speech Applications on GPUs Using TensorRT Technical Blog	0	406	August 25, 2020
Training with Custom Pretrained Models Using the NVIDIA Transfer Learning Toolkit Technical Blog	0	469	August 25, 2020
Optimizing and Accelerating AI Inference with the TensorRT Container from NVIDIA NGC Technical Blog	0	352	August 25, 2020
How to Build Domain Specific Automatic Speech Recognition Models on GPUs Technical Blog	0	448	August 25, 2020
Speech Recognition: Customizing Models to Your Domain Using Transfer Learning Technical Blog	0	408	November 9, 2021
Deep Learning is Transforming ASR and TTS Algorithms Technical Blog	0	369	December 16, 2022
Adding External Knowledge and Controllability to Language Models with Megatron-CNTRL Technical Blog	1	448	October 7, 2020
Bring AI to Market Fast with Pre-Trained Models and Transfer Learning Toolkit 3.0 Technical Blog	0	311	April 7, 2021
Simplifying AI Inference with NVIDIA Triton Inference Server from NVIDIA NGC Technical Blog	3	464	October 29, 2020

Building and Deploying Conversational AI Models Using the NVIDIA Transfer Learning Toolkit

Related topics