Adding External Knowledge and Controllability to Language Models with Megatron-CNTRL

Originally published at: https://developer.qa.nvidia.com/blog/adding-external-knowledge-and-controllability-to-language-models-with-megatron-cntrl/

Large language models such as Megatron and GPT-3 are transforming AI. We are excited about applications that can take advantage of these models to create better conversational AI. One main problem that generative language models have in conversational AI applications is their lack of controllability and consistency with real-world facts. In this work, we try…