Originally published at: https://developer.nvidia.com/blog/using-deepspeed-and-megatron-to-train-megatron-turing-nlg-530b-the-worlds-largest-and-most-powerful-generative-language-model/
MT-NLG has 3x the number of parameters compared to the existing largest model of this type and demonstrates unmatched accuracy in a broad set of natural language tasks
Awesome. How can end-user get access to this model? Is it integrated with Azure cognitive services - given it is a joint effort by Microsoft?
1 Like