Originally published at: https://developer.nvidia.com/blog/increasing-inference-acceleration-of-kogpt-with-fastertransformer/
Transformers are one of the most influential AI model architectures today and are shaping the direction of future AI R&D. First invented as a tool for natural language processing (NLP), transformers are now used in almost every AI task, including computer vision, automatic speech recognition, molecular structure classification, and financial data processing. In Korea, KakaoBrain…