Originally published at: What Is a Transformer Model? | NVIDIA Blogs
A transformer model is a neural network that learns context and thus meaning by tracking relationships in sequential data like the words in this sentence.
jwitsoe
1
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Explainer: What Is a Transformer Model? | 0 | 276 | October 12, 2022 | |
What Are Foundation Models? | 0 | 389 | March 27, 2023 | |
What Are Foundation Models? | 2 | 657 | July 18, 2023 | |
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, the World’s Largest and Most Powerful Generative Language Model | 2 | 1437 | May 15, 2025 | |
Introduction to Neural Machine Translation with GPUs (part 3) | 36 | 583 | August 19, 2018 | |
Demystifying AI Inference Deployments for Trillion Parameter Large Language Models | 3 | 189 | April 17, 2025 | |
NVIDIA Faster Transformer를 통한 KoGPT의 추론 가속화 | 0 | 583 | August 24, 2023 | |
Accelerated Inference for Large Transformer Models Using FasterTransformer and Triton Inference Server | 1 | 554 | August 10, 2023 | |
Explainer: What Is a Transformer Model? | 0 | 271 | October 20, 2022 | |
NVIDIA's 2017 Open-Source Deep Learning Frameworks Contributions | 0 | 222 | August 21, 2022 |