Announcing SteerLM: A Simple and Practical Technique to Customize LLMs During Inference

Originally published at: https://developer.nvidia.com/blog/announcing-steerlm-a-simple-and-practical-technique-to-customize-llms-during-inference/

With the advent of large language models (LLMs) such as GPT-3, Megatron-Turing, Chinchilla, PaLM-2, Falcon, and Llama 2, remarkable progress in natural language generation has been made in recent years. However, despite their ability to produce human-like text, ‌foundation LLMs can fail to provide helpful and nuanced responses aligned with user preferences.  The current approach…