Training language models to follow instructions with human feedback

Read article

Related Concepts

Reinforcement Learning from Human Feedback (RLHF)

← Back to all articles