Reinforcement Learning - Graph View A machine learning paradigm where an agent learns to make decisions by taking actions in an environment and receiving rewards or penalties as feedback. View concept details Related ConceptsMachine Learning Reinforcement Learning from Human Feedback (RLHF) Reward Model Reward Hacking Deep Learning Neural Networks AI Alignment Direct Preference Optimization ← Back to full graph