AI Alignment - Graph View

Ensuring AI systems behave in accordance with human intentions and values.

View concept details

Related Concepts

AI Guardrails
AI Safety
Constitutional AI
Dual-Use Dilemma
Human-in-the-Loop
Large Language Models (LLMs)
Oppenheimerian Guilt
Reinforcement Learning from Human Feedback (RLHF)
Endogenous Goals

← Back to full graph