Beta. Content is under active construction and has not been peer-reviewed. Report errors on
GitHub
.
Disclaimer
Theorem
Path
Curriculum
Paths
Demos
Diagnostic
Search
Quiz Hub
/
Exploration vs Exploitation
Exploration vs Exploitation
6 questions
Difficulty 2-6
View topic
Foundation
0 / 6
2 foundation
4 intermediate
Adapts to your performance
1 / 6
foundation (2/10)
conceptual
The exploration-exploitation tradeoff is central to reinforcement learning and multi-armed bandits. What is the fundamental tension?
Hide and think first
A.
The tradeoff only applies to RL; supervised learning never involves exploration-exploitation decisions
B.
Exploit uses known-good options for immediate reward; explore tries unknown options that might be better but are risky
C.
Exploit always has higher expected reward than explore; exploration is only valuable for its aesthetic diversity
D.
Explore requires more compute per step than exploit, making it infeasible for large action spaces
Submit Answer