Reinforcement Learning Pulse

Five questions across 6 topics. No timer.

1 / 5Reinforcement Learning

Question 1 of 5

foundation (2/10)conceptual

The exploration-exploitation tradeoff is central to reinforcement learning and multi-armed bandits. What is the fundamental tension?