Beta. Content is under active construction and has not been peer-reviewed. Report errors on
GitHub
.
Disclaimer
Theorem
Path
Curriculum
Paths
Demos
Diagnostic
Search
Quiz Hub
/
Bellman Equations
Bellman Equations
12 questions
Difficulty 2-7
View topic
Foundation
0 / 12
3 foundation
8 intermediate
1 advanced
Adapts to your performance
1 / 12
foundation (2/10)
state theorem
The RL framework models an agent interacting with an environment. What are the four primary quantities in a single interaction step?
Hide and think first
A.
Temperature, entropy, loss, gradient — training-dynamics quantities
B.
Input, layer, activation, output — the standard neural network pipeline
C.
Pixels, frames, buffer, replay — replay buffer components
D.
State, action, reward, next state —
(
s
,
a
,
r
,
s
′
)
tuple
Submit Answer