Skip to main content

Actor-Critic Methods

2 selectedDifficulty 5-62 unseenView topic
IntermediateNew
0 answered
2 intermediateAdapts to your performance
Question 1 of 2
120sintermediate (5/10)conceptual
Actor-critic methods use the advantage function in the policy gradient . Why subtract from ?