Esc
Actor-Critic
Definition
Actor-Critic is a Temporal Difference(TD) version of Policy gradient. It has two networks: Actor and Critic. The actor decided which action should be taken and critic inform the actor how good was the action and how it should adjust. The learning of the actor is based on policy gradient approach. In comparison, critics evaluate the action produced by the actor by computing the value function.
References
The Actor-Critic Reinforcement Learning Algorithm. Medium. Link.
loading...
loading...
D3FEND™
A knowledge graph of cybersecurity countermeasures