Actor-Critic & Continuous ControlDDPGDeep Deterministic Policy Gradient for continuous action spaces.Copy MarkdownOpenPlaceholder content for DDPG.Actor-Critic FrameworkA2C, advantage estimation, and Generalized Advantage Estimation (GAE).TD3 and SACTwin Delayed DDPG and Soft Actor-Critic with maximum entropy RL.