StudyPreprintWikiReinforcement LearningSequential DecisionsModerateSoft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorRead full paper →AuthorsTuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey LevineYear2018Citations11,170Read full paper →More Reinforcement Learning research