StudyPreprintWikiReinforcement LearningModerateSecond-Order Actor-Critic Methods for Discounted MDPs via Policy Hessian DecompositionRead full paper →AuthorsSanjeev Manivannan, Shuban VYear2026Read full paper →More Reinforcement Learning research