StudyPreprintWikiReinforcement LearningModerateBayesian policy gradient and actor-critic algorithmsRead full paper →AuthorsMohammad Ghavamzadeh, Yaakov Engel, Michal ValkoYear2026Read full paper →More Reinforcement Learning research