StudyPreprintWikiReinforcement LearningSequential DecisionsModerateFinite-Time Regret Analysis of Retry-Aware BanditsRead full paper →AuthorsBingkui Tong, Junpei Komiyama, Soichiro Nishimori, Paavo ParmasYear2026Read full paper →More Reinforcement Learning research