StudyPreprintWikiCausal EstimationReinforcement LearningSequential DecisionsModerateOffline Contextual Bandits in the Presence of New ActionsRead full paper →AuthorsRen Kishimoto, Tatsuhiro Shimizu, Kazuki Kawamura, Takanori Muroi, Yusuke Narita, Yuki Sasamoto, Kei Tateno, Takuma Udagawa, Yuta SaitoYear2026Read full paper →More Causal Estimation research