oirl: Robust adversarial inverse reinforcement learning with temporally extended actions D Venuto, J Chakravorty, L Boussioux, J Wang, G McCracken, D Precup arXiv preprint arXiv:2002.09043, 2020 | 3 | 2020 |
PAC-Bayesian analysis of counterfactual risk in stochastic contextual bandits J Wang, B Mazoure, G McCracken, D Venuto, A Durand Multi-disciplinary Conference on Reinforcement Learning and Decision Making, 2019 | 2 | 2019 |
Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions D Venuto McGill University (Canada), 2020 | 1 | 2020 |
Using Exact Models to Analyze Policy Gradient Algorithms G McCracken McGill University (Canada), 2021 | | 2021 |
A Study of Policy Gradient on a Class of Exactly Solvable Models G McCracken, C Daniels, R Zhao, A Brandenberger, P Panangaden, ... arXiv preprint arXiv:2011.01859, 2020 | | 2020 |
Extending Fluctuation Dissipation Relations to Policy Gradient Methods in Reinforcement Learning A Brandenberger, G McCracken | | |