Towards minimax optimality of model-based robust reinforcement learning P Clavier, EL Pennec, M Geist arXiv preprint arXiv:2302.05372, 2023 | 12 | 2023 |
Robust reinforcement learning with distributional risk-averse formulation P Clavier, S Allassonière, EL Pennec arXiv preprint arXiv:2206.06841, 2022 | 4 | 2022 |
Sum-Product Network in the context of missing data P Clavier | 1 | 2020 |
Time-Constrained Robust MDPs A Zouitine, D Bertoin, P Clavier, M Geist, E Rachelson arXiv preprint arXiv:2406.08395, 2024 | | 2024 |
RRLS: Robust Reinforcement Learning Suite A Zouitine, D Bertoin, P Clavier, M Geist, E Rachelson arXiv preprint arXiv:2406.08406, 2024 | | 2024 |
Bootstrapping Expectiles in Reinforcement Learning P Clavier, E Rachelson, EL Pennec, M Geist arXiv preprint arXiv:2406.04081, 2024 | | 2024 |
Gaussian Sum-Product Networks Learning in the Presence of Interval Censored Data C Pierre, B Olivier, N Gregory International Conference on Probabilistic Graphical Models, 125-136, 2020 | | 2020 |
: Variational Inference Thompson Sampling for contextual bandits P Clavier, T Huix, AO Durmus Forty-first International Conference on Machine Learning, 0 | | |