Distributed multi-agent reinforcement learning by actor-critic method PC Heredia, S Mou IFAC-PapersOnLine 52 (20), 363-368, 2019 | 20 | 2019 |
Finite-sample analysis of multi-agent policy evaluation with kernelized gradient temporal difference P Heredia, S Mou 2020 59th IEEE Conference on Decision and Control (CDC), 5647-5652, 2020 | 7 | 2020 |
Finite-Sample Analysis of Distributed Q-learning for Multi-Agent Networks P Heredia, H Ghadialy, S Mou 2020 American Control Conference (ACC), 3511-3516, 2020 | 6 | 2020 |
Policy Learning based on Deep Koopman Representation W Hao, PC Heredia, B Huang, Z Lu, Z Liang, S Mou arXiv preprint arXiv:2305.15188, 2023 | | 2023 |
Distributed Offline Reinforcement Learning P Heredia, J George, S Mou 2022 IEEE 61st Conference on Decision and Control (CDC), 4621-4626, 2022 | | 2022 |
Distributed State Estimation for Nonlinear Systems with Unknown Parameters P Heredia, E Garcia, S Mou 2022 American Control Conference (ACC), 96-101, 2022 | | 2022 |
Multi-Agent Reinforcement Learning: Analysis and Application PC Heredia Purdue University Graduate School, 2022 | | 2022 |