Mapgo: Model-assisted policy optimization for goal-oriented tasks M Zhu, M Liu, J Shen, Z Zhang, S Chen, W Zhang, D Ye, Y Yu, Q Fu, ... arXiv preprint arXiv:2105.06350, 2021 | 20 | 2021 |
Maviper: Learning decision tree policies for interpretable multi-agent reinforcement learning S Milani, Z Zhang, N Topin, ZR Shi, C Kamhoua, EE Papalexakis, F Fang Joint European Conference on Machine Learning and Knowledge Discovery in …, 2022 | 11 | 2022 |