Variance-reduced off-policy TDC learning: Non-asymptotic convergence analysis S Ma, Y Zhou, S Zou Advances in neural information processing systems 33, 14796-14806, 2020 | 17 | 2020 |
Greedy-GQ with variance reduction: Finite-time analysis and improved complexity S Ma, Z Chen, Y Zhou, S Zou International Conference on Learning Representations 2021, 2021 | 15 | 2021 |
Sample efficient stochastic policy extragradient algorithm for zero-sum markov game Z Chen, S Ma, Y Zhou International Conference on Learning Representations, 2021 | 13 | 2021 |
Finding correlated equilibrium of constrained markov game: A primal-dual approach Z Chen, S Ma, Y Zhou Advances in Neural Information Processing Systems 35, 25560-25572, 2022 | 6 | 2022 |
Accelerated proximal alternating gradient-descent-ascent for nonconvex minimax machine learning Z Chen, S Ma, Y Zhou 2022 IEEE International Symposium on Information Theory (ISIT), 672-677, 2022 | 6 | 2022 |
Understanding the impact of model incoherence on convergence of incremental sgd with random reshuffle S Ma, Y Zhou International Conference on Machine Learning, 6565-6574, 2020 | 4 | 2020 |
Decentralized Robust V-learning for Solving Markov Games with Model Uncertainty S Ma, Z Chen, S Zou, Y Zhou Journal of Machine Learning Research 24 (371), 1-40, 2023 | 2 | 2023 |
Data sampling affects the complexity of online sgd over dependent data S Ma, Z Chen, Y Zhou, K Ji, Y Liang Uncertainty in Artificial Intelligence, 1296-1305, 2022 | 2 | 2022 |
End-to-End Mesh Optimization of a Hybrid Deep Learning Black-Box PDE Solver S Ma, J Diffenderfer, B Kailkhura, Y Zhou arXiv preprint arXiv:2404.11766, 2024 | | 2024 |
Towards Understanding Reinforcement Learning from Optimization Perspectives S Ma | | 2021 |