Follow
Shaocong Ma
Title
Cited by
Cited by
Year
Variance-reduced off-policy TDC learning: Non-asymptotic convergence analysis
S Ma, Y Zhou, S Zou
Advances in neural information processing systems 33, 14796-14806, 2020
172020
Greedy-GQ with variance reduction: Finite-time analysis and improved complexity
S Ma, Z Chen, Y Zhou, S Zou
International Conference on Learning Representations 2021, 2021
152021
Sample efficient stochastic policy extragradient algorithm for zero-sum markov game
Z Chen, S Ma, Y Zhou
International Conference on Learning Representations, 2021
132021
Finding correlated equilibrium of constrained markov game: A primal-dual approach
Z Chen, S Ma, Y Zhou
Advances in Neural Information Processing Systems 35, 25560-25572, 2022
62022
Accelerated proximal alternating gradient-descent-ascent for nonconvex minimax machine learning
Z Chen, S Ma, Y Zhou
2022 IEEE International Symposium on Information Theory (ISIT), 672-677, 2022
62022
Understanding the impact of model incoherence on convergence of incremental sgd with random reshuffle
S Ma, Y Zhou
International Conference on Machine Learning, 6565-6574, 2020
42020
Decentralized Robust V-learning for Solving Markov Games with Model Uncertainty
S Ma, Z Chen, S Zou, Y Zhou
Journal of Machine Learning Research 24 (371), 1-40, 2023
22023
Data sampling affects the complexity of online sgd over dependent data
S Ma, Z Chen, Y Zhou, K Ji, Y Liang
Uncertainty in Artificial Intelligence, 1296-1305, 2022
22022
End-to-End Mesh Optimization of a Hybrid Deep Learning Black-Box PDE Solver
S Ma, J Diffenderfer, B Kailkhura, Y Zhou
arXiv preprint arXiv:2404.11766, 2024
2024
Towards Understanding Reinforcement Learning from Optimization Perspectives
S Ma
2021
The system can't perform the operation now. Try again later.
Articles 1–10