Yufei Zhang

Cited by

	All	Since 2019
Citations	329	328
h-index	11	11
i10-index	11	11

120

2019202020212022202320246 25 48 60 120 58

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Christoph ReisingerProfessor of Applied Mathematics, University of OxfordVerified email at maths.ox.ac.uk
Xin GuoUC Berkeley, Cornell Univeristy, IBMVerified email at berkeley.edu
Lukasz SzpruchUniversity of Edinburgh and The Alan Turing InstituteVerified email at ed.ac.uk
Anran HuUniversity of OxfordVerified email at maths.ox.ac.uk
Tanut TreetanthiploetThe Alan Turing InstituteVerified email at turing.ac.uk
Kazufumi ItoNorth Carolina State UniversityVerified email at math.ncsu.edu
Matteo BaseiQuant researcher at EDF R&DVerified email at edf.fr
Le SongBiomap, Mohamed bin Zayed University of Artificial IntelligenceVerified email at biomap.com
Xinshi ChenGeorgia Institution of TechnologyVerified email at bytedance.com
Roxana DumitrescuAssociate Professor, King's College LondonVerified email at kcl.ac.uk
David SiskaSchool of Mathematics, University of Edinburgh and Vega ResearchVerified email at ed.ac.uk
Eyal NeumanImperial College LondonVerified email at imperial.ac.uk
James-Michael LeahyImperial College LondonVerified email at imperial.ac.uk

Yufei Zhang

Imperial College London

Verified email at imperial.ac.uk - Homepage

Stochastic Control Reinforcement Learning Mathematical Finance


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Rectified deep neural networks overcome the curse of dimensionality for nonsmooth value functions in zero-sum games of nonlinear stiff systems C Reisinger, Y Zhang Analysis and Applications 18 (06), 951-999, 2020	77	2020
A Neural Network-Based Policy Iteration Algorithm with Global -Superlinear Convergence for Stochastic Games on Domains K Ito, C Reisinger, Y Zhang Foundations of Computational Mathematics 21 (2), 331-374, 2021	40	2021
Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon M Basei, X Guo, A Hu, Y Zhang Journal of Machine Learning Research 23 (178), 1-34, 2022	32*	2022
Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models L Szpruch, T Treetanthiploet, Y Zhang arXiv preprint arXiv:2112.10264, 2021	17	2021
Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls X Guo, A Hu, Y Zhang SIAM Journal on Control and Optimization 61 (2), 755-787, 2023	16	2023
A fast iterative PDE-based algorithm for feedback controls of nonsmooth mean-field control problems C Reisinger, W Stockinger, Y Zhang arXiv preprint arXiv:2108.06740, 2021	16	2021
Regularity and stability of feedback relaxed controls C Reisinger, Y Zhang SIAM Journal on Control and Optimization 59 (5), 3118-3151, 2021	16	2021
Understanding deep architecture with reasoning layer X Chen, Y Zhang, C Reisinger, L Song Advances in Neural Information Processing Systems 33, 1240-1252, 2020	16	2020
Approximation schemes for mixed optimal stopping and control problems with nonlinear expectations and jumps R Dumitrescu, C Reisinger, Y Zhang Applied Mathematics & Optimization 83, 1387-1429, 2021	14	2021
A posteriori error estimates for fully coupled McKean-Vlasov forward-backward SDEs C Reisinger, W Stockinger, Y Zhang arXiv preprint arXiv:2007.07731, 2020	14	2020
Convergence of Policy Gradient Methods for Finite-Horizon Exploratory Linear-Quadratic Control Problems M Giegrich, C Reisinger, Y Zhang SIAM Journal on Control and Optimization 62 (2), 1060-1092, 2024	11*	2024
Linear convergence of a policy gradient method for some finite horizon continuous time control problems C Reisinger, W Stockinger, Y Zhang SIAM Journal on Control and Optimization 61 (6), 3526-3558, 2023	9	2023
Optimal Scheduling of Entropy Regularizer for Continuous-Time Linear-Quadratic Reinforcement Learning L Szpruch, T Treetanthiploet, Y Zhang SIAM Journal on Control and Optimization 62 (1), 135-166, 2024	8	2024
Path regularity of coupled McKean-Vlasov FBSDEs C Reisinger, W Stockinger, Y Zhang arXiv preprint arXiv:2011.06664, 2020	8	2020
Error estimates of penalty schemes for quasi-variational inequalities arising from impulse control problems C Reisinger, Y Zhang SIAM Journal on Control and Optimization 58 (1), 243-276, 2020	8	2020
A penalty scheme for monotone systems with interconnected obstacles: convergence and error estimates C Reisinger, Y Zhang SIAM Journal on Numerical Analysis 57 (4), 1625-1648, 2019	7	2019
A penalty scheme and policy iteration for nonlocal HJB variational inequalities with monotone nonlinearities C Reisinger, Y Zhang Computers & Mathematics with Applications 93, 199-213, 2021	4	2021
A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces B Kerimkulov, JM Leahy, D Siska, L Szpruch, Y Zhang arXiv preprint arXiv:2310.02951, 2023	3	2023
Towards an analytical framework for potential games X Guo, Y Zhang arXiv preprint arXiv:2310.02259, 2023	3	2023
Fully Discrete Schemes and Their Analyses for Forward-Backward Stochastic Differential Equations K Ito, Y Zhang, J Zou arXiv preprint arXiv:1804.10944, 2018	3	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors