Chuheng Zhang

Cited by

	All	Since 2019
Citations	258	250
h-index	8	8
i10-index	7	7

100

201720182019202020212022202320242 5 8 9 30 64 99 39

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jian LiTsinghua UniversityVerified email at mail.tsinghua.edu.cn
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Luming DuanC.C. Yao Professor, Tsinghua UniversityVerified email at tsinghua.edu.cn

Chuheng Zhang

Microsoft Research

Verified email at microsoft.com

Machine Learning Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Exploration by maximizing Rényi entropy for reward-free RL framework C Zhang, Y Cai, L Huang, J Li Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10859 …, 2021	46	2021
Return-based contrastive representation learning for reinforcement learning G Liu, C Zhang, L Zhao, T Qin, J Zhu, J Li, N Yu, TY Liu arXiv preprint arXiv:2102.10960, 2021	42	2021
Observation of topological links associated with Hopf insulators in a solid-state quantum simulator XX Yuan, L He, ST Wang, DL Deng, F Wang, WQ Lian, X Wang, ... Chinese Physics Letters 34 (6), 060302, 2017	38	2017
Cross DQN: Cross deep Q network for ads allocation in feed G Liao, Z Wang, X Wu, X Shi, C Zhang, Y Wang, X Wang, D Wang Proceedings of the ACM Web Conference 2022, 401-409, 2022	26	2022
Inductive matrix completion using graph autoencoder W Shen, C Zhang, Y Tian, L Zeng, X He, W Dou, X Xu Proceedings of the 30th ACM International Conference on Information …, 2021	19	2021
Auxiliary-task based deep reinforcement learning for participant selection problem in mobile crowdsourcing W Shen, X He, C Zhang, Q Ni, W Dou, Y Wang Proceedings of the 29th ACM International Conference on Information …, 2020	18	2020
DoubleEnsemble: A new ensemble method based on sample reweighting and feature selection for financial data analysis C Zhang, Y Li, X Chen, Y Jin, P Tang, J Li 2020 IEEE International Conference on Data Mining (ICDM), 781-790, 2020	14	2020
Multi-agent reinforcement learning with shared resources for inventory management Y Ding, M Feng, G Liu, W Jiang, C Zhang, L Zhao, L Song, H Li, Y Jin, ... arXiv preprint arXiv:2212.07684, 2022	8	2022
Policy Search by Target Distribution Learning for Continuous Control. C Zhang, Y Li, J Li AAAI, 6770-6777, 2020	8	2020
Deep page-level interest network in reinforcement learning for ads allocation G Liao, X Shi, Z Wang, X Wu, C Zhang, Y Wang, X Wang, D Wang Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022	7	2022
Pre-trained large language models for industrial control L Song, C Zhang, L Zhao, J Bian arXiv preprint arXiv:2308.03028, 2023	6	2023
Venlafaxine as an adjuvant therapy for inflammatory bowel disease patients with anxious and depressive symptoms: a randomized controlled trial C Liang, P Chen, Y Tang, C Zhang, N Lei, Y Luo, S Duan, Y Zhang Frontiers in Psychiatry 13, 880058, 2022	5	2022
RePreM: representation pre-training with masked model for reinforcement learning Y Cai, C Zhang, W Shen, X Zhang, W Ruan, L Huang Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 6879-6887, 2023	3	2023
A versatile multi-agent reinforcement learning benchmark for inventory management X Yang, Z Liu, W Jiang, C Zhang, L Zhao, L Song, J Bian arXiv preprint arXiv:2306.07542, 2023	3	2023
Towards generalizable reinforcement learning for trade execution C Zhang, Y Duan, X Chen, J Chen, J Li, L Zhao arXiv preprint arXiv:2307.11685, 2023	3	2023
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks Z Wang, G Liao, X Shi, X Wu, C Zhang, Y Wang, X Wang, D Wang Proceedings of the 31st ACM International Conference on Information …, 2022	3	2022
Imitation learning to outperform demonstrators by directly extrapolating demonstrations Y Cai, C Zhang, W Shen, X He, X Zhang, L Huang Proceedings of the 31st ACM International Conference on Information …, 2022	2	2022
A transformer-based user satisfaction prediction for proactive interaction mechanism in DuerOS W Shen, X He, C Zhang, X Zhang, J Xie Proceedings of the 31st ACM International Conference on Information …, 2022	2	2022
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation Z Wang, G Liao, X Shi, X Wu, C Zhang, B Zhu, Y Wang, X Wang, D Wang Proceedings of the 31st ACM International Conference on Information …, 2022	2	2022
LLM+ A: Grounding Large Language Models in Physical World with Affordance Prompting G Cheng, C Zhang, W Cai, L Zhao, C Sun, J Bian	1	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors