Banghua Zhu

Cited by

	All	Since 2019
Citations	948	946
h-index	13	13
i10-index	17	17

360

180

270

20192020202120222023202422 56 98 199 354 205

Public access

View all

11 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jiantao JiaoAssistant Professor of EECS and Statistics, University of California, BerkeleyVerified email at berkeley.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Cong MaUniversity of ChicagoVerified email at uchicago.edu
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Paria RashidinejadPostdoctoral Scholar, University of California, BerkeleyVerified email at berkeley.edu
Jacob SteinhardtStanford UniversityVerified email at cs.stanford.edu
Ying ShengPhD student of Stanford UniversityVerified email at stanford.edu
Song JianTsinghua UniversityVerified email at tsinghua.edu.cn
Lianmin ZhengUC BerkeleyVerified email at berkeley.edu
Ikechukwu UchenduHarvard UniversityVerified email at g.harvard.edu
Haris VikaloProfessor, University of Texas at AustinVerified email at ece.utexas.edu
Abolfazl HashemiAssistant Professor of ECE, Purdue UniversityVerified email at purdue.edu
Ion StoicaProfessor of Computer Science, UC BerkeleyVerified email at cs.berkeley.edu
Joseph E. GonzalezProfessor of Computer Science, UC BerkeleyVerified email at berkeley.edu
Dacheng LiUC BerkeleyVerified email at berkeley.edu
Kurt KeutzerProfessor of the Graduate School, EECS, University of California, BerkeleyVerified email at berkeley.edu
Lele WangUniversity of British ColumbiaVerified email at ece.ubc.ca
Nadim GhaddarPostdoctoral Fellow, University of TorontoVerified email at utoronto.ca
Hanlin ZhuPh.D. student, University of California, BerkeleyVerified email at berkeley.edu
Sergey LevineUC Berkeley, Physical IntelligenceVerified email at eecs.berkeley.edu

Banghua Zhu

University of California, Berkeley

Verified email at berkeley.edu - Homepage

foundation models human-AI interaction statistics information theory reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bridging offline reinforcement learning and imitation learning: A tale of pessimism P Rashidinejad, B Zhu, C Ma, J Jiao, S Russell Advances in Neural Information Processing Systems 34, 11702-11716, 2021	254	2021
Deconstructing Generative Adversarial Networks B Zhu, J Jiao, D Tse arXiv preprint arXiv:1901.09465, 2019	132*	2019
Joint transceiver optimization for wireless communication PHY using neural network B Zhu, J Wang, L He, J Song IEEE Journal on Selected Areas in Communications 37 (6), 1364-1373, 2019	101	2019
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons B Zhu, M Jordan, J Jiao International Conference on Machine Learning, 43037-43067, 2023	72	2023
Jump-start reinforcement learning I Uchendu, T Xiao, Y Lu, B Zhu, M Yan, J Simon, M Bennice, C Fu, C Ma, ... International Conference on Machine Learning, 34556-34583, 2023	64	2023
Generalized resilience and robust statistics B Zhu, J Jiao, J Steinhardt The Annals of Statistics 50 (4), 2256-2283, 2022	44	2022
Robust estimation via generalized quasi-gradients B Zhu, J Jiao, J Steinhardt Information and Inference: A Journal of the IMA 11 (2), 581-636, 2022	39	2022
The sample complexity of online contract design B Zhu, S Bates, Z Yang, Y Wang, J Jiao, MI Jordan arXiv preprint arXiv:2211.05732, 2022	28	2022
Sparse tensor decomposition for haplotype assembly of diploids and polyploids A Hashemi, B Zhu, H Vikalo BMC genomics 19, 1-15, 2018	26	2018
Byzantine-robust federated learning with optimal statistical rates B Zhu, L Wang, Q Pang, S Wang, J Jiao, D Song, MI Jordan International Conference on Artificial Intelligence and Statistics, 3151-3178, 2023	23*	2023
Fine-tuning language models with advantage-induced policy alignment B Zhu, H Sharma, FV Frujeri, S Dong, C Zhu, MI Jordan, J Jiao arXiv preprint arXiv:2306.02231, 2023	18	2023
S-lora: Serving thousands of concurrent lora adapters Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ... arXiv preprint arXiv:2311.03285, 2023	17	2023
When does the Tukey median work? B Zhu, J Jiao, J Steinhardt 2020 IEEE International Symposium on Information Theory (ISIT), 1201-1206, 2020	15	2020
Minimax off-policy evaluation for multi-armed bandits C Ma, B Zhu, J Jiao, MJ Wainwright IEEE Transactions on Information Theory 68 (8), 5314-5339, 2022	11	2022
Starling-7B: Improving LLM Helpfulness & Harmlessness with RLAIF B Zhu, E Frick, T Wu, H Zhu, J Jiao https://starling.cs.berkeley.edu/, 2023	10	2023
Online learning in stackelberg games with an omniscient follower G Zhao, B Zhu, J Jiao, M Jordan International Conference on Machine Learning, 42304-42316, 2023	10	2023
Linear representation meta-reinforcement learning for instant adaptation M Peng, B Zhu, J Jiao arXiv preprint arXiv:2101.04750, 2021	10	2021
On optimal caching and model multiplexing for large model inference B Zhu, Y Sheng, L Zheng, C Barrett, MI Jordan, J Jiao arXiv preprint arXiv:2306.02003, 2023	8	2023
Noisy Sorting Capacity Z Wang, N Ghaddar, B Zhu, L Wang arXiv preprint arXiv:2202.01446, 2023	8	2023
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons (2023) B Zhu, J Jiao, MI Jordan arXiv preprint arXiv:2301.11270, 0	6

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors