关注
Zihan Qiu
Zihan Qiu
在 mails.tsinghua.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Supported policy optimization for offline reinforcement learning
J Wu, H Wu, Z Qiu, J Wang, M Long
Advances in Neural Information Processing Systems 35, 31278-31291, 2022
362022
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
Z Qiu, Z Huang, J Fu
arXiv preprint arXiv:2310.10908, 2023
32023
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
H Zhao, Z Qiu, H Wu, Z Wang, Z He, J Fu
arXiv preprint arXiv:2402.12656, 2024
2024
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers
Z Qiu, Z Huang, Y Huang, J Fu
Tiny Paper @ ICLR 2024, 2024
2024
Heterogenous Memory Augmented Neural Networks
Z Qiu, Z Liu, S Yan, S Zhang, J Fu
arXiv preprint arXiv:2310.10909, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–5