Follow
Zihan Qiu
Title
Cited by
Cited by
Year
Supported policy optimization for offline reinforcement learning
J Wu, H Wu, Z Qiu, J Wang, M Long
Advances in Neural Information Processing Systems 35, 31278-31291, 2022
362022
Emergent Mixture-of-Experts: Can Dense Pre-trained Transformers Benefit from Emergent Modular Structures?
Z Qiu, Z Huang, J Fu
arXiv preprint arXiv:2310.10908, 2023
32023
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
H Zhao, Z Qiu, H Wu, Z Wang, Z He, J Fu
arXiv preprint arXiv:2402.12656, 2024
2024
Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers
Z Qiu, Z Huang, Y Huang, J Fu
Tiny Paper @ ICLR 2024, 2024
2024
Heterogenous Memory Augmented Neural Networks
Z Qiu, Z Liu, S Yan, S Zhang, J Fu
arXiv preprint arXiv:2310.10909, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–5