Follow
Ziming Liu
Title
Cited by
Cited by
Year
Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency
Z Liu, S Cheng, H Zhou, Y You
SC '23: Proceedings of the International Conference for High Performance …, 2023
82023
EnergonAI: An inference system for 10-100 billion parameter transformer models
J Du, Z Liu, J Fang, S Li, Y Li, Y Lu, Y You
arXiv preprint arXiv:2209.02341, 2022
32022
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
X Zhao, S Cheng, Z Zheng, Z Yang, Z Liu, Y You
arXiv preprint arXiv:2403.10266, 2024
2024
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices
X Zhao, B Jia, H Zhou, Z Liu, S Cheng, Y You
arXiv preprint arXiv:2403.01164, 2024
2024
ATP: Adaptive Tensor Parallelism for Foundation Models
S Cheng, Z Liu, J Du, Y You
arXiv preprint arXiv:2301.08658, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–5