Follow
Jiaao He
Jiaao He
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Prague: High-performance heterogeneity-aware asynchronous decentralized training
Q Luo, J He, Y Zhuo, X Qian
Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020
692020
Fastmoe: A fast mixture-of-expert training system
J He, J Qiu, A Zeng, Z Yang, J Zhai, J Tang
arXiv preprint arXiv:2103.13262, 2021
552021
BaGuaLu: targeting brain scale pretrained models with over 37 million cores
Z Ma, J He, J Qiu, H Cao, Y Wang, Z Sun, L Zheng, H Wang, S Tang, ...
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
342022
Fastermoe: modeling and optimizing training of large-scale dynamic pre-trained models
J He, J Zhai, T Antunes, H Wang, F Luo, S Shi, Q Li
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
332022
SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization
M Zhai, J He, Z Ma, Z Zong, R Zhang, J Zhai
2023 USENIX Annual Technical Conference (USENIX ATC 23), 961-975, 2023
82023
Critique of “Planetary Normal Mode Computation: Parallel Algorithms, Performance, and Reproducibility” by SCC Team From Tsinghua University
C Zhang, C Zhao, J He, S Chen, L Zheng, K Huang, W Han, J Zhai
IEEE Transactions on Parallel and Distributed Systems 32 (11), 2631-2634, 2021
22021
Efficiently emulating high-bitwidth computation with low-bitwidth hardware
Z Ma, H Wang, G Feng, C Zhang, L Xie, J He, S Chen, J Zhai
Proceedings of the 36th ACM International Conference on Supercomputing, 1-12, 2022
12022
FastDecode: High-Throughput GPU-Efficient LLM Serving using Heterogeneous Pipelines
J He, J Zhai
arXiv preprint arXiv:2403.11421, 2024
2024
POSTER: Pattern-Aware Sparse Communication for Scalable Recommendation Model Training
J He, S Chen, J Zhai
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024
2024
Student Cluster Competition 2018, Team Tsinghua University: Reproducing performance of multi-physics simulations of the Tsunamigenic 2004 Sumatra megathrust earthquake on the …
J He, C Zhao, J Yu, X Yu, L Zheng, C Lou, S Tang, W Han, J Zhai
Parallel Computing 90, 102570, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–10