FastFold: Optimizing AlphaFold Training and Inference on GPU Clusters S Cheng, X Zhao, G Lu, J Fang, T Zheng, R Wu, X Zhang, J Peng, Y You Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | 21* | 2024 |
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers X Zhao, S Cheng, Z Zheng, Z Yang, Z Liu, Y You arXiv preprint arXiv:2403.10266, 2024 | | 2024 |
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices X Zhao, B Jia, H Zhou, Z Liu, S Cheng, Y You Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024 | | 2024 |
AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference X Zhao, S Cheng, G Lu, J Fang, H Zhou, B Jia, Z Liu, Y You Proceedings of the 12th International Conference on Learning Representations, 2024 | | 2024 |