Follow
Guangxuan Xiao
Guangxuan Xiao
Ph.D. student, MIT
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
SmoothQuant: Accurate and efficient post-training quantization for large language models
G Xiao, J Lin, M Seznec, H Wu, J Demouth, S Han
International Conference on Machine Learning, 38087-38099, 2023
3392023
Awq: Activation-aware weight quantization for llm compression and acceleration
J Lin, J Tang, H Tang, S Yang, WM Chen, WC Wang, G Xiao, X Dang, ...
MLSys 2024, 2023
2002023
Efficient streaming language models with attention sinks
G Xiao, Y Tian, B Chen, S Han, M Lewis
International Conference on Learning Representations (ICLR), 2024
1272024
Fastcomposer: Tuning-free multi-subject image generation with localized attention
G Xiao, T Yin, WT Freeman, F Durand, S Han
arXiv preprint arXiv:2305.10431, 2023
732023
The system can't perform the operation now. Try again later.
Articles 1–4