Follow
Luning Wang
Luning Wang
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
A survey on efficient inference for large language models
Z Zhou, X Ning, K Hong, T Fu, J Xu, S Li, Y Lou, L Wang, Z Yuan, X Li, ...
arXiv preprint arXiv:2404.14294, 2024
102024
Evaluating quantized large language models
S Li, X Ning, L Wang, T Liu, X Shi, S Yan, G Dai, H Yang, Y Wang
arXiv preprint arXiv:2402.18158, 2024
82024
Llm-mq: Mixed-precision quantization for efficient llm deployment
S Li, X Ning, K Hong, T Liu, L Wang, X Li, K Zhong, G Dai, H Yang, ...
The Efficient Natural Language and Speech Processing Workshop with NeurIPS 9, 2023
52023
The system can't perform the operation now. Try again later.
Articles 1–3