Follow
Luning Wang
Luning Wang
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
LLM-MQ: Mixed-precision Quantization for Efficient LLM Deployment
S Li, X Ning, K Hong, T Liu, L Wang, X Li, K Zhong, G Dai, H Yang, ...
5
A survey on efficient inference for large language models
Z Zhou, X Ning, K Hong, T Fu, J Xu, S Li, Y Lou, L Wang, Z Yuan, X Li, ...
arXiv preprint arXiv:2404.14294, 2024
42024
Evaluating Quantized Large Language Models
S Li, X Ning, L Wang, T Liu, X Shi, S Yan, G Dai, H Yang, Y Wang
arXiv preprint arXiv:2402.18158, 2024
32024
The system can't perform the operation now. Try again later.
Articles 1–3