Follow
Jiaming Tang
Jiaming Tang
Ph.D. student, MIT
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
J Lin*, J Tang*, H Tang, S Yang, X Dang, S Han
MLSys 2024, 2023
269*2023
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization
C Guo*, J Tang*, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu
ISCA 2023, 2023
372023
Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
J Tang*, Y Zhao*, K Zhu, G Xiao, B Kasikci, S Han
ICML 2024, 2024
22024
The system can't perform the operation now. Try again later.
Articles 1–3