Follow
Jiaming Tang
Jiaming Tang
Verified email at sjtu.edu.cn - Homepage
Title
Cited by
Cited by
Year
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
J Lin*, J Tang*, H Tang, S Yang, X Dang, S Han
MLSys 2024, 2023
1862023
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization
C Guo*, J Tang*, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu
ISCA 2023, 2023
322023
The system can't perform the operation now. Try again later.
Articles 1–2