Follow
Nicholas Lee
Title
Cited by
Cited by
Year
Squeezeformer: An efficient transformer for automatic speech recognition
S Kim, A Gholami, A Shaw, N Lee, K Mangalam, J Malik, MW Mahoney, ...
Advances in Neural Information Processing Systems 35, 9361-9373, 2022
1122022
S-lora: Serving thousands of concurrent lora adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
arXiv preprint arXiv:2311.03285, 2023
542023
An LLM compiler for parallel function calling
S Kim, S Moon, R Tabrizi, N Lee, MW Mahoney, K Keutzer, A Gholami
arXiv preprint arXiv:2312.04511, 2023
262023
Integer-only zero-shot quantization for efficient speech recognition
S Kim, A Gholami, Z Yao, N Lee, P Wang, A Nrusimha, B Zhai, T Gao, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
232022
Llm2llm: Boosting llms with novel iterative data enhancement
N Lee, T Wattanawong, S Kim, K Mangalam, S Shen, G Anumanchipali, ...
arXiv preprint arXiv:2403.15042, 2024
212024
Tinyagent: Function calling at the edge
LE Erdogan, N Lee, S Jha, S Kim, R Tabrizi, S Moon, C Hooper, ...
arXiv preprint arXiv:2409.00608, 2024
32024
SLoRA: Scalable Serving of Thousands of LoRA Adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
Proceedings of Machine Learning and Systems 6, 296-311, 2024
22024
Sylber: Syllabic Embedding Representation of Speech from Raw Audio
CJ Cho, N Lee, A Gupta, D Agarwal, E Chen, AW Black, ...
arXiv preprint arXiv:2410.07168, 2024
2024
TinyAgent: Function Calling at the Edge
L Eren Erdogan, N Lee, S Jha, S Kim, R Tabrizi, S Moon, C Hooper, ...
arXiv e-prints, arXiv: 2409.00608, 2024
2024
Exploring the Limits of Small Language Models
N Lee, K Keutzer, GK Anumanchipalli
2023
The system can't perform the operation now. Try again later.
Articles 1–10