Group Preference Optimization: Few-Shot Alignment of Large Language Models S Zhao, J Dang, A Grover International Conference on Learning Representations (ICLR 2024), 2023 | 9 | 2023 |
Object Insertion Based Data Augmentation for Semantic Segmentation Y Ren, S Zhao, L Bingbing 2022 IEEE International Conference on Robotics and Automation (ICRA), 2022 | 8 | 2022 |
Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models S Zhao, A Grover Conference on Neural Information Processing Systems (NeurIPS 2023), 2023 | 2 | 2023 |
Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models S Zhao, D Israel, GV Broeck, A Grover arXiv preprint arXiv:2404.09529, 2024 | 1 | 2024 |
One demonstration imitation learning BC Stadie*, S Zhao*, Q Xu, B Li, L Zhang | 1 | 2020 |