Follow
Kaiyan Zhang
Kaiyan Zhang
PhD Student at Tsinghua University
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
BoB: BERT over BERT for training persona-based dialogue models from limited personalized data
H Song, Y Wang, K Zhang, WN Zhang, T Liu
ACL 2021, 2021
1232021
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
X Zhu, B Qi, K Zhang, X Long, Z Lin, B Zhou
NAACL 2024, 2023
31*2023
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
B Qi*, K Zhang*, K Tian, H Li, ZR Chen, S Zeng, E Hua, H Jinfang, B Zhou
COLM 2024, 2024
20*2024
Generative Multi-Modal Knowledge Retrieval with Large Language Models
X Long, J Zeng, F Meng, Z Ma, K Zhang, B Zhou, J Zhou
AAAI 2024, 2024
122024
Ultramedical: Building specialized generalists in biomedicine
K Zhang, S Zeng, E Hua, N Ding, ZR Chen, Z Ma, H Li, G Cui, B Qi, X Zhu, ...
NeurIPS 2024 D&B Track, 2024
92024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following
K Zhang, J Wang, E Hua, B Qi, N Ding, B Zhou
ACL 2024, 2024
92024
A static and dynamic attention framework for multi turn dialogue generation
W Zhang, Y Cui, K Zhang, Y Wang, Q Zhu, L Li, T Liu
ACM Transactions on Information Systems 41 (1), 1-30, 2023
72023
Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing
B Qi, P Li, F Li, J Gao, K Zhang, B Zhou
Preprint, 2024
42024
SMR: State Memory Replay for Long Sequence Modeling
B Qi, J Gao, K Zhang, D Li, J Liu, L Wu, B Zhou
ACL 2024 findings, 2024
4*2024
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
H Song, WN Zhang, K Zhang, T Liu
ACM Transactions on Information Systems 41 (3), 1-36, 2023
42023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
K Zhang, N Ding, B Qi, X Zhu, X Long, B Zhou
EMNLP 2023, 2023
32023
Towards Building Specialized Generalist AI with System 1 and System 2 Fusion
K Zhang, B Qi, B Zhou
Preprint, 2024
22024
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
E Hua, B Qi, K Zhang, Y Yu, N Ding, X Lv, K Tian, B Zhou
Preprint, 2024
2*2024
A survey of multi-party dialogue research based on deep learning
K Zhang, WN Zhang, T Liu
SCIENTIA SINICA Informationis 51 (8), 1217-1232, 2021
2*2021
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding
K Zhang, J Wang, N Ding, B Qi, E Hua, X Lv, B Zhou
Preprint, 2024
12024
Automating Exploratory Proteomics Research via Language Models
N Ding, S Qu, L Xie, Y Li, Z Liu, K Zhang, Y Xiong, Y Zuo, Z Chen, E Hua, ...
arXiv preprint arXiv:2411.03743, 2024
2024
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
X Lv, N Ding, K Zhang, E Hua, G Cui, B Zhou
arXiv preprint arXiv:2411.02063, 2024
2024
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma, G Liu, K Zhang, J Li, B Zhou
arXiv preprint arXiv:2410.11795, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–18