关注
Weixin Chen
标题
引用次数
引用次数
年份
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models
B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
1132023
Effective Backdoor Defense by Exploiting Sensitivity of Poisoned Samples
W Chen, B Wu, H Wang
Advances in Neural Information Processing Systems (NeurIPS), 2022
412022
TrojDiff: Trojan Attacks on Diffusion Models with Diverse Targets
W Chen, D Song, B Li
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
332023
GRATH: Gradual Self-Truthifying for Large Language Models
W Chen, D Song, B Li
arXiv preprint arXiv:2401.12292, 2024
12024
系统目前无法执行此操作,请稍后再试。
文章 1–4