Survey of vulnerabilities in large language models revealed by adversarial attacks E Shayegani, MAA Mamun, Y Fu, P Zaree, Y Dong, N Abu-Ghazaleh arXiv preprint arXiv:2310.10844, 2023 | 29 | 2023 |
Watermarking conditional text generation for ai detection: Unveiling challenges and a semantic-aware watermark remedy Y Fu, D Xiong, Y Dong AAAI 2024, 2023 | 11 | 2023 |
An efficient policy evaluation engine for XACML policy management F Deng, Z Yu, W Liu, X Luo, Y Fu, B Qiang, C Xu, Z Li Information Sciences 547, 1105-1121, 2021 | 4 | 2021 |
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack Y Fu, Y Li, W Xiao, C Liu, Y Dong arXiv preprint arXiv:2312.06924, 2023 | 1 | 2023 |
Inverse Reinforcement Learning for Text Summarization Y Fu, D Xiong, Y Dong Findings of EMNLP 2023, 2023 | | 2023 |
MetaXCR: Reinforcement-Based Meta-Transfer Learning for Cross-Lingual Commonsense Reasoning J He, Y Fu Transfer Learning for Natural Language Processing Workshop, 74-87, 2023 | | 2023 |