Gptfuzzer: Red teaming large language models with auto-generated jailbreak prompts J Yu, X Lin, Z Yu, X Xing arXiv preprint arXiv:2309.10253, 2023 | 109 | 2023 |
Voiceprint mimicry attack towards speaker verification system in smart home L Zhang, Y Meng, J Yu, C Xiang, B Falk, H Zhu IEEE INFOCOM 2020-IEEE conference on computer communications, 377-386, 2020 | 49 | 2020 |
Speedup robust graph structure learning with low-rank information H Xu, L Xiang, J Yu, A Cao, X Wang Proceedings of the 30th ACM International Conference on Information …, 2021 | 22 | 2021 |
Assessing prompt injection risks in 200+ custom gpts J Yu, Y Wu, D Shu, M Jin, X Xing ICLR 2024 Workshop on Secure and Trustworthy Large Language Models, 2023 | 18 | 2023 |
Matrix gaussian mechanisms for differentially-private learning J Yang, L Xiang, J Yu, X Wang, B Guo, Z Li, B Li IEEE Transactions on Mobile Computing 22 (2), 1036-1048, 2021 | 10 | 2021 |
Research on Application of Artificial Intelligence Technology in Electrical Automation Control C Jiang, X Xiong, T Zhu, J Cao, J Yu Journal of Physics: Conference Series 1601 (5), 052006, 2020 | 9 | 2020 |
Statemask: Explaining deep reinforcement learning through state mask Z Cheng, X Wu, J Yu, W Sun, W Guo, X Xing Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
{AIRS}: Explanation for Deep Reinforcement Learning based Security Applications J Yu, W Guo, Q Qin, G Wang, T Wang, X Xing 32nd USENIX Security Symposium (USENIX Security 23), 7375-7392, 2023 | 3 | 2023 |
Decoupled Alignment for Robust Plug-and-Play Adaptation H Luo, J Yu, W Zhang, J Li, JYC Hu, X Xin, H Liu arXiv preprint arXiv:2406.01514, 2024 | | 2024 |
Enhancing Jailbreak Attack Against Large Language Models through Silent Tokens J Yu, H Luo, J Yao-Chieh, W Guo, H Liu, X Xing arXiv preprint arXiv:2405.20653, 2024 | | 2024 |
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation Z Cheng, X Wu, J Yu, S Yang, G Wang, X Xing Proceedings of the 41st International Conference on Machine Learning, 2024 | | 2024 |
BandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning W Shi, H Li, J Yu, W Guo, X Xing | | 2024 |