‪Jiongxiao Wang‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	150	150
h-index	6	6
i10-index	5	5

0

80

40

2022202320243 74 73

Public access

2 articles

0 articles

available

not available

Based on funding mandates

Jiongxiao Wang

Jiongxiao Wang

PhD Student, University of Wisconsin–Madison

Verified email at wisc.edu - Homepage

Trustworthy Machine Learning Diffusion Model Large Language Model AI for Science


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Densepure: Understanding diffusion models for adversarial robustness C Xiao, Z Chen, K Jin, J Wang, W Nie, M Liu, A Anandkumar, B Li, D Song The Eleventh International Conference on Learning Representations, 2022	37*	2022
Adversarial demonstration attacks on large language models J Wang, Z Liu, KH Park, M Chen, C Xiao arXiv preprint arXiv:2305.14950, 2023	29	2023
On the exploitability of instruction tuning M Shu, J Wang, C Zhu, J Geiping, C Xiao, T Goldstein Advances in Neural Information Processing Systems 36, 2024	24	2024
Conversational Drug Editing Using Retrieval and Domain Feedback S Liu, J Wang, Y Yang, C Wang, L Liu, H Guo, C Xiao The Twelfth International Conference on Learning Representations, 2023	23*	2023
Defending against adversarial audio via diffusion model S Wu, J Wang, W Ping, W Nie, C Xiao arXiv preprint arXiv:2303.01507, 2023	16	2023
Fast and reliable evaluation of adversarial robustness with minimum-margin attack R Gao, J Wang, K Zhou, F Liu, B Xie, G Niu, B Han, J Cheng International Conference on Machine Learning, 7144-7163, 2022	9	2022
A critical revisit of adversarial robustness in 3D point cloud recognition with diffusion-driven purification J Sun, J Wang, W Nie, Z Yu, Z Mao, C Xiao International Conference on Machine Learning, 33100-33114, 2023	5	2023
Test-time backdoor mitigation for black-box large language models with defensive demonstrations W Mo, J Xu, Q Liu, J Wang, J Yan, C Xiao, M Chen arXiv preprint arXiv:2311.09763, 2023	3	2023
On the exploitability of reinforcement learning with human feedback for large language models J Wang, J Wu, M Chen, Y Vorobeychik, C Xiao arXiv preprint arXiv:2311.09641, 2023	3	2023
Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment J Wang, J Li, Y Li, X Qi, M Chen, J Hu, Y Li, B Li, C Xiao arXiv preprint arXiv:2402.14968, 2024	1	2024
Preference Poisoning Attacks on Reward Model Learning J Wu, J Wang, C Xiao, C Wang, N Zhang, Y Vorobeychik arXiv preprint arXiv:2402.01920, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–11