Follow
Xiangming Gu
Title
Cited by
Cited by
Year
Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
L Ou*, X Gu*, Y Wang
International Society for Music Information Retrieval Conference, 2022
212022
On memorization in diffusion models
X Gu, C Du, T Pang, C Li, M Lin, Y Wang
arXiv preprint arXiv:2310.02664, 2023
142023
MM-ALT: A multimodal automatic lyric transcription system
X Gu, L Ou, D Ong, Y Wang
Proceedings of the 30th ACM International Conference on Multimedia, 3328-3337, 2022
122022
Boosting monocular 3d human pose estimation with part aware attention
Y Xue, J Chen, X Gu, H Ma, H Ma
IEEE Transactions on Image Processing 31, 4278-4291, 2022
122022
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
X Gu, X Zheng, T Pang, C Du, Q Liu, Y Wang, J Jiang, M Lin
International Conference on Machine Learning, 2024
92024
Laser endoscopic manipulator using spring-reinforced multi-DoF soft actuator
B Zhang, P Yang, X Gu, H Liao
IEEE Robotics and Automation Letters 6 (4), 7736-7743, 2021
72021
Distilling a deep neural network into a Takagi-Sugeno-Kang fuzzy inference system
X Gu, X Cheng
arXiv preprint arXiv:2010.04974, 2020
72020
Extrapolative continuous-time bayesian neural network for fast training-free test-time adaptation
H Huang, X Gu, H Wang, C Xiao, H Liu, Y Wang
Advances in Neural Information Processing Systems 35, 36000-36013, 2022
62022
Elucidate gender fairness in singing voice transcription
X Gu, W Zeng, Y Wang
Proceedings of the 31st ACM International Conference on Multimedia, 8760-8769, 2023
32023
Deep audio-visual singing voice transcription based on self-supervised learning models
X Gu, W Zeng, J Zhang, L Ou, Y Wang
arXiv preprint arXiv:2304.12082, 2023
22023
Disentangled adversarial domain adaptation for phonation mode detection in singing and speech
Y Wang, W Wei, X Gu, X Guan, Y Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
12023
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
X Gu, L Ou, W Zeng, J Zhang, N Wong, Y Wang
ACM Transactions on Multimedia Computing, Communications and Applications, 2024
2024
Unsupervised Mismatch Localization in Cross-Modal Sequential Data with Application to Mispronunciations Localization
W Wei, H Huang, X Gu, H Wang, Y Wang
Transactions on Machine Learning Research, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–13