Yuan Gong

Cited by

	All	Since 2019
Citations	2039	2010
h-index	18	18
i10-index	21	21

840

420

210

630

201820192020202120222023202423 54 135 178 471 827 336

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

James GlassMIT Computer Science and Artificial Intelligence LaboratoryVerified email at mit.edu
Christian PoellabauerProfessor, Florida International UniversityVerified email at cs.fiu.edu
Yu-An ChungFacebook AI Research (FAIR)Verified email at fb.com
Alexander H. LiuMassachusetts Institute of TechnologyVerified email at mit.edu
Leonid KarlinskyPrincipal Research Scientist, MIT-IBM Watson AI Lab, IBM ResearchVerified email at ibm.com
Andrew RouditchenkoPhD Student at MIT CSAILVerified email at mit.edu
Hongyin LuoMIT CSAILVerified email at mit.edu
Bryan (Ning) XiaResearch Scientist, MicrosoftVerified email at microsoft.com
Yizhe ZhangNanjing University of Science and TechnologyVerified email at njust.edu.cn
Sameer KhuranaMitsubishi Electric Research Lab (MERL); MIT PhDVerified email at mit.edu
Cheng-I Jeff LaiMassachusetts Institute of TechnologyVerified email at mit.edu
Jian YangResearch Scientist, MetaVerified email at meta.com
Yoon KimAssistant Professor, MITVerified email at mit.edu
Yung-Sung ChuangMassachusetts Institute of TechnologyVerified email at mit.edu
David HarwathThe University of Texas at AustinVerified email at utexas.edu
Hilde KuehneUniversity of Bonn , MIT-IBM Watson LabVerified email at uni-bonn.de
Yiyu ShiUniversity of Notre DameVerified email at nd.edu
Boyang LiUniversity of Notre DameVerified email at nd.edu
Peng ChangPAII Inc.Verified email at paii-labs.com
Jin YuTeradyne/Northeastern UniversityVerified email at northeastern.edu

Yuan Gong

Research Scientist, MIT CSAIL

Verified email at mit.edu - Homepage

Audio Processing Speech Processing Natural Language Processing Large Language Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
AST: Audio Spectrogram Transformer Y Gong, YA Chung, J Glass Interspeech 2021, 2021	675	2021
Second-order non-local attention networks for person re-identification BN Xia, Y Gong, Y Zhang, C Poellabauer ICCV 2019, 3760-3769, 2019	231	2019
SSAST: Self-Supervised Audio Spectrogram Transformer Y Gong, CIJ Lai, YA Chung, J Glass AAAI 2022, 2022	207	2022
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation Y Gong, YA Chung, J Glass IEEE Transactions on Audio, Speech, and Language Processing, 2021	141	2021
Topic modeling based multi-modal depression detection Y Gong, C Poellabauer Proceedings of the 7th annual workshop on Audio/Visual emotion challenge, 69-76, 2017	137	2017
Crafting adversarial examples for speech paralinguistics applications Y Gong, C Poellabauer Proceedings of 2018 DYnamic and Novel Advances in Machine Learning and …, 2017	118	2017
Contrastive Audio-Visual Masked Autoencoder Y Gong, A Rouditchenko, AH Liu, D Harwath, L Karlinsky, H Kuehne, ... ICLR 2023, 2022	69	2022
Real-time Adversarial Attacks Y Gong, B Li, C Poellabauer, Y Shi IJCAI 2019, 2019	59	2019
Transformer-based multi-aspect multi-granularity non-native english speaker pronunciation assessment Y Gong, Z Chen, IH Chu, P Chang, J Glass ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	45	2022
Listen, Think, and Understand Y Gong, H Luo, AH Liu, L Karlinsky, J Glass ICLR 2024, 2023	41	2023
ReMASC: realistic replay attack corpus for voice controlled systems Y Gong, J Yang, J Huber, M MacKnight, C Poellabauer Interspeech 2019, 2019	40	2019
An overview of vulnerabilities of voice controlled systems Y Gong, C Poellabauer 1st International Workshop on Security and Privacy for the Internet-of …, 2018	38	2018
Protecting voice controlled systems using sound source identification based on acoustic cues Y Gong, C Poellabauer 2018 27th International Conference on Computer Communication and Networks …, 2018	36	2018
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers Y Gong, S Khurana, L Karlinsky, J Glass Interspeech 2023, 2023	30	2023
Detecting replay attacks using multi-channel audio: A neural network-based method Y Gong, J Yang, C Poellabauer IEEE Signal Processing Letters 27, 920-924, 2020	27	2020
Cmkd: Cnn/transformer-based cross-model knowledge distillation for audio classification Y Gong, S Khurana, A Rouditchenko, J Glass arXiv preprint arXiv:2203.06760, 2022	25	2022
Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models Y Gong, C Poellabauer Interspeech 2018, 2698-2702, 2018	23	2018
Vocalsound: A dataset for improving human vocal sounds recognition Y Gong, J Yu, J Glass ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	20	2022
Automatic autism spectrum disorder detection using everyday vocalizations captured by smart devices Y Gong, H Yatawatte, C Poellabauer, S Schneider, S Latham Proceedings of the 2018 ACM international conference on bioinformatics …, 2018	17	2018
Search augmented instruction learning H Luo, T Zhang, YS Chuang, Y Gong, Y Kim, X Wu, H Meng, J Glass Findings of the Association for Computational Linguistics: EMNLP 2023, 3717-3729, 2023	15*	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors