Arda Senocak

Cited by

	All	Since 2019
Citations	591	576
h-index	9	9
i10-index	8	8

160

120

201820192020202120222023202412 38 62 84 103 140 149

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

In So KweonKAISTVerified email at kaist.ac.kr
Tae-Hyun OhAssociate Professor, POSTECHVerified email at csail.mit.edu
Junsik KimHarvard UniversityVerified email at seas.harvard.edu
Ming-Hsuan YangUniversity of California at Merced; Google DeepMindVerified email at ucmerced.edu
Hyeonggon RyuKAISTVerified email at kaist.ac.kr
Joon Son ChungKAISTVerified email at kaist.ac.kr
Andrew OwensAssistant Professor, University of MichiganVerified email at mit.edu
Dingzeyu LiSenior Research Scientist @ Adobe ResearchVerified email at adobe.com

Arda Senocak

KAIST

Verified email at kaist.ac.kr - Homepage

Computer Vision Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning to localize sound source in visual scenes A Senocak, TH Oh, J Kim, MH Yang, IS Kweon Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018	362	2018
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications A Senocak, TH Oh, J Kim, MH Yang, IS Kweon IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (5), 1605-1619, 2021	53	2021
Part-based Player Identification using Deep Convolutional Representation and Multi-scale Pooling A Senocak, TH Oh, J Kim, IS Kweon Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018	50	2018
Learning Sound Localization Better From Semantically Similar Samples A Senocak, H Ryu, J Kim*, IS Kweon ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2022	30	2022
Less can be more: Sound source localization with a classification model A Senocak, H Ryu, J Kim*, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022	26	2022
Sound to visual scene generation by audio-to-visual latent alignment K Sung-Bin, A Senocak, H Ha, A Owens, TH Oh Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	23	2023
MarginNCE: Robust Sound Localization with a Negative Margin S Park, A Senocak, JS Chung ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023	12	2023
Sound source localization is all about cross-modal alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	11	2023
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding A Senocak, J Kim, TH Oh, D Li, IS Kweon Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023	9*	2023
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples H Ryu, A Senocak, IS Kweon, JS Chung ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023	6	2023
Can CLIP Help Sound Source Localization? S Park, A Senocak, JS Chung Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024	3	2024
FlexiAST: Flexibility is What AST Needs J Feng, MH Erol, JS Chung, A Senocak Interspeech, 2023	3	2023
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning MH Erol, A Senocak, J Feng, JS Chung arXiv preprint arXiv:2406.03344, 2024	2	2024
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers J Feng, MH Erol, JS Chung, A Senocak ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	1	2024
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung arXiv preprint arXiv:2407.13676, 2024		2024
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions J Feng, MH Erol, JS Chung, A Senocak arXiv preprint arXiv:2407.08691, 2024		2024
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning M Hamza Erol, A Senocak, J Feng, J Son Chung arXiv e-prints, arXiv: 2406.03344, 2024		2024
Speech Guided Masked Image Modeling for Visually Grounded Speech J Woo, H Ryu, A Senocak, JS Chung ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Learning audio-visual relationships and correspondences in the visual scenes A Senocak 한국과학기술원, 2022		2022
Nearly-Unsupervised Localization of Sound Sources in Videos A Senocak, TH Oh, J Kim, MH Yang, IS Kweon MMTC Communications–Review, 1-15, 2021		2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors