Follow
Arda Senocak
Arda Senocak
Verified email at kaist.ac.kr - Homepage
Title
Cited by
Cited by
Year
Learning to localize sound source in visual scenes
A Senocak, TH Oh, J Kim, MH Yang, IS Kweon
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
3622018
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
A Senocak, TH Oh, J Kim, MH Yang, IS Kweon
IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (5), 1605-1619, 2021
532021
Part-based Player Identification using Deep Convolutional Representation and Multi-scale Pooling
A Senocak, TH Oh, J Kim, IS Kweon
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
502018
Learning Sound Localization Better From Semantically Similar Samples
A Senocak*, H Ryu*, J Kim*, IS Kweon
ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2022
302022
Less can be more: Sound source localization with a classification model
A Senocak*, H Ryu*, J Kim*, IS Kweon
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022
262022
Sound to visual scene generation by audio-to-visual latent alignment
K Sung-Bin, A Senocak, H Ha, A Owens, TH Oh
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
232023
MarginNCE: Robust Sound Localization with a Negative Margin
S Park*, A Senocak*, JS Chung
ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023
122023
Sound source localization is all about cross-modal alignment
A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
112023
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding
A Senocak*, J Kim*, TH Oh, D Li, IS Kweon
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023
9*2023
Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
H Ryu*, A Senocak*, IS Kweon, JS Chung
ICASSP IEEE International Conference on Acoustics, Speech and Signal …, 2023
62023
Can CLIP Help Sound Source Localization?
S Park, A Senocak, JS Chung
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
32024
FlexiAST: Flexibility is What AST Needs
J Feng, MH Erol, JS Chung, A Senocak
Interspeech, 2023
32023
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
MH Erol, A Senocak, J Feng, JS Chung
arXiv preprint arXiv:2406.03344, 2024
22024
From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers
J Feng, MH Erol, JS Chung, A Senocak
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment
A Senocak, H Ryu, J Kim, TH Oh, H Pfister, JS Chung
arXiv preprint arXiv:2407.13676, 2024
2024
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions
J Feng, MH Erol, JS Chung, A Senocak
arXiv preprint arXiv:2407.08691, 2024
2024
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
M Hamza Erol, A Senocak, J Feng, J Son Chung
arXiv e-prints, arXiv: 2406.03344, 2024
2024
Speech Guided Masked Image Modeling for Visually Grounded Speech
J Woo, H Ryu, A Senocak, JS Chung
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Learning audio-visual relationships and correspondences in the visual scenes
A Senocak
한국과학기술원, 2022
2022
Nearly-Unsupervised Localization of Sound Sources in Videos
A Senocak, TH Oh, J Kim, MH Yang, IS Kweon
MMTC Communications–Review, 1-15, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–20