Suivre
Neil Zeghidour
Neil Zeghidour
Kyutai
Adresse e-mail validée de kyutai.org
Titre
Citée par
Citée par
Année
Fader networks: Manipulating images by sliding attributes
G Lample, N Zeghidour, N Usunier, A Bordes, L Denoyer, MA Ranzato
Advances in Neural Information Processing Systems, 2017
5902017
Soundstream: An end-to-end neural audio codec
N Zeghidour, A Luebs, A Omran, J Skoglund, M Tagliasacchi
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 495-507, 2021
3052021
Musiclm: Generating music from text
A Agostinelli, TI Denk, Z Borsos, J Engel, M Verzetti, A Caillon, Q Huang, ...
arXiv preprint arXiv:2301.11325, 2023
2862023
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
2642023
Wavesplit: End-to-end speech separation by speaker clustering
N Zeghidour, D Grangier
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2840-2849, 2021
2522021
Contrastive learning of general-purpose audio representations
A Saeed, D Grangier, N Zeghidour
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
2352021
LEAF: A Learnable Frontend for Audio Classification
N Zeghidour, O Teboul, FC Quitry, M Tagliasacchi
ICLR 2021, 2021
1332021
Learning Filterbanks from Raw Speech for Phone Recognition
N Zeghidour, N Usunier, I Kokkinos, T Schatz, G Synnaeve, E Dupoux
ICASSP 2018, 2017
1252017
Fully convolutional speech recognition
N Zeghidour, Q Xu, V Liptchinsky, N Usunier, G Synnaeve, R Collobert
arXiv preprint arXiv:1812.06864, 2018
1092018
End-to-end speech recognition from the raw waveform
N Zeghidour, N Usunier, G Synnaeve, R Collobert, E Dupoux
Interspeech 2018, 2018
1082018
Sing: Symbol-to-instrument neural generator
A Défossez, N Zeghidour, N Usunier, L Bottou, F Bach
Advances in Neural Information Processing Systems, 2018
762018
Audiopalm: A large language model that can speak and listen
PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ...
arXiv preprint arXiv:2306.12925, 2023
682023
Joint learning of speaker and phonetic similarities with siamese networks.
N Zeghidour, G Synnaeve, N Usunier, E Dupoux
INTERSPEECH, 1295-1299, 2016
632016
Speak, read and prompt: High-fidelity text-to-speech with minimal supervision
E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ...
Transactions of the Association for Computational Linguistics 11, 1703-1718, 2023
592023
General-purpose, long-context autoregressive modeling with perceiver AR
C Hawthorne, A Jaegle, C Cangea, S Borgeaud, C Nash, M Malinowski, ...
International Conference on Machine Learning, 8535-8558, 2022
542022
A Deep Scattering Spectrum - Deep Siamese network Pipeline For Unsupervised Acoustic Modeling
N Zeghidour, G Synnaeve, M Versteegh, E Dupoux
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
482016
To reverse the gradient or not: An empirical comparison of adversarial and multi-task learning in speech recognition
Y Adi, N Zeghidour, R Collobert, N Usunier, V Liptchinsky, G Synnaeve
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
412019
Learning strides in convolutional neural networks
R Riad, O Teboul, D Grangier, N Zeghidour
arXiv preprint arXiv:2202.01653, 2022
402022
Multi-instrument music synthesis with spectrogram diffusion
C Hawthorne, I Simon, A Roberts, N Zeghidour, J Gardner, E Manilow, ...
arXiv preprint arXiv:2206.05408, 2022
382022
Deep multi-class learning from label proportions
G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert
arXiv preprint arXiv:1905.12909, 2019
352019
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20