Follow
Zalán Borsos
Zalán Borsos
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
MusicLM: Generating Music From Text
A Agostinelli, TI Denk, Z Borsos, J Engel, M Verzetti, A Caillon, Q Huang, ...
arXiv preprint arXiv:2301.11325, 2023
2852023
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
2622023
Coresets via Bilevel Optimization for Continual Learning and Streaming
Z Borsos, M Mutný, A Krause
NeurIPS 2020 - Advances in Neural Information Processing Systems, 2020
1822020
AudioPaLM: A Large Language Model That Can Speak and Listen
PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ...
arXiv preprint arXiv:2306.12925, 2023
652023
Speak, read and prompt: High-fidelity text-to-speech with minimal supervision
E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ...
Transactions of the Association for Computational Linguistics 11, 1703-1718, 2023
592023
SoundStorm: Efficient Parallel Audio Generation
Z Borsos, M Sharifi, D Vincent, E Kharitonov, N Zeghidour, M Tagliasacchi
arXiv preprint arXiv:2305.09636, 2023
332023
Online Variance Reduction for Stochastic Optimization
Z Borsos, A Krause, KY Levy
Proceedings of the 31st Conference On Learning Theory 75, 324--357, 2018
292018
Dealing with overlap and imbalance: a new metric and approach
Z Borsos, C Lemnaru, R Potolea
Pattern Analysis and Applications, 1-15, 2016
242016
SpeechPainter: Text-conditioned Speech Inpainting
Z Borsos, M Sharifi, M Tagliasacchi
arXiv preprint arXiv:2202.07273, 2022
232022
Online Variance Reduction with Mixtures
Z Borsos, S Curi, KY Levy, A Krause
ICML 2019 - Proceedings of the 36th International Conference on Machine …, 2019
162019
Semi-supervised Batch Active Learning via Bilevel Optimization
Z Borsos, M Tagliasacchi, A Krause
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
152021
Implementing Modular FFTs in FPGAs--A Basic Block for Lattice-Based Cryptography
T Györfi, O Cret, Z Borsos
Digital System Design (DSD), 2013 Euromicro Conference on, 305-308, 2013
112013
Inference of the three-dimensional chromatin structure and its temporal behavior
BC Cristescu, Z Borsos, J Lygeros, MR Martínez, MA Rapsomaniki
arXiv preprint arXiv:1811.09619, 2018
82018
Data Summarization via Bilevel Optimization
Z Borsos, M Mutný, M Tagliasacchi, A Krause
arXiv preprint arXiv:2109.12534, 2021
72021
Disentangling speech from surroundings in a neural audio codec
A Omran, N Zeghidour, Z Borsos, F de Chaumont Quitry, M Slaney, ...
arXiv preprint ArXiv:2203.15578, 2022
62022
MicAugment: One-Shot Microphone Style Transfer
Z Borsos, Y Li, B Gfeller, M Tagliasacchi
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
62021
LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models
T Jenrungrot, M Chinen, WB Kleijn, J Skoglund, Z Borsos, N Zeghidour, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
52023
Disentangling speech from surroundings with neural embeddings
A Omran, N Zeghidour, Z Borsos, F de Chaumont Quitry, M Slaney, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
32023
Transfer NAS: Knowledge Transfer between Search Spaces with Transformer Agents
Z Borsos, A Khorlin, A Gesmundo
arXiv preprint arXiv:1906.08102, 2019
32019
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
H Erdogan, S Wisdom, X Chang, Z Borsos, M Tagliasacchi, N Zeghidour, ...
arXiv preprint arXiv:2308.10415, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20