Follow
Olivier Pietquin
Olivier Pietquin
Cohere | ex Google DeepMind (On leave - Professor at University of Lille)
Verified email at univ-lille.fr - Homepage
Title
Cited by
Cited by
Year
Deep q-learning from demonstrations
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
12102018
Noisy Networks for Exploration
SL Meire Fortunato, Mohammad Gheshlaghi Azar, Bilal Piot, Jacob Menick, Ian ...
International Conference on Learning Representations (ICLR), 2018
1163*2018
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards
M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ...
arXiv preprint arXiv:1707.08817, 2017
7742017
Modulating early visual processing by language
H De Vries, F Strub, J Mary, H Larochelle, O Pietquin, AC Courville
Advances in neural information processing systems 30, 2017
5482017
Guesswhat?! visual object discovery through multi-modal dialogue
H De Vries, F Strub, S Chandar, O Pietquin, H Larochelle, A Courville
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
4502017
Audiolm: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM transactions on audio, speech, and language processing 31, 2523-2533, 2023
3472023
Listen and translate: A proof of concept for end-to-end speech-to-text translation
A Bérard, O Pietquin, C Servan, L Besacier
arXiv preprint arXiv:1612.01744, 2016
3152016
A theory of regularized markov decision processes
M Geist, B Scherrer, O Pietquin
International Conference on Machine Learning, 2160-2169, 2019
3082019
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2382020
What matters for on-policy deep actor-critic methods? A large-scale empirical study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
arXiv preprint arXiv:2006.05990, 2020
235*2020
End-to-end automatic speech translation of audiobooks
A Bérard, L Besacier, AC Kocabiyikoglu, O Pietquin
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2182018
A probabilistic framework for dialog simulation and optimal strategy learning
O Pietquin, T Dutoit
IEEE Transactions on Audio, Speech, and Language Processing 14 (2), 589-599, 2006
2032006
Learning from demonstrations for real world reinforcement learning
T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ...
arXiv preprint arXiv:1704.03732, 2017, 2018
1892018
What matters for on-policy deep actor-critic methods? a large-scale study
M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ...
International conference on learning representations, 2020
1632020
Machine learning for spoken dialogue systems
O Lemon, O Pietquin
European Conference on Speech Communication and Technologies (Interspeech'07 …, 2007
1502007
A framework for unsupervised learning of dialogue strategies
O Pietquin
Presses univ. de Louvain, 2005
1482005
A survey on metrics for the evaluation of user simulations
O Pietquin, H Hastie
The knowledge engineering review 28 (1), 59-73, 2013
1402013
Observe and look further: Achieving consistent performance on atari
T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ...
arXiv preprint arXiv:1805.11593, 2018
1362018
Primal wasserstein imitation learning
R Dadashi, L Hussenot, M Geist, O Pietquin
arXiv preprint arXiv:2006.04678, 2020
1272020
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
1232010
The system can't perform the operation now. Try again later.
Articles 1–20