Authors
Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan
Publication date
2015
Conference
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
Pages
3156-3164
Description
Abstract Automatically describing the content of an image is a fundamental problem in
artificial intelligence that connects computer vision and natural language processing. In this
paper, we present a generative model based on a deep recurrent architecture that combines
recent advances in computer vision and machine translation and that can be used to
generate natural sentences describing an image. The model is trained to maximize the
likelihood of the target description sentence given the training image. Experiments on ...
artificial intelligence that connects computer vision and natural language processing. In this
paper, we present a generative model based on a deep recurrent architecture that combines
recent advances in computer vision and machine translation and that can be used to
generate natural sentences describing an image. The model is trained to maximize the
likelihood of the target description sentence given the training image. Experiments on ...
Total citations
Scholar articles
O Vinyals, A Toshev, S Bengio, D Erhan - Proceedings of the IEEE Conference on Computer …, 2015
Dates and citation counts are estimated and are determined automatically by a computer program.