Authors
Jianmin Chen, Rajat Monga, Samy Bengio, Rafal Jozefowicz
Publication date
2016/4/4
Journal
arXiv preprint arXiv:1604.00981
Description
Abstract: The recent success of deep learning approaches for domains like speech
recognition (Hinton et al., 2012) and computer vision (Ioffe & Szegedy, 2015) stems from
many algorithmic improvements but also from the fact that the size of available training data
has grown significantly over the years, together with the computing power, in terms of both
CPUs and GPUs. While a single GPU often provides algorithmic simplicity and speed up to a
given scale of data and model, there exist an operating point where a distributed ...
recognition (Hinton et al., 2012) and computer vision (Ioffe & Szegedy, 2015) stems from
many algorithmic improvements but also from the fact that the size of available training data
has grown significantly over the years, together with the computing power, in terms of both
CPUs and GPUs. While a single GPU often provides algorithmic simplicity and speed up to a
given scale of data and model, there exist an operating point where a distributed ...
Total citations
20161
Scholar articles
J Chen, R Monga, S Bengio, R Jozefowicz - arXiv preprint arXiv:1604.00981, 2016
Dates and citation counts are estimated and are determined automatically by a computer program.