Authors
Tianqi Chen, Ian Goodfellow, Jonathon Shlens
Publication date
2015/11/18
Journal
arXiv preprint arXiv:1511.05641
Description
Abstract: We introduce techniques for rapidly transferring the information stored in one
neural net into another neural net. The main purpose is to accelerate the training of a
significantly larger neural net. During real-world workflows, one often trains very many
different neural networks during the experimentation and design process. This is a wasteful
process in which each new model is trained from scratch. Our Net2Net technique
accelerates the experimentation process by instantaneously transferring the knowledge ...
Total citations
20152016112
Scholar articles
T Chen, I Goodfellow, J Shlens - arXiv preprint arXiv:1511.05641, 2015