Approximate Fisher Information Matrix to Characterize the Training of Deep Neural Networks

In this paper, we introduce a novel methodology for characterizing the performance of deep learning networks (ResNets and DenseNet) with respect to training convergence and generalization as a function of mini-batch size and learning rate for image classification. This methodology is based on novel...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 42(2020), 1 vom: 16. Jan., Seite 15-26
1. Verfasser: Liao, Zhibin (VerfasserIn)
Weitere Verfasser: Drummond, Tom, Reid, Ian, Carneiro, Gustavo
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2020
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article Research Support, Non-U.S. Gov't