The Loss Surface of Deep Linear Networks Viewed Through the Algebraic Geometry Lens

By using the viewpoint of modern computational algebraic geometry, we explore properties of the optimization landscapes of deep linear neural network models. After providing clarification on the various definitions of "flat" minima, we show that the geometrically flat minima, which are mer...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 9 vom: 06. Sept., Seite 5664-5680
1. Verfasser: Mehta, Dhagash (VerfasserIn)
Weitere Verfasser: Chen, Tianran, Tang, Tingting, Hauenstein, Jonathan D
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article