Latent Weight Quantization for Integerized Training of Deep Neural Networks

Existing methods for integerized training speed up deep learning by using low-bitwidth integerized weights, activations, gradients, and optimizer buffers. However, they overlook the issue of full-precision latent weights, which consume excessive memory to accumulate gradient-based updates for optimi...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on pattern analysis and machine intelligence. - 1979. - 47(2025), 4 vom: 15. Apr., Seite 2816-2832
1. Verfasser:	Fei, Wen (VerfasserIn)
Weitere Verfasser:	Dai, Wenrui, Zhang, Liang, Zhang, Luoming, Li, Chenglin, Zou, Junni, Xiong, Hongkai
Format:	Online-Aufsatz
Sprache:	English
Veröffentlicht:	2025
Zugriff auf das übergeordnete Werk:	IEEE transactions on pattern analysis and machine intelligence
Schlagworte:	Journal Article

Online verfügbar	Volltext