SpotSDC : Revealing the Silent Data Corruption Propagation in High-Performance Computing Systems

The trend of rapid technology scaling is expected to make the hardware of high-performance computing (HPC) systems more susceptible to computational errors due to random bit flips. Some bit flips may cause a program to crash or have a minimal effect on the output, but others may lead to silent data...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on visualization and computer graphics. - 1996. - 27(2021), 10 vom: 30. Okt., Seite 3938-3952
1. Verfasser: Li, Zhimin (VerfasserIn)
Weitere Verfasser: Menon, Harshitha, Maljovec, Dan, Livnat, Yarden, Liu, Shusen, Mohror, Kathryn, Bremer, Peer-Timo, Pascucci, Valerio
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2021
Zugriff auf das übergeordnete Werk:IEEE transactions on visualization and computer graphics
Schlagworte:Journal Article