Massively parallel algorithm and implementation of RI-MP2 energy calculation for peta-scale many-core supercomputers

© 2016 Wiley Periodicals, Inc.

Bibliographische Detailangaben
Veröffentlicht in:Journal of computational chemistry. - 1984. - 37(2016), 30 vom: 15. Nov., Seite 2623-2633
1. Verfasser: Katouda, Michio (VerfasserIn)
Weitere Verfasser: Naruse, Akira, Hirano, Yukihiko, Nakajima, Takahito
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2016
Zugriff auf das übergeordnete Werk:Journal of computational chemistry
Schlagworte:Journal Article Research Support, Non-U.S. Gov't GPGPU K computer NTChem TSUBAME 2.5 electron correlation theory massively parallel algorithm second-order Møller-Plesset perturbation theory
Beschreibung
Zusammenfassung:© 2016 Wiley Periodicals, Inc.
A new parallel algorithm and its implementation for the RI-MP2 energy calculation utilizing peta-flop-class many-core supercomputers are presented. Some improvements from the previous algorithm (J. Chem. Theory Comput. 2013, 9, 5373) have been performed: (1) a dual-level hierarchical parallelization scheme that enables the use of more than 10,000 Message Passing Interface (MPI) processes and (2) a new data communication scheme that reduces network communication overhead. A multi-node and multi-GPU implementation of the present algorithm is presented for calculations on a central processing unit (CPU)/graphics processing unit (GPU) hybrid supercomputer. Benchmark results of the new algorithm and its implementation using the K computer (CPU clustering system) and TSUBAME 2.5 (CPU/GPU hybrid system) demonstrate high efficiency. The peak performance of 3.1 PFLOPS is attained using 80,199 nodes of the K computer. The peak performance of the multi-node and multi-GPU implementation is 514 TFLOPS using 1349 nodes and 4047 GPUs of TSUBAME 2.5. © 2016 Wiley Periodicals, Inc
Beschreibung:Date Completed 19.07.2018
Date Revised 19.07.2018
published: Print-Electronic
Citation Status PubMed-not-MEDLINE
ISSN:1096-987X
DOI:10.1002/jcc.24491