The GPU-enabled divide-expand-consolidate RI-MP2 method (DEC-RI-MP2)

© 2016 Wiley Periodicals, Inc.

Bibliographische Detailangaben
Veröffentlicht in:Journal of computational chemistry. - 1984. - 38(2017), 4 vom: 05. Feb., Seite 228-237
1. Verfasser: Bykov, Dmytro (VerfasserIn)
Weitere Verfasser: Kjaergaard, Thomas
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2017
Zugriff auf das übergeordnete Werk:Journal of computational chemistry
Schlagworte:Journal Article Research Support, Non-U.S. Gov't MP2 graphic processing units heterogeneous architectures parallel Implementation
LEADER 01000caa a22002652 4500
001 NLM266887562
003 DE-627
005 20250221000156.0
007 cr uuu---uuuuu
008 231224s2017 xx |||||o 00| ||eng c
024 7 |a 10.1002/jcc.24678  |2 doi 
028 5 2 |a pubmed25n0889.xml 
035 |a (DE-627)NLM266887562 
035 |a (NLM)27925252 
040 |a DE-627  |b ger  |c DE-627  |e rakwb 
041 |a eng 
100 1 |a Bykov, Dmytro  |e verfasserin  |4 aut 
245 1 4 |a The GPU-enabled divide-expand-consolidate RI-MP2 method (DEC-RI-MP2) 
264 1 |c 2017 
336 |a Text  |b txt  |2 rdacontent 
337 |a ƒaComputermedien  |b c  |2 rdamedia 
338 |a ƒa Online-Ressource  |b cr  |2 rdacarrier 
500 |a Date Completed 02.05.2017 
500 |a Date Revised 02.05.2017 
500 |a published: Print-Electronic 
500 |a Citation Status PubMed-not-MEDLINE 
520 |a © 2016 Wiley Periodicals, Inc. 
520 |a We report porting of the Divide-Expand-Consolidate Resolution of the Identity second-order Møller-Plesset perturbation (DEC-RI-MP2) method to the graphic processing units (GPUs) using OpenACC compiler directives. It is shown that the OpenACC compiler directives implementation efficiently accelerates the rate-determining step of the DEC-RI-MP2 method with minor implementation effort. Moreover, the GPU acceleration results in a better load balance and thus in an overall scaling improvement of the DEC algorithm. The resulting cross-platform hybrid MPI/OpenMP/OpenACC implementation has scalable and portable performance on heterogeneous HPC architectures. The GPU-enabled code was benchmarked using a reduced version of the S12L test set of Stefan Grimme (Grimme, Chem. Eur. J. 2012, 18, 9955) consisting of supramolecular complexes up to 158 atoms and 4292 contracted basis functions (cc-pVTZ). The test set results demonstrate the general applicability of the DEC-RI-MP2 method showing results consistent with the DEC-RI-MP2 introductory paper (Baudin et al., J. Chem. Phys. 2016, 144, 054102) on molecules of complicated electronic structures. © 2016 Wiley Periodicals, Inc 
650 4 |a Journal Article 
650 4 |a Research Support, Non-U.S. Gov't 
650 4 |a MP2 
650 4 |a graphic processing units 
650 4 |a heterogeneous architectures 
650 4 |a parallel Implementation 
700 1 |a Kjaergaard, Thomas  |e verfasserin  |4 aut 
773 0 8 |i Enthalten in  |t Journal of computational chemistry  |d 1984  |g 38(2017), 4 vom: 05. Feb., Seite 228-237  |w (DE-627)NLM098138448  |x 1096-987X  |7 nnns 
773 1 8 |g volume:38  |g year:2017  |g number:4  |g day:05  |g month:02  |g pages:228-237 
856 4 0 |u http://dx.doi.org/10.1002/jcc.24678  |3 Volltext 
912 |a GBV_USEFLAG_A 
912 |a SYSFLAG_A 
912 |a GBV_NLM 
912 |a GBV_ILN_350 
951 |a AR 
952 |d 38  |j 2017  |e 4  |b 05  |c 02  |h 228-237