Hybrid localized graph kernel for machine learning energy-related properties of molecules and solids

© 2021 Wiley Periodicals LLC.

Bibliographische Detailangaben
Veröffentlicht in:Journal of computational chemistry. - 1984. - 42(2021), 20 vom: 30. Juli, Seite 1390-1401
1. Verfasser: Casier, Bastien (VerfasserIn)
Weitere Verfasser: Chagas da Silva, Mauricio, Badawi, Michael, Pascale, Fabien, Bučko, Tomáš, Lebègue, Sébastien, Rocca, Dario
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2021
Zugriff auf das übergeordnete Werk:Journal of computational chemistry
Schlagworte:Journal Article QM7 and BA10 datasets energy-related properties graph kernel machine learning regression
Beschreibung
Zusammenfassung:© 2021 Wiley Periodicals LLC.
Nowadays, the coupling of electronic structure and machine learning techniques serves as a powerful tool to predict chemical and physical properties of a broad range of systems. With the aim of improving the accuracy of predictions, a large number of representations for molecules and solids for machine learning applications has been developed. In this work we propose a novel descriptor based on the notion of molecular graph. While graphs are largely employed in classification problems in cheminformatics or bioinformatics, they are not often used in regression problem, especially of energy-related properties. Our method is based on a local decomposition of atomic environments and on the hybridization of two kernel functions: a graph kernel contribution that describes the chemical pattern and a Coulomb label contribution that encodes finer details of the local geometry. The accuracy of this new kernel method in energy predictions of molecular and condensed phase systems is demonstrated by considering the popular QM7 and BA10 datasets. These examples show that the hybrid localized graph kernel outperforms traditional approaches such as, for example, the smooth overlap of atomic positions and the Coulomb matrices
Beschreibung:Date Revised 16.06.2021
published: Print-Electronic
Citation Status PubMed-not-MEDLINE
ISSN:1096-987X
DOI:10.1002/jcc.26550