Automated preparation of nanoscopic structures : Graph-based sequence analysis, mismatch detection, and pH-consistent protonation with uncertainty estimates

© 2023 The Authors. Journal of Computational Chemistry published by Wiley Periodicals LLC.

Bibliographische Detailangaben
Veröffentlicht in:Journal of computational chemistry. - 1984. - 45(2024), 11 vom: 30. März, Seite 761-776
1. Verfasser: Csizi, Katja-Sophia (VerfasserIn)
Weitere Verfasser: Reiher, Markus
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:Journal of computational chemistry
Schlagworte:Journal Article Gaussian process atomistic simulation machine learning protein structure
Beschreibung
Zusammenfassung:© 2023 The Authors. Journal of Computational Chemistry published by Wiley Periodicals LLC.
Structure and function in nanoscale atomistic assemblies are tightly coupled, and every atom with its specific position and even every electron will have a decisive effect on the electronic structure, and hence, on the molecular properties. Molecular simulations of nanoscopic atomistic structures therefore require accurately resolved three-dimensional input structures. If extracted from experiment, these structures often suffer from severe uncertainties, of which the lack of information on hydrogen atoms is a prominent example. Hence, experimental structures require careful review and curation, which is a time-consuming and error-prone process. Here, we present a fast and robust protocol for the automated structure analysis and pH-consistent protonation, in short, ASAP. For biomolecules as a target, the ASAP protocol integrates sequence analysis and error assessment of a given input structure. ASAP allows for p K a  prediction from reference data through Gaussian process regression including uncertainty estimation and connects to system-focused atomistic modeling described in Brunken and Reiher (J. Chem. Theory Comput. 16, 2020, 1646). Although focused on biomolecules, ASAP can be extended to other nanoscopic objects, because most of its design elements rely on a general graph-based foundation guaranteeing transferability. The modular character of the underlying pipeline supports different degrees of automation, which allows for (i) efficient feedback loops for human-machine interaction with a low entrance barrier and for (ii) integration into autonomous procedures such as automated force field parametrizations. This facilitates fast switching of the pH-state through on-the-fly system-focused reparametrization during a molecular simulation at virtually no extra computational cost
Beschreibung:Date Revised 15.03.2024
published: Print-Electronic
Citation Status PubMed-not-MEDLINE
ISSN:1096-987X
DOI:10.1002/jcc.27276