15
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We report an evaluation of a semi-empirical quantum chemical method PM7 from the perspective of uncertainty quantification. Specifically, we apply Bound-to-Bound Data Collaboration, an uncertainty quantification framework, to characterize (a) variability of PM7 model parameter values consistent with the uncertainty in the training data and (b) uncertainty propagation from the training data to the model predictions. Experimental heats of formation of a homologous series of linear alkanes are used as the property of interest. The training data are chemically accurate, i.e., they have very low uncertainty by the standards of computational chemistry. The analysis does not find evidence of PM7 consistency with the entire data set considered as no single set of parameter values is found that captures the experimental uncertainties of all training data. A set of parameter values for PM7 was able to capture the training data within ±1 kcal/mol, but not to the smaller level of uncertainty in the reported data. Nevertheless, PM7 was found to be consistent for subsets of the training data. In such cases, uncertainty propagation from the chemically accurate training data to the predicted values preserves error within bounds of chemical accuracy if predictions are made for the molecules of comparable size. Otherwise, the error grows linearly with the relative size of the molecules.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          Design of Density Functionals by Combining the Method of Constraint Satisfaction with Parametrization for Thermochemistry, Thermochemical Kinetics, and Noncovalent Interactions.

          We present a new hybrid meta exchange-correlation functional, called M05-2X, for thermochemistry, thermochemical kinetics, and noncovalent interactions. We also provide a full discussion of the new M05 functional, previously presented in a short communication. The M05 functional was parametrized including both metals and nonmetals, whereas M05-2X is a high-nonlocality functional with double the amount of nonlocal exchange (2X) that is parametrized only for nonmetals. In particular, M05 was parametrized against 35 data values, and M05-2X is parametrized against 34 data values. Both functionals, along with 28 other functionals, have been comparatively assessed against 234 data values:  the MGAE109/3 main-group atomization energy database, the IP13/3 ionization potential database, the EA13/3 electron affinity database, the HTBH38/4 database of barrier height for hydrogen-transfer reactions, five noncovalent databases, two databases involving metal-metal and metal-ligand bond energies, a dipole moment database, a database of four alkyl bond dissociation energies of alkanes and ethers, and three total energies of one-electron systems. We also tested the new functionals and 12 others for eight hydrogen-bonding and stacking interaction energies in nucleobase pairs, and we tested M05 and M05-2X and 19 other functionals for the geometry, dipole moment, and binding energy of HCN-BF3, which has recently been shown to be a very difficult case for density functional theory. We tested eight functionals for four more alkyl bond dissociation energies, and we tested 12 functionals for several additional bond energies with varying amounts of multireference character. On the basis of all the results for 256 data values in 18 databases in the present study, we recommend M05-2X, M05, PW6B95, PWB6K, and MPWB1K for general-purpose applications in thermochemistry, kinetics, and noncovalent interactions involving nonmetals and we recommend M05 for studies involving both metallic and nonmetallic elements. The M05 functional, essentially uniquely among the functionals with broad applicability to chemistry, also performs well not only for main-group thermochemistry and radical reaction barrier heights but also for transition-metal-transition-metal interactions. The M05-2X functional has the best performance for thermochemical kinetics, noncovalent interactions (especially weak interaction, hydrogen bonding, π···π stacking, and interactions energies of nucleobases), and alkyl bond dissociation energies and the best composite results for energetics, excluding metals.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Quantum-chemical insights from deep tensor neural networks

            Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems? Here we develop an efficient deep learning approach that enables spatially and chemically resolved insights into quantum-mechanical observables of molecular systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks, which leads to size-extensive and uniformly accurate (1 kcal mol−1) predictions in compositional and configurational chemical space for molecules of intermediate size. As an example of chemical relevance, the model reveals a classification of aromatic rings with respect to their stability. Further applications of our model for predicting atomic energies and local chemical potentials in molecules, reliable isomer energies, and molecules with peculiar electronic structure demonstrate the potential of machine learning for revealing insights into complex quantum-chemical systems.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters

              Modern semiempirical methods are of sufficient accuracy when used in the modeling of molecules of the same type as used as reference data in the parameterization. Outside that subset, however, there is an abundance of evidence that these methods are of very limited utility. In an attempt to expand the range of applicability, a new method called PM7 has been developed. PM7 was parameterized using experimental and high-level ab initio reference data, augmented by a new type of reference data intended to better define the structure of parameter space. The resulting method was tested by modeling crystal structures and heats of formation of solids. Two changes were made to the set of approximations: a modification was made to improve the description of noncovalent interactions, and two minor errors in the NDDO formalism were rectified. Average unsigned errors (AUEs) in geometry and ΔH f for PM7 were reduced relative to PM6; for simple gas-phase organic systems, the AUE in bond lengths decreased by about 5 % and the AUE in ΔH f decreased by about 10 %; for organic solids, the AUE in ΔH f dropped by 60 % and the reduction was 33.3 % for geometries. A two-step process (PM7-TS) for calculating the heights of activation barriers has been developed. Using PM7-TS, the AUE in the barrier heights for simple organic reactions was decreased from values of 12.6 kcal/mol-1 in PM6 and 10.8 kcal/mol-1 in PM7 to 3.8 kcal/mol-1. The origins of the errors in NDDO methods have been examined, and were found to be attributable to inadequate and inaccurate reference data. This conclusion provides insight into how these methods can be improved. Electronic supplementary material The online version of this article (doi:10.1007/s00894-012-1667-x) contains supplementary material, which is available to authorized users.
                Bookmark

                Author and article information

                Contributors
                frenklach@berkeley.edu
                Journal
                Sci Rep
                Sci Rep
                Scientific Reports
                Nature Publishing Group UK (London )
                2045-2322
                5 September 2018
                5 September 2018
                2018
                : 8
                : 13248
                Affiliations
                [1 ]ISNI 0000 0001 2181 7878, GRID grid.47840.3f, Department of Mechanical Engineering, , University of California at Berkeley, ; Berkeley, California 94720-1740 USA
                [2 ]IBM Almaden Research Center, 650 Harry Road, San Jose, California, 95136 USA
                Article
                31677
                10.1038/s41598-018-31677-y
                6125339
                30185953
                d80961d9-6372-48d4-99e9-87d51dece6eb
                © The Author(s) 2018

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 19 June 2018
                : 22 August 2018
                Funding
                Funded by: FundRef https://doi.org/10.13039/100006168, DOE | National Nuclear Security Administration (NNSA);
                Award ID: DE-NA0002375
                Award ID: DE-NA0002375
                Award ID: DE-NA0002375
                Award ID: DE-NA0002375
                Award ID: DE-NA0002375
                Award Recipient :
                Categories
                Article
                Custom metadata
                © The Author(s) 2018

                Uncategorized
                Uncategorized

                Comments

                Comment on this article