9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Enabling late-stage drug diversification by high-throughput experimentation with geometric deep learning

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Late-stage functionalization is an economical approach to optimize the properties of drug candidates. However, the chemical complexity of drug molecules often makes late-stage diversification challenging. To address this problem, a late-stage functionalization platform based on geometric deep learning and high-throughput reaction screening was developed. Considering borylation as a critical step in late-stage functionalization, the computational model predicted reaction yields for diverse reaction conditions with a mean absolute error margin of 4–5%, while the reactivity of novel reactions with known and unknown substrates was classified with a balanced accuracy of 92% and 67%, respectively. The regioselectivity of the major products was accurately captured with a classifier F-score of 67%. When applied to 23 diverse commercial drug molecules, the platform successfully identified numerous opportunities for structural diversification. The influence of steric and electronic information on model performance was quantified, and a comprehensive simple user-friendly reaction format was introduced that proved to be a key enabler for seamlessly integrating deep learning and high-throughput experimentation for late-stage functionalization.

          Abstract

          Late-stage functionalization of complex drug molecules is challenging. To address this problem, a discovery platform based on geometric deep learning and high-throughput experimentation was developed. The computational model predicts binary reaction outcome, reaction yield and regioselectivity with low error margins, enabling the functionalization of complex molecules without de novo synthesis.

          Related collections

          Most cited references64

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The FAIR Guiding Principles for scientific data management and stewardship

          There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Balanced basis sets of split valence, triple zeta valence and quadruple zeta valence quality for H to Rn: Design and assessment of accuracy.

            Gaussian basis sets of quadruple zeta valence quality for Rb-Rn are presented, as well as bases of split valence and triple zeta valence quality for H-Rn. The latter were obtained by (partly) modifying bases developed previously. A large set of more than 300 molecules representing (nearly) all elements-except lanthanides-in their common oxidation states was used to assess the quality of the bases all across the periodic table. Quantities investigated were atomization energies, dipole moments and structure parameters for Hartree-Fock, density functional theory and correlated methods, for which we had chosen Møller-Plesset perturbation theory as an example. Finally recommendations are given which type of basis set is used best for a certain level of theory and a desired quality of results.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Long-range corrected hybrid density functionals with damped atom-atom dispersion corrections.

              We report re-optimization of a recently proposed long-range corrected (LC) hybrid density functional [J.-D. Chai and M. Head-Gordon, J. Chem. Phys., 2008, 128, 084106] to include empirical atom-atom dispersion corrections. The resulting functional, omegaB97X-D yields satisfactory accuracy for thermochemistry, kinetics, and non-covalent interactions. Tests show that for non-covalent systems, omegaB97X-D shows slight improvement over other empirical dispersion-corrected density functionals, while for covalent systems and kinetics it performs noticeably better. Relative to our previous functionals, such as omegaB97X, the new functional is significantly superior for non-bonded interactions, and very similar in performance for bonded interactions.
                Bookmark

                Author and article information

                Contributors
                david.konrad@cup.lmu.de
                uwe.grether@roche.com
                rainer_e.martin@roche.com
                gisbert@ethz.ch
                Journal
                Nat Chem
                Nat Chem
                Nature Chemistry
                Nature Publishing Group UK (London )
                1755-4330
                1755-4349
                23 November 2023
                23 November 2023
                2024
                : 16
                : 2
                : 239-248
                Affiliations
                [1 ]GRID grid.417570.0, ISNI 0000 0004 0374 1269, Roche Pharma Research and Early Development (pRED), , Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., ; Basel, Switzerland
                [2 ]Department of Pharmacy, Ludwig-Maximilians-Universität München, ( https://ror.org/05591te55) Munich, Germany
                [3 ]Department of Chemistry and Applied Biosciences, ETH Zurich, ( https://ror.org/05a28rw58) Zurich, Switzerland
                [4 ]GRID grid.417570.0, ISNI 0000 0004 0374 1269, Process Chemistry and Catalysis (PCC), , F. Hoffmann-La Roche Ltd., ; Basel, Switzerland
                [5 ]GRID grid.514054.1, ISNI 0000 0004 9450 5164, ETH Singapore SEC Ltd, ; Singapore, Singapore
                Author information
                http://orcid.org/0000-0002-0346-3786
                http://orcid.org/0000-0001-8063-9952
                http://orcid.org/0000-0002-1527-4325
                http://orcid.org/0000-0001-6682-1213
                http://orcid.org/0000-0001-7536-6144
                http://orcid.org/0000-0001-7338-4103
                http://orcid.org/0000-0003-2203-129X
                http://orcid.org/0000-0001-5718-8081
                http://orcid.org/0000-0002-3164-9270
                http://orcid.org/0000-0001-7895-497X
                http://orcid.org/0000-0001-6706-1084
                Article
                1360
                10.1038/s41557-023-01360-5
                10849962
                37996732
                54acf582-b944-4ff2-9fbc-1f4332738690
                © The Author(s) 2023

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 21 October 2022
                : 3 October 2023
                Funding
                Funded by: FundRef 501100001711, Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (Swiss National Science Foundation);
                Award ID: 205321_182176
                Award ID: 205321_182176
                Award Recipient :
                Funded by: Fonds der Chemischen Industrie (FCI)
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2024

                Chemistry
                machine learning,lead optimization,synthetic chemistry methodology,chemical engineering,automation

                Comments

                Comment on this article