35
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence mapping problem. The end-to-end trained model has an encoder–decoder architecture that consists of two recurrent neural networks, which has previously shown great success in solving other sequence-to-sequence prediction tasks such as machine translation. The model is trained on 50,000 experimental reaction examples from the United States patent literature, which span 10 broad reaction types that are commonly used by medicinal chemists. We find that our model performs comparably with a rule-based expert system baseline model, and also overcomes certain limitations associated with rule-based expert systems and with any machine learning approach that contains a rule-based expert system component. Our model provides an important first step toward solving the challenging problem of computational retrosynthetic analysis.

          Abstract

          A neural sequence-to-sequence model learns from patent data to perform retrosynthetic reaction prediction. The model is trained end-to-end, and eliminates the need for reaction rules and atom-mapping

          Related collections

          Most cited references43

          • Record: found
          • Abstract: not found
          • Article: not found

          Computer-Assisted Synthetic Planning: The End of the Beginning

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            InChI - the worldwide chemical structure identifier standard

            Since its public introduction in 2005 the IUPAC InChI chemical structure identifier standard has become the international, worldwide standard for defined chemical structures. This article will describe the extensive use and dissemination of the InChI and InChIKey structure representations by and for the world-wide chemistry community, the chemical information community, and major publishers and disseminators of chemical and related scientific offerings in manuscripts and databases.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Neural-Symbolic Machine Learning for Retrosynthesis and Reaction Prediction.

              Reaction prediction and retrosynthesis are the cornerstones of organic chemistry. Rule-based expert systems have been the most widespread approach to computationally solve these two related challenges to date. However, reaction rules often fail because they ignore the molecular context, which leads to reactivity conflicts. Herein, we report that deep neural networks can learn to resolve reactivity conflicts and to prioritize the most suitable transformation rules. We show that by training our model on 3.5 million reactions taken from the collective published knowledge of the entire discipline of chemistry, our model exhibits a top10-accuracy of 95 % in retrosynthesis and 97 % for reaction prediction on a validation set of almost 1 million reactions.
                Bookmark

                Author and article information

                Journal
                ACS Cent Sci
                ACS Cent Sci
                oc
                acscii
                ACS Central Science
                American Chemical Society
                2374-7943
                2374-7951
                05 September 2017
                25 October 2017
                : 3
                : 10
                : 1103-1113
                Affiliations
                []Department of Chemistry, Stanford University , Stanford, California 94305, United States
                []Department of Computer Science, Stanford University , Stanford, California 94305, United States
                [§ ]Department of Chemical and Systems Biology, Stanford University , Stanford, California 94305, United States
                []Department of Structural Biology, Stanford University , Stanford, California 94305, United States
                Author notes
                Article
                10.1021/acscentsci.7b00303
                5658761
                29104927
                badb1483-3374-4743-9704-546a3e8545e4
                Copyright © 2017 American Chemical Society

                This is an open access article published under an ACS AuthorChoice License, which permits copying and redistribution of the article or any adaptations for non-commercial purposes.

                History
                : 11 July 2017
                Categories
                Research Article
                Custom metadata
                oc7b00303
                oc-2017-003038

                Comments

                Comment on this article