9
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space†

      research-article
      a
      Chemical Science
      Royal Society of Chemistry

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          This paper presents a comparison of a graph-based genetic algorithm (GB-GA) and machine learning (ML) results for the optimization of log  P values with a constraint for synthetic accessibility and shows that the GA is as good as or better than the ML approaches for this particular property.

          Abstract

          This paper presents a comparison of a graph-based genetic algorithm (GB-GA) and machine learning (ML) results for the optimization of log  P values with a constraint for synthetic accessibility and shows that the GA is as good as or better than the ML approaches for this particular property. The molecules found by the GB-GA bear little resemblance to the molecules used to construct the initial mating pool, indicating that the GB-GA approach can traverse a relatively large distance in chemical space using relatively few (50) generations. The paper also introduces a new non-ML graph-based generative model (GB-GM) that can be parameterized using very small data sets and combined with a Monte Carlo tree search (MCTS) algorithm. The results are comparable to previously published results ( Sci. Technol. Adv. Mater., 2017, 18, 972–976) using a recurrent neural network (RNN) generative model, and the GB-GM-based method is several orders of magnitude faster. The MCTS results seem more dependent on the composition of the training set than the GA approach for this particular property. Our results suggest that the performance of new ML-based generative models should be compared to that of more traditional, and often simpler, approaches such a GA.

          Related collections

          Author and article information

          Journal
          Chem Sci
          Chem Sci
          Chemical Science
          Royal Society of Chemistry
          2041-6520
          2041-6539
          11 February 2019
          28 March 2019
          : 10
          : 12
          : 3567-3572
          Affiliations
          [a ] Department of Chemistry , University of Copenhagen , Copenhagen , Denmark . Email: jhjensen@ 123456chem.ku.dk ; http://www.twitter.com/janhjensen
          Author information
          http://orcid.org/0000-0002-1465-1010
          Article
          c8sc05372c
          10.1039/c8sc05372c
          6438151
          30996948
          d53ff037-d785-4e65-bae1-0f195c83a99d
          This journal is © The Royal Society of Chemistry 2019

          This article is freely available. This article is licensed under a Creative Commons Attribution 3.0 Unported Licence (CC BY 3.0)

          History
          : 1 December 2018
          : 8 February 2019
          Categories
          Chemistry

          Notes

          †Electronic supplementary information (ESI) available: The codes used in this study can be found on GitHub: github.com/jensengroup/GB-GA/tree/v0.0 and github.com/jensengroup/GB-GM/tree/v0.0. See DOI: 10.1039/c8sc05372c


          Comments

          Comment on this article