Molecular de-novo design through deep reinforcement learning

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

This work introduces a method to tune a sequence-based generative model for molecular de novo design that through augmented episodic likelihood can learn to generate structures with certain specified desirable properties. We demonstrate how this model can execute a range of tasks such as generating analogues to a query structure and generating compounds predicted to be active against a biological target. As a proof of principle, the model is first trained to generate molecules that do not contain sulphur. As a second example, the model is trained to generate analogues to the drug Celecoxib, a technique that could be used for scaffold hopping or library expansion starting from a single molecule. Finally, when tuning the model towards generating compounds predicted to be active against the dopamine receptor type 2, the model generates structures of which more than 95% are predicted to be active, including experimentally confirmed actives that have not been included in either the generative model nor the activity prediction model.

Graphical abstract

Electronic supplementary material

The online version of this article (doi:10.1186/s13321-017-0235-x) contains supplementary material, which is available to authorized users.

Related collections

Most cited references 33

Record: found
Abstract: found
Article: not found

Long Short-Term Memory

Jürgen Schmidhuber, Jürgen Schmidhuber (2002)

Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.

0 comments Cited 6189 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Mastering the game of Go with deep neural networks and tree search.

David Silver, Aja Huang, Chris J Maddison … (2016)

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

0 comments Cited 1110 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Extended-connectivity fingerprints.

David Rogers, Mathew Hahn (2010)

Extended-connectivity fingerprints (ECFPs) are a novel class of topological fingerprints for molecular characterization. Historically, topological fingerprints were developed for substructure and similarity searching. ECFPs were developed specifically for structure-activity modeling. ECFPs are circular fingerprints with a number of useful qualities: they can be very rapidly calculated; they are not predefined and can represent an essentially infinite number of different molecular features (including stereochemical information); their features represent the presence of particular substructures, allowing easier interpretation of analysis results; and the ECFP algorithm can be tailored to generate different types of circular fingerprints, optimized for different uses. While the use of ECFPs has been widely adopted and validated, a description of their implementation has not previously been presented in the literature.

0 comments Cited 860 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Marcus Olivecrona:

ORCID: http://orcid.org/0000-0002-8177-2787

m.olivecrona@gmail.com

Thomas Blaschke: thomas.blaschke@astrazeneca.com

Ola Engkvist: ola.engkvist@astrazeneca.com

Hongming Chen: hongming.chen@astrazeneca.com

Journal

Journal ID (nlm-ta): J Cheminform

Journal ID (iso-abbrev): J Cheminform

Title: Journal of Cheminformatics

Publisher: Springer International Publishing (Cham )

ISSN (Electronic): 1758-2946

Publication date (Electronic): 4 September 2017

Publication date PMC-release: 4 September 2017

Publication date Collection: 2017

Volume: 9

Electronic Location Identifier: 48

Affiliations

ISNI 0000 0001 1519 6403, GRID grid.418151.8, Hit Discovery, Discovery Sciences, Innovative Medicines and Early Development Biotech Unit, , AstraZeneca R&D Gothenburg, ; 43183 Mölndal, Sweden

Author information

Marcus Olivecrona http://orcid.org/0000-0002-8177-2787

Article

Publisher ID: 235

DOI: 10.1186/s13321-017-0235-x

PMC ID: 5583141

PubMed ID: 29086083

SO-VID: 5e3a0ae7-8e92-4cdc-8514-b9300ea6c5a4

License:

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

History

Date received : 14 May 2017

Date accepted : 23 August 2017

Funding

Funded by: FundRef http://dx.doi.org/10.13039/100010665, H2020 Marie Skłodowska-Curie Actions;

Award ID: 676434

Award Recipient : Thomas Blaschke

Custom metadata

ScienceOpen disciplines: Chemoinformatics

Keywords: de novo design,recurrent neural networks,reinforcement learning

Data availability:

ScienceOpen disciplines: Chemoinformatics

Keywords: de novo design, recurrent neural networks, reinforcement learning

Comments

Comment on this article

scite_

Cited by 263

See all cited by

Most referenced authors 660

See all reference authors

- Version 1

Molecular de-novo design through deep reinforcement learning

Read this article at

Abstract

Electronic supplementary material

Related collections

Computer Vision, Deep Learning, Deep Reinforcement Learning, IoT

Most cited references 33

Long Short-Term Memory

Mastering the game of Go with deep neural networks and tree search.

Extended-connectivity fingerprints.

Author and article information

Contributors

Journal

Affiliations

Author information

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 77

Cited by 263

Most referenced authors 660