DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Identification of drug-target interactions (DTIs) plays a key role in drug discovery. The high cost and labor-intensive nature of in vitro and in vivo experiments have highlighted the importance of in silico-based DTI prediction approaches. In several computational models, conventional protein descriptors have been shown to not be sufficiently informative to predict accurate DTIs. Thus, in this study, we propose a deep learning based DTI prediction model capturing local residue patterns of proteins participating in DTIs. When we employ a convolutional neural network (CNN) on raw protein sequences, we perform convolution on various lengths of amino acids subsequences to capture local residue patterns of generalized protein classes. We train our model with large-scale DTI information and demonstrate the performance of the proposed model using an independent dataset that is not seen during the training phase. As a result, our model performs better than previous protein descriptor-based models. Also, our model performs better than the recently developed deep learning models for massive prediction of DTIs. By examining pooled convolution results, we confirmed that our model can detect binding sites of proteins for DTIs. In conclusion, our prediction model for detecting local residue patterns of target proteins successfully enriches the protein features of a raw protein sequence, yielding better prediction results than previous approaches. Our code is available at https://github.com/GIST-CSBL/DeepConv-DTI.

Author summary

Drugs work by interacting with target proteins to activate or inhibit a target’s biological process. Therefore, identification of DTIs is a crucial step in drug discovery. However, identifying drug candidates via biological assays is very time and cost consuming, which introduces the need for a computational prediction approach for the identification of DTIs. In this work, we constructed a novel DTI prediction model to extract local residue patterns of target protein sequences using a CNN-based deep learning approach. As a result, the detected local features of protein sequences perform better than other protein descriptors for DTI prediction and previous models for predicting PubChem independent test datasets. That is, our approach of capturing local residue patterns with CNN successfully enriches protein features from a raw sequence.

Related collections

Most cited references 29

Record: found
Abstract: not found
Article: not found

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Yoav Benjamini, Yosef Hochberg (1995)

0 comments Cited 23736 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Identification of common molecular subsequences.

T.F. Smith, M.S. Waterman (1981)

0 comments Cited 1697 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Prediction of drug–target interaction networks from the integration of chemical and genomic spaces

Yoshihiro Yamanishi, Michihiro Araki, Alex Gutteridge … (2008)

Motivation: The identification of interactions between drugs and target proteins is a key area in genomic drug discovery. Therefore, there is a strong incentive to develop new methods capable of detecting these potential drug–target interactions efficiently. Results: In this article, we characterize four classes of drug–target interaction networks in humans involving enzymes, ion channels, G-protein-coupled receptors (GPCRs) and nuclear receptors, and reveal significant correlations between drug structure similarity, target sequence similarity and the drug–target interaction network topology. We then develop new statistical methods to predict unknown drug–target interaction networks from chemical structure and genomic sequence information simultaneously on a large scale. The originality of the proposed method lies in the formalization of the drug–target interaction inference as a supervised learning problem for a bipartite graph, the lack of need for 3D structure information of the target proteins, and in the integration of chemical and genomic spaces into a unified space that we call ‘pharmacological space’. In the results, we demonstrate the usefulness of our proposed method for the prediction of the four classes of drug–target interaction networks. Our comprehensively predicted drug–target interaction networks enable us to suggest many potential drug–target interactions and to increase research productivity toward genomic drug discovery. Availability: Softwares are available upon request. Contact: Yoshihiro.Yamanishi@ensmp.fr Supplementary information: Datasets and all prediction results are available at http://web.kuicr.kyoto-u.ac.jp/supp/yoshi/drugtarget/.

0 comments Cited 327 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Ingoo Lee:

ORCID: http://orcid.org/0000-0002-8958-2945

Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing

Jongsoo Keum: Role: ConceptualizationRole: Data curationRole: MethodologyRole: SoftwareRole: Validation

Hojung Nam:

ORCID: http://orcid.org/0000-0002-5109-9114

Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SupervisionRole: ValidationRole: Writing – review & editing

James M. Briggs: Role: Editor

Journal

Journal ID (nlm-ta): PLoS Comput Biol

Journal ID (iso-abbrev): PLoS Comput. Biol

Journal ID (publisher-id): plos

Journal ID (pmc): ploscomp

Title: PLoS Computational Biology

Publisher: Public Library of Science (San Francisco, CA USA )

ISSN (Print): 1553-734X

ISSN (Electronic): 1553-7358

Publication date (Electronic): 14 June 2019

Publication date Collection: June 2019

Volume: 15

Issue: 6

Electronic Location Identifier: e1007129

Affiliations

[001]School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Buk-ku, Gwangju, Republic of Korea

University of Houston, UNITED STATES

Author notes

No authors have competing interests.

* E-mail: hjnam@ 123456gist.ac.kr

Author information

Ingoo Lee http://orcid.org/0000-0002-8958-2945

Hojung Nam http://orcid.org/0000-0002-5109-9114

Article

Publisher ID: PCOMPBIOL-D-18-01686

DOI: 10.1371/journal.pcbi.1007129

PMC ID: 6594651

PubMed ID: 31199797

SO-VID: fcfa5021-7c28-4c48-a15d-f82ff218158c

License:

This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

History

Date received : 1 October 2018

Date accepted : 24 May 2019

Page count

Figures: 7, Tables: 0, Pages: 21

Funding

Funded by: funder-id http://dx.doi.org/10.13039/501100003621, Ministry of Science, ICT and Future Planning;

Award ID: NRF-2018M3A9A7053266.

Award Recipient :

ORCID: http://orcid.org/0000-0002-5109-9114

Hojung Nam

Funded by: Bio-Synergy Research Project

Award ID: NRF-2017M3A9C4092978

Award Recipient :

ORCID: http://orcid.org/0000-0002-5109-9114

Hojung Nam

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (NRF-2018M3A9A7053266), the Bio-Synergy Research Project (NRF-2017M3A9C4092978) of the Ministry of Science and ICT through the National Research Foundation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Custom metadata

PLOS Publication Stage vor-update-to-uncorrected-proof

Publication Update 2019-06-26

Data Availability All code we used in manuscript are available from GitHub repository ( https://github.com/GIST-CSBL/DeepConv-DTI)

DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences

Read this article at

Abstract

Author summary

Related collections

Journal of Systems Thinking Preprints

Most cited references 29

Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Identification of common molecular subsequences.

Prediction of drug–target interaction networks from the integration of chemical and genomic spaces

Author and article information

Contributors

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 203

Cited by 127

Most referenced authors 2,542