Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Motivation

Adverse events resulting from drug-drug interactions (DDI) pose a serious health issue. The ability to automatically extract DDIs described in the biomedical literature could further efforts for ongoing pharmacovigilance. Most of neural networks-based methods typically focus on sentence sequence to identify these DDIs, however the shortest dependency path (SDP) between the two entities contains valuable syntactic and semantic information. Effectively exploiting such information may improve DDI extraction.

Results

In this article, we present a hierarchical recurrent neural networks (RNNs)-based method to integrate the SDP and sentence sequence for DDI extraction task. Firstly, the sentence sequence is divided into three subsequences. Then, the bottom RNNs model is employed to learn the feature representation of the subsequences and SDP, and the top RNNs model is employed to learn the feature representation of both sentence sequence and SDP. Furthermore, we introduce the embedding attention mechanism to identify and enhance keywords for the DDI extraction task. We evaluate our approach using the DDI extraction 2013 corpus. Our method is competitive or superior in performance as compared with other state-of-the-art methods. Experimental results show that the sentence sequence and SDP are complementary to each other. Integrating the sentence sequence with SDP can effectively improve the DDI extraction performance.

Availability and implementation

The experimental data is available at https://github.com/zhangyijia1979/hierarchical-RNNs-model-for-DDI-extraction.

Supplementary information

Supplementary data are available at Bioinformatics online.

Related collections

Most cited references 15

Record: found
Abstract: found
Article: not found

The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions.

Thierry Declerck, Paloma Martínez, M Zazo … (2013)

The management of drug-drug interactions (DDIs) is a critical issue resulting from the overwhelming amount of information available on them. Natural Language Processing (NLP) techniques can provide an interesting way to reduce the time spent by healthcare professionals on reviewing biomedical literature. However, NLP techniques rely mostly on the availability of the annotated corpora. While there are several annotated corpora with biological entities and their relationships, there is a lack of corpora annotated with pharmacological substances and DDIs. Moreover, other works in this field have focused in pharmacokinetic (PK) DDIs only, but not in pharmacodynamic (PD) DDIs. To address this problem, we have created a manually annotated corpus consisting of 792 texts selected from the DrugBank database and other 233 Medline abstracts. This fined-grained corpus has been annotated with a total of 18,502 pharmacological substances and 5028 DDIs, including both PK as well as PD interactions. The quality and consistency of the annotation process has been ensured through the creation of annotation guidelines and has been evaluated by the measurement of the inter-annotator agreement between two annotators. The agreement was almost perfect (Kappa up to 0.96 and generally over 0.80), except for the DDIs in the MedLine database (0.55-0.72). The DDI corpus has been used in the SemEval 2013 DDIExtraction challenge as a gold standard for the evaluation of information extraction techniques applied to the recognition of pharmacological substances and the detection of DDIs from biomedical texts. DDIExtraction 2013 has attracted wide attention with a total of 14 teams from 7 different countries. For the task of recognition and classification of pharmacological names, the best system achieved an F1 of 71.5%, while, for the detection and classification of DDIs, the best result was F1 of 65.1%. These results show that the corpus has enough quality to be used for training and testing NLP techniques applied to the field of Pharmacovigilance. The DDI corpus and the annotation guidelines are free for use for academic research and are available at http://labda.inf.uc3m.es/ddicorpus. Copyright © 2013 Elsevier Inc. All rights reserved.

0 comments Cited 126 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

Hamid Palangi, Li Deng, Yelong Shen … (2016)

0 comments Cited 115 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

A neural probabilistic language model

Y. Bengio, R Ducharme, P Vincent … (2003)

0 comments Cited 98 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Robert Murphy: Role: Associate Editor

Journal

Journal ID (nlm-ta): Bioinformatics

Journal ID (iso-abbrev): Bioinformatics

Journal ID (publisher-id): bioinformatics

Title: Bioinformatics

Publisher: Oxford University Press

ISSN (Print): 1367-4803

ISSN (Electronic): 1367-4811

Publication date (Print): 01 March 2018

Publication date (Electronic): 25 October 2017

Publication date PMC-release: 25 October 2017

Volume: 34

Issue: 5

Pages: 828-835

Affiliations

[1 ]College of Computer Science and Technology, Dalian University of Technology, Dalian, China

[2 ]Stanford Center for Biomedical Informatics Research, School of Medicine, Stanford University, Stanford, CA, USA

[3 ]College of Software, Dalian JiaoTong University, Dalian, China

[4 ]Institute of Data Science, Maastricht University, Maastricht, ER, The Netherlands

Author notes

To whom correspondence should be addressed. Email: zhyj@ 123456dlut.edu.cn or michel.dumontier@ 123456maastrichtuniversity.nl .

Article

Publisher ID: btx659

DOI: 10.1093/bioinformatics/btx659

PMC ID: 6030919

PubMed ID: 29077847

SO-VID: 0d31fe0d-6b20-42a9-835b-dc8ebb04f43c

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

History

Date received : 30 June 2017

Date revision received : 03 October 2017

Date accepted : 26 October 2017

Page count

Pages: 8

Funding

Funded by: Natural Science Foundation of China 10.13039/501100001809

Award ID: 61572098 and 61572102

Comments

Comment on this article

scite_

Cited by 52

See all cited by

Most referenced authors 516

See all reference authors

Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths

Read this article at

Abstract

Motivation

Results

Availability and implementation

Supplementary information

Related collections

Drug Repurposing Research Collection

Most cited references 15

The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions.

Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

A neural probabilistic language model

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Page count

Funding

Categories

Comments

Comment on this article

Similar content 109

Cited by 52

Most referenced authors 516