22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Drug–drug interaction extraction via hierarchical RNNs on sequence and shortest dependency paths

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          Adverse events resulting from drug-drug interactions (DDI) pose a serious health issue. The ability to automatically extract DDIs described in the biomedical literature could further efforts for ongoing pharmacovigilance. Most of neural networks-based methods typically focus on sentence sequence to identify these DDIs, however the shortest dependency path (SDP) between the two entities contains valuable syntactic and semantic information. Effectively exploiting such information may improve DDI extraction.

          Results

          In this article, we present a hierarchical recurrent neural networks (RNNs)-based method to integrate the SDP and sentence sequence for DDI extraction task. Firstly, the sentence sequence is divided into three subsequences. Then, the bottom RNNs model is employed to learn the feature representation of the subsequences and SDP, and the top RNNs model is employed to learn the feature representation of both sentence sequence and SDP. Furthermore, we introduce the embedding attention mechanism to identify and enhance keywords for the DDI extraction task. We evaluate our approach using the DDI extraction 2013 corpus. Our method is competitive or superior in performance as compared with other state-of-the-art methods. Experimental results show that the sentence sequence and SDP are complementary to each other. Integrating the sentence sequence with SDP can effectively improve the DDI extraction performance.

          Availability and implementation

          The experimental data is available at https://github.com/zhangyijia1979/hierarchical-RNNs-model-for-DDI-extraction.

          Supplementary information

          Supplementary data are available at Bioinformatics online.

          Related collections

          Most cited references15

          • Record: found
          • Abstract: found
          • Article: not found

          The DDI corpus: an annotated corpus with pharmacological substances and drug-drug interactions.

          The management of drug-drug interactions (DDIs) is a critical issue resulting from the overwhelming amount of information available on them. Natural Language Processing (NLP) techniques can provide an interesting way to reduce the time spent by healthcare professionals on reviewing biomedical literature. However, NLP techniques rely mostly on the availability of the annotated corpora. While there are several annotated corpora with biological entities and their relationships, there is a lack of corpora annotated with pharmacological substances and DDIs. Moreover, other works in this field have focused in pharmacokinetic (PK) DDIs only, but not in pharmacodynamic (PD) DDIs. To address this problem, we have created a manually annotated corpus consisting of 792 texts selected from the DrugBank database and other 233 Medline abstracts. This fined-grained corpus has been annotated with a total of 18,502 pharmacological substances and 5028 DDIs, including both PK as well as PD interactions. The quality and consistency of the annotation process has been ensured through the creation of annotation guidelines and has been evaluated by the measurement of the inter-annotator agreement between two annotators. The agreement was almost perfect (Kappa up to 0.96 and generally over 0.80), except for the DDIs in the MedLine database (0.55-0.72). The DDI corpus has been used in the SemEval 2013 DDIExtraction challenge as a gold standard for the evaluation of information extraction techniques applied to the recognition of pharmacological substances and the detection of DDIs from biomedical texts. DDIExtraction 2013 has attracted wide attention with a total of 14 teams from 7 different countries. For the task of recognition and classification of pharmacological names, the best system achieved an F1 of 71.5%, while, for the detection and classification of DDIs, the best result was F1 of 65.1%. These results show that the corpus has enough quality to be used for training and testing NLP techniques applied to the field of Pharmacovigilance. The DDI corpus and the annotation guidelines are free for use for academic research and are available at http://labda.inf.uc3m.es/ddicorpus. Copyright © 2013 Elsevier Inc. All rights reserved.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A neural probabilistic language model

                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                01 March 2018
                25 October 2017
                25 October 2017
                : 34
                : 5
                : 828-835
                Affiliations
                [1 ]College of Computer Science and Technology, Dalian University of Technology, Dalian, China
                [2 ]Stanford Center for Biomedical Informatics Research, School of Medicine, Stanford University, Stanford, CA, USA
                [3 ]College of Software, Dalian JiaoTong University, Dalian, China
                [4 ]Institute of Data Science, Maastricht University, Maastricht, ER, The Netherlands
                Author notes
                To whom correspondence should be addressed. Email: zhyj@ 123456dlut.edu.cn or michel.dumontier@ 123456maastrichtuniversity.nl .
                Article
                btx659
                10.1093/bioinformatics/btx659
                6030919
                29077847
                0d31fe0d-6b20-42a9-835b-dc8ebb04f43c
                © The Author 2017. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 30 June 2017
                : 03 October 2017
                : 26 October 2017
                Page count
                Pages: 8
                Funding
                Funded by: Natural Science Foundation of China 10.13039/501100001809
                Award ID: 61572098 and 61572102
                Categories
                Original Papers
                Data and Text Mining

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article