19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Biomedical event trigger detection by dependency-based word embedding

      research-article
      1 , , 1 , 2 , 1 , 1 , 1 , 1
      BMC Medical Genomics
      BioMed Central
      IEEE International Conference on Bioinformatics and Biomedicine 2015
      9-12 November 2015
      Biomedical event extraction, Trigger detection, Dependency-based word embedding, Neural network

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          In biomedical research, events revealing complex relations between entities play an important role. Biomedical event trigger identification has become a research hotspot since its important role in biomedical event extraction. Traditional machine learning methods, such as support vector machines (SVM) and maxent classifiers, which aim to manually design powerful features fed to the classifiers, depend on the understanding of the specific task and cannot generalize to the new domain or new examples.

          Methods

          In this paper, we propose an approach which utilizes neural network model based on dependency-based word embedding to automatically learn significant features from raw input for trigger classification. First, we employ Word2vecf, the modified version of Word2vec, to learn word embedding with rich semantic and functional information based on dependency relation tree. Then neural network architecture is used to learn more significant feature representation based on raw dependency-based word embedding. Meanwhile, we dynamically adjust the embedding while training for adapting to the trigger classification task. Finally, softmax classifier labels the examples by specific trigger class using the features learned by the model.

          Results

          The experimental results show that our approach achieves a micro-averaging F1 score of 78.27 and a macro-averaging F1 score of 76.94 % in significant trigger classes, and performs better than baseline methods. In addition, we can achieve the semantic distributed representation of every trigger word.

          Related collections

          Most cited references18

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          NLTK

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Distributed Representations of Sentences and Documents

            Many machine learning algorithms require the input to be represented as a fixed-length feature vector. When it comes to texts, one of the most common fixed-length features is bag-of-words. Despite their popularity, bag-of-words features have two major weaknesses: they lose the ordering of the words and they also ignore semantics of the words. For example, "powerful," "strong" and "Paris" are equally distant. In this paper, we propose Paragraph Vector, an unsupervised algorithm that learns fixed-length feature representations from variable-length pieces of texts, such as sentences, paragraphs, and documents. Our algorithm represents each document by a dense vector which is trained to predict words in the document. Its construction gives our algorithm the potential to overcome the weaknesses of bag-of-words models. Empirical results show that Paragraph Vectors outperform bag-of-words models as well as other techniques for text representations. Finally, we achieve new state-of-the-art results on several text classification and sentiment analysis tasks.
              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Efficient Estimation of Word Representation in Vector Space

                Bookmark

                Author and article information

                Contributors
                wangjian@dlut.edu.cn
                Conference
                BMC Med Genomics
                BMC Med Genomics
                BMC Medical Genomics
                BioMed Central (London )
                1755-8794
                10 August 2016
                10 August 2016
                2016
                : 9
                Issue : Suppl 2 Issue sponsor : Publication of this supplement has not been supported by sponsorship. Information about the source of funding for publication charges can be found in the individual articles. The articles have undergone the journal's standard peer review process for supplements. The Supplement Editors declare that they have no competing interests.
                : 45
                Affiliations
                [1 ]School of Computer Science and Technology, Dalian University of Technology, Dalian, China
                [2 ]College of Computing & Informatics, Drexel University, Philadelphia, USA
                Article
                203
                10.1186/s12920-016-0203-8
                4980775
                27510445
                11c9e289-1541-4bcc-8348-8dfdba84b951
                © The Author(s). 2016

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                IEEE International Conference on Bioinformatics and Biomedicine 2015
                Washington, DC, USA
                9-12 November 2015
                History
                Categories
                Research
                Custom metadata
                © The Author(s) 2016

                Genetics
                biomedical event extraction,trigger detection,dependency-based word embedding,neural network

                Comments

                Comment on this article