44
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Biomedical ontology alignment: an approach based on representation learning

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          While representation learning techniques have shown great promise in application to a number of different NLP tasks, they have had little impact on the problem of ontology matching. Unlike past work that has focused on feature engineering, we present a novel representation learning approach that is tailored to the ontology matching task. Our approach is based on embedding ontological terms in a high-dimensional Euclidean space. This embedding is derived on the basis of a novel phrase retrofitting strategy through which semantic similarity information becomes inscribed onto fields of pre-trained word vectors. The resulting framework also incorporates a novel outlier detection mechanism based on a denoising autoencoder that is shown to improve performance.

          Results

          An ontology matching system derived using the proposed framework achieved an F-score of 94% on an alignment scenario involving the Adult Mouse Anatomical Dictionary and the Foundational Model of Anatomy ontology (FMA) as targets. This compares favorably with the best performing systems on the Ontology Alignment Evaluation Initiative anatomy challenge. We performed additional experiments on aligning FMA to NCI Thesaurus and to SNOMED CT based on a reference alignment extracted from the UMLS Metathesaurus. Our system obtained overall F-scores of 93.2% and 89.2% for these experiments, thus achieving state-of-the-art results.

          Conclusions

          Our proposed representation learning approach leverages terminological embeddings to capture semantic similarity. Our results provide evidence that the approach produces embeddings that are especially well tailored to the ontology matching task, demonstrating a novel pathway for the problem.

          Related collections

          Most cited references60

          • Record: found
          • Abstract: not found
          • Article: not found

          Extracting and composing robust features with denoising autoencoders

            • Record: found
            • Abstract: not found
            • Article: not found

            A STATISTICAL INTERPRETATION OF TERM SPECIFICITY AND ITS APPLICATION IN RETRIEVAL

              • Record: found
              • Abstract: not found
              • Article: not found

              SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation

                Author and article information

                Contributors
                prodromos.kolyvakis@epfl.ch
                alexandros.kalousis@unige.ch
                phismith@buffalo.edu
                dimitris.kiritsis@epfl.ch
                Journal
                J Biomed Semantics
                J Biomed Semantics
                Journal of Biomedical Semantics
                BioMed Central (London )
                2041-1480
                15 August 2018
                15 August 2018
                2018
                : 9
                : 21
                Affiliations
                [1 ]ISNI 0000000121839049, GRID grid.5333.6, École Polytechnique Fédérale de Lausanne (EPFL), ; Route Cantonale, Lausanne, 1015 Switzerland
                [2 ]ISNI 0000 0000 8718 2812, GRID grid.460104.7, Business Informatics Department, University of Applied Sciences, ; HES-SO, Western Switzerland Carouge, Switzerland
                [3 ]Department of Philosophy and Department of Biomedical Informatics, 104 Park Hall, University at Buffalo, Buffalo, 14260 NY USA
                Author information
                http://orcid.org/0000-0001-8173-744X
                Article
                187
                10.1186/s13326-018-0187-8
                6094585
                30111369
                04760b7b-6600-4698-bee0-2354e700addb
                © The Author(s) 2018

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 1 March 2018
                : 16 July 2018
                Funding
                Funded by: Staatssekretariat für Bildung, Forschung und Innovation (CH)
                Award ID: 15.0303
                Categories
                Research
                Custom metadata
                © The Author(s) 2018

                Bioinformatics & Computational biology
                ontology matching,semantic similarity,sentence embeddings,word embeddings,denoising autoencoder,outlier detection

                Comments

                Comment on this article

                Related Documents Log