3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      MeSH2Matrix: combining MeSH keywords and machine learning for biomedical relation classification based on PubMed

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Biomedical relation classification has been significantly improved by the application of advanced machine learning techniques on the raw texts of scholarly publications. Despite this improvement, the reliance on large chunks of raw text makes these algorithms suffer in terms of generalization, precision, and reliability. The use of the distinctive characteristics of bibliographic metadata can prove effective in achieving better performance for this challenging task. In this research paper, we introduce an approach for biomedical relation classification using the qualifiers of co-occurring Medical Subject Headings (MeSH). First of all, we introduce MeSH2Matrix, our dataset consisting of 46,469 biomedical relations curated from PubMed publications using our approach. Our dataset includes a matrix that maps associations between the qualifiers of subject MeSH keywords and those of object MeSH keywords. It also specifies the corresponding Wikidata relation type and the superclass of semantic relations for each relation. Using MeSH2Matrix, we build and train three machine learning models (Support Vector Machine [ SVM], a dense model [ D-Model], and a convolutional neural network [ C-Net]) to evaluate the efficiency of our approach for biomedical relation classification. Our best model achieves an accuracy of 70.78% for 195 classes and 83.09% for five superclasses. Finally, we provide confusion matrix and extensive feature analyses to better examine the relationship between the MeSH qualifiers and the biomedical relations being classified. Our results will hopefully shed light on developing better algorithms for biomedical ontology classification based on the MeSH keywords of PubMed publications. For reproducibility purposes, MeSH2Matrix, as well as all our source codes, are made publicly accessible at https://github.com/SisonkeBiotik-Africa/MeSH2Matrix.

          Supplementary Information

          The online version contains supplementary material available at 10.1186/s13326-024-00319-w.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: not found
          • Article: not found

          ImageNet classification with deep convolutional neural networks

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Biopython: freely available Python tools for computational molecular biology and bioinformatics

            Summary: The Biopython project is a mature open source international collaboration of volunteer developers, providing Python libraries for a wide range of bioinformatics problems. Biopython includes modules for reading and writing different sequence file formats and multiple sequence alignments, dealing with 3D macro molecular structures, interacting with common tools such as BLAST, ClustalW and EMBOSS, accessing key online databases, as well as providing numerical methods for statistical learning. Availability: Biopython is freely available, with documentation and source code at www.biopython.org under the Biopython license. Contact: All queries should be directed to the Biopython mailing lists, see www.biopython.org/wiki/_Mailing_lists peter.cock@scri.ac.uk.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Wikidata

                Bookmark

                Author and article information

                Contributors
                turkiabdelwaheb@hotmail.fr
                Journal
                J Biomed Semantics
                J Biomed Semantics
                Journal of Biomedical Semantics
                BioMed Central (London )
                2041-1480
                2 October 2024
                2 October 2024
                2024
                : 15
                : 18
                Affiliations
                [1 ]Data Engineering and Semantics Research Unit, Faculty of Sciences of Sfax, University of Sfax, ( https://ror.org/04d4sd432) Sfax, Tunisia
                [2 ]GRID grid.510486.e, Mila Quebec AI Institute, ; Montreal, Canada
                [3 ]McGill University, ( https://ror.org/01pxwe438) Montreal, Canada
                [4 ]GRID grid.6936.a, ISNI 0000000123222966, Technical University of Munich, ; Munich, Germany
                [5 ]Faculty of Life Sciences, University of Ilorin, ( https://ror.org/032kdwk38) Ilorin, Nigeria
                [6 ]Laboratory of Probability and Statistics, Faculty of Sciences of Sfax, University of Sfax, ( https://ror.org/04d4sd432) Sfax, Tunisia
                Article
                319
                10.1186/s13326-024-00319-w
                11445994
                39354632
                6e8f6459-802b-4dad-887d-5ad8b3238454
                © The Author(s) 2024

                Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

                History
                : 14 November 2023
                : 31 August 2024
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100011049, Wikimedia Foundation;
                Award ID: Adapting Wikidata to support clinical practice using Data Science, Semantic Web and Machine Learning
                Award ID: Adapting Wikidata to support clinical practice using Data Science, Semantic Web and Machine Learning
                Award ID: Adapting Wikidata to support clinical practice using Data Science, Semantic Web and Machine Learning
                Categories
                Research
                Custom metadata
                © BioMed Central Ltd., part of Springer Nature 2024

                Bioinformatics & Computational biology
                biomedical relation classification,mesh keywords,pubmed records,mesh qualifiers,machine learning,integrated gradients,feature analysis

                Comments

                Comment on this article