5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An improved random forest-based computational model for predicting novel miRNA-disease associations

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          A large body of evidence shows that miRNA regulates the expression of its target genes at post-transcriptional level and the dysregulation of miRNA is related to many complex human diseases. Accurately discovering disease-related miRNAs is conductive to the exploring of the pathogenesis and treatment of diseases. However, because of the limitation of time-consuming and expensive experimental methods, predicting miRNA-disease associations by computational models has become a more economical and effective mean.

          Results

          Inspired by the work of predecessors, we proposed an improved computational model based on random forest (RF) for identifying miRNA-disease associations (IRFMDA). First, the integrated similarity of diseases and the integrated similarity of miRNAs were calculated by combining the semantic similarity and Gaussian interaction profile kernel (GIPK) similarity of diseases, the functional similarity and GIPK similarity of miRNAs, respectively. Then, the integrated similarity of diseases and the integrated similarity of miRNAs were combined to represent each miRNA-disease relationship pair. Next, the miRNA-disease relationship pairs contained in the HMDD (v2.0) database were considered positive samples, and the randomly constructed miRNA-disease relationship pairs not included in HMDD (v2.0) were considered negative samples. Next, the feature selection based on the variable importance score of RF was performed to choose more useful features to represent samples to optimize the model’s ability of inferring miRNA-disease associations. Finally, a RF regression model was trained on reduced sample space to score the unknown miRNA-disease associations. The AUCs of IRFMDA under local leave-one-out cross-validation (LOOCV), global LOOCV and 5-fold cross-validation achieved 0.8728, 0.9398 and 0.9363, which were better than several excellent models for predicting miRNA-disease associations. Moreover, case studies on oesophageal cancer, lymphoma and lung cancer showed that 94 (oesophageal cancer), 98 (lymphoma) and 100 (lung cancer) of the top 100 disease-associated miRNAs predicted by IRFMDA were supported by the experimental data in the dbDEMC (v2.0) database.

          Conclusions

          Cross-validation and case studies demonstrated that IRFMDA is an excellent miRNA-disease association prediction model, and can provide guidance and help for experimental studies on the regulatory mechanism of miRNAs in complex human diseases in the future.

          Related collections

          Most cited references28

          • Record: found
          • Abstract: not found
          • Article: not found

          Medical Subject Headings (MeSH).

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            RWRMDA: predicting novel human microRNA-disease associations.

            Recently, more and more research has shown that microRNAs (miRNAs) play critical roles in the development and progression of various diseases, but it is not easy to predict potential human miRNA-disease associations from the vast amount of biological data. Computational methods for predicting potential disease-miRNA associations have gained a lot of attention based on their feasibility, guidance and effectiveness. Differing from traditional local network similarity measures, we adopted global network similarity measures and developed Random Walk with Restart for MiRNA-Disease Association (RWRMDA) to infer potential miRNA-disease interactions by implementing random walk on the miRNA-miRNA functional similarity network. We tested RWRMDA on 1616 known miRNA-disease associations based on leave-one-out cross-validation, and achieved an area under the ROC curve of 86.17%, which significantly improves previous methods. The method was also applied to three cancers for accuracy evaluation. As a result, 98% (Breast cancer), 74% (Colon cancer), and 88% (Lung cancer) of top 50 predicted miRNAs are confirmed by published experiments. These results suggest that RWRMDA will represent an important bioinformatics resource in biomedical research of both miRNAs and diseases.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              WBSMDA: Within and Between Score for MiRNA-Disease Association prediction

              Increasing evidences have indicated that microRNAs (miRNAs) are functionally associated with the development and progression of various complex human diseases. However, the roles of miRNAs in multiple biological processes or various diseases and their underlying molecular mechanisms still have not been fully understood yet. Predicting potential miRNA-disease associations by integrating various heterogeneous biological datasets is of great significance to the biomedical research. Computational methods could obtain potential miRNA-disease associations in a short time, which significantly reduce the experimental time and cost. Considering the limitations in previous computational methods, we developed the model of Within and Between Score for MiRNA-Disease Association prediction (WBSMDA) to predict potential miRNAs associated with various complex diseases. WBSMDA could be applied to the diseases without any known related miRNAs. The AUC of 0.8031 based on Leave-one-out cross validation has demonstrated its reliable performance. WBSMDA was further applied to Colon Neoplasms, Prostate Neoplasms, and Lymphoma for the identification of their potential related miRNAs. As a result, 90%, 84%, and 80% of predicted miRNA-disease pairs in the top 50 prediction list for these three diseases have been confirmed by recent experimental literatures, respectively. It is anticipated that WBSMDA would be a useful resource for potential miRNA-disease association identification.
                Bookmark

                Author and article information

                Contributors
                ydkvictory@hrbust.edu.cn
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                3 December 2019
                3 December 2019
                2019
                : 20
                : 624
                Affiliations
                [1 ]ISNI 0000 0000 8621 1394, GRID grid.411994.0, School of Software and Microelectronics, , Harbin University of Science and Technology, ; Harbin, 150080 China
                [2 ]ISNI 0000 0004 1763 3496, GRID grid.484612.d, College of Computer Science and Technology, , Heilongjiang Institute of Technology, ; Harbin, 150050 China
                [3 ]ISNI 0000 0001 2224 0361, GRID grid.59025.3b, School of Computer Science and Engineering, , Nanyang Technological University, ; Singapore, 639798 Singapore
                Author information
                http://orcid.org/0000-0002-4974-3054
                Article
                3290
                10.1186/s12859-019-3290-7
                6889672
                31795954
                a6aa8cf3-6c85-4507-b61b-a8e5afe56248
                © The Author(s). 2019

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 18 September 2019
                : 21 November 2019
                Funding
                Funded by: Harbin Science and Technology Bureau (CN)
                Award ID: 2017RAQXJ027
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100004543, China Scholarship Council;
                Funded by: Fundamental Research Foundation for Universities of Heilongjiang Province
                Award ID: LGYC2018JQ003
                Award Recipient :
                Categories
                Research Article
                Custom metadata
                © The Author(s) 2019

                Bioinformatics & Computational biology
                disease,mirna,mirna-disease association prediction,feature selection,random forest

                Comments

                Comment on this article