1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Framework for Implementing Machine Learning on Omics Data

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The potential benefits of applying machine learning methods to -omics data are becoming increasingly apparent, especially in clinical settings. However, the unique characteristics of these data are not always well suited to machine learning techniques. These data are often generated across different technologies in different labs, and frequently with high dimensionality. In this paper we present a framework for combining -omics data sets, and for handling high dimensional data, making -omics research more accessible to machine learning applications. We demonstrate the success of this framework through integration and analysis of multi-analyte data for a set of 3,533 breast cancers. We then use this data-set to predict breast cancer patient survival for individuals at risk of an impending event, with higher accuracy and lower variance than methods trained on individual data-sets. We hope that our pipelines for data-set generation and transformation will open up -omics data to machine learning researchers. We have made these freely available for noncommercial use at www.ccg.ai.

          Related collections

          Most cited references5

          • Record: found
          • Abstract: found
          • Article: not found

          A new genome-driven integrated classification of breast cancer and its implications.

          Breast cancer is a group of heterogeneous diseases that show substantial variation in their molecular and clinical characteristics. This heterogeneity poses significant challenges not only in breast cancer management, but also in studying the biology of the disease. Recently, rapid progress has been made in understanding the genomic diversity of breast cancer. These advances led to the characterisation of a new genome-driven integrated classification of breast cancer, which substantially refines the existing classification systems currently used. The novel classification integrates molecular information on the genomic and transcriptomic landscapes of breast cancer to define 10 integrative clusters, each associated with distinct clinical outcomes and providing new insights into the underlying biology and potential molecular drivers. These findings have profound implications both for the individualisation of treatment approaches, bringing us a step closer to the realisation of personalised cancer management in breast cancer, but also provide a new framework for studying the underlying biology of each novel subtype.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Feature specific quantile normalization enables cross-platform classification of molecular subtypes using gene expression data

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found
              Is Open Access

              Random-projection ensemble classification

                Bookmark

                Author and article information

                Journal
                26 November 2018
                Article
                1811.10455
                1110ad56-7792-40fa-b0da-b8c1653c8684

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                ML4H/2018/102
                Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:cs/0101200
                cs.LG cs.AI q-bio.GN stat.ML

                Machine learning,Artificial intelligence,Genetics
                Machine learning, Artificial intelligence, Genetics

                Comments

                Comment on this article