180
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries.

          Related collections

          Most cited references13

          • Record: found
          • Abstract: not found
          • Article: not found

          Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal.

            The cBioPortal for Cancer Genomics (http://cbioportal.org) provides a Web resource for exploring, visualizing, and analyzing multidimensional cancer genomics data. The portal reduces molecular profiling data from cancer tissues and cell lines into readily understandable genetic, epigenetic, gene expression, and proteomic events. The query interface combined with customized data storage enables researchers to interactively explore genetic alterations across samples, genes, and pathways and, when available in the underlying data, to link these to clinical outcomes. The portal provides graphical summaries of gene-level data from multiple platforms, network visualization and analysis, survival analysis, patient-centric queries, and software programmatic access. The intuitive Web interface of the portal makes complex cancer genomics profiles accessible to researchers and clinicians without requiring bioinformatics expertise, thus facilitating biological discoveries. Here, we provide a practical guide to the analysis and visualization features of the cBioPortal for Cancer Genomics.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin.

              Recent genomic analyses of pathologically defined tumor types identify "within-a-tissue" disease subtypes. However, the extent to which genomic signatures are shared across tissues is still unclear. We performed an integrative analysis using five genome-wide platforms and one proteomic platform on 3,527 specimens from 12 cancer types, revealing a unified classification into 11 major subtypes. Five subtypes were nearly identical to their tissue-of-origin counterparts, but several distinct cancer types were found to converge into common subtypes. Lung squamous, head and neck, and a subset of bladder cancers coalesced into one subtype typified by TP53 alterations, TP63 amplifications, and high expression of immune and proliferation pathway genes. Of note, bladder cancers split into three pan-cancer subtypes. The multiplatform classification, while correlated with tissue-of-origin, provides independent information for predicting clinical outcomes. All data sets are available for data-mining from a unified resource to support further biological discoveries and insights into novel therapeutic strategies. Copyright © 2014 Elsevier Inc. All rights reserved.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                05 May 2016
                23 December 2015
                23 December 2015
                : 44
                : 8
                : e71
                Affiliations
                [1 ]Interuniversity Institute of Bioinformatics in Brussels (IB) 2, Brussels, Belgium
                [2 ]Machine Learning Group (MLG), Department d'Informatique, Université libre de Bruxelles (ULB), Brussels, Belgium
                [3 ]Department of Genetics Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, São Paulo, Brazil
                [4 ]Center for Integrative Systems Biology - CISBi, NAP/USP, Ribeirão Preto, São Paulo, Brazil
                [5 ]Department of Science and Technology, University of Sannio, Benevento, Italy
                [6 ]Unlimited Software srl, Naples, Italy
                [7 ]Institute of Molecular Bioimaging and Physiology of the National Research Council (IBFM-CNR), Milan, Italy
                [8 ]Physics for Complex Systems, Department of Physics, University of Turin, Italy
                [9 ]Bioinformatics Laboratory, BIOGEM, Ariano Irpino, Avellino, Italy
                [10 ]Qatar Computing Research Institute (QCRI), HBKU, Doha, Qatar
                Author notes
                [* ]To whom correspondence should be addressed. Tel: +1 310 570 2362; Fax: +55 16 3315 0222; Email: houtan@ 123456usp.br
                Correspondence may also be addressed to Gianluca Bontempi. Tel: +32 2 650 55 91; Fax: +32 2 650 56 09; Email: gbonte@ 123456ulb.ac.be
                []These authors contributed equally to the paper as first authors.
                Author information
                http://orcid.org/0000-0001-8582-0865
                Article
                10.1093/nar/gkv1507
                4856967
                26704973
                b172cd81-1268-4acc-b79a-600e986cb6f3
                © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@ 123456oup.com

                History
                : 10 December 2015
                : 06 December 2015
                : 22 October 2015
                Page count
                Pages: 11
                Categories
                7
                24
                Methods Online
                Custom metadata
                05 May 2016

                Genetics
                Genetics

                Comments

                Comment on this article