186
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Integration of copy number and transcriptomics provides risk stratification in prostate cancer: A discovery and validation cohort study

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Understanding the heterogeneous genotypes and phenotypes of prostate cancer is fundamental to improving the way we treat this disease. As yet, there are no validated descriptions of prostate cancer subgroups derived from integrated genomics linked with clinical outcome.

          Methods

          In a study of 482 tumour, benign and germline samples from 259 men with primary prostate cancer, we used integrative analysis of copy number alterations (CNA) and array transcriptomics to identify genomic loci that affect expression levels of mRNA in an expression quantitative trait loci (eQTL) approach, to stratify patients into subgroups that we then associated with future clinical behaviour, and compared with either CNA or transcriptomics alone.

          Findings

          We identified five separate patient subgroups with distinct genomic alterations and expression profiles based on 100 discriminating genes in our separate discovery and validation sets of 125 and 103 men. These subgroups were able to consistently predict biochemical relapse (p = 0.0017 and p = 0.016 respectively) and were further validated in a third cohort with long-term follow-up (p = 0.027). We show the relative contributions of gene expression and copy number data on phenotype, and demonstrate the improved power gained from integrative analyses. We confirm alterations in six genes previously associated with prostate cancer ( MAP3K7, MELK, RCBTB2, ELAC2, TPD52, ZBTB4), and also identify 94 genes not previously linked to prostate cancer progression that would not have been detected using either transcript or copy number data alone. We confirm a number of previously published molecular changes associated with high risk disease, including MYC amplification, and NKX3-1, RB1 and PTEN deletions, as well as over-expression of PCA3 and AMACR, and loss of MSMB in tumour tissue. A subset of the 100 genes outperforms established clinical predictors of poor prognosis (PSA, Gleason score), as well as previously published gene signatures (p = 0.0001). We further show how our molecular profiles can be used for the early detection of aggressive cases in a clinical setting, and inform treatment decisions.

          Interpretation

          For the first time in prostate cancer this study demonstrates the importance of integrated genomic analyses incorporating both benign and tumour tissue data in identifying molecular alterations leading to the generation of robust gene sets that are predictive of clinical outcome in independent patient cohorts.

          Highlights

          • Integrated genomic profiling of 259 men with prostate cancer Cambridge discovery cohort and Stockholm validation cohort

          • 100-feature gene set that reliably differentiates five subgroups (iClusters) of prostate cancer

          • Prognostic gene signature that predicts relapse-free survival

          Studies in other cancers have shown the advantage of integrated genomic approaches in stratifying disease (e.g. Curtis et al., 2012, breast cancer) but such approaches have not yet been undertaken in prostate cancer. In this study we conducted a comprehensive integrated analysis of a Cambridge discovery and Stockholm validation cohort to stratify men into 5 molecular clusters of varying clinical risk based on copy number and gene expression profiling of 100 key genes. The study then showed that we could predict disease relapse based on a refined subgroup of this gene set and showed the superiority of this ‘signature’ compared to other available signatures. The study introduces to the prostate cancer research community 259 men who have been profiled with copy number, transcript expression, TMPRSS2:ERG gene fusion status and tissue microarray data.

          Related collections

          Most cited references66

          • Record: found
          • Abstract: found
          • Article: not found

          Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

          Although genomewide RNA expression analysis has become a routine tool in biomedical research, extracting biological insight from such information remains a major challenge. Here, we describe a powerful analytical method called Gene Set Enrichment Analysis (GSEA) for interpreting gene expression data. The method derives its power by focusing on gene sets, that is, groups of genes that share common biological function, chromosomal location, or regulation. We demonstrate how GSEA yields insights into several cancer-related data sets, including leukemia and lung cancer. Notably, where single-gene analysis finds little similarity between two independent studies of patient survival in lung cancer, GSEA reveals many biological pathways in common. The GSEA method is embodied in a freely available software package, together with an initial database of 1,325 biologically defined gene sets.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Adjusting batch effects in microarray expression data using empirical Bayes methods.

            Non-biological experimental variation or "batch effects" are commonly observed across multiple batches of microarray experiments, often rendering the task of combining data from these batches difficult. The ability to combine microarray data sets is advantageous to researchers to increase statistical power to detect biological phenomena from studies where logistical considerations restrict sample size or in studies that require the sequential hybridization of arrays. In general, it is inappropriate to combine data sets without adjusting for batch effects. Methods have been proposed to filter batch effects from data, but these are often complicated and require large batch sizes ( > 25) to implement. Because the majority of microarray studies are conducted using much smaller sample sizes, existing methods are not sufficient. We propose parametric and non-parametric empirical Bayes frameworks for adjusting data for batch effects that is robust to outliers in small sample sizes and performs comparable to existing methods for large samples. We illustrate our methods using two example data sets and show that our methods are justifiable, easy to apply, and useful in practice. Software for our method is freely available at: http://biosun1.harvard.edu/complab/batch/.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found

              The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups

              The elucidation of breast cancer subgroups and their molecular drivers requires integrated views of the genome and transcriptome from representative numbers of patients. We present an integrated analysis of copy number and gene expression in a discovery and validation set of 997 and 995 primary breast tumours, respectively, with long-term clinical follow-up. Inherited variants (copy number variants and single nucleotide polymorphisms) and acquired somatic copy number aberrations (CNAs) were associated with expression in ~40% of genes, with the landscape dominated by cis- and trans-acting CNAs. By delineating expression outlier genes driven in cis by CNAs, we identified putative cancer genes, including deletions in PPP2R2A, MTAP and MAP2K4. Unsupervised analysis of paired DNA–RNA profiles revealed novel subgroups with distinct clinical outcomes, which reproduced in the validation cohort. These include a high-risk, oestrogen-receptor-positive 11q13/14 cis-acting subgroup and a favourable prognosis subgroup devoid of CNAs. Trans-acting aberration hotspots were found to modulate subgroup-specific gene networks, including a TCR deletion-mediated adaptive immune response in the ‘CNA-devoid’ subgroup and a basal-specific chromosome 5 deletion-associated mitotic network. Our results provide a novel molecular stratification of the breast cancer population, derived from the impact of somatic CNAs on the transcriptome.
                Bookmark

                Author and article information

                Contributors
                Journal
                EBioMedicine
                EBioMedicine
                EBioMedicine
                Elsevier
                2352-3964
                29 July 2015
                September 2015
                29 July 2015
                : 2
                : 9
                : 1133-1144
                Affiliations
                [a ]Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge CB2 0RE, UK
                [b ]Department of Urology, Addenbrooke's Hospital, Cambridge CB2 2QQ, UK
                [c ]Academic Urology Group, University of Cambridge, Cambridge, CB2 0QQ, UK
                [d ]Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
                [e ]Department of Oncology–Pathology, Karolinska Institutet, Stockholm, Sweden
                [f ]Nuffield Department of Surgical Sciences, University of Oxford, Roosevelt Drive, Oxford, UK
                [g ]Molecular Diagnostics and Therapeutics Group, University College London, WC1E 6BT, UK
                [h ]University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK
                [i ]Department of Pathology, Addenbrooke's Hospital, Cambridge CB2 2QQ, UK
                [j ]Prostate Cancer Research Group, Centre for Molecular Medicine Norway, Nordic EMBL Partnership, University of Oslo and Oslo University Hospital, N-0318 Oslo, Norway
                [k ]Department of Molecular Oncology, Institute of Cancer Research, Oslo University Hospitals, N-0424 Oslo, Norway
                [l ]Prostate Cancer UK/Movember Centre of Excellence for Prostate Cancer Research, Centre for Cancer Research and Cell Biology, Queen's University, Belfast, UK
                Author notes
                [* ]Corresponding author. Alastair.Lamb@ 123456cruk.cam.ac.uk
                [1]

                These authors contributed equally to this work.

                [2]

                List of participants and affiliations appear at the end of the paper.

                Article
                S2352-3964(15)30071-2
                10.1016/j.ebiom.2015.07.017
                4588396
                26501111
                ff78f597-3d0a-4199-8e2e-a992e0f9960e
                Crown Copyright © 2015 Published by Elsevier B.V.

                This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

                History
                : 12 June 2015
                : 10 July 2015
                : 14 July 2015
                Categories
                Research Paper

                prostate cancer,risk stratification,genomics,prognosis,gene signature,biochemical relapse,personalised medicine

                Comments

                Comment on this article