51
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Analyzing Illumina Gene Expression Microarray Data from Different Tissues: Methodological Aspects of Data Analysis in the MetaXpress Consortium

      research-article
      1 , 2 , 3 , 25 , 4 , 5 , 6 , 7 , 8 , 9 , 7 , 8 , 10 , 11 , 6 , 12 , 8 , 1 , 11 , 13 , 3 , 25 , 2 , 14 , 15 , 16 , 4 , 17 , 8 , 11 , 14 , 16 , 18 , 19 , 6 , 20 , 10 , 21 , 1 , 8 , 12 , 8 , 11 , 17 , 8 , 22 , 23 , 24 , 4 , 5 , 1 , * , 2 , 14 , 15 , * , 3 , 25 , *
      PLoS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Microarray profiling of gene expression is widely applied in molecular biology and functional genomics. Experimental and technical variations make meta-analysis of different studies challenging. In a total of 3358 samples, all from German population-based cohorts, we investigated the effect of data preprocessing and the variability due to sample processing in whole blood cell and blood monocyte gene expression data, measured on the Illumina HumanHT-12 v3 BeadChip array.

          Gene expression signal intensities were similar after applying the log 2 or the variance-stabilizing transformation. In all cohorts, the first principal component (PC) explained more than 95% of the total variation. Technical factors substantially influenced signal intensity values, especially the Illumina chip assignment (33–48% of the variance), the RNA amplification batch (12–24%), the RNA isolation batch (16%), and the sample storage time, in particular the time between blood donation and RNA isolation for the whole blood cell samples (2–3%), and the time between RNA isolation and amplification for the monocyte samples (2%). White blood cell composition parameters were the strongest biological factors influencing the expression signal intensities in the whole blood cell samples (3%), followed by sex (1–2%) in both sample types. Known single nucleotide polymorphisms (SNPs) were located in 38% of the analyzed probe sequences and 4% of them included common SNPs (minor allele frequency >5%). Out of the tested SNPs, 1.4% significantly modified the probe-specific expression signals (Bonferroni corrected p-value<0.05), but in almost half of these events the signal intensities were even increased despite the occurrence of the mismatch. Thus, the vast majority of SNPs within probes had no significant effect on hybridization efficiency.

          In summary, adjustment for a few selected technical factors greatly improved reliability of gene expression analyses. Such adjustments are particularly required for meta-analyses.

          Related collections

          Most cited references31

          • Record: found
          • Abstract: found
          • Article: not found

          Gene expression and benefit of chemotherapy in women with node-negative, estrogen receptor-positive breast cancer.

          The 21-gene recurrence score (RS) assay quantifies the likelihood of distant recurrence in women with estrogen receptor-positive, lymph node-negative breast cancer treated with adjuvant tamoxifen. The relationship between the RS and chemotherapy benefit is not known. The RS was measured in tumors from the tamoxifen-treated and tamoxifen plus chemotherapy-treated patients in the National Surgical Adjuvant Breast and Bowel Project (NSABP) B20 trial. Cox proportional hazards models were utilized to test for interaction between chemotherapy treatment and the RS. A total of 651 patients were assessable (227 randomly assigned to tamoxifen and 424 randomly assigned to tamoxifen plus chemotherapy). The test for interaction between chemotherapy treatment and RS was statistically significant (P = .038). Patients with high-RS (> or = 31) tumors (ie, high risk of recurrence) had a large benefit from chemotherapy (relative risk, 0.26; 95% CI, 0.13 to 0.53; absolute decrease in 10-year distant recurrence rate: mean, 27.6%; SE, 8.0%). Patients with low-RS (< 18) tumors derived minimal, if any, benefit from chemotherapy treatment (relative risk, 1.31; 95% CI, 0.46 to 3.78; absolute decrease in distant recurrence rate at 10 years: mean, -1.1%; SE, 2.2%). Patients with intermediate-RS tumors did not appear to have a large benefit, but the uncertainty in the estimate can not exclude a clinically important benefit. The RS assay not only quantifies the likelihood of breast cancer recurrence in women with node-negative, estrogen receptor-positive breast cancer, but also predicts the magnitude of chemotherapy benefit.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Oncomine 3.0: genes, pathways, and networks in a collection of 18,000 cancer gene expression profiles.

            DNA microarrays have been widely applied to cancer transcriptome analysis; however, the majority of such data are not easily accessible or comparable. Furthermore, several important analytic approaches have been applied to microarray analysis; however, their application is often limited. To overcome these limitations, we have developed Oncomine, a bioinformatics initiative aimed at collecting, standardizing, analyzing, and delivering cancer transcriptome data to the biomedical research community. Our analysis has identified the genes, pathways, and networks deregulated across 18,000 cancer gene expression microarrays, spanning the majority of cancer types and subtypes. Here, we provide an update on the initiative, describe the database and analysis modules, and highlight several notable observations. Results from this comprehensive analysis are available at http://www.oncomine.org.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              RankProd: a bioconductor package for detecting differentially expressed genes in meta-analysis.

              While meta-analysis provides a powerful tool for analyzing microarray experiments by combining data from multiple studies, it presents unique computational challenges. The Bioconductor package RankProd provides a new and intuitive tool for this purpose in detecting differentially expressed genes under two experimental conditions. The package modifies and extends the rank product method proposed by Breitling et al., [(2004) FEBS Lett., 573, 83-92] to integrate multiple microarray studies from different laboratories and/or platforms. It offers several advantages over t-test based methods and accepts pre-processed expression datasets produced from a wide variety of platforms. The significance of the detection is assessed by a non-parametric permutation test, and the associated P-value and false discovery rate (FDR) are included in the output alongside the genes that are detected by user-defined criteria. A visualization plot is provided to view actual expression levels for each gene with estimated significance measurements. RankProd is available at Bioconductor http://www.bioconductor.org. A web-based interface will soon be available at http://cactus.salk.edu/RankProd
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, USA )
                1932-6203
                2012
                7 December 2012
                : 7
                : 12
                : e50938
                Affiliations
                [1 ]Interfaculty Institute for Genetics and Functional Genomics, Ernst-Moritz-Arndt-University Greifswald, Greifswald, Germany
                [2 ]Institute of Human Genetics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
                [3 ]Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany
                [4 ]Klinik für Allgemeine und Interventionelle Kardiologie, Universitäres Herzzentrum Hamburg, Hamburg, Germany
                [5 ]DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Hamburg, Germany
                [6 ]Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Düsseldorf, Germany
                [7 ]Department of Internal Medicine B, University Medicine Greifswald, Greifswald, Germany
                [8 ]DZHK (German Centre for Cardiovascular Research), partner site Greifswald, Greifswald, Germany
                [9 ]Institute of Anatomy and Cell Biology, University Medicine Greifswald, Greifswald, Germany
                [10 ]Institute of Genetic Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
                [11 ]Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
                [12 ]Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
                [13 ]Medical School Hannover, Hannover Unified Biobank, Hannover, Germany
                [14 ]DZHK (German Centre for Cardiovascular Research), partner site Munich, Munich, Germany
                [15 ]Institut für Humangenetik, Technische Universität München, München, Germany
                [16 ]Munich Heart Alliance, Munich, Germany
                [17 ]Institute of Clinical Chemistry and Laboratory Medicine, University Medicine Greifswald, Greifswald, Germany
                [18 ]Institute of Epidemiology II, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
                [19 ]Institute of Physiology, University Medicine Greifswald, Karlsburg, Germany
                [20 ]Department of Metabolic Diseases, University Hospital Düsseldorf, Heinrich-Heine University, Düsseldorf, Germany
                [21 ]Institute of Medical Informatics, Biometry and Epidemiology, Chair of Genetic Epidemiology, Ludwig-Maximilians-Universität, Munich, Germany
                [22 ]Center for Thrombosis and Hemostasis, University Medical Center of the Johannes Gutenberg-University Mainz, Mainz, Germany
                [23 ]Department of Medicine 2, University Medical Center of the Johannes Gutenberg-University Mainz, Mainz, Germany
                [24 ]DZHK (German Centre for Cardiovascular Research), partner site Rhine-Main, Mainz, Germany
                [25 ]DZHK (German Centre for Cardiovascular Research), partner site Hamburg/Kiel/Lübeck, Lübeck, Germany
                Ernst-Moritz-Arndt-University Greifswald, Germany
                Author notes

                ¶ These authors also contributed equally to this work.

                Competing Interests: The authors have read the journal's policy and have the following interest: They received funding from a commercial source (Siemens Healthcare, Erlangen, Germany, InterSystems GmbH, Boehringer Ingelheim, PHILIPS Medical Systems). There are no patents, products in development or marketed products to declare. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials. Co-authors Tanja Zeller and Christian Herder are PLOS ONE Editorial Board members. This does not alter their adherence to all the PLOS ONE policies on sharing data and materials.

                Conceived and designed the experiments: CS KH AT HP AZ. Analyzed the data: CS KH AS JK CM AT AZ. Wrote the paper: CS KH GH RR HW TZ AT HP AZ. Sample collection and preparation: SB MC MD SBF CG HG CH WH GH TI TM MN AP RR MR KS UV HV SW HW PSW TZ HP. Interpretation of data: CS KH AS SB KE CG GH JK TM CM RR UV TZ AT HP AZ. Review and revision of the manuscript: CS KH AS SB MC MD KE SBF CG HG CH WH GH TI JK TM CM MN AP RR MR KS UV HV SW HW PSW TZ AT HP AZ.

                Article
                PONE-D-12-28349
                10.1371/journal.pone.0050938
                3517598
                23236413
                b25412d5-2f59-404c-8ba4-0ce6d1a7ff86
                Copyright @ 2012

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 17 September 2012
                : 22 October 2012
                Page count
                Pages: 14
                Funding
                SHIP is part of the Community Medicine Research net of the University of Greifswald, Germany, which is funded by the BMBF (German Ministry of Education and Research, http://www.bmbf.de), the Ministry of Cultural Affairs ( http://www.regierung-mv.de/cms2/Regierungsportal_prod/Regierungsportal/de/bm/) as well as the Social Ministry of the Federal State of Mecklenburg-West Pomerania ( http://www.regierung-mv.de/cms2/Regierungsportal_prod/Regierungsportal/de/sm/). Analyses were supported by the “Greifswald Approach to Individualized Medicine (GANI_MED, http://www.gani-med.de/)” consortium funded by the BMBF (grant 03IS2061A). Genome-wide genotyping and expression data have been supported by the BMBF (grant no. 03ZIK012) and a joint grant from Siemens Healthcare, Erlangen, Germany ( http://www.siemens.com/) and the Federal State of Mecklenburg, West Pomerania ( http://www.regierung-mv.de/). The University of Greifswald is a member of the ‘Center of Knowledge Interchange’ program of the Siemens AG and the Caché Campus program of the InterSystems GmbH ( http://www.intersystems.com). The KORA research platform and the KORA Augsburg studies are financed by the Helmholtz Zentrum München, German Research Center for Environmental Health ( http://www.helmholtz-muenchen.de/), which is funded by the BMBF and by the State of Bavaria ( http://www.bayern.de/). The German Diabetes Center is funded by the German Federal Ministry of Health ( http://www.bmg.bund.de/) and the Ministry of School, Science and Research of the State of North-Rhine-Westphalia ( http://www.innovation.nrw.de/). The Diabetes Cohort Study was funded by a German Research Foundation ( http://www.dfg.de) project grant to W.R. (DFG; RA 459/2-1). This study was supported in part by a grant from the BMBF to the German Center for Diabetes Research (DZD e.V., http://www.dzd-ev.de/). This work was supported by the BMBF funded Systems Biology of Metabotypes grant (SysMBo#0315494A). Additional support was obtained from the BMBF (National Genome Research Network NGFNplus Atherogenomics, 01GS0834) and the Leibniz Association ( http://www.wgl.de/) (WGL Pakt für Forschung und Innovation). The Gutenberg Health Study is funded through the government of Rheinland-Pfalz ( http://www.rlp.de/) (“Stiftung Rheinland Pfalz für Innovation”, contract AZ 961–386261/733), the research programs “Wissen schafft Zukunft” and “Schwerpunkt Vaskuläre Prävention” of the Johannes Gutenberg-University of Mainz ( http://www.uni-mainz.de/), and its contract with Boehringer Ingelheim ( http://www.boehringer-ingelheim.de/) and PHILIPS Medical Systems ( http://www.healthcare.philips.com/), including an unrestricted grant for the Gutenberg Health Study. Specifically, the research reported in this article was supported by the National Genome Network “NGFNplus” ( http://www.ngfn.de/en/start.html) (contract 01GS0833 and 01GS0831) by the BMBF, and a joint funding grant from the BMBF, and the Agence Nationale de la Recherche, France ( http://www.agence-nationale-recherche.fr/) (contract BMBF 01KU0908A and ANR 09 GENO 106 01). This work was supported in part by the European Union ( http://europa.eu/) (HEALTH-2011-278913), the BMBF (grants 01KU0908A, 01KU0908B, 0315536F), and supported by the DZHK (Deutsches Zentrum für Herz-Kreislauf-Forschung – German Centre for Cardiovascular Research, http://www.bmbf.de/de/16542.php, http://www.dzhk.de) and by the BMBF. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology
                Computational Biology
                Genomics
                Genome Analysis Tools
                Genome-Wide Association Studies
                Molecular Genetics
                Gene Expression
                Microarrays
                Genetics
                Human Genetics
                Genome-Wide Association Studies
                Gene Expression
                Genome-Wide Association Studies
                Genomics
                Genome Analysis Tools
                Genome-Wide Association Studies
                Genome Sequencing
                Engineering
                Signal Processing
                Array Processing
                Medicine
                Clinical Research Design
                Statistical Methods

                Uncategorized
                Uncategorized

                Comments

                Comment on this article