188
views
0
recommends
+1 Recommend
0 collections
    8
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium

      research-article
      SEQC Consortium
      Nature biotechnology

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the United States Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing depths, we discover unannotated exon-exon junctions, with >80% validated by qPCR. We find that measurements of relative expression are accurate and reproducible across sites and platforms if specific filters are used. In contrast, RNA-seq and microarrays do not provide accurate absolute measurements, and gene-specific biases are observed, for these and qPCR. Measurement performance depends on the platform and data analysis pipeline, and variation is large for transcript-level profiling. The complete SEQC data sets, comprising >100 billion reads (10Tb), provide unique resources for evaluating RNA-seq analyses for clinical and regulatory settings.

          Related collections

          Most cited references32

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy

          The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of genomic, transcript and protein sequence records. These records are selected and curated from public sequence archives and represent a significant reduction in redundancy compared to the volume of data archived by the International Nucleotide Sequence Database Collaboration. The database includes over 16 000 organisms, 2.4 × 106 genomic records, 13 × 106 proteins and 2 × 106 RNA records spanning prokaryotes, eukaryotes and viruses (RefSeq release 49, September 2011). The RefSeq database is maintained by a combined approach of automated analyses, collaboration and manual curation to generate an up-to-date representation of the sequence, its features, names and cross-links to related sources of information. We report here on recent growth, the status of curating the human RefSeq data set, more extensive feature annotation and current policy for eukaryotic genome annotation via the NCBI annotation pipeline. More information about the resource is available online (see http://www.ncbi.nlm.nih.gov/RefSeq/).
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Variance stabilization applied to microarray data calibration and to the quantification of differential expression.

            We introduce a statistical model for microarray gene expression data that comprises data calibration, the quantification of differential expression, and the quantification of measurement error. In particular, we derive a transformation h for intensity measurements, and a difference statistic Deltah whose variance is approximately constant along the whole intensity range. This forms a basis for statistical inference from microarray data, and provides a rational data pre-processing strategy for multivariate analyses. For the transformation h, the parametric form h(x)=arsinh(a+bx) is derived from a model of the variance-versus-mean dependence for microarray intensity data, using the method of variance stabilizing transformations. For large intensities, h coincides with the logarithmic transformation, and Deltah with the log-ratio. The parameters of h together with those of the calibration between experiments are estimated with a robust variant of maximum-likelihood estimation. We demonstrate our approach on data sets from different experimental platforms, including two-colour cDNA arrays and a series of Affymetrix oligonucleotide arrays.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Twenty-five years of quantitative PCR for gene expression analysis.

              Following its invention 25 years ago, PCR has been adapted for numerous molecular biology applications. Gene expression analysis by reverse-transcription quantitative PCR (RT-qPCR) has been a key enabling technology of the post-genome era. Since the founding of BioTechniques, this journal has been a resource for the improvements in qPCR technology, experimental design, and data analysis. qPCR and, more specifically, real-time qPCR has become a routine and robust approach for measuring the expression of genes of interest, validating microarray experiments, and monitoring biomarkers. The use of real-time qPCR has nearly supplanted other approaches (e.g., Northern blotting, RNase protection assays). This review examines the current state of qPCR for gene expression analysis now that the method has reached a mature stage of development and implementation. Specifically, the different fluorescent reporter technologies of real-time qPCR are discussed as well as the selection of endogenous controls. The conceptual framework for data analysis methods is also presented to demystify these analysis techniques. The future of qPCR remains bright as the technology becomes more rapid, cost-effective, easier to use, and capable of higher throughput.
                Bookmark

                Author and article information

                Journal
                9604648
                20305
                Nat Biotechnol
                Nat. Biotechnol.
                Nature biotechnology
                1087-0156
                1546-1696
                5 November 2014
                24 August 2014
                September 2014
                01 March 2015
                : 32
                : 9
                : 903-914
                Author notes
                Correspondence and requests for materials should be addressed to L.S. ( leming.shi@ 123456gmail.com ), C.E.M. ( chm2042@ 123456med.cornell.edu ) or D.P.K. ( david.kreil@ 123456boku.ac.at / d.kreil@ 123456warwick.ac.uk )
                Article
                NIHMS604610
                10.1038/nbt.2957
                4321899
                25150838
                c7c36332-d081-40c2-9856-044b88623147
                History
                Categories
                Article

                Biotechnology
                Biotechnology

                Comments

                Comment on this article