30
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Sequencing bias: comparison of different protocols of MicroRNA library construction

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          MicroRNAs(miRNAs) are 18-25 nt small RNAs playing critical roles in many biological processes. The majority of known miRNAs were discovered by conventional cloning and a Sanger sequencing approach. The next-generation sequencing (NGS) technologies enable in-depth characterization of the global repertoire of miRNAs, and different protocols for miRNA library construction have been developed. However, the possible bias between the relative expression levels and sequences introduced by different protocols of library preparation have rarely been explored.

          Results

          We assessed three different miRNA library preparation protocols, SOLiD, Illumina versions 1 and 1.5, using cloning or SBS sequencing of total RNA samples extracted from skeletal muscles from Hu sheep and Dorper sheep, and then validated 9 miRNAs by qRT-PCR. Our results show that SBS sequencing data highly correlate with Illumina cloning data. The SOLiD data, when compared to Illumina's, indicate more dispersed distribution of length, higher frequency variation for nucleotides near the 3'- and 5'-ends, higher frequency occurrence for reads containing end secondary structure (ESS), and higher frequency for reads that do not map to known miRNAs. qRT-PCR results showed the best correlation with SOLiD cloning data. Fold difference of Hu sheep and Dorper sheep between qRT-PCR result and SBS sequencing data correlated well (r = 0.937), and fold difference of miR-1 and miR-206 among SOLiD cloning data, qRT-PCR and SBS sequencing data was similar.

          Conclusions

          The sequencing depth can influence the quantitative measurement of miRNA abundance, but the discrepancy caused by it was not statistically significant as high correlation was observed between Illumina cloning and SBS sequencing data. Bias of length distribution, sequence variation, and ESS was observed between data obtained with the different protocols. SOLiD cloning data differ from Illumina cloning data mainly because of distinct methods of adapter ligation. The good correlation between qRT-PCR result and SOLiD data might be due to the similarities of the hybridization-based methods. The fold difference analysis indicated that methods based on hybridization may be superior for quantitative measurement of miRNA abundance. Because of the genome sequence of the sheep is not available, our data may not explain how the entire miRNA bias in the natural miRNAs in sheep or other mammal miRNA expression, unbiased artificially synthesized miRNA will help on evaluating the methodology of miRNA library preparation.

          Related collections

          Most cited references7

          • Record: found
          • Abstract: found
          • Article: not found

          Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals.

          Comprehensive identification of all functional elements encoded in the human genome is a fundamental need in biomedical research. Here, we present a comparative analysis of the human, mouse, rat and dog genomes to create a systematic catalogue of common regulatory motifs in promoters and 3' untranslated regions (3' UTRs). The promoter analysis yields 174 candidate motifs, including most previously known transcription-factor binding sites and 105 new motifs. The 3'-UTR analysis yields 106 motifs likely to be involved in post-transcriptional regulation. Nearly one-half are associated with microRNAs (miRNAs), leading to the discovery of many new miRNA genes and their likely target genes. Our results suggest that previous estimates of the number of human miRNA genes were low, and that miRNAs regulate at least 20% of human genes. The overall results provide a systematic view of gene regulation in the human, which will be refined as additional mammalian genomes become available.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Identification of microRNAs and other small regulatory RNAs using cDNA library sequencing.

            Distinct classes of small RNAs, 20-32 nucleotides long, play important regulatory roles for diverse cellular processes. It is therefore important to identify and quantify small RNAs as a function of development, tissue and cell type, in normal and disease states. Here we describe methods to prepare cDNA libraries from pools of small RNAs isolated from organisms, tissues or cells. These methods enable the identification of new members or new classes of small RNAs, and they are also suitable to obtain miRNA expression profiles based on clone count frequencies. This protocol includes the use of new deep sequencing methods (454/Roche and Solexa) to facilitate the characterization of diverse sequence pools of small RNAs.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              MicroRNA discovery and profiling in human embryonic stem cells by deep sequencing of small RNA libraries.

              We used massively parallel pyrosequencing to discover and characterize microRNAs (miRNAs) expressed in human embryonic stem cells (hESC). Sequencing of small RNA cDNA libraries derived from undifferentiated hESC and from isogenic differentiating cultures yielded a total of 425,505 high-quality sequence reads. A custom data analysis pipeline delineated expression profiles for 191 previously annotated miRNAs, 13 novel miRNAs, and 56 candidate miRNAs. Further characterization of a subset of the novel miRNAs in Dicer-knockdown hESC demonstrated Dicer-dependent expression, providing additional validation of our results. A set of 14 miRNAs (9 known and 5 novel) was noted to be expressed in undifferentiated hESC and then strongly downregulated with differentiation. Functional annotation analysis of predicted targets of these miRNAs and comparison with a null model using non-hESC-expressed miRNAs identified statistically enriched functional categories, including chromatin remodeling and lineage-specific differentiation annotations. Finally, integration of our data with genome-wide chromatin immunoprecipitation data on OCT4, SOX2, and NANOG binding sites implicates these transcription factors in the regulation of nine of the novel/candidate miRNAs identified here. Comparison of our results with those of recent deep sequencing studies in mouse and human ESC shows that most of the novel/candidate miRNAs found here were not identified in the other studies. The data indicate that hESC express a larger complement of miRNAs than previously appreciated, and they provide a resource for additional studies of miRNA regulation of hESC physiology. Disclosure of potential conflicts of interest is found at the end of this article.
                Bookmark

                Author and article information

                Journal
                BMC Biotechnol
                BMC Biotechnology
                BioMed Central
                1472-6750
                2010
                6 September 2010
                : 10
                : 64
                Affiliations
                [1 ]Beijing Institute of Genomics, Chinese Academy of Science, Beijing 101300, China
                [2 ]The Graduate University of Chinese Academy of Sciences, Beijing 100062, China
                [3 ]Genome Research Institute, ShenZhen University Medical School, ShenZhen 518000, China
                [4 ]Beijing Genomics Institute, Shenzhen 518000, China
                [5 ]Insitute of Human Genetics, University of Aarhus, Aarhus DK-8000, Denmark
                Article
                1472-6750-10-64
                10.1186/1472-6750-10-64
                2946280
                20815927
                0403edf7-e728-4f60-9ede-026311207c70
                Copyright ©2010 Tian et al; licensee BioMed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 7 August 2009
                : 6 September 2010
                Categories
                Research Article

                Biotechnology
                Biotechnology

                Comments

                Comment on this article