60
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Methylated DNA is over-represented in whole-genome bisulfite sequencing data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The development of whole-genome bisulfite sequencing (WGBS) has resulted in a number of exciting discoveries about the role of DNA methylation leading to a plethora of novel testable hypotheses. Methods for constructing sodium bisulfite-converted and amplified libraries have recently advanced to the point that the bottleneck for experiments that use WGBS has shifted to data analysis and interpretation. Here we present empirical evidence for an over-representation of reads from methylated DNA in WGBS. This enrichment for methylated DNA is exacerbated by higher cycles of PCR and is influenced by the type of uracil-insensitive DNA polymerase used for amplifying the sequencing library. Future efforts to computationally correct for this enrichment bias will be essential to increasing the accuracy of determining methylation levels for individual cytosines. It is especially critical for studies that seek to accurately quantify DNA methylation levels in populations that may segregate for allelic DNA methylation states.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning.

          Cytosine DNA methylation is important in regulating gene expression and in silencing transposons and other repetitive sequences. Recent genomic studies in Arabidopsis thaliana have revealed that many endogenous genes are methylated either within their promoters or within their transcribed regions, and that gene methylation is highly correlated with transcription levels. However, plants have different types of methylation controlled by different genetic pathways, and detailed information on the methylation status of each cytosine in any given genome is lacking. To this end, we generated a map at single-base-pair resolution of methylated cytosines for Arabidopsis, by combining bisulphite treatment of genomic DNA with ultra-high-throughput sequencing using the Illumina 1G Genome Analyser and Solexa sequencing technology. This approach, termed BS-Seq, unlike previous microarray-based methods, allows one to sensitively measure cytosine methylation on a genome-wide scale within specific sequence contexts. Here we describe methylation on previously inaccessible components of the genome and analyse the DNA methylation sequence composition and distribution. We also describe the effect of various DNA methylation mutants on genome-wide methylation patterns, and demonstrate that our newly developed library construction and computational methods can be applied to large genomes such as that of mouse.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana.

            To take complete advantage of information on within-species polymorphism and divergence from close relatives, one needs to know the rate and the molecular spectrum of spontaneous mutations. To this end, we have searched for de novo spontaneous mutations in the complete nuclear genomes of five Arabidopsis thaliana mutation accumulation lines that had been maintained by single-seed descent for 30 generations. We identified and validated 99 base substitutions and 17 small and large insertions and deletions. Our results imply a spontaneous mutation rate of 7 x 10(-9) base substitutions per site per generation, the majority of which are G:C-->A:T transitions. We explain this very biased spectrum of base substitution mutations as a result of two main processes: deamination of methylated cytosines and ultraviolet light-induced mutagenesis.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands.

              The modulation of DNA-protein interactions by methylation of protein-binding sites in DNA and the occurrence in genomic imprinting, X chromosome inactivation, and fragile X syndrome of different methylation patterns in DNA of different chromosomal origin have underlined the need to establish methylation patterns in individual strands of particular genomic sequences. We report a genomic sequencing method that provides positive identification of 5-methylcytosine residues and yields strand-specific sequences of individual molecules in genomic DNA. The method utilizes bisulfite-induced modification of genomic DNA, under conditions whereby cytosine is converted to uracil, but 5-methylcytosine remains nonreactive. The sequence under investigation is then amplified by PCR with two sets of strand-specific primers to yield a pair of fragments, one from each strand, in which all uracil and thymine residues have been amplified as thymine and only 5-methylcytosine residues have been amplified as cytosine. The PCR products can be sequenced directly to provide a strand-specific average sequence for the population of molecules or can be cloned and sequenced to provide methylation maps of single DNA molecules. We tested the method by defining the methylation status within single DNA strands of two closely spaced CpG dinucleotides in the promoter of the human kininogen gene. During the analysis, we encountered in sperm DNA an unusual methylation pattern, which suggests that the high methylation level of single-copy sequences in sperm may be locally modulated by binding of protein factors in germ-line cells.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Genet
                Front Genet
                Front. Genet.
                Frontiers in Genetics
                Frontiers Media S.A.
                1664-8021
                21 October 2014
                2014
                : 5
                : 341
                Affiliations
                [1] 1Department of Genetics, University of Georgia, Athens, GA USA
                [2] 2Institute of Bioinformatics, University of Georgia, Athens, GA USA
                [3] 3Department of Microbiology, University of Georgia, Athens, GA USA
                [4] 4Department of Statistics, University of Georgia, Athens, GA USA
                Author notes

                Edited by: Mark D. Robinson, University of Zurich, Switzerland

                Reviewed by: Anne Parle-McDermott, Dublin City University, Ireland; Cuncong Zhong, J. Craig Venter Institute, USA; Michael B. Stadler, Friedrich Miescher Institute, Switzerland

                *Correspondence: Robert J. Schmitz, Department of Genetics, University of Georgia,120 East Green Street, Davison Life Sciences, B416 Athens, GA 30602, USA e-mail: schmitz@uga.edu

                This article was submitted to Bioinformatics and Computational Biology, a section of the journal Frontiers in Genetics.

                Article
                10.3389/fgene.2014.00341
                4204604
                25374580
                efc63082-6943-496a-adf1-768ced000bf7
                Copyright © 2014 Ji, Sasaki, Sun, Ma, Lewis and Schmitz.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 08 June 2014
                : 11 September 2014
                Page count
                Figures: 6, Tables: 1, Equations: 0, References: 54, Pages: 10, Words: 0
                Categories
                Genetics
                Original Research Article

                Genetics
                dna methylation,whole genome bisulfite sequencing,epigenomics,epigenetics,pcr bias
                Genetics
                dna methylation, whole genome bisulfite sequencing, epigenomics, epigenetics, pcr bias

                Comments

                Comment on this article