5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Differential methylation analysis of reduced representation bisulfite sequencing experiments using edgeR

      methods-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Studies in epigenetics have shown that DNA methylation is a key factor in regulating gene expression. Aberrant DNA methylation is often associated with DNA instability, which could lead to development of diseases such as cancer. DNA methylation typically occurs in CpG context. When located in a gene promoter, DNA methylation often acts to repress transcription and gene expression. The most commonly used technology of studying DNA methylation is bisulfite sequencing (BS-seq), which can be used to measure genomewide methylation levels on the single-nucleotide scale. Notably, BS-seq can also be combined with enrichment strategies, such as reduced representation bisulfite sequencing (RRBS), to target CpG-rich regions in order to save per-sample costs. A typical DNA methylation analysis involves identifying differentially methylated regions (DMRs) between different experimental conditions. Many statistical methods have been developed for finding DMRs in BS-seq data. In this workflow, we propose a novel approach of detecting DMRs using edgeR. By providing a complete analysis of RRBS profiles of epithelial populations in the mouse mammary gland, we will demonstrate that differential methylation analyses can be fit into the existing pipelines specifically designed for RNA-seq differential expression studies.

          In addition, the edgeR generalized linear model framework offers great flexibilities for complex experimental design, while still accounting for the biological variability. The analysis approach illustrated in this article can be applied to any BS-seq data that includes some replication, but it is especially appropriate for RRBS data with small numbers of biological replicates.

          Related collections

          Most cited references13

          • Record: found
          • Abstract: found
          • Article: not found

          Cancer epigenetics comes of age.

          The discovery of numerous hypermethylated promoters of tumour-suppressor genes, along with a better understanding of gene-silencing mechanisms, has moved DNA methylation from obscurity to recognition as an alternative mechanism of tumour-suppressor inactivation in cancer. Epigenetic events can also facilitate genetic damage, as illustrated by the increased mutagenicity of 5-methylcytosine and the silencing of the MLH1 mismatch repair gene by DNA methylation in colorectal tumours. We review here current mechanistic understanding of the role of DNA methylation in malignant transformation, and suggest Knudson's two-hit hypothesis should now be expanded to include epigenetic mechanisms of gene inactivation.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data

            DNA methylation is an important epigenetic modification that has essential roles in cellular processes including gene regulation, development and disease and is widely dysregulated in most types of cancer. Recent advances in sequencing technology have enabled the measurement of DNA methylation at single nucleotide resolution through methods such as whole-genome bisulfite sequencing and reduced representation bisulfite sequencing. In DNA methylation studies, a key task is to identify differences under distinct biological contexts, for example, between tumor and normal tissue. A challenge in sequencing studies is that the number of biological replicates is often limited by the costs of sequencing. The small number of replicates leads to unstable variance estimation, which can reduce accuracy to detect differentially methylated loci (DML). Here we propose a novel statistical method to detect DML when comparing two treatment groups. The sequencing counts are described by a lognormal-beta-binomial hierarchical model, which provides a basis for information sharing across different CpG sites. A Wald test is developed for hypothesis testing at each CpG site. Simulation results show that the proposed method yields improved DML detection compared to existing methods, particularly when the number of replicates is low. The proposed method is implemented in the Bioconductor package DSS.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found
              Is Open Access

              From reads to genes to pathways: differential expression analysis of RNA-Seq experiments using Rsubread and the edgeR quasi-likelihood pipeline

                Bookmark

                Author and article information

                Contributors
                Role: Data CurationRole: Formal AnalysisRole: MethodologyRole: SoftwareRole: VisualizationRole: Writing – Original Draft PreparationRole: Writing – Review & Editing
                Role: InvestigationRole: Writing – Review & Editing
                Role: Funding AcquisitionRole: SupervisionRole: Writing – Review & Editing
                Role: ConceptualizationRole: Formal AnalysisRole: Funding AcquisitionRole: MethodologyRole: Project AdministrationRole: SoftwareRole: SupervisionRole: Writing – Review & Editing
                Journal
                F1000Res
                F1000Res
                F1000Research
                F1000Research
                F1000 Research Limited (London, UK )
                2046-1402
                28 November 2017
                2017
                : 6
                : 2055
                Affiliations
                [1 ]Department of Medical Biology, The University of Melbourne, Melbourne, VIC, 3010, Australia
                [2 ]The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, 3052, Australia
                [3 ]School of Mathematics and Statistics, The University of Melbourne, Melbourne, VIC, 3010, Australia
                [1 ]Department of Biostatistics, Johns Hopkins University, Baltimore, MD, USA
                [1 ]Department of Environmental and Occupational Health Sciences, University of Washington, Seattle, WA, USA
                [1 ]Bioinformatics Group, Babraham Institute, Cambridge, UK
                Author notes

                No competing interests were disclosed.

                Competing interests: No competing interests were disclosed.

                Competing interests: No competing interests were disclosed.

                Competing interests: No competing interests were disclosed.

                Author information
                https://orcid.org/0000-0001-9221-2892
                Article
                10.12688/f1000research.13196.1
                5747346
                29333247
                70dacca8-6740-42af-b53e-783df25e6c82
                Copyright: © 2017 Chen Y et al.

                This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 21 November 2017
                Funding
                Funded by: Victorian State Government
                Funded by: National Health and Medical Research Council
                Award ID: 1058892
                Award ID: 1054618
                This work was supported by the National Health and Medical Research Council (Fellowship 1058892 and Program 1054618 to G.K.S, Independent Research Institutes Infrastructure Support to the Walter and Eliza Hall Institute) and by a Victorian State Government Operational Infrastructure Support Grant.
                The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Method Article
                Articles
                Bioinformatics
                Genomics

                Comments

                Comment on this article