10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Characterizing chromatin folding coordinate and landscape with deep learning

      research-article
      , , *
      PLoS Computational Biology
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genome organization is critical for setting up the spatial environment of gene transcription, and substantial progress has been made towards its high-resolution characterization. The underlying molecular mechanism for its establishment is much less understood. We applied a deep-learning approach, variational autoencoder (VAE), to analyze the fluctuation and heterogeneity of chromatin structures revealed by single-cell imaging and to identify a reaction coordinate for chromatin folding. This coordinate connects the seemingly random structures observed in individual cohesin-depleted cells as intermediate states along a folding pathway that leads to the formation of topologically associating domains (TAD). We showed that folding into wild-type-like structures remain energetically favorable in cohesin-depleted cells, potentially as a result of the phase separation between the two chromatin segments with active and repressive histone marks. The energetic stabilization, however, is not strong enough to overcome the entropic penalty, leading to the formation of only partially folded structures and the disappearance of TADs from contact maps upon averaging. Our study suggests that machine learning techniques, when combined with rigorous statistical mechanical analysis, are powerful tools for analyzing structural ensembles of chromatin.

          Author summary

          Chromatin folding, the dynamical process during which chromatin establishes its three-dimensional organization for proper function, is of critical importance. However, it is difficult to visualize and characterize due to challenges associated with live-cell imaging at high temporal and spatial resolution. Here, using a combination of deep learning and statistical mechanical theory, we demonstrate that great insight can be gained into the folding process by analyzing snapshots of chromatin structures taken across a population of cells. Though these static structures are not connected in time, prior research on chemical reactions suggests that fluctuation within the conformational ensemble provides valuable information for uncovering the reaction mechanism. Our analysis reconciles the seemingly contradictory results from different experimental techniques and supports the presence of multiple factors in organizing the chromatin. As single-cell experimental data are becoming routine, the approaches presented here could help with their interpretation to provide more insight into chromatin folding.

          Related collections

          Most cited references61

          • Record: found
          • Abstract: found
          • Article: not found

          Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

          We describe Hi-C, a method that probes the three-dimensional architecture of whole genomes by coupling proximity-based ligation with massively parallel sequencing. We constructed spatial proximity maps of the human genome with Hi-C at a resolution of 1 megabase. These maps confirm the presence of chromosome territories and the spatial proximity of small, gene-rich chromosomes. We identified an additional level of genome organization that is characterized by the spatial segregation of open and closed chromatin to form two genome-wide compartments. At the megabase scale, the chromatin conformation is consistent with a fractal globule, a knot-free, polymer conformation that enables maximally dense packing while preserving the ability to easily fold and unfold any genomic locus. The fractal globule is distinct from the more commonly used globular equilibrium model. Our results demonstrate the power of Hi-C to map the dynamic conformations of whole genomes.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Spatial partitioning of the regulatory landscape of the X-inactivation centre.

            In eukaryotes transcriptional regulation often involves multiple long-range elements and is influenced by the genomic environment. A prime example of this concerns the mouse X-inactivation centre (Xic), which orchestrates the initiation of X-chromosome inactivation (XCI) by controlling the expression of the non-protein-coding Xist transcript. The extent of Xic sequences required for the proper regulation of Xist remains unknown. Here we use chromosome conformation capture carbon-copy (5C) and super-resolution microscopy to analyse the spatial organization of a 4.5-megabases (Mb) region including Xist. We discover a series of discrete 200-kilobase to 1 Mb topologically associating domains (TADs), present both before and after cell differentiation and on the active and inactive X. TADs align with, but do not rely on, several domain-wide features of the epigenome, such as H3K27me3 or H3K9me2 blocks and lamina-associated domains. TADs also align with coordinately regulated gene clusters. Disruption of a TAD boundary causes ectopic chromosomal contacts and long-range transcriptional misregulation. The Xist/Tsix sense/antisense unit illustrates how TADs enable the spatial segregation of oppositely regulated chromosomal neighbourhoods, with the respective promoters of Xist and Tsix lying in adjacent TADs, each containing their known positive regulators. We identify a novel distal regulatory region of Tsix within its TAD, which produces a long intervening RNA, Linx. In addition to uncovering a new principle of cis-regulatory architecture of mammalian chromosomes, our study sets the stage for the full genetic dissection of the X-inactivation centre.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              ChromHMM: automating chromatin-state discovery and characterization.

                Bookmark

                Author and article information

                Contributors
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: ResourcesRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: InvestigationRole: Writing – original draftRole: Writing – review & editing
                Role: ConceptualizationRole: Data curationRole: Formal analysisRole: Funding acquisitionRole: InvestigationRole: MethodologyRole: Project administrationRole: ResourcesRole: SoftwareRole: SupervisionRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, CA USA )
                1553-734X
                1553-7358
                September 2020
                28 September 2020
                : 16
                : 9
                : e1008262
                Affiliations
                [001] Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
                Rutgers University, UNITED STATES
                Author notes

                The authors have declared that no competing interests exist.

                Author information
                http://orcid.org/0000-0002-3982-9305
                http://orcid.org/0000-0002-3685-7503
                Article
                PCOMPBIOL-D-20-00738
                10.1371/journal.pcbi.1008262
                7544120
                32986691
                f5a8e9ce-b237-4067-8758-b8e522b9605b
                © 2020 Xie et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 4 May 2020
                : 14 August 2020
                Page count
                Figures: 7, Tables: 0, Pages: 19
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100000152, Division of Molecular and Cellular Biosciences;
                Award ID: MCB-1715859
                Award Recipient :
                Funded by: funder-id http://dx.doi.org/10.13039/100000057, National Institute of General Medical Sciences;
                Award ID: 1R35GM133580-01
                Award Recipient :
                This work was supported by the National Science Foundation (Grant MCB-1715859) and the National Institutes of Health (Grant 1R35GM133580-01). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology and Life Sciences
                Cell Biology
                Chromosome Biology
                Chromatin
                Biology and Life Sciences
                Genetics
                Epigenetics
                Chromatin
                Biology and Life Sciences
                Genetics
                Gene Expression
                Chromatin
                Physical Sciences
                Physics
                Thermodynamics
                Free Energy
                Physical Sciences
                Chemistry
                Polymer Chemistry
                Macromolecules
                Polymers
                Physical Sciences
                Materials Science
                Materials
                Polymers
                Physical Sciences
                Chemistry
                Polymer Chemistry
                Polymers
                Physical Sciences
                Mathematics
                Probability Theory
                Probability Distribution
                Physical Sciences
                Physics
                Thermodynamics
                Entropy
                Biology and Life Sciences
                Genetics
                Genomics
                Structural Genomics
                Biology and life sciences
                Biochemistry
                Proteins
                DNA-binding proteins
                Histones
                Computer and Information Sciences
                Artificial Intelligence
                Machine Learning
                Deep Learning
                Custom metadata
                vor-update-to-uncorrected-proof
                2020-10-08
                All source code for VAE model training and analysis are available from the Github repository: https://github.com/ZhangGroup-MITChemistry/chromVAE.

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article