Genome organization: experiments and modeling

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Understanding the spatial organization of chromosomes in 3D has been a key outstanding question in biology for some time. This is a crucial area of research, because the 3D organization of chromatin at the 10–100’s kilobase pair (kbp) level underlies several aspects of gene regulation, and there is evidence that it plays an important role, for example, in development (Sproul et al. 2005), aging (Pal and Tyler 2016), as well as in a number of genetic diseases (Misteli 2010). Very recently, we have witnessed a dramatic and unprecedented rise in the number of contributions on this topic. This burst in activity is due to at least a twofold reason. First, the development of chromosome conformation capture techniques, and especially of the high throughput variants “Hi-C,” “CaptureC,” and “Capture-HiC” (Osborne and Mifsud, 2017), have provided us with an impressive high-resolution catalog of chromosomal contacts in different cell types, tissues, and organisms, for healthy, senescent, and diseased cells. Originally, this data was at low resolution, but the lower cost of sequencing has enabled large datasets to be generated providing a structural framework for genome organization in different cell types (Rao et al. 2014; Mifsud et al. 2015). Second, the refinement of polymer and statistical physics models for DNA and chromatin have made it possible to simulate the stochastic organization of genomic loci, or even entire chromosomes. These simulations are informed by existing data, but they can in turn stimulate further experiments via their predictions. “Inverse” models (some of these are reviewed by (Zhan et al. 2017) and by (Bianco et al. 2017)) start from the Hi-C data and work backwards, using sophisticated fitting procedures to infer a plausible polymer model; the model can then be used to make predictions on future experiments, where, for instance, the genomic region of interest is edited. “Direct” models (some of these are reviewed by Bianco et al. 2017 and by Haddad et al. 2017) start instead from simple biological and biophysical assumptions to deliver computer simulations whose output can be compared directly to Hi-C contact maps (as well as other experiments); their use is instrumental, combined with experimental evidence, to provide mechanistic models for genome organization. There are currently two main popular models for the organization of chromosomes within 3D nuclear space. Both have been prompted by the combination of insights from experimental evidence and from computer simulations. The first one assumes the organization is driven by transcription factors binding to active and inactive regions of the genome (Brackley et al. 2016b; Bianco et al. 2017; Haddad et al. 2017), whereas the second assumes that the main organizers are cohesin and condensin (Finn et al. 2017; Kalitsis et al. 2017). Both the models can explain some key aspects of genome organization, but neither can rationalize all observations. The transcription factor (TF) model is at the basis of the “strings-and-binders” (Bianco et al. 2017) and the “block copolymer” (Haddad et al. 2017) models. The underlying idea is that chromatin conformations arise as a result of the action of bivalent (or multivalent) factors (the “binders”) which can bind the chromatin fiber (the “string”) at multiple points, thereby forming chromosome bridges which stabilize genomic contacts. Chromatin regions bearing active and inactive marks recruit different kinds of proteins. For instance, inactive heterochromatic regions rich in H3K9me3 bind HP1, whereas active regions (rich in H3K4me3, H3K4me1, or H3K27ac) bind holoenzymes, polymerases, and transcription factors. There is evidence for the ability of the “binders” to form bridges. HP1 is known to be multivalent, and this is also the case for other repressing factors (such as PRC1 and other polycomb-group proteins), while complexes of transcription factors and polymerases will also normally have multiple DNA-binding sites. Therefore, the TF model is based on broadly valid assumptions, and it can naturally explain the segregation between euchromatin and heterochromatin (Nishibuchi and Dejardin 2017), as well as the organization of the genome into A (active) and B (inactive) compartments. Because active and inactive factors have a generic tendency to cluster through the “bridging induced attraction” (Brackley et al. 2013), the TF model further provides a natural framework to capture the biogenesis of nuclear bodies. The TF model is appealing because it relies on minimal input (the binding sites of active and inactive factors) and in principle no fitting, yet it delivers contact maps which are in good quantitative agreement with experiments (Brackley et al. 2016b). The TF model also relates well to studies indicating that the local transcriptional environment impacts on structure and orchestrates chromosome organization. Hi-C like techniques provide a structural basis for the genome but appear to be relatively intransigent to transcription. Instead, a superimposition of topological data (Naughton et al. 2013) to Hi-C maps provides an additional level of domain-like organization to reveal the formation of over-wound and under-wound 100 kb chromatin domains. These correspond well to high-resolution Hi-C maps (Rao et al. 2014) and indicate the genome is organized into structural domains subdivided into functional domains regulated by transcription and topoisomerase activity. Our own simulations of these phenomena highlight how supercoiling, and the implicit binding of transcriptional regulators can both control transcriptional activity and facilitate domain remodeling (Brackley et al. 2016a). The main drawback of the TF model is that it cannot easily account for the striking observation, made through Hi-C, that chromosome loops between convergent CTCF binding sites are abundant, while those between divergent ones are virtually absent (Rao et al. 2014). This is because, at least in its simplest version, this model works in thermodynamic equilibrium, and under this condition, convergent and divergent loops share the same chromatin structure; as a result, the experimentally observed bias is inexplicable within this framework. There are also some outstanding open questions which the TF model prompts. For instance, the model works well given the 1D epigenetic patterning of histone modifications, as this is a good proxy for the binding landscape of bridging factors. But how is this 1D patterning set up in the first place, and how can it be changed reproducibly during development? The “living chromatin” model outlined in (Haddad et al. 2017.) considers a chromatin fiber where the epigenetic marks can be dynamically written and erased and may provide the right avenue to quantitatively address this question in the future (Michieletto et al. 2016). As anticipated, the second, currently popular, mechanistic model for chromatin organization instead assumes that cohesin and condensin are the main players. Condensin has long been known to be crucial for mitotic chromatin organization (see the review by Kalitsis et al. 2017); the idea that cohesin is fundamental to organization during interphase is the basis of the “loop extrusion” (LE) model (Fudenberg et al. 2016), reviewed by Finn et al. 2017 Like condensin, cohesin is a DNA-binding protein which topologically embraces DNA or chromatin upon binding and stabilizes chromosome loops (Finn et al. 2017). Details of the chromatin-bound structure are still debated. A single cohesin monomer may embrace two chromatin fibers, or only one, in which case looping requires dimerization to form a molecular “hand-cuff” (Finn et al. 2017). Independently of these microscopic details, the LE model assumes that cohesin has a motor activity, which is either intrinsic or extrinsic to it, and which allows it to extrude progressively larger loops. These loops grow until they are halted by a boundary element, most likely a bound CTCF, which is known to interact with cohesin in a directional way (Fudenberg et al. 2016). The main strength of the LE model is, therefore, that it can naturally explain why almost all CTCF-mediated loops are convergent, and virtually none are divergent. This is because as the loop grows, it can “sense” the orientation of CTCF, because cohesin will only stall when it faces two convergent and occupied CTCF binding sites. The main problem of the LE model is that it assumes that cohesin actively creates loops of hundreds of kilo-base-pairs (the typical size of CTCF-mediated loops), without dissociating. Existing experimental evidence suggests that the residence time of cohesin on DNA is about 20 min, so, in order to extrude a loop of 100 kbp, cohesin would have to move at about 5 kbp/min, an impressive speed, as this is about five times as fast as an RNA polymerase. Yet, there is not currently evidence of any motor activity of cohesin associated with unidirectional motion, as postulated in the loop extrusion model. The primary outstanding question prompted by this model (Finn et al. 2017) is therefore, what is the dynamics of cohesin-mediated chromatin loops?, how does cohesin translocate on chromatin, and how fast can it do so? While the sliding of a cohesin ring embracing a single DNA molecule has recently been studied (Stigler et al. 2016), it is now necessary to characterize mobility on chromatinized DNA, and ultimately probe the dynamics of cohesin-mediated loops. One view is that the TF and LE models are competing ones, and that there is a single main organizer, either transcription factors or cohesin/condensin, and new research will tell which one it is. Another possible view is, however, that the TF and LE models are instead complementary. After all, the first one explains A/B compartments, while the second one explains the formation of convergent CTCF loops. Further research will then be needed to understand how the two kinds of organizers couple when they are active at the same time. For instance, are the two organizations independent of each other, or is there cross-talk between them? Our discussion has so far been centered on large scale (10–100 kbp) features of genome organization as explored by Hi-C, where computational models have proved very beneficial in facilitating interpretation of existing experiments and also stimulated the design of new ones. Computer-based modeling is, however, possibly even more important at smaller scale of chromatin organization, down to the single nucleosome level. A different approach to study genome organization has recently been further developed by the Greenleaf lab (Risca et al. 2016). It utilizes ionizing radiation-induced spatially correlated cleavage of DNA and sequencing (RICC-seq), to provide information about local (50–500 kbp) nucleosomal interactions. At present, this data is relatively low resolution but after significant computational analysis, it gives evidence for a two-start helical chromatin fiber in heterochromatic regions of the genome but more disrupted fibers in regions of the genome with more open chromatin (Gilbert et al. 2004). These structures resemble those predicted by mesoscale modeling of chromatin folding at the scale of a few nucleosomes, where compaction is induced by proteins such as linker histones (Luque et al. 2014); this mesoscale modeling can then be further scaled up to simulate chromatin loops (Bascom et al. 2016). In the near future, we expect that the combination of experimental and simulation techniques, which is the central theme of this special issue, will prove more and more effective at addressing outstanding open questions such as those we have outlined above. Such a combination has the potential to yield a transformative tool in the field, because the two approaches have different strengths and weaknesses, hence are highly complementary and can be used to ask questions which could not be answered by using either modeling or experiments alone. Electronic supplementary material Fig. S1 (PDF 1137 kb)

Related collections

Most cited references 11

Record: found
Abstract: found
Article: found

Is Open Access

Epigenetics and aging

Sangita Pal, Jessica Tyler (2016)

Researchers review how random changes and our environment (for example, diet) determines our life span.

0 comments Cited 311 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Transcription forms and remodels supercoiling domains unfolding large-scale chromatin structures

Catherine Naughton, Nicolaos Avlonitis, Samuel Corless … (2013)

DNA supercoiling is an inherent consequence of twisting DNA and is critical for regulating gene expression and DNA replication. However, DNA supercoiling at a genomic scale in human cells is uncharacterized. To map supercoiling we used biotinylated-trimethylpsoralen as a DNA structure probe to show the genome is organized into supercoiling domains. Domains are formed and remodeled by RNA polymerase and topoisomerase activities and are flanked by GC-AT boundaries and CTCF binding sites. Under-wound domains are transcriptionally active, enriched in topoisomerase I, “open” chromatin fibers and DNaseI sites, but are depleted of topoisomerase II. Furthermore DNA supercoiling impacts on additional levels of chromatin compaction as under-wound domains are cytologically decondensed, topologically constrained, and decompacted by transcription of short RNAs. We suggest that supercoiling domains create a topological environment that facilitates gene activation providing an evolutionary purpose for clustering genes along chromosomes.

0 comments Cited 161 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Chromatin architecture of the human genome: gene-rich domains are enriched in open chromatin fibers.

Timothy W Bickmore, Kathryn Woodfine, Nick Gilbert … (2004)

We present an analysis of chromatin fiber structure across the human genome. Compact and open chromatin fiber structures were separated by sucrose sedimentation and their distributions analyzed by hybridization to metaphase chromosomes and genomic microarrays. We show that compact chromatin fibers originate from some sites of heterochromatin (C-bands), and G-bands (euchromatin). Open chromatin fibers correlate with regions of highest gene density, but not with gene expression since inactive genes can be in domains of open chromatin, and active genes in regions of low gene density can be embedded in compact chromatin fibers. Moreover, we show that chromatin fiber structure impacts on further levels of chromatin condensation. Regions of open chromatin fibers are cytologically decondensed and have a distinctive nuclear organization. We suggest that domains of open chromatin may create an environment that facilitates transcriptional activation and could provide an evolutionary constraint to maintain clusters of genes together along chromosomes.

0 comments Cited 148 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Nick Gilbert: nick.gilbert@ed.ac.uk

Davide Marenduzzo: dmarendu@ph.ed.ac.uk

Journal

Journal ID (nlm-ta): Chromosome Res

Journal ID (iso-abbrev): Chromosome Res

Title: Chromosome Research

Publisher: Springer Netherlands (Dordrecht )

ISSN (Print): 0967-3849

ISSN (Electronic): 1573-6849

Publication date (Electronic): 2 February 2017

Publication date PMC-release: 2 February 2017

Publication date (Print): 2017

Volume: 25

Issue: 1

Pages: 1-4

Affiliations

[1 ]ISNI 0000 0004 1936 7988, GRID grid.4305.2, MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, , University of Edinburgh, ; Crewe Rd, Edinburgh, EH4 2XR UK

[2 ]ISNI 0000 0004 1936 7988, GRID grid.4305.2, SUPA, School of Physics & Astronomy, , University of Edinburgh, ; Peter Guthrie Tait Road, Edinburgh, EH9 3FD UK

Author notes

Responsible Editor: Beth A. Sullivan

Article

Publisher ID: 9551

DOI: 10.1007/s10577-017-9551-2

PMC ID: 5346143

PubMed ID: 28155082

SO-VID: 1b80e320-261f-41fa-96f1-0609e2e3e143

License:

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

History

Date received : 10 January 2017

Date accepted : 11 January 2017

Funding

Funded by: University of Edinburgh

Custom metadata

ScienceOpen disciplines: Genetics

Keywords: chromatin,dna,genome organization,chromosome conformation capture,polymer modeling

Data availability:

ScienceOpen disciplines: Genetics

Keywords: chromatin, dna, genome organization, chromosome conformation capture, polymer modeling

Genome organization: experiments and modeling

Read this article at

Abstract

Related collections

Higher order chromatin architecture

Most cited references 11

Epigenetics and aging

Transcription forms and remodels supercoiling domains unfolding large-scale chromatin structures

Chromatin architecture of the human genome: gene-rich domains are enriched in open chromatin fibers.

Author and article information

Contributors

Journal

Affiliations

Author notes

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 6

Cited by 6

Most referenced authors 201