We recently used in situ Hi-C to create kilobase-resolution 3D maps of mammalian genomes. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. First, by examining the probability for short chromatin fragments to bend and form a cycle, we show that nuclear human chromatin is flexible at the kilobase scale, inconsistent with the widespread existence of 30-nm fibers in vivo. Next, we find that contact domains are inconsistent with the equilibrium state for an ordinary condensed polymer. Combining Hi-C data and novel mathematical theorems, we show that contact domains are also not consistent with a fractal globule. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during polymer condensation leads to formation of an anisotropic “tension globule.” In the other, CCCTC-binding factor (CTCF) and cohesin act together to extrude unknotted loops during inter-phase. Both models are consistent with the observed contact domains and with the observation that contact domains tend to form inside loops. However, the extrusion model explains a far wider array of observations, such as why loops tend not to overlap, why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation, and why edges of enhanced contacts occur at some domains. Furthermore, loop extrusion leads naturally to the formation of chromosome territories. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The convergent rule correctly predicts the affected loops in every case. Moreover, the extrusion model accurately predicts in silico the 3D maps resulting from each experiment using only the location of CTCF-binding sites in the wild-type. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair. Support or Funding Information This project is supported by NSF Grant PHY-1308264, NSF Grant PHY-1427654, NIH New Innovator Award 1DP2OD008540-01, Cancer Prevention Research Institute of Texas Scholar Award R1304, a McNair Medical Institute Scholar Award, and the President's Early Career Award in Science and Engineering, and funding from the Welch Foundation, International Business Machines, and Nvidia. (A)Summary of loop extrusion binding (i, ii), extruding (iii), and halting at motifs (iv). (B) 3D rendering of an extrusion globule. (C) Contact probability vs. distance within domains created in silico using loop extrusion. (D) Molecular simulations of loop extrusion based only on CTCF ChIP-Seq signals accurately recapitulate features observed in our Hi-C maps. (A) Results of CRISPR/Cas9-based editing experiments on chr 8. Simulations shown on left, experimental data shown on right. (B) Similar results on chr 1. (C) We disrupted two loops by inserting a single basepair. (D) Our data suggest that the region shown in (A) is typically found in one of two states. (E) Extrusion can explain the formation of exclusion domains.
We recently used in situ Hi-C to create kilobase-resolution 3D maps of mammalian genomes. Here, we combine these maps with new Hi-C, microscopy, and genome-editing experiments to study the physical structure of chromatin fibers, domains, and loops. We find that the observed contact domains are inconsistent with the equilibrium state for an ordinary condensed polymer. Combining Hi-C data and novel mathematical theorems, we show that contact domains are also not consistent with a fractal globule. Instead, we use physical simulations to study two models of genome folding. In one, intermonomer attraction during polymer condensation leads to formation of an anisotropic "tension globule." In the other, CCCTC-binding factor (CTCF) and cohesin act together to extrude unknotted loops during interphase. Both models are consistent with the observed contact domains and with the observation that contact domains tend to form inside loops. However, the extrusion model explains a far wider array of observations, such as why loops tend not to overlap and why the CTCF-binding motifs at pairs of loop anchors lie in the convergent orientation. Finally, we perform 13 genome-editing experiments examining the effect of altering CTCF-binding sites on chromatin folding. The convergent rule correctly predicts the affected loops in every case. Moreover, the extrusion model accurately predicts in silico the 3D maps resulting from each experiment using only the location of CTCF-binding sites in the WT. Thus, we show that it is possible to disrupt, restore, and move loops and domains using targeted mutations as small as a single base pair.
Adeno-associated virus (AAV)-based gene therapy could be facilitated by the development of molecular switches to control the magnitude and timing of expression of therapeutic transgenes. RNA interference (RNAi)-based approaches hold unique potential as a clinically proven modality to pharmacologically regulate AAV gene dosage in a sequence-specific manner. We present a generalizable RNAi-based rheostat wherein hepatocyte-directed AAV transgene expression is silenced using the clinically validated modality of chemically modified small interfering RNA (siRNA) conjugates or vectorized co-expression of short hairpin RNA (shRNA). For transgene induction, we employ REVERSIR technology, a synthetic high-affinity oligonucleotide complementary to the siRNA or shRNA guide strand to reverse RNAi activity and rapidly recover transgene expression. For potential clinical development, we report potent and specific siRNA sequences that may allow selective regulation of transgenes while minimizing unintended off-target effects. Our results establish a conceptual framework for RNAi-based regulatory switches with potential for infrequent dosing in clinical settings to dynamically modulate expression of virally-delivered gene therapies.
ABSTRACT Adeno-associated virus (AAV)-based gene therapy could be facilitated by the development of molecular switches to control the magnitude and timing of expression of therapeutic transgenes. RNA interference (RNAi)-based approaches hold unique potential as a clinically proven modality to pharmacologically regulate AAV gene dosage in a sequence-specific manner. We present a generalizable RNAi-based rheostat wherein AAV transgene expression is silenced using the clinically validated modality of chemically modified short interfering RNA (siRNA) conjugates or vectorized co-expression of short hairpin RNA (shRNA). For transgene induction, we employ REVERSIR technology, a synthetic high-affinity oligonucleotide complementary to the siRNA or shRNA guide strand to reverse RNAi activity and rapidly recover transgene expression. For potential clinical development, we report potent and specific siRNA sequences that may allow selective regulation of transgenes while minimizing unintended off-target effects. Our results establish a conceptual framework for RNAi-based regulatory switches with potential for infrequent dosing in clinical settings to dynamically modulate expression of virally-delivered gene therapies.
Cancer evolution is fueled by epigenetic as well as genetic diversity. In chronic lymphocytic leukemia (CLL), intra-tumoral DNA methylation (DNAme) heterogeneity empowers evolution. Here, to comprehensively study the epigenetic dimension of cancer evolution, we integrate DNAme analysis with histone modification mapping and single cell analyses of RNA expression and DNAme in 22 primary CLL and 13 healthy donor B lymphocyte samples. Our data reveal corrupted coherence across different layers of the CLL epigenome. This manifests in decreased mutual information across epigenetic modifications and gene expression attributed to cell-to-cell heterogeneity. Disrupted epigenetic-transcriptional coordination in CLL is also reflected in the dysregulation of the transcriptional output as a function of the combinatorial chromatin states, including incomplete Polycomb-mediated gene silencing. Notably, we observe unexpected co-mapping of typically mutually exclusive activating and repressing histone modifications, suggestive of intra-tumoral epigenetic diversity. Thus, CLL epigenetic diversification leads to decreased coordination across layers of epigenetic information, likely reflecting an admixture of cells with diverging cellular identities.
In normal mammalian development cytosine methylation is essential and is directed to specific regions of the genome. Despite notable advances through mapping its genome-wide distribution, studying the direct contribution of DNA methylation to gene and genome regulation has been limited by the lack of tools for its precise manipulation. Thus, combining the targeting capability of the CRISPR-Cas9 system with an epigenetic modifier has attracted interest in the scientific community. In contrast to profiling the genome-wide cleavage of a nuclease competent Cas9, tracing the global activity of a dead Cas9 (dCas9) methyltransferase fusion protein is challenging within a highly methylated genome. Here, we report the generation and use of an engineered, methylation depleted but maintenance competent mouse ES cell line and find surprisingly ubiquitous nuclear activity of dCas9-methyltransferases. Subsequent experiments in human somatic cells refine these observations and point to an important difference between genetic and epigenetic editing tools that require unique experimental considerations.