Significance We scanned soybean and Arabidopsis seed genomes for hypomethylated regions, or DNA methylation valleys (DMVs), present in mammalian cells. Seeds contain DMV regions that have <5% bulk DNA methylation or, in many cases, no detectable DNA methylation. Methylation levels of seed DMVs do not vary detectably during seed development and are present prior to fertilization. Seed DMVs are enriched in transcription factor (TF) genes and are decorated with histone marks that fluctuate developmentally, resembling their animal counterparts in significant ways. We conclude that many genes playing important roles in seed formation are regulated without detectable DNA methylation events and suggest that selective action of TFs, as well as chromatin epigenetic events, play important roles in making a seed.
Significance We describe the spatial and temporal profiles of soybean and Arabidopsis seed methylomes during development. CHH methylation increases globally from fertilization through dormancy in all seed parts, decreases following germination, and targets primarily transposons. By contrast, CG- and CHG-context methylation remains constant throughout seed development. Mutant seeds lacking non-CG methylation develop normally, but have a set of up-regulated transposon RNAs suggesting that the CHH methylation increase may be a failsafe mechanism to reinforce transposon silencing. Major classes of seed genes have similar methylation profiles, whether they are active or not. Our results suggest that soybean and Arabidopsis seed methylomes are similar, and that DNA methylation does not play a significant role in regulating many genes important for seed development.
Abstract The precise mechanisms that control gene activity during seed development remain largely unknown. Previously, we showed that several genes essential for seed development, including those encoding storage proteins, fatty acid biosynthesis enzymes, and transcriptional regulators, such as ABI3 and FUS3 , are located within hypomethylated regions of the soybean genome. These hypomethylated regions are similar to the DNA methylation valleys (DMVs), or canyons, found in mammalian cells. Here, we address the question of the extent to which DMVs are present within seed genomes, and what role they might play in seed development. We scanned soybean and Arabidopsis seed genomes from post-fertilization through dormancy and germination for regions that contain < 5% or < 0.4% bulk methylation in CG-, CHG-, and CHH-contexts over all developmental stages. We found that DMVs represent extensive portions of seed genomes, range in size from 5 to 76 kb, are scattered throughout all chromosomes, and are hypomethylated throughout the plant life cycle. Significantly, DMVs are enriched greatly in transcription factor genes, and other developmental genes, that play critical roles in seed formation. Many DMV genes are regulated with respect to seed stage, region, and tissue - and contain H3K4me3, H3K27me3, or bivalent marks that fluctuate during development. Our results indicate that DMVs are a unique regulatory feature of both plant and animal genomes, and that a large number of seed genes are regulated in the absence of methylation changes during development - probably by the action of specific transcription factors and epigenetic events at the chromatin level. Significance We scanned soybean and Arabidopsis seed genomes for hypomethylated regions, or DNA Methylation Valleys (DMVs), present in mammalian cells. A significant fraction of seed genomes contain DMV regions that have < 5% bulk DNA methylation, or, in many cases, no detectable DNA methylation. Methylation levels of seed DMVs do not vary detectably during seed development with respect to time, region, and tissue, and are present prior to fertilization. Seed DMVs are enriched in transcription factor genes and other genes critical for seed development, and are also decorated with histone marks that fluctuate with developmental stage, resembling in significant ways their animal counterparts. We conclude that many genes playing important roles in seed formation are regulated in the absence of detectable DNA methylation events, and suggest that selective action of transcriptional activators and repressors, as well as chromatin epigenetic events play important roles in making a seed - particularly embryo formation.
Seeds are complex structures composed of three regions, embryo, endosperm, and seed coat, with each further divided into subregions that consist of tissues, cell layers, and cell types. Although the seed is well characterized anatomically, much less is known about the genetic circuitry that dictates its spatial complexity. To address this issue, we profiled mRNAs from anatomically distinct seed subregions at several developmental stages. Analyses of these profiles showed that all subregions express similar diverse gene numbers and that the small gene numbers expressed subregion specifically provide information about the biological processes that occur in these seed compartments. In parallel, we profiled RNAs in individual nuclei and identified nuclei clusters representing distinct cell identities. Integrating single-nucleus RNA and subregion mRNA transcriptomes allowed most cell identities to be assigned to specific subregions and cell types and/or cell states. The number of cell identities exceeds the number of anatomically distinguishable cell types, emphasizing the spatial complexity of seeds. We defined gene coexpression networks that underlie distinct biological processes during seed development. We showed that network distribution among subregions and cell identities is highly variable. Some networks operate in single subregions and/or cell identities, and many coexpression networks operate in multiple subregions and/or cell identities. We also showed that single subregions and cell identities possess several networks. Together, our studies provide unique insights into the biological processes and genetic circuitry that underlie the spatial landscape of the seed.