Exploring the genetic basis of human population differences in DNA methylation and their causal impact on immune gene regulation
Lucas HusquinMaxime RotivalMaud FagnyHélène QuachNora ZidaneLisa M. McEwenJulia L. MacIsaacMichael S. KoborHugues AschardÉtienne PatinLluís Quintana‐Murci
114
Citation
112
Reference
10
Related Paper
Citation Trend
Abstract:
DNA methylation is influenced by both environmental and genetic factors and is increasingly thought to affect variation in complex traits and diseases. Yet, the extent of ancestry-related differences in DNA methylation, their genetic determinants, and their respective causal impact on immune gene regulation remain elusive. We report extensive population differences in DNA methylation between 156 individuals of African and European descent, detected in primary monocytes that are used as a model of a major innate immunity cell type. Most of these differences (~ 70%) are driven by DNA sequence variants nearby CpG sites, which account for ~ 60% of the variance in DNA methylation. We also identify several master regulators of DNA methylation variation in trans, including a regulatory hub nearby the transcription factor-encoding CTCF gene, which contributes markedly to ancestry-related differences in DNA methylation. Furthermore, we establish that variation in DNA methylation is associated with varying gene expression levels following mostly, but not exclusively, a canonical model of negative associations, particularly in enhancer regions. Specifically, we find that DNA methylation highly correlates with transcriptional activity of 811 and 230 genes, at the basal state and upon immune stimulation, respectively. Finally, using a Bayesian approach, we estimate causal mediation effects of DNA methylation on gene expression in ~ 20% of the studied cases, indicating that DNA methylation can play an active role in immune gene regulation. Using a system-level approach, our study reveals substantial ancestry-related differences in DNA methylation and provides evidence for their causal impact on immune gene regulation.Keywords:
Human genetics
Genome Biology
We polled the Editorial Board of Genome Biology to ask where they see genomics going in the next few years. Here are some of their responses.
Genome Biology
Personal genomics
Ask price
Human genetics
Computational genomics
Functional Genomics
Cite
Citations (21)
Genome Biology
Human genetics
Computational genomics
Personal genomics
Sequence (biology)
Comparative Genomics
Cite
Citations (3)
Abstract Background MicroRNAs (miRNAs) are established regulators of development, cell identity and disease. Although nearly two thousand human miRNA genes are known and new ones are continuously discovered, no attempt has been made to gauge the total miRNA content of the human genome. Results Employing an innovative computational method on massively pooled small RNA sequencing data, we report 2,469 novel human miRNA candidates of which 1,098 are validated by in-house and published experiments. Almost 300 candidates are robustly expressed in a neuronal cell system and are regulated during differentiation or when biogenesis factors Dicer, Drosha, DGCR8 or Ago2 are silenced. To improve expression profiling, we devised a quantitative miRNA capture system. In a kidney cell system, 400 candidates interact with DGCR8 at transcript positions that suggest miRNA hairpin recognition, and 1,000 of the new miRNA candidates interact with Ago1 or Ago2, indicating that they are directly bound by miRNA effector proteins. From kidney cell CLASH experiments, in which miRNA-target pairs are ligated and sequenced, we observe hundreds of interactions between novel miRNAs and mRNA targets. The novel miRNA candidates are specifically but lowly expressed, raising the possibility that not all may be functional. Interestingly, the majority are evolutionarily young and overrepresented in the human brain. Conclusions In summary, we present evidence that the complement of human miRNA genes is substantially larger than anticipated, and that more are likely to be discovered in the future as more tissues and experimental conditions are sequenced to greater depth.
Genome Biology
Human genetics
Computational genomics
Cite
Citations (246)
Activating mutations of fibroblast growth factor receptor 3 (FGFR3) cause various skeletal dysplasias and are also associated with certain cancers.Because there are no known specific pharmaceutical inhibitors of FGFR3, we established a cell-based protein translocation assay system that can monitor FGFR3 activity and be used for high throughput screening of complex mixtures.With this system we identified ethanol extract from a plant as a FGFR3 inhibitor and performed bioassay-guided fractionation to identify potent active fractions.The functionality of extract and active fractions were validated in vitro in FGFR3-activated primary multiple myeloma cells.The therapeutic efficacy and safety of the active fractions were further assessed in FGFR3 ACH mice, an achondroplasia mouse model.Oral administration significantly improved growth and dwarfism-related clinical features of the FGFR3 ACH mice.Our results demonstrate the applicability of this discovery approach.The identified plant extracts and active factions hold therapeutic potential for the treatment of FGFR3-activated skeletal dysplasias and cancers.
Human genetics
Genome Biology
Personal genomics
Cite
Citations (0)
Whole-genome analyses of human medulloblastomas show that the dominant clone at relapse is present as a rare subclone at primary diagnosis.
Human genetics
Genome Biology
clone (Java method)
Cancer genetics
Cite
Citations (2)
Abstract Background The core promoter region plays a critical role in the regulation of eukaryotic gene expression. We have determined the non-random distribution of DNA sequences relative to the transcriptional start site in Drosophila melanogaster promoters to identify sequences that may be biologically significant. We compare these results with those obtained for human promoters. Results We determined the distribution of all 65,536 octamer (8-mers) DNA sequences in 10,914 Drosophila promoters and two sets of human promoters aligned relative to the transcriptional start site. In Drosophila , 298 8-mers have highly significant ( p ≤ 1 × 10 -16 ) non-random distributions peaking within 100 base-pairs of the transcriptional start site. These sequences were grouped into 15 DNA motifs. Ten motifs, termed directional motifs, occur only on the positive strand while the remaining five motifs, termed non-directional motifs, occur on both strands. The only directional motifs to localize in human promoters are TATA, INR, and DPE. The directional motifs were further subdivided into those precisely positioned relative to the transcriptional start site and those that are positioned more loosely relative to the transcriptional start site. Similar numbers of non-directional motifs were identified in both species and most are different. The genes associated with all 15 DNA motifs, when they occur in the peak, are enriched in specific Gene Ontology categories and show a distinct mRNA expression pattern, suggesting that there is a core promoter code in Drosophila . Conclusion Drosophila and human promoters use different DNA sequences to regulate gene expression, supporting the idea that evolution occurs by the modulation of gene regulation.
Human genetics
Genome Biology
Comparative Genomics
Computational genomics
Functional Genomics
Cite
Citations (155)
Genome Biology
Human genetics
Computational genomics
Cite
Citations (1)
The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This was achieved by a combination of initial manual annotation by the HAVANA team, experimental validation by the GENCODE consortium and a refinement of the annotation based on these experimental results.The GENCODE gene features are divided into eight different categories of which only the first two (known and novel coding sequence) are confidently predicted to be protein-coding genes. 5' rapid amplification of cDNA ends (RACE) and RT-PCR were used to experimentally verify the initial annotation. Of the 420 coding loci tested, 229 RACE products have been sequenced. They supported 5' extensions of 30 loci and new splice variants in 50 loci. In addition, 46 loci without evidence for a coding sequence were validated, consisting of 31 novel and 15 putative transcripts. We assessed the comprehensiveness of the GENCODE annotation by attempting to validate all the predicted exon boundaries outside the GENCODE annotation. Out of 1,215 tested in a subset of the ENCODE regions, 14 novel exon pairs were validated, only two of them in intergenic regions.In total, 487 loci, of which 434 are coding, have been annotated as part of the GENCODE reference set available from the UCSC browser. Comparison of GENCODE annotation with RefSeq and ENSEMBL show only 40% of GENCODE exons are contained within the two sets, which is a reflection of the high number of alternative splice forms with unique exons annotated. Over 50% of coding loci have been experimentally verified by 5' RACE for EGASP and the GENCODE collaboration is continuing to refine its annotation of 1% human genome with the aid of experimental validation.
ENCODE
Human genetics
Genome Biology
Cite
Citations (619)
The Forkhead (FKH) transcription factor FOXM1 is a key regulator of the cell cycle and is overexpressed in most types of cancer. FOXM1, similar to other FKH factors, binds to a canonical FKH motif in vitro. However, genome-wide mapping studies in different cell lines have shown a lack of enrichment of the FKH motif, suggesting an alternative mode of chromatin recruitment. We have investigated the role of direct versus indirect DNA binding in FOXM1 recruitment by performing ChIP-seq with wild-type and DNA binding deficient FOXM1. An in vitro fluorescence polarization assay identified point mutations in the DNA binding domain of FOXM1 that inhibit binding to a FKH consensus sequence. Cell lines expressing either wild-type or DNA binding deficient GFP-tagged FOXM1 were used for genome-wide mapping studies comparing the distribution of the DNA binding deficient protein to the wild-type. This shows that interaction of the FOXM1 DNA binding domain with target DNA is essential for recruitment. Moreover, analysis of the protein interactome of wild-type versus DNA binding deficient FOXM1 shows that the reduced recruitment is not due to inhibition of protein-protein interactions. A functional DNA binding domain is essential for FOXM1 chromatin recruitment. Even in FOXM1 mutants with almost complete loss of binding, the protein-protein interactions and pattern of phosphorylation are largely unaffected. These results strongly support a model whereby FOXM1 is specifically recruited to chromatin through co-factor interactions by binding directly to non-canonical DNA sequences.
Genome Biology
Human genetics
Computational genomics
Personal genomics
Cite
Citations (56)
Michael Snyder answers Genome Biology's questions on the human and professional stories underlying his Snyderome integrative omics project.
Genome Biology
Human genetics
Personal genomics
Computational genomics
Cite
Citations (5)