Familial combined hyperlipidemia (FCH) is a complex and common familial dyslipidemia characterized by elevated total cholesterol and/or triglyceride levels with over five-fold risk of coronary heart disease. The genetic architecture and contribution of rare Mendelian and common variants to FCH susceptibility is unknown. In 53 Finnish FCH families, we genotyped and imputed nine million variants in 715 family members with DNA available. We studied the enrichment of variants previously implicated with monogenic dyslipidemias and/or lipid levels in the general population by comparing allele frequencies between the FCH families and population samples. We also constructed weighted polygenic scores using 212 lipid-associated SNPs and estimated the relative contributions of Mendelian variants and polygenic scores to the risk of FCH in the families. We identified, across the whole allele frequency spectrum, an enrichment of variants known to elevate, and a deficiency of variants known to lower LDL-C and/or TG levels among both probands and affected FCH individuals. The score based on TG associated SNPs was particularly high among affected individuals compared to non-affected family members. Out of 234 affected FCH individuals across the families, seven (3%) carried Mendelian variants and 83 (35%) showed high accumulation of either known LDL-C or TG elevating variants by having either polygenic score over the 90th percentile in the population. The positive predictive value of high score was much higher for affected FCH individuals than for similar sporadic cases in the population. FCH is highly polygenic, supporting the hypothesis that variants across the whole allele frequency spectrum contribute to this complex familial trait. Polygenic SNP panels improve identification of individuals affected with FCH, but their clinical utility remains to be defined.
Background: Leigh syndrome is a rare, genetic, and severe mitochondrial disorder characterized by neuromuscular issues (ataxia, seizure, hypotonia, developmental delay, dystonia) and ocular abnormalities (nystagmus, atrophy, strabismus, ptosis). It is caused by pathogenic variants in either mitochondrial or nuclear DNA genes, with an estimated incidence rate of 1 per 40,000 live births. Case presentation: Herein, we present an infant male with nystagmus, hypotonia, and developmental delay who carried a clinical diagnosis of Leigh-like syndrome. Cerebral magnetic resonance imaging changes further supported the clinical evidence of an underlying mitochondrial disorder, but extensive diagnostic testing was negative. Trio exome sequencing under a research protocol uncovered compound-heterozygous missense variants in the HTRA2 gene (MIM: #606441): NM_013247.5:c.1037A>T:(p.Glu346Val) (maternal) and NM_013247.5:c.1172T>A:(p.Val391Glu) (paternal). Both variants are absent from public databases, making them extremely rare in the population. The maternal variant is adjacent to an exon-intron boundary and predicted to disrupt splicing, while the paternal variant alters a highly conserved amino acid and is predicted to be damaging by nearly all in silico tools. Biallelic variants in HTRA2 cause 3-methylglutaconic aciduria, type VIII (MGCA8), an extremely rare autosomal recessive disorder with fewer than ten families reported to date. Variant interpretation is challenging given the paucity of known disease-causing variants, and indeed we assess both paternal and maternal variants as Variants of Uncertain Significance under current American College of Medical Genetics guidelines. However, based on the inheritance pattern, suggestive evidence of pathogenicity, and significant clinical correlation with other reported MGCA8 patients, the clinical care team considers this a diagnostic result. Conclusion: Our findings ended the diagnostic odyssey for this family and provide further insights into the genetic and clinical spectrum of this critically under-studied disorder.
Early T-cell precursor acute lymphoblastic leukaemia (ETP ALL) is an aggressive malignancy of unknown genetic basis. We performed whole-genome sequencing of 12 ETP ALL cases and assessed the frequency of the identified somatic mutations in 94 T-cell acute lymphoblastic leukaemia cases. ETP ALL was characterized by activating mutations in genes regulating cytokine receptor and RAS signalling (67% of cases; NRAS, KRAS, FLT3, IL7R, JAK3, JAK1, SH2B3 and BRAF), inactivating lesions disrupting haematopoietic development (58%; GATA3, ETV6, RUNX1, IKZF1 and EP300) and histone-modifying genes (48%; EZH2, EED, SUZ12, SETD2 and EP300). We also identified new targets of recurrent mutation including DNM2, ECT2L and RELN. The mutational spectrum is similar to myeloid tumours, and moreover, the global transcriptional profile of ETP ALL was similar to that of normal and myeloid leukaemia haematopoietic stem cells. These findings suggest that addition of myeloid-directed therapies might improve the poor outcome of ETP ALL. This work shows that treatments used for acute myeloid leukaemia and targeted therapies could be used for early T-cell precursor acute lymphoblastic leukaemia. The early T-cell precursor (ETP) subtype of childhood acute lymphoblastic leukaemia (ALL) has a poor prognosis when treated with standard chemotherapy. Whole genome sequencing is used here to gain insights into the genetic basis of the condition. The results reveal a high frequency of activating mutations in genes regulating cytokine receptor and Ras signalling, lesions that disrupt haemopoiesis (many of which arise from chromosomal rearrangements that generate novel chimeric in-frame fusion genes), and inactivating mutations in histone modifying genes. This mutation pattern resembles that of myeloid malignancies, suggesting that myeloid-directed therapies such as high-dose cytarabine, or targeted therapies that inhibit cytokine receptor and JAK signalling, might be effective in ETP ALL.
The sensitivity of massively-parallel sequencing has confirmed that most cancers are oligoclonal, with subpopulations of neoplastic cells harboring distinct mutations. A fine resolution view of this clonal architecture provides insight into tumor heterogeneity, evolution, and treatment response, all of which may have clinical implications. Single tumor analysis already contributes to understanding these phenomena. However, cryptic subclones are frequently revealed by additional patient samples (e.g., collected at relapse or following treatment), indicating that accurately characterizing a tumor requires analyzing multiple samples from the same patient. To address this need, we present SciClone, a computational method that identifies the number and genetic composition of subclones by analyzing the variant allele frequencies of somatic mutations. We use it to detect subclones in acute myeloid leukemia and breast cancer samples that, though present at disease onset, are not evident from a single primary tumor sample. By doing so, we can track tumor evolution and identify the spatial origins of cells resisting therapy.
The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found strong evidence for 1,300 new C. elegans genes. In addition, comparisons of the two genomes will help to understand the evolutionary forces that mold nematode genomes.
Retinoblastoma is an aggressive childhood cancer of the developing retina that is initiated by the biallelic loss of RB1. Tumours progress very quickly following RB1 inactivation but the underlying mechanism is not known. Here we show that the retinoblastoma genome is stable, but that multiple cancer pathways can be epigenetically deregulated. To identify the mutations that cooperate with RB1 loss, we performed whole-genome sequencing of retinoblastomas. The overall mutational rate was very low; RB1 was the only known cancer gene mutated. We then evaluated the role of RB1 in genome stability and considered non-genetic mechanisms of cancer pathway deregulation. For example, the proto-oncogene SYK is upregulated in retinoblastoma and is required for tumour cell survival. Targeting SYK with a small-molecule inhibitor induced retinoblastoma tumour cell death in vitro and in vivo. Thus, retinoblastomas may develop quickly as a result of the epigenetic deregulation of key cancer pathways as a direct or indirect result of RB1 loss.
A431 cells have an amplification of the epidermal growth factor (EGF) receptor gene, the cellular homolog of the v-erb B oncogene, and overproduce an aberrant 2.9-kilobase RNA that encodes a portion of the EGF receptor. A cDNA (pE15) for the aberrant RNA was cloned, sequenced, and used to analyze genomic DNA blots from A431 and normal cells. These data indicate that the aberrant RNA is created by a gene rearrangement within chromosome 7, resulting in a fusion of the 5' portion of the EGF receptor gene to an unidentified region of genomic DNA. The unidentified sequences are amplified to about the same degree (20- to 30-fold) as the EGF receptor sequences. In situ hybridization to chromosomes from normal cells and A431 cells show that both the EGF receptor gene and the unidentified DNA are localized to the p14-p12 region of chromosome 7. By using cDNA fragments to probe DNA blots from mouse-A431 somatic cell hybrids, the rearranged receptor gene was shown to be associated with translocation chromosome M4.
Maize is a major cereal crop and an important model system for basic biological research. Knowledge gained from maize research can also be used to genetically improve its grass relatives such as sorghum, wheat, and rice. The primary objective of the Maize Genome Sequencing Consortium (MGSC) was to generate a reference genome sequence that was integrated with both the physical and genetic maps. Using a previously published integrated genetic and physical map, combined with in-coming maize genomic sequence, new sequence-based genetic markers, and an optical map, we dynamically picked a minimum tiling path (MTP) of 16,910 bacterial artificial chromosome (BAC) and fosmid clones that were used by the MGSC to sequence the maize genome. The final MTP resulted in a significantly improved physical map that reduced the number of contigs from 721 to 435, incorporated a total of 8,315 mapped markers, and ordered and oriented the majority of FPC contigs. The new integrated physical and genetic map covered 2,120 Mb (93%) of the 2,300-Mb genome, of which 405 contigs were anchored to the genetic map, totaling 2,103.4 Mb (99.2% of the 2,120 Mb physical map). More importantly, 336 contigs, comprising 94.0% of the physical map (∼1,993 Mb), were ordered and oriented. Finally we used all available physical, sequence, genetic, and optical data to generate a golden path (AGP) of chromosome-based pseudomolecules, herein referred to as the B73 Reference Genome Sequence version 1 (B73 RefGen_v1).