Abstract Studies of complex disorders benefit from integrative analyses of multiple omics data. Yet, sample mix-ups frequently occur in multi-omics studies, weakening statistical power and risking false findings. Accurately aligning sample information, genotype, and corresponding omics data is critical for integrative analyses. We developed DRAMS ( https://github.com/Yi-Jiang/DRAMS ) to Detect and Re-Align Mixed-up Samples to address the sample mix-up problem. It uses a logistic regression model followed by a modified topological sorting algorithm to identify the potential true IDs based on data relationships of multi-omics. According to tests using simulated data, the more types of omics data used or the smaller the proportion of mix-ups, the better that DRAMS performs. Applying DRAMS to real data from the PsychENCODE BrainGVEX project, we detected and corrected 201 (12.5% of total data generated) mix-ups. Of the 21 mix-ups involving errors of racial identity, DRAMS re-assigned all samples to the correct racial cluster in the 1000 Genomes project. In doing so, quantitative trait loci (QTL) (FDR<0.01) increased by an average of 1.62-fold. The use of DRAMS in multi-omics studies will strengthen statistical power of the study and improve quality of the results. Even though very limited studies have multi-omics data in place, we expect such data will increase quickly with the needs of DRAMS. Author summary Sample mix-up happens inevitably during sample collection, processing, and data management. It leads to reduced statistical power and sometimes false findings. It is of great importance to correct mixed-up samples before conducting any downstream analyses. We developed DRAMS to detect and re-align mixed-up samples in multi-omics studies. The basic idea of DRAMS is to align the data and labels for each sample leveraging the genetic information of multi-omics data. DRAMS corrects sample IDs following a two-step strategy. At first, it estimates pairwise genetic relatedness among all the data generated from all the individuals. Because the different data generated from the same individual should share the same genetics, we can cluster all the highly related data and consider that the data from one cluster have only one potential ID. Then, we used a “majority vote” strategy to infer the potential ID for individuals in each cluster. Other information, such as match of genetics-based and reported sexes, omics priorority, etc., were also used to direct identifying the potential IDs. It has been proved that DRAMS performs very well in both simulation and PsychENCODE BrainGVEX multi-omics data.
Most genetic risk for psychiatric disease lies in regulatory regions, implicating pathogenic dysregulation of gene expression and splicing. However, comprehensive assessments of transcriptomic organization in diseased brains are limited. In this work, we integrated genotypes and RNA sequencing in brain samples from 1695 individuals with autism spectrum disorder (ASD), schizophrenia, and bipolar disorder, as well as controls. More than 25% of the transcriptome exhibits differential splicing or expression, with isoform-level changes capturing the largest disease effects and genetic enrichments. Coexpression networks isolate disease-specific neuronal alterations, as well as microglial, astrocyte, and interferon-response modules defining previously unidentified neural-immune mechanisms. We integrated genetic and genomic data to perform a transcriptome-wide association study, prioritizing disease loci likely mediated by cis effects on brain expression. This transcriptome-wide characterization of the molecular pathology across three major psychiatric disorders provides a comprehensive resource for mechanistic insight and therapeutic development.
INTRODUCTION The brain is responsible for cognition, behavior, and much of what makes us uniquely human. The development of the brain is a highly complex process, and this process is reliant on precise regulation of molecular and cellular events grounded in the spatiotemporal regulation of the transcriptome. Disruption of this regulation can lead to neuropsychiatric disorders. RATIONALE The regulatory, epigenomic, and transcriptomic features of the human brain have not been comprehensively compiled across time, regions, or cell types. Understanding the etiology of neuropsychiatric disorders requires knowledge not just of endpoint differences between healthy and diseased brains but also of the developmental and cellular contexts in which these differences arise. Moreover, an emerging body of research indicates that many aspects of the development and physiology of the human brain are not well recapitulated in model organisms, and therefore it is necessary that neuropsychiatric disorders be understood in the broader context of the developing and adult human brain. RESULTS Here we describe the generation and analysis of a variety of genomic data modalities at the tissue and single-cell levels, including transcriptome, DNA methylation, and histone modifications across multiple brain regions ranging in age from embryonic development through adulthood. We observed a widespread transcriptomic transition beginning during late fetal development and consisting of sharply decreased regional differences. This reduction coincided with increases in the transcriptional signatures of mature neurons and the expression of genes associated with dendrite development, synapse development, and neuronal activity, all of which were temporally synchronous across neocortical areas, as well as myelination and oligodendrocytes, which were asynchronous. Moreover, genes including MEF2C , SATB2 , and TCF4 , with genetic associations to multiple brain-related traits and disorders, converged in a small number of modules exhibiting spatial or spatiotemporal specificity. CONCLUSION We generated and applied our dataset to document transcriptomic and epigenetic changes across human development and then related those changes to major neuropsychiatric disorders. These data allowed us to identify genes, cell types, gene coexpression modules, and spatiotemporal loci where disease risk might converge, demonstrating the utility of the dataset and providing new insights into human development and disease. Spatiotemporal dynamics of human brain development and neuropsychiatric risks. Human brain development begins during embryonic development and continues through adulthood (top). Integrating data modalities (bottom left) revealed age- and cell type–specific properties and global patterns of transcriptional dynamics, including a late fetal transition (bottom middle). We related the variation in gene expression (brown, high; purple, low) to regulatory elements in the fetal and adult brains, cell type–specific signatures, and genetic loci associated with neuropsychiatric disorders (bottom right; gray circles indicate enrichment for corresponding features among module genes). Relationships depicted in this panel do not correspond to specific observations. CBC, cerebellar cortex; STR, striatum; HIP, hippocampus; MD, mediodorsal nucleus of thalamus; AMY, amygdala.
The impact of genetic variants on gene expression has been intensely studied at the transcription level, yielding in valuable insights into the association between genes and the risk of complex disorders, such as schizophrenia (SCZ). However, the downstream impact of these variants and the molecular mechanisms connecting transcription variation to disease risk are not well understood.We quantitated ribosome occupancy in prefrontal cortex samples of the BrainGVEX cohort. Together with transcriptomics and proteomics data from the same cohort, we performed cis-Quantitative Trait Locus (QTL) mapping and identified 3,253 expression QTLs (eQTLs), 1,344 ribosome occupancy QTLs (rQTLs), and 657 protein QTLs (pQTLs) out of 7,458 genes quantitated in all three omics types from 185 samples. Of the eQTLs identified, only 34% have their effects propagated to the protein level. Further analysis on the effect size of prefrontal cortex eQTLs identified from an independent dataset showed clear post-transcriptional attenuation of eQTL effects. To investigate the biological relevance of the attenuated eQTLs, we identified 70 expression-specific QTLs (esQTLs), 51 ribosome-occupancy-specific QTLs (rsQTLs), and 107 protein-specific QTLs (psQTLs). Five of these omics-specific QTLs showed strong colocalization with SCZ GWAS signals, three of them are esQTLs. The limited number of GWAS colocalization discoveries from omics-specific QTLs and the apparent prevalence of eQTL attenuation prompted us to take a complementary approach to investigate the functional relevance of attenuated eQTLs. Using S-PrediXcan we identified 74 SCZ risk genes, 34% of which were novel, and 67% of these risk genes were replicated in a MR-Egger test. Notably, 52 out of 74 risk genes were identified using eQTL data and 70% of these SCZ-risk-gene-driving eQTLs show little to no evidence of driving corresponding variations at the protein level.The effect of eQTLs on gene expression in the prefrontal cortex is commonly attenuated post-transcriptionally. Many of the attenuated eQTLs still correlate with SCZ GWAS signal. Further investigation is needed to elucidate a mechanistic link between attenuated eQTLs and SCZ disease risk.
The clinical similarity among different neuropsychiatric disorders (NPDs) suggested a shared genetic basis. We catalogued 23,109 coding de novo mutations (DNMs) from 6511 patients with autism spectrum disorder (ASD), 4,293 undiagnosed developmental disorder (UDD), 933 epileptic encephalopathy (EE), 1022 intellectual disability (ID), 1094 schizophrenia (SCZ), and 3391 controls. We evaluated that putative functional DNMs contribute to 38.11%, 34.40%, 33.31%, 10.98% and 6.91% of patients with ID, EE, UDD, ASD and SCZ, respectively. Consistent with phenotype similarity and heterogeneity in different NPDs, they show different degree of genetic association. Cross-disorder analysis of DNMs prioritized 321 candidate genes (FDR < 0.05) and showed that genes shared in more disorders were more likely to exhibited specific expression pattern, functional pathway, genetic convergence, and genetic intolerance.
Unicellular S. cerevisiae cells switch from the yeast form to pseudohyphal or filamentous form in response to environmental cues. We report that wild-type BY diploids (in which yeast ORFs have been systematically deleted) undergo normal HU-induced filamentous growth and discernable nitrogen starvation-induced filamentous growth, despite their perceived filamentation-deficient S288C genetic background. This finding allowed us to perform a genome-wide survey for non-essential genes that are required for filamentous growth with the homozygous deletion strains. We report that genes involved in endocytosis are required for both HU-induced and nitrogen starvation-induced filamentous growth. Surprisingly, no known genes involved in exocytosis are required. Despite the fact that polarized growth involves transport of vesicles to the site of growth, we failed to obtain genetic/genomic evidence that exocytosis plays an essential role in filamentous growth. A possible key role of polarized endocytosis (from the growth tip) is consistent with the proposed biological function of filamentous growth as a foraging behaviour. In addition, BUD8 that encodes the distal landmark in yeast-form bipolar budding is required for nitrogen starvation-induced but not HU-induced filamentous growth. Moreover, BUD5, SPA2, PEA2 and BUD6 that regulate bipolar bud site selection do not regulate the unipolar distal budding pattern in HU-induced filamentous growth.
Abstract Background Cancer/testis antigens (CTAs) participate in the regulation of malignant biological behaviors in breast cancer. However, the function and mechanism of KK-LC-1, a member of the CTA family, in breast cancer are still unclear. Methods Bioinformatic tools, immunohistochemistry, and western blotting were utilized to detect the expression of KK-LC-1 in breast cancer and to explore the prognostic effect of KK-LC-1 expression in breast cancer patients. Cell function assays, animal assays, and next-generation sequencing were utilized to explore the function and mechanism of KK-LC-1 in the malignant biological behaviors of triple-negative breast cancer. Small molecular compounds targeting KK-LC-1 were also screened and drug susceptibility testing was performed. Results KK-LC-1 was significantly highly expressed in triple-negative breast cancer tissues than in normal breast tissues. KK-LC-1 high expression was related to poor survival outcomes in patients with breast cancer. In vitro studies suggested that KK-LC-1 silencing can inhibit triple-negative breast cancer cell proliferation, invasion, migration, and scratch healing ability, increase cell apoptosis ratio, and arrest the cell cycle in the G0–G1 phase. In vivo studies have suggested that KK-LC-1 silencing decreases tumor weight and volume in nude mice. Results showed that KK-CL-1 can regulate the malignant biological behaviors of triple-negative breast cancer via the MAL2/MUC1-C/PI3K/AKT/mTOR pathway. The small-molecule compound Z839878730 had excellent KK-LC-1 targeting ability and cancer cell killing ability. The EC 50 value was 9.7 μM for MDA-MB-231 cells and 13.67 µM for MDA-MB-468 cells. Besides, Z839878730 has little tumor-killing effect on human normal mammary epithelial cells MCF10A and can inhibit the malignant biological behaviors of triple-negative breast cancer cells by MAL2/MUC1-C/PI3K/AKT/mTOR pathway. Conclusions Our findings suggest that KK-LC-1 may serve as a novel therapeutic target for triple-negative breast cancer. Z839878730, which targets KK-LC-1, presents a new path for breast cancer clinical treatment.