logo
    1288 Spatial transcriptomics-enabled integrated morphology-transcriptome tumor cell phenotyping using machine learning
    0
    Citation
    1
    Reference
    10
    Related Paper
    Abstract:

    Background

    In routine cancer diagnosis, pathologists manually characterize tumor cells based on hematoxylin and eosin (H&E)-stained images. On the other hand, transcriptomic-based tumor molecular subtypes were shown to be associated with important clinical features including tumorigenesis and prognosis. Leveraging recent development of spatial transcriptomics (ST) which allows in-situ transcriptomic profiling of tissues,1 we aim to develop a first-of-its-kind machine learning (ML)-enabled integrated morphology-transcriptome tumor single-cell phenotyping approach.

    Methods

    Two tissue sections each from tumor and adjacent-normal areas collected from a hepatocellular carcinoma (HCC) patient were profiled using 10× Visium ST platform. Using the companion H&E image, individual epithelial cells were segmented (StarDist algorithm) with 53 morphological and staining features extracted (QuPath v0.3.2). These cells were unsupervisedly clustered using encoder-based ensemble method where the optimal clustering solution was determined based on a consensus score of three clustering metrics. Phenotypic gene signatures of the cell clusters were determined through deconvoluting the ST data. Gene ontology (GO) analysis was done using single sample gene set enrichment, based on the molecular signatures database.

    Results

    At the optimal clustering setting, 4 epithelial cell clusters, characterized by differential nuclear size, were detected individually in each HCC tissue (figure. 1). Manual inspection by a pathologist (YZ) confirmed that the tumor epithelial cells demonstrated different nuclear sizes and revealed that the two smaller cell clusters looked relatively more well-differentiated, and ~1% found outside the tumor nest, suggesting potential epithelial to mesenchymal transition (EMT) activity. Whereas the two larger clusters were moderately-differentiated and demonstrated hyperchromatic nuclei and pleomorphism. GO analysis confirmed the upregulation of EMT in the smallest cluster, in both tumor tissues. While epithelial cells in the two normal-adjacent tissues appeared morphologically non-cancerous, the corresponding cell clusters contributed to similar cell fractions as that of the tumor tissues; two smaller clusters contributed to ~70% of the total cells across all tissues (figure. 2). Cell clusters with similar nuclear size shared 30%-65% of the top 20 pathways across tissues, indicating inter-tissue phenotypic consistency. Cells were found near cell-type of its own followed by cell-type of similar size, suggesting preferential cell clustering of similar phenotypes (figure 3).

    Conclusions

    Our ML approach revealed four morphologically-transcriptomically distinct tumor cell subsets in the HCC tissues, with the smallest cells appeared EMT-like. We revealed intra-patient tumor cell heterogeneity yet phenotypic consistency across tissue sampling sites. Altogether, our proposed approach would enable more refined tumor cell phenotyping, advancing our understanding of tumor biology.

    Acknowledgements

    I would like to thank NTU Undergraduate Research Experience on Campus (URECA) program for giving me opportunity to work on this project for the past year.

    Reference

    Nerurkar SN, Goh D, Cheung CCL, Nga PQY, Lim JCT, Yeong JPS. Transcriptional spatial profiling of cancer tissues in the era of immunotherapy: the potential and promise. Cancers. 2020;12:2572.

    Ethics Approval

    This study was approved by the SingHealth Centralized Institutional Review Board (reference numbers: 2018/3045 and 2019/2653).

    Consent

    The patients provided their written informed consent to participate in this study.
    Machine learning tool TEEBoT predicts an individual’s tissue-specific gene expression from his/her blood transcriptome.
    Citations (67)
    Abstract Age is well-known to be a significant factor in both disease pathology and response to treatment, yet the molecular changes that occur with age in humans remain ill-defined. Here, using transcriptome profiling of healthy human male skin, we demonstrate that there is a period of significantly elevated, transcriptome-wide expression changes occurring predominantly in middle age. Both pre and post this period, the transcriptome appears to undergo much smaller, linear changes with increasing age. Functional analysis of the transient changes in middle age suggest a period of heightened metabolic activity and cellular damage associated with NF-kappa-B and TNF signaling pathways. Through meta-analysis we also show the presence of global, tissue independent linear transcriptome changes with age which appear to be regulated by NF-kappa-B. These results suggest that aging in human skin is associated with a critical mid-life period with widespread transcriptome changes, both preceded and proceeded by a relatively steady rate of linear change in the transcriptome. The data provides insight into molecular changes associated with normal aging and will help to better understand the increasingly important pathological changes associated with aging.
    Senescence
    Citations (60)
    Transcriptome analysis is a powerful tool to characterize changes in leukocyte gene expression patterns and reveal very early molecular abnormalities in tissue. Herein, we report on characterization of the very earliest abnormalities in the transcriptome of leukocytes from young "prepathologic" NOD and NON female mice.
    Citations (6)
    Abstract The human genome is thought to contain 100 000 genes of which a subset of approximately 15 000 to 20 000 genes is expressed in an individual cell. The set of genes expressed and the stoichiometry of the resulting messenger RNAs, together called a transcriptome, determine the phenotype of a cell, tissue, and whole organism. It is generally accepted that a transcriptome is largely determined by an interplay of hereditary and environmental factors. For example, in the CNS, a challenge from the environment, e.g. a learning or a traumatic experience may lead to an alteration of the transcriptome of target neurons. Thus, transcriptome analysis and subsequent transcriptome comparisons may reveal novel insights in the molecular mechanisms underlying complex processes such learning and memory formation.
    The transcriptome represents the whole complement of RNA transcripts in cells or tissues and reflects the expressed genes at various life stages, tissue types, physiological states, and environmental conditions. Transcriptomics study concerning medicinal plants has become the most active area in medicinal plant genome research. Transcriptome analysis provides a comprehensive understanding of gene expression and its regulation. The study of its transcriptome has great significance in solving the questions of genetic evolution, genetic breeding, ecology and so on. Here we report the application status of transcriptomics in medicinal plants based on emergence, development and methodology of transcriptomics.
    Citations (10)
    Abstract Since age related perturbations in gene expression profiles have been described and transcriptomic changes in specific biological pathways have been implicated in the aging process, we performed whole transcriptome sequencing on 4000 HRS participants using RNA obtained from Paxgene tubes collected during the 2016 interview. We will describe design and implementation of innovative quality control procedures to minimize technical variability in transcriptomic measurements and monitor analytical variation in large population studies such as HRS. We will also report the distribution of transcriptomic profiles according to various demographic characteristics (age, sex, racial/ethnic and socioeconomic differences) and describe the prevalence of previously reported aging related transcriptomic signatures in HRS. We will describe the associations between transcriptomic profiles and other measures of biological aging in HRS and report how changes in cell composition can affect transcriptomic profiles observed in population studies such as HRS.
    RNA-Seq
    Physiological and molecular processes including the transcriptome change across the 24-h day, driven by molecular circadian clocks and behavioral and systemic factors. It is not known how the temporal organization of the human transcriptome responds to a long-lasting challenge. This may, however, provide insights into adaptation, disease, and recovery. We investigated the human 24-h time series transcriptome in 20 individuals during a 90-day constant bed rest protocol. We show that the protocol affected 91% of the transcriptome with 76% of the transcriptome still affected after 10 days of recovery. Dimensionality-reduction approaches revealed that many affected transcripts were associated with mRNA translation and immune function. The number, amplitude, and phase of rhythmic transcripts, including clock genes, varied significantly across the challenge. These findings of long-lasting changes in the temporal organization of the transcriptome have implications for understanding the mechanisms underlying health consequences of conditions such as microgravity and bed rest.
    Proteome
    Organs and tissues age at different rates within a single individual. Such asynchrony in aging has been widely observed at multiple levels, from functional hallmarks, such as anatomical structures and physiological processes, to molecular endophenotypes, such as the transcriptome and metabolome. However, we lack a conceptual framework to understand why some components age faster than others. Just as demographic models explain why aging evolves, here we test the hypothesis that demographic differences among cell types, determined by cell-specific differences in turnover rate, can explain why the transcriptome shows signs of aging in some cell types but not others. Through analysis of mouse single-cell transcriptome data across diverse tissues and ages, we find that cellular age explains a large proportion of the variation in the age-related increase in transcriptome variance. We further show that long-lived cells are characterized by relatively high expression of genes associated with proteostasis and that the transcriptome of long-lived cells shows greater evolutionary constraint than short-lived cells. In contrast, in short-lived cell types, the transcriptome is enriched for genes associated with DNA repair. Based on these observations, we develop a novel heuristic model that explains how and why aging rates differ among cell types.
    Proteostasis
    Metabolome
    Cell type
    Citations (3)
    Brassica napus is one of the most important oilseed crops in the world. However, there is currently no enough stem transcriptome information and comparative transcriptome analysis of different tissues, which impedes further functional genomics research on B. napus. In this study, the stem transcriptome of B. napus was characterized by RNA-seq technology. Approximately 13.4 Gb high-quality clean reads with an average length of 100 bp were generated and used for comparative transcriptome analysis with the existing transcriptome sequencing data of roots, leaves, flower buds and immature embryos of B. napus. All the transcripts were annotated against GO and KEGG databases. The common genes in five tissues, differentially expressed genes (DEGs) of the common genes between stems and other tissues, and tissue-specific genes were detected, and the main biochemical activities and pathways implying the common genes, DEGs and tissue-specific genes were investigated. Accordingly, the common transcription factors (TFs) in the five tissues and tissue-specific TFs were identified, and a TFs-based regulation network between TFs and the target genes involved in "Phenylpropanoid biosynthesis" pathway were constructed to show several important TFs and key nodes in the regulation process. Collectively, this study not only provided an available stem transcriptome resource in B. napus, but also revealed a valuable comparative transcriptome information of five tissues of B. napus for future investigation on specific processes, functions and pathways.
    KEGG
    RNA-Seq
    Citations (16)
    Abstract Organs age at different rates within a single individual. Such asynchrony in aging has been widely observed at multiple levels, from functional hallmarks, such as anatomical structures and physiological processes, to molecular endophenotypes, such as the transcriptome and metabolome. However, we lack a conceptual framework to understand why some components age faster than others. Just as demographic models explain why aging evolves, here we test the hypothesis that demographic differences among cell types, determined by cell-specific differences in turnover rate, can explain why the transcriptome shows signs of aging in some cell types but not others. Through analysis of mouse single-cell transcriptome data across diverse organs and ages, we find that cellular age explains a large proportion of the variation in the age-related increase in transcriptome variance. We further show that long-lived cells are characterized by relatively high expression of genes associated with proteostasis, and that the transcriptome of long-lived cells shows greater evolutionary constraint than short-lived cells. In contrast, in short-lived cell types the transcriptome is enriched for genes associated with DNA repair. Based on these observations, we develop a novel heuristic model that explains how and why aging rates differ among cell types.
    Proteostasis
    Metabolome
    Cell type
    Citations (0)