Charting an organs’ biological atlas requires us to spatially resolve the entire single-cell transcriptome, and to relate such cellular features to the anatomical scale. Single-cell and single-nucleus RNA-seq (sc/snRNA-seq) can profile cells comprehensively, but lose spatial information. Spatial transcriptomics allows for spatial measurements, but at lower resolution and with limited sensitivity. Targeted in situ technologies solve both issues, but are limited in gene throughput. To overcome these limitations we present Tangram, a method that aligns sc/snRNA-seq data to various forms of spatial data collected from the same region, including MERFISH, STARmap, smFISH, Spatial Transcriptomics (Visium) and histological images. Tangram can map any type of sc/snRNA-seq data, including multimodal data such as those from SHARE-seq, which we used to reveal spatial patterns of chromatin accessibility. We demonstrate Tangram on healthy mouse brain tissue, by reconstructing a genome-wide anatomically integrated spatial map at single-cell resolution of the visual and somatomotor areas. Tangram is a versatile tool for aligning single-cell and single-nucleus RNA-seq data to spatially resolved transcriptomics data using deep learning.
ABSTRACT The COVID-19 pandemic, caused by the novel coronavirus SARS-CoV-2, creates an urgent need for identifying molecular mechanisms that mediate viral entry, propagation, and tissue pathology. Cell membrane bound angiotensin-converting enzyme 2 (ACE2) and associated proteases, transmembrane protease serine 2 (TMPRSS2) and Cathepsin L (CTSL), were previously identified as mediators of SARS-CoV2 cellular entry. Here, we assess the cell type-specific RNA expression of ACE2 , TMPRSS2 , and CTSL through an integrated analysis of 107 single-cell and single-nucleus RNA-Seq studies, including 22 lung and airways datasets (16 unpublished), and 85 datasets from other diverse organs. Joint expression of ACE2 and the accessory proteases identifies specific subsets of respiratory epithelial cells as putative targets of viral infection in the nasal passages, airways, and alveoli. Cells that co-express ACE2 and proteases are also identified in cells from other organs, some of which have been associated with COVID-19 transmission or pathology, including gut enterocytes, corneal epithelial cells, cardiomyocytes, heart pericytes, olfactory sustentacular cells, and renal epithelial cells. Performing the first meta-analyses of scRNA-seq studies, we analyzed 1,176,683 cells from 282 nasal, airway, and lung parenchyma samples from 164 donors spanning fetal, childhood, adult, and elderly age groups, associate increased levels of ACE2 , TMPRSS2 , and CTSL in specific cell types with increasing age, male gender, and smoking, all of which are epidemiologically linked to COVID-19 susceptibility and outcomes. Notably, there was a particularly low expression of ACE2 in the few young pediatric samples in the analysis. Further analysis reveals a gene expression program shared by ACE2 + TMPRSS2 + cells in nasal, lung and gut tissues, including genes that may mediate viral entry, subtend key immune functions, and mediate epithelial-macrophage cross-talk. Amongst these are IL6, its receptor and co-receptor, IL1R , TNF response pathways, and complement genes. Cell type specificity in the lung and airways and smoking effects were conserved in mice. Our analyses suggest that differences in the cell type-specific expression of mediators of SARS-CoV-2 viral entry may be responsible for aspects of COVID-19 epidemiology and clinical course, and point to putative molecular pathways involved in disease susceptibility and pathogenesis.
Unfortunately, the original version of this article [1] contained an error. Figures 2, ,44 and and55 were incorrect and the captions for Figs. 4 and and55 were incorrect. Below are the correct figures and captions:
Fig. 2
Histogram of k-mer relative abundances. Both 20- and 25-mer relative abundance densities appear log-laplacian. These data included 20- and 25-mers found in all tumor cells. a Histogram of 20-mer relative abundances in log10 scale. b Histogram of 25-mer ...
Fig. 4
20-mer bootstrap consensus neighbor-joining tree built from T10 primary breast tumor cells (prefix C), T16 primary (prefix P) and metastatic data (prefix M). Distinct groupings of cells are labeled as clusters
Fig. 5
20-mer bootstrap consensus neighbor-joining tree built from T16 primary (prefix P) and metastatic data (prefix M). Distinct groupings of cells are labeled as clusters
Abstract The balance between T helper type 1 (Th1) cells and other Th cells is critical for antiviral and anti-tumor responses, but how this fine balance is achieved remains poorly understood. Here, we dissected the dynamic regulation of Th1 cell differentiation during in vitro polarization, as well as in vivo differentiation upon acute viral infection, using scRNA-seq and multiple in vitro and in vivo CRISPR screens. We confirmed the role of known regulators and identified 5 novel regulators for Th1 differentiation. Among the novel regulators the neuropeptide receptor RAMP3, which is a component of the receptor for calcitonin gene-related peptide (CGRP), plays a cell-intrinsic role in Th1 cell-fate determination. Using a unique Th1/Th2 dichotomous culture system, we identified that RAMP3-CGRP interaction directly restricted the differentiation of Th2 cells but promoted Th1 differentiation through activation of downstream cyclic AMP (cAMP) signaling in T cells. Mechanistically, RAMP3 and cAMP signaling resulted in global changes in chromatin accessibility, blocking Th2 genes and specific induction of Th1 programs through activation of IFNγ-STAT1 pathway. Furthermore, both CGRP and RAMP3 were required for inducing effective anti-viral T cell responses to control acute viral infection. Our work reveals a novel neuro-immune circuit in which tissue itself participates in T cell fate determination by producing a neuropeptide during acute viral infection, which acts on RAMP3 expressing T cells to induce an effective anti-viral Th1 response.
Abstract Quality control (QC) of cells, a critical step in single-cell RNA sequencing data analysis, has largely relied on arbitrarily fixed data-agnostic thresholds on QC metrics such as gene complexity and fraction of reads mapping to mitochondrial genes. The few existing data-driven approaches perform QC at the level of samples or studies without accounting for biological variation in the commonly used QC criteria. We demonstrate that the QC metrics vary both at the tissue and cell state level across technologies, study conditions, and species. We propose data-driven QC ( ddqc ), an unsupervised adaptive quality control framework that performs flexible and data-driven quality control at the level of cell states while retaining critical biological insights and improved power for downstream analysis. On applying ddqc to 6,228,212 cells and 835 mouse and human samples, we retain a median of 39.7% more cells when compared to conventional data-agnostic QC filters. With ddqc , we recover biologically meaningful trends in gene complexity and ribosomal expression among cell-types enabling exploration of cell states with minimal transcriptional diversity or maximum ribosomal protein expression. Moreover, ddqc allows us to retain cell-types often lost by conventional QC such as metabolically active parenchymal cells, and specialized cells such as neutrophils or gastric chief cells. Taken together, our work proposes a revised paradigm to quality filtering best practices - iterative QC, providing a data-driven quality control framework compatible with observed biological diversity.