Uncovering Transcriptional Dark Matter via Gene Annotation Independent Single-Cell RNA Sequencing Analysis

2020 
Single-cell RNA sequencing (scRNA-seq) enables the study of cell biology with high resolution. scRNA-seq expression analyses rely on the availability of a high quality annotation of genes in the genome. Yet, as we show here with scRNA-seq experiments and analyses spanning human, mouse, chicken, mole rat, lemur and sea urchin, gene annotations often fail to cover the full transcriptome of every cell type at every stage of development, in particular for organisms that are not routinely studied. To overcome this hurdle, we created a scRNA-seq analysis routine that recovers biologically relevant information beyond the scope of the best available gene annotation. This is achieved by performing single-cell expression analysis on any region in the genome for which transcriptional products are detected. Our routine identifies transcriptionally active regions (TARs) using a hidden Markov model, generates a matrix of expression levels for all TARs across all cells in a dataset, performs single-cell TAR expression analysis to identify TARs that are biologically significant, and then annotates biologically significant TARs using gene homology analysis. This procedure leverages single-cell expression analyses as a filter to direct annotation efforts to biologically significant transcripts in complex tissues and thereby uncovers biology to which scRNA-seq would otherwise be in the dark.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    40
    References
    0
    Citations
    NaN
    KQI
    []