Meta-transcriptomic analysis reveals the gene expression and novel conserved sub-genomic RNAs in SARS-CoV-2 and MERS-CoV

2021 
Fundamental to viral biology is identification and annotation of viral genes and their function. Determining the level of coronavirus gene expression is inherently difficult due to the positive stranded RNA genome and the identification of subgenomic RNAs (sgRNAs) that are required for expression of most viral genes. We developed a bioinformatic pipeline to analyze meta-transcriptomic data from 20 independent studies encompassing 588 individual samples covering more than 10 coronavirus species. Our result showed that SARS-CoV, SARS-CoV-2 and MERS-CoV each had a core sgRNA repertoire generated via a canonical mechanism. Novel sgRNAs that encode peptides with evolutionarily conserved structures were identified in several coronaviruses and were expressed in vitro and in vivo, greatly expanding the predicted number of coronaviruses proteins. Two novel peptides may have direct functional relevance to disease, by alluding interferon responses and disrupting IL17E (IL25) signaling. Relevant to coronavirus infectivity and transmission, we also observed that the level of Spike sgRNAs were significantly higher in-vivo than in-vitro, while the opposite held true for the Nucleocapside protein. Our methods and findings shed new light on coronavirus biology and provides a valuable resource for future development of therapeutics.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    34
    References
    2
    Citations
    NaN
    KQI
    []