Mutational insights into the envelope protein of SARS-CoV-2

2021 
Abstract The ongoing mutations in the structural proteins of SARS-CoV-2 are the major impediment for prevention and control of the COVID-19 disease. Presently we focused on evolution of the envelope (E) protein, one of the most enigmatic and less studied protein among the four structural proteins (S, E, M and N) associated with multitude of immunopathological functions of SARS-CoV-2. In the present study, we comprehensively analyzed 81,818 high quality E protein sequences of SARS-CoV-2 globally available in the GISAID database as of 20 August 2020. Compared to Wuhan reference strain, our mutational analysis explored only 1.2 % (982/81818) mutant strains undergoing a total of 115 unique amino acid (aa) substitutions in the E protein, highlighting the fact that most (98.8 %) of the E protein of SARS-CoV-2 strains are highly conserved. Moreover, we found 58.77 % (134 of 228) nucleotides (nt) positions of SARS-CoV-2 E gene encountering a total of 176 unique nt-level mutations globally, which may affect the efficacy of real time RT-PCR-based molecular detection of COVID-19. Importantly, higher aa variations observed in the C-terminal domain (CTD) of the E protein, particularly at Ser55-Phe56, Arg69 and the C-terminal end (DLLV: 72–75) may alter the binding of SARS-CoV-2 Envelope protein to tight junction-associated PALS1 and thus could play a key role in COVID-19 pathogenesis. Furthermore, this study revealed the V25A mutation in the transmembrane domain which is a key factor for the homopentameric conformation of E protein. Our analysis also observed a triple cysteine motif harboring mutation (L39M, A41S, A41V, C43F, C43R, C43S, C44Y, N45R) which may hinder the binding of E protein with spike glycoprotein. These results therefore suggest the continuous monitoring of the structural proteins including the envelope protein of SARS-CoV-2 since the number of genome sequences from across the world are continuously increasing.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    52
    References
    16
    Citations
    NaN
    KQI
    []