Structural and Functional Implications of Non-synonymous Mutations in the Spike protein of 2,954 SARS-CoV-2 Genomes

2020 
SARS-CoV-2, the causative agent of the COVID-2019 pandemic is an RNA virus prone to mutations. Information on mutations within the circulating strains of the virus is pivotal to understand disease spread and dynamics. Here, we analyse the mutations associated with 2,954 globally reported high quality genomes of SARS-CoV-2 with special emphasis on genomes of viral strains from India. Molecular phylogenetic analysis suggests that SARS-CoV-2 strains circulating in India form five distinct phyletic clades designated R1-R5. These clades categorize into the previously reported S, G as well as a new unclassified subtype. A detailed analysis of the gene encoding the spike (S) protein in the strains across the globe shows non-synonymous mutations on 54 amino acid residues. Among these, we pinpoint 4 novel mutations in the region that interacts with the human ACE2 receptor (RBD). Further in silico molecular docking analyses suggest that these RBD mutations could alter the binding affinity of S-protein with ACE2 that may lead to changes in SARS-CoV-2 infectivity. Strikingly, one of these RBD mutations (S438F) is unique to a subset within the R4 clade suggesting intrinsic S-protein variations in strains currently circulating in India. Together, our findings reveal a unique pattern of SARS-CoV-2 evolution that may alert vaccine and therapeutic development.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    46
    References
    2
    Citations
    NaN
    KQI
    []