Meta-Analysis of the Dynamics of the Emergence of Mutations and Variants of SARS-CoV-2
Summary (1 min read)
INTRODUCTION
- In late December 2019, a new betacoronavirus known as Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-COV-2) emerged in the city of Wuhan in the province of Hubei, China (1).
- Coronaviruses are therefore expected to evolve through genetic drift much slower than other RNA viruses that do not have this ability, such as influenza viruses (8,10).
- Genetic variants are therefore the rare successful offshoots of the reference strain.
- In addition to the N501Y mutation, both the South African and Brazil variants possesses RBD mutations K417N(T) and E484K, which are also associated with further increased Nabs escape capabilities (24,25).
MATERIALS AND METHODS
- Data collection and mutational analysis Genomes uploaded to the GISAID EpiCoVTM server database were analyzed from December 1st, 2019, to December 31st, 2020, and selected viral sequences with submission dates from December 1st, 2019 to January 6, 2021.
- The authors filtered through 309,962 genomes for the analysis of selected mutations.
- For the analysis of the mutations in B.1.1.7, B.1.351, and P.1 variants, the authors used the GISAID EpiCoVTM server database.
- Only complete SARS-CoV-2 genomes (28 to 30 Kbps) isolated from human hosts were analyzed.
Structural modeling
- Mutations in the spike protein in complex with hACE2 were analyzed using a mutagenesis tool for PyMOL (PDB: 7A94).
- Figures and rendering were prepared with PyMOL.
RESULTS
- Identification of emerging mutations in various SARS-CoV-2 genes.
- The authors also demonstrate that most genes in SARS-CoV-2 have mutations with overall frequencies lower than 10% (Fig.1).
- Therefore, mutations presented here vastly underrepresent the global landscape of mutation frequency dynamics.
- These results allow us to better understand the frequencies, localization, and interactions of mutations in the S protein of the B.1.1.7 variant.
DISCUSSION
- The emergence of new genetic variants that are more transmissible, virulent and resistant to antibody neutralization have highlighted the importance of studying the function of mutations in the viral genome.
- The Oxford-AstraZeneca vaccine has displayed compromised efficacy against the B.1.351 variant with only 10% vaccine efficacy (36).
- The combination of D614G with other mutations in S can enable Nabs escape (19).
- The global frequency of mutations and variants in the database is therefore biased to represent the genetic landscape of the virus in the countries doing the most testing, sequencing, and data sharing.
- A global approach to analyzing both transmitted variants and non-transmitted sub-variants and quasispecies could provide a better understanding of the effects of SARS-CoV-2 mutations.
FIGURE LEGENDS
- Variations in mutations and mutation frequencies in SARS-CoV-2 genes, also known as Figure 1.
- Graphs were generated using RStudio and Biorender.
- G) Genome of the SARS-CoV-2 P.1 variant with identified nucleotides substitution, deletions, and insertions.
Did you find this useful? Give us your feedback
Citations
67 citations
58 citations
10 citations
3 citations
2 citations
References
21,455 citations
10,274 citations
9,231 citations
8,336 citations
7,219 citations
Related Papers (5)
Frequently Asked Questions (17)
Q2. What is the etiological agent for the COVID-19 pandemic?
SARS-CoV-2 is the etiological viral agent for the worldwide COVID-19 pandemic resulting in more than 162 million infected and 3.4 million deaths worldwide as of April, 2021 (2,3).
Q3. What are the mutations in the genome of SARS-CoV-2?
Mutations occur in the genome of SARS-CoV-2 during viral replication and affect viral infectivity, transmissibility, and virulence.
Q4. What is the importance of tracking mutations and the evolution of the SARS-CoV-2?
Tracking mutations and the evolution of the SARS-CoV-2 genome is critical for the development and deployment of effective treatments and vaccines.
Q5. What is the tool for tracking the emergence of mutations?
GISAID is a formidable tool for tracking the emergence of mutations, identifying the geographic region where it emerged, and tracking its spread around the globe.
Q6. What is the caveat in analyzing sequences in the GISAID database?
Another caveat in analyzing sequences in the GISAID database is that consensus sequences are uploaded, but subsequences and quasispecies are generally not included.
Q7. What is the effect of the K417T mutation on the C102 Nab?
The K417T mutation reduces interactions with neighboring residues in the C102 Nab and therefore the model predicts, as with K417N in B.1.1.7, an increased ability to escape neutralization (Fig. 7D).
Q8. What is the efficacy of the vaccine against the B.1.351 variant?
The recently approved Johnson & Johnson adenovirus-based vaccine only requires a single dose, in comparison to two for the mRNA vaccines and Oxford-AstraZeneca vaccine, and has an efficacy of 66% against the original Wuhan reference strain, 52.0 to 64.0% against the B.1.351 variant and 66% against the P.1 variant (43).
Q9. What is the effect of the P.1 vaccine on the immune system?
antibodies induced by the PfizerBioNTech and Moderna vaccines appear to display a 6.7-fold, and 4.5-fold decrease in neutralization efficacy against the P.1 variant (Table 6) (39).
Q10. What is the effect of the N501Y mutation on the hACE2?
This mutation is reported to cause increased resistance to Nabs, increased transmissibility, and increased virulence in animal models (27).
Q11. How many sequenced viral genomes were uploaded to the GISAID database?
The number of sequenced viral genomes uploaded to the GISAID database grew rapidly from 131,417 at the end of September, 2020 to 451,913 by January 30th, 2021 (28).
Q12. What is the association between N501Y and hACE2?
The N501Y mutation is associated with an increased affinity to hACE2, along with an increase in infectivity and virulence (Table 2).
Q13. What is the recent discovery of the B.1.1.7 variant in the UK?
A new variant was discovered in late 2020 in the UK that displayed increased affinity to hACE2, virulence and Nabs escape capabilities (Fig. 4) (24,25,30,31).
Q14. What is the frequency of the mutations in the SARS-CoV-2 genome?
Here the authors present a retrospective metadata analysis of mutations throughout the SARS-CoV-2 genome that reached at least a 1% worldwide frequency between December 2019 and January 2021.
Q15. What are the prevalent mutations in the B.1.1.7 variant?
The authors also observe the emergence of several mutations in S and N present in the B.1.1.7 variant, which now have a frequency higher than 65% (Fig. 2C, 2D and Table 2).
Q16. What is the reason why new variants may go undetected?
Emergence of new variants may therefore go undetected until they leave their point of origin and enter countries with high testing and sequencing rates.
Q17. What is the frequency of the recurring mutations in the SARS-CoV-2 genome?
The authors first selected recurring mutations that were present in more than 500 reported genomes by August 2020, and another selection was made in January 2021 to capture recurring mutations present in more than 4000 reported genomes in GISAID.