The ongoing evolution of variants of concern and interest of SARS-CoV-2 in Brazil revealed by convergent indels in the amino (N)-terminal domain of the Spike protein
Summary (2 min read)
Introduction
- Recurrent deletions in the amino (N)-terminal domain (NTD) of the spike (S) glycoprotein of SARS-CoV-2 have been identified during long-term infection of immunocompromised patients 1–4 as well as during extended human-to-human transmission 3.
- The VOCs B.1.1.7 and B.1.351 are resistant to neutralization by several anti-NTD monoclonal antibodies (mAbs) and NTD deletions at RDR2 and RDR4 are important for such phenotype 9–14.
- With the exception of N.10, none of the other variants described in Brazil displayed indels in the NTD.
Results and Discussion
- The authors genomic survey identified 35 SARS-CoV-2 sequences from seven different Brazilian states that harbor a variable combination of mutations in the RBD (K417T, E484K, N501Y) and indels in the NTD region of the S protein.
- The ML phylogenetic analyses further confirm that all sequences belonging to variants P.1-like ins214ANRN (Fig. 1A) and VOI N.10 (Fig. 1B) branched in highly supported (aLRT > 99%) monophyletic clades.
- Most P.1 𝚫144, 𝚫141-144 and 𝚫189-190 sequences were detected in the Amazonas or were recovered from individuals that were transferred from or that reported a travel history to the Amazonas state, like all P.1 𝚫144 sequences from the Bahia state described previously 21 and in the present study (Table 1).
- These findings suggest that NTD indels detected here probably abrogate the binding of NAb directed against the antigenic-supersite and other epitopes.
- Finally, a recent longitudinal analysis of intra-host SARS-CoV-2 evolution during acute infection in one immunocompetent individual revealed the emergence of virus haplotypes bearing deletions 𝚫144 and 𝚫141-144 in the NTD following the development of autologous anti-NTD specific antibodies 39.
Material and Methods
- The SARS-CoV-2 genomes were recovered using Illumina sequencing protocols as previously described 43,44.
- The alignment was refined using the InDels and Structural Variants module.
- Additionally, the same reads were imported in a different pipeline 45 based on Bowtie2 and bcftools 46 mapping and consensus generation allowing us to further confirm the indels supported by paired-end reads, removing putative indels with less than 10x of sequencing depth and with mapping read quality score below to 10 for all samples sequenced in this study.
Structural Modeling
- The resolved crystallographic structure of SARS-CoV-2 NTD protein bound to the neutralizing antibody 2-51 was retrieved from the Protein Databank (PDB) under the accession code 7L2C 33.
- This structure was then used as a template to model the NTD variants using the Swiss-Model webserver.
- The modeled structures of the NTDs variants were superimposed onto the coordinates of the PDB ID 7L2C to visualize the differences between the NTD-antibody binding interfaces.
- Image rendering was carried out using Visual Molecular Dynamics (VMD) software 51. 54 and the InterfaceAnalyzer protocol of the Rosetta package interfaced with the RosettaScripts scripting language 55.
Epitope prediction
- Epitopes in the NTD region were predicted by the ElliPro Antibody Epitope Prediction server 56.
- NTD are shown as predicted linear epitopes when using PDB accession codes 6VXX 57 and 6VSB 58, (structural coordinates corresponding to the entire S protein), along with a minimum score of 0.9, i.e., a highly strict criterion.
Did you find this useful? Give us your feedback
Citations
169 citations
81 citations
10 citations
10 citations
8 citations
References
46,130 citations
25,325 citations
13,668 citations
8,377 citations
7,324 citations
Related Papers (5)
Frequently Asked Questions (15)
Q2. What can be altered by the SARS-CoV-2 Spike?
Mutations at both the receptor-binding domain (RBD) and the amino (N)-terminal domain (NTD) of the SARS-CoV-2 Spike (S) glycoprotein can alter its antigenicity and promote immune escape.
Q3. Where do some bats SC2r-CoV lineages display deletions?
Deletions 𝚫242-244 and 𝚫256-258 occur immediately upstream and downstream to IR-5, respectively, where some bat and pangolin SC2r-CoV lineages also displayed deletions.
Q4. What is the mechanism of evolution of SARS-CoV-2?
Several studies of SARS-CoV-2 evolution in vitro and ex vivo also support thatNTD indels here observed in Brazilian SARS-CoV-2 VOC and VOI represent a mechanism of ongoing adaptive evolution to escape from dominant neutralizing antibodies directed against the NTD.
Q5. What is the role of the NTD in the evolution of SARS-CoV-2?
In vitro co-incubation of SARS-CoV-2 with highly neutralizing plasma form COVID-19 convalescent patient, has revealed an incremental resistance to neutralization followed by the stepwise acquisition of indels at N3/N5 loops 35.
Q6. What is the role of the NTD in the transmission of SARS-CoV-2?
The authors identified that SARS-CoV-2 lineages circulating in Brazil with mutations of concern in the RBD independently acquired convergent deletions and insertions in the NTD of the S protein, which altered the NTD antigenic-supersite and other predicted epitopes at this region.
Q7. What was the sequence alignment used to determine the sequences of SARS-CoV-2?
The S amino acid sequences from selected SARS-CoV-2 and SC2r-CoV lineages available in the EpiCoV database were also aligned using Clustal W 49 adjusted by visual inspection.
Q8. What are the main findings of this study?
These findings support that the ongoing widespread transmission of SARS-CoV-2 in Brazil is generating new viral lineages that might be more resistant to antibody neutralization than parental variants of concern.
Q9. What is the role of the Spike gene in the evolution of SARS-CoV-2?
Studies of intra-host SARS-CoV-2 evolution in immuno-compromised hosts revealed the emergence of viral variants with NTD deletions at RDR1 (𝚫69-70), RDR2 (𝚫144 and 𝚫141-144) and RDR4 (𝚫243-244) following therapy with convalescent plasma 1,3,4,36,37.
Q10. What is the criterion for the prediction of NTD?
NTD are shown as predicted linear epitopes when using PDB accession codes 6VXX 57 and 6VSB 58, (structural coordinates corresponding to the entire S protein), along with a minimum score of 0.9, i.e., a highly strict criterion.
Q11. What is the sequence of the SARS-CoV-2?
Their search of SARS-CoV-2 sequences available at EpiCoV database in the GISAID (https://www.gisaid.org/) on March 1st retrieved only 146 SARS-CoV-2 sequences of lineages A.2.4 (n = 52), B (n = 3), B.1 (n = 7), B.1.1.7 (n = 1), B.1.177 (n = 1), B.1.2 (n = 1), B.1.214 (n = 80) and B.1.429 (n = 1) that displayed an insert motif of three to four amino acids (AKKN, KLGB, AQER, AAG, KFH, KRI, and TDR) in position 214 (Appendix Table 1).
Q12. What is the aLRT of the other sub-clade?
The other sub-clade (aLRT = 85%) was characterized by the synonymous mutations G29781A and T29834A and comprises seven sequences: the three P.1 𝚫69-70 from São Paulo state plus four P.1 sequencesfrom São Paulo, Amazonas and Tocantins states.
Q13. What is the significance of the lack of monophyletic clustering of P.1 ?
The lack of monophyletic clustering of P.1 𝚫69-70 sequences from Santa Catarina, however, shouldbe interpreted with caution due to the paucity of synapomorphic mutations within diversity of lineage P.1.
Q14. What are the main characteristics of the SARS-CoV-2 genome?
Keywords: COVID-19, pandemics, antibody escape, coronavirus, communitary transmissionRecurrent deletions in the amino (N)-terminal domain (NTD) of the spike (S)glycoprotein of SARS-CoV-2 have been identified during long-term infection of immunocompromised patients 1–4 as well as during extended human-to-human transmission 3.
Q15. What is the hypothesis that drives the evolution of SARS-CoV-2?
One hypothesis is that such a major selection pressure shift on the virus genome is driven by the increasing worldwide human population immunity acquired from natural SARS-CoV-2 infection that might also select for convergent deletions at NTD.