Altered Sub-Genomic RNA Expression in SARS-CoV-2 B.1.1.7 Infections
Summary (1 min read)
Main Text
- Reasons for the potential viral load increase and enhanced mortality are currently unclear ( 9 ) .
- It is possible that the increased GAT>CTA, resulting in an amino acid substitution, D3L, in the nucleocapsid protein.
- The authors propose that this represents the ORF9b sgRNA, which was detected at low levels in 13 3A&C ).
- Finally the authors believe that sgRNA expression analysis should be carried out on all compatible genomic surveillance platforms to give an instant readout of altered expression profiles in emerging SARS-CoV-2 variants.
Methods
- Testing SARS-CoV-2 positivity was determined from nose/throat swabs diagnostically by Sheffield Teaching Hospitals NHS Foundation trust either using the Hologic Panther to generate Relative Light Units (RLU) ( 26 ) or an in house dual E/RdRp real time PCR assay to generate a cycle threshold (ECT or RCT respectively) ( 16) .
- Sample Preparation, ARTIC Network PCR and Nanopore Sequencing RNA was extracted from viral transport medium (VTM) with Qiagen QIAamp MinElute Virus Spin Kit (50).
- Resultant RNA was subject to the ARTIC network ( 10) tiled amplicon protocol and subsequently sequenced on an Oxford Nanopore GridION X5.
- Bases were called with the default basecaller in MinKNOW (currently guppy v4) with --require-both-ends set for de-multiplexing.
Subgenomic Read Classification and Normalisation
- Briefly, reads containing the leader sequence at their start are identified by a local alignment, the quality of this alignment determines which quality "bin" sgRNA reads are placed (HQ, LQ or LLQ).
- The amplicon from which the sgRNA evidence was generated is determined and a count of genomic reads for this amplicon used to normalise the raw sgRNA read counts.
- Samples were excluded from the subsequent analysis if: Their consensus coverage was < 0.9 .
- They had less than 50,000 mapped reads (the authors have previously shown that fewer reads produce a less robust analysis).
Phylogenetic Tree Generation
- The grapevine pipeline ( https://github.com/COG-UK/grapevine ) was used for generating the phylogeny based on all data available on GISAID and COG-UK up until 16th February 2021.
- First by randomly selecting one sequence per country per epi week followed by random sampling of the remaining sequences to generate a sample of 4000 sequences.
- The global tree was then pruned using code adapted from the tree-manip package ( https://github.com/josephhughes/tree-manip ).
- The authors then identified samples with D3L mutations and colour coded these tips according to their lineages.
- The visualisation was produced using R/ape, R/ggplot2, R/ggtree, R/treeio, R/phangorn, R/stringr, R/dplyr, R/aplot.
Did you find this useful? Give us your feedback
Citations
593 citations
85 citations
56 citations
31 citations
29 citations
References
10,798 citations
7,298 citations
2,093 citations
1,626 citations
593 citations
Related Papers (5)
Frequently Asked Questions (6)
Q2. Why do noncanonical sgRNA tend to be enriched around canonical?
These noncanonical sgRNA tend to be enriched around canonical sites, presumably due to the frequency of RdRp template switching that occurs in close proximity.
Q3. What could have led to the GAT > CTA mutation in the genome?
This could have led to a transcriptionally driven recombination event resulting in the GAT > CTA mutation in the genomic sequence (Figure 3E panel 2&3).
Q4. who has a license to display the preprint in perpetuity?
4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
Q5. who has a license to display the preprint?
This demonstrates the importance of stratifying sequences by lineage when studying sgRNA..CC-BY-NC-ND 4.0 International licensemade available under a (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity.
Q6. What is the sgRNA that is upstream of the ORF9b ATG?
Of note, this site is upstream of the ORF9b ATG and this noncanonical sgRNA retains the full coding region of the ORF9b, but lacks the canonical start codon of N.