scispace - formally typeset
Search or ask a question
Posted ContentDOI

Emergence of a novel SARS-CoV-2 strain in Southern California, USA

20 Jan 2021-medRxiv (Cold Spring Harbor Laboratory Press)-
TL;DR: In this paper, a novel strain of SARS-CoV-2 emerging in Southern California is reported, which is defined by multiple mutations in the S protein, a characteristic shared with both the UK and South African strains, both of which are of significant clinical and scientific interest.
Abstract: Since October 2020, novel strains of SARS-CoV-2 including B.1.1.7, have been identified to be of global significance from an infection and surveillance perspective. While this strain (B.1.1.7) may play an important role in increased COVID rates in the UK, there are still no reported strains to account for the spike of cases in Los Angeles (LA) and California as a whole, which currently has some of the highest absolute and per-capita COVID transmission rates in the country. From the early days of the pandemic when LA only had a single viral genome uploaded onto GISAID we have been at the forefront of generating and analyzing the SARS-CoV-2 sequencing data from the LA region. We report a novel strain emerging in Southern California. Most current cases in the catchment population in LA fall into two distinct subclades: 1) 20G (24% of total) is the predominant subclade currently in the United States 2) a relatively novel strain in clade 20C, CAL.20C strain (∼36% of total) is defined by five concurrent mutations. After an analysis of all of the publicly available data and a comparison to our recent sequences, we see a dramatic growth in the relative percentage of the CAL.20C strain beginning in November of 2020. The predominance of this strain coincides with the increased positivity rate seen in this region. Unlike 20G, this novel strain CAL.20C is defined by multiple mutations in the S protein, a characteristic it shares with both the UK and South African strains, both of which are of significant clinical and scientific interest

Summary (2 min read)

Jump to: [Introduction:][Methods:][Results:] and [Discussion:]

Introduction:

  • The first systematic genetic analysis of SARS-CoV-2 from Southern California showed most isolates originated from clade 20C that likely emerged from New York via Europe early in the COVID-19 pandemic (1) .
  • Since October 2020, novel strains of SARS-CoV-2 including B.1.1.7(UK) and B.1.351(South Africa) have become globally important because of their local dominance in COVID-19 positive cases (2) .
  • While the B.1.1.7 strain may play an important role in increased COVID rates in the UK and Europe, there are still no reports to account for the current spike of cases in Los Angeles and California as a whole that began in early November 2020.

Methods:

  • Regulatory review was completed by the CSMC Office of Research Compliance and Quality Improvement (IRB # STUDY629).
  • In total, 192 SARS-CoV-2 positive nasopharyngeal samples collected between.
  • Sequencing libraries were made using 200 ng of cDNA and a viral respiratory panel(RVOPV2).
  • This data was combined with all publicly available sequences(N = 4,337) from Southern California(Imperial, Kern, Los Angeles, Orange, Riverside, San Bernardino, San Diego, San Luis Obispo, Santa Barbara, and Ventura Counties) and phylogenetic analysis was performed as previously reported (1) .

Results:

  • This strain, CAL.20C, was first observed in July 2020 in 1/1230 samples from LA county and not detected in Southern California again until October.
  • Since then, this strain's prevalence has increased absolutely and relatively in Southern California, where by December it accounted for 24% of all samples and 36.4% (66/181) of their local Los Angeles cohort.
  • Unlike clade 20G, which is currently the largest reported clade in North America, this clade is defined by multiple mutations in the S protein (similar to B.1.1.7/B.1.351) and represents a separate subclade of 20C .
  • Currently the CAL.20C strain is primarily found in Southern California, however the authors have detected multiple recent isolates in Northern California, New York and Washington DC.

Discussion:

  • These findings add to their current understanding of COVID-19 transmission within the US, specifically that the recent surge in COVID-19 positive cases in Southern California coincides with the emergence of a unique strain, CAL.20C.
  • Given the independent emergence of geographically isolated, clinically relevant strains such as B.1.1.7 (UK) and B.1.351 (South Africa), the CAL.20C strain may be partially responsible for the magnitude of the surge in COVID-19 on the West Coast of the US.
  • The S protein L452R mutation is within a known receptor binding domain that has been found to be markedly resistant to certain monoclonal antibodies to the spike protein (4) .
  • The functional effect of this mutation in concert with other detected mutations in CAL.20C, both in terms of infectivity and antibody resistance is unknown at this time.
  • (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Did you find this useful? Give us your feedback

Content maybe subject to copyright    Report

Emergence of a novel SARS-CoV-2 strain in Southern California, USA
Wenjuan Zhang, PhD
1
, Brian D Davis, BSc
2,3
, Stephanie S Chen, BSc
2,3
, Jorge M Sincuir
Martinez
1
, Jasmine T Plummer, PhD
2,3#¶
, Eric Vail, MD
1#¶
1
Molecular Pathology Laboratory, Department of Pathology and Laboratory Medicine,
Cedars-Sinai Medical Center, Los Angeles, CA, USA
2
Center for Bioinformatics and Functional Genomics, Department of Biomedical
Sciences, Cedars-Sinai Medical Center, Los Angeles, CA, USA
3
Applied Genomics, Computation and Translational Core, Cedars-Sinai Cancer Center,
Los Angeles, CA, USA
Corresponding authors: jasmine.plummer@cshs.org and eric.vail@cshs.org
Word count 613
Abstract:
Since October 2020, novel strains of SARS-CoV-2 including B.1.1.7, have been identified to be
of global significance from an infection and surveillance perspective. While this strain (B.1.1.7)
may play an important role in increased COVID rates in the UK, there are still no reported
strains to account for the spike of cases in Los Angeles (LA) and California as a whole, which
currently has some of the highest absolute and per-capita COVID transmission rates in the
country. From the early days of the pandemic when LA only had a single viral genome uploaded
onto GISAID we have been at the forefront of generating and analyzing the SARS-CoV-2
sequencing data from the LA region. We report a novel strain emerging in Southern California.
Most current cases in the catchment population in LA fall into two distinct subclades: 1) 20G
(24% of total) is the predominant subclade currently in the United States 2) a relatively novel
strain in clade 20C, CAL.20C strain (~36% of total) is defined by five concurrent mutations. After
an analysis of all of the publicly available data and a comparison to our recent sequences, we
see a dramatic growth in the relative percentage of the CAL.20C strain beginning in November
of 2020. The predominance of this strain coincides with the increased positivity rate seen in this
region. Unlike 20G, this novel strain CAL.20C is defined by multiple mutations in the S protein, a
characteristic it shares with both the UK and South African strains, both of which are of
significant clinical and scientific interest
All rights reserved. No reuse allowed without permission.
(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprintthis version posted January 20, 2021. ; https://doi.org/10.1101/2021.01.18.21249786doi: medRxiv preprint
NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.

Introduction: The first systematic genetic analysis of SARS-CoV-2 from Southern California
showed most isolates originated from clade 20C that likely emerged from New York via Europe
early in the COVID-19 pandemic(1)
. Since October 2020, novel strains of SARS-CoV-2
including B.1.1.7(UK) and B.1.351(South Africa) have become globally important because of
their local dominance in COVID-19 positive cases(2)
. While the B.1.1.7 strain may play an
important role in increased COVID rates in the UK and Europe, there are still no reports to
account for the current spike of cases in Los Angeles and California as a whole that began in
early November 2020. We report the existence of a novel strain CAL.20C that is currently
increasing in numbers in Southern California.
Methods: Regulatory review was completed by the CSMC Office of Research Compliance and
Quality Improvement (IRB # STUDY629). In total, 192 SARS-CoV-2 positive nasopharyngeal
samples collected between 11/22/2020 and 12/28/2020 were processed for NGS and as per our
protocol (1)
with the following updates. Total RNA(100ng) was processed for cDNA synthesis
using the Illumina RNA preparation enrichment tagmentation kit. Sequencing libraries were
made using 200 ng of cDNA and a viral respiratory panel(RVOPV2). Samples were pooled and
sequenced on an Illumina NovaSeq(2X100 bp). All sequencing reads were mapped to SARS-
CoV-2 genome and data deposited to GISAID(3)
. This data was combined with all publicly
available sequences(N = 4,337) from Southern California(Imperial, Kern, Los Angeles, Orange,
Riverside, San Bernardino, San Diego, San Luis Obispo, Santa Barbara, and Ventura Counties)
and phylogenetic analysis was performed as previously reported(1).
Results: We detected a novel strain descended from cluster 20C and defined by five mutations
(ORF1a: I4205V, ORF1b:D1183Y, S: S13I;W152C;L452R)(Figure 1). This strain, CAL.20C, was
first observed in July 2020 in 1/1230 samples from LA county and not detected in Southern
California again until October. Since then, this strain’s prevalence has increased absolutely and
relatively in Southern California, where by December it accounted for 24% of all samples
(Figure 2A) and 36.4% (66/181) of our local Los Angeles cohort. The emerging predominance of
this strain temporally tracks to a time at or before the onset of the current spike in Southern
California(Figure 2B). Unlike clade 20G, which is currently the largest reported clade in North
America, this clade is defined by multiple mutations in the S protein (similar to
B.1.1.7/B.1.351) and represents a separate subclade of 20C (Figure 3). Currently the CAL.20C
strain is primarily found in Southern California, however we have detected multiple recent
isolates in Northern California, New York and Washington DC. Additionally, our analysis reveals
a small number of cases found outside of the US in Oceania in the past month.
Discussion:
These findings add to our current understanding of COVID-19 transmission within the US,
specifically that the recent surge in COVID-19 positive cases in Southern California coincides
with the emergence of a unique strain, CAL.20C. Given the independent emergence of
geographically isolated, clinically relevant strains such as B.1.1.7 (UK) and B.1.351 (South
Africa), the CAL.20C strain may be partially responsible for the magnitude of the surge in
COVID-19 on the West Coast of the US. The S protein L452R mutation is within a known
receptor binding domain that has been found to be markedly resistant to certain monoclonal
antibodies to the spike protein(4). Mutations in this domain may be resistant to polyclonal sera
as seen in convalescent patients or those post vaccination(5). The functional effect of this
mutation in concert with other detected mutations in CAL.20C, both in terms of infectivity and
antibody resistance is unknown at this time. The identification of this novel strain is important to
frontline and global surveillance of this constantly evolving virus and has a direct impact on
public health, especially vaccination efforts.
All rights reserved. No reuse allowed without permission.
(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprintthis version posted January 20, 2021. ; https://doi.org/10.1101/2021.01.18.21249786doi: medRxiv preprint

Figure 1. Phylogenetic classification of SARS-CoV-2 genomes.
Phylogenetic analysis using 192 Los Angeles isolates and a global subsampling from May 2020
to January 2021 reveal a novel Southern California strain CAL.20C.
Figure 2. Timeline for the emergence of a novel Southern California strain,
CAL.20C, amongst all SARS-CoV-2 strains observed.
a) Diagrammatic representation of isolates from Southern California corresponding to collection
time points spanning emergence of first isolates (3/2020) to present COVID positive cases
(12/2020).
b) 7 day average of confirmed COVID-19 cases in the Southern California region.
Figure 3. Phylogenetic tree of SARS-CoV-2 genomes.
References
1. Zhang W, Govindavari JP, Davis BD, Chen SS, Kim JT, Song J, et al. Analysis of Genomic
Characteristics and Transmission Routes of Patients With Confirmed SARS-CoV-2 in Southern
California During the Early Stage of the US COVID-19 Pandemic. JAMA Netw Open. 2020 Oct
1;3(10):e2024191.
2. Washington NL, White S, Schiabor Barrett K m, Cirulli ET, Bolze A, Lu JT. S gene dropout
patterns in SARS-CoV-2 tests suggest spread of the H69del/V70del mutation in the US.
medRxiv. 2020 Dec 30;
3. Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data - from vision to
reality. Euro Surveill. 2017 Mar 30;22(13):30494.
4. Li Q, Wu J, Nie J, Zhang L, Hao H, Liu S, Zhao C, Zhang Q, Liu H, Nie L, Qin H. The impact of
mutations in SARS-CoV-2 spike on viral infectivity and antigenicity. Cell. 2020 Sep
3;182(5):1284-94.
5. Greaney, A. J., et al. (2021). "Comprehensive mapping of mutations to the SARS-CoV-2
receptor-binding domain that affect recognition by polyclonal human serum antibodies." Biorxiv:
2020.2012.2031.425021.
All rights reserved. No reuse allowed without permission.
(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprintthis version posted January 20, 2021. ; https://doi.org/10.1101/2021.01.18.21249786doi: medRxiv preprint

CAL.20C
All rights reserved. No reuse allowed without permission.
(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprintthis version posted January 20, 2021. ; https://doi.org/10.1101/2021.01.18.21249786doi: medRxiv preprint

0%
20%
40%
60%
80%
100%
2020-03
2020-04
2020-05
2020-06
2020-07
2020-08
2020-09
2020-10
2020-11
2020-12
19A
19B
20A
20B
20C
20D
20E
20G
20I/ 501Y.V1
CAL .20C
0
10
20
30
40
50
60
3/1/2020 4/1/2020 5/1/2020 6/1/2020 7/1/2020 8/1/2020 9/1/2020 10/1/2020 11/1/2020 12/ 1/2020 1/1/2021
Thousands
All rights reserved. No reuse allowed without permission.
(which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
The copyright holder for this preprintthis version posted January 20, 2021. ; https://doi.org/10.1101/2021.01.18.21249786doi: medRxiv preprint

Citations
More filters
Journal ArticleDOI
TL;DR: In this paper, the authors show that B.1.7 remains sensitive to neutralization, albeit at moderately reduced levels (∼sim;2-fold), by serum samples from convalescent individuals and recipients of an mRNA vaccine (mRNA-1273, Moderna) and a protein nanoparticle vaccine (NVX-CoV2373, Novavax).

295 citations

Posted ContentDOI
01 Mar 2021-bioRxiv
TL;DR: In this article, the authors performed a comprehensive analysis of SARS-CoV-2-specific CD4+ and CD8+ T cell responses from COVID-19 convalescent subjects recognizing the ancestral strain.
Abstract: The emergence of SARS-CoV-2 variants highlighted the need to better understand adaptive immune responses to this virus. It is important to address whether also CD4+ and CD8+ T cell responses are affected, because of the role they play in disease resolution and modulation of COVID-19 disease severity. Here we performed a comprehensive analysis of SARS-CoV-2-specific CD4+ and CD8+ T cell responses from COVID-19 convalescent subjects recognizing the ancestral strain, compared to variant lineages B.1.1.7, B.1.351, P.1, and CAL.20C as well as recipients of the Moderna (mRNA-1273) or Pfizer/BioNTech (BNT162b2) COVID-19 vaccines. Similarly, we demonstrate that the sequences of the vast majority of SARS-CoV-2 T cell epitopes are not affected by the mutations found in the variants analyzed. Overall, the results demonstrate that CD4+ and CD8+ T cell responses in convalescent COVID-19 subjects or COVID-19 mRNA vaccinees are not substantially affected by mutations found in the SARS-CoV-2 variants.

150 citations

Journal ArticleDOI
TL;DR: As the global severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic continues to spread, several variants have emerged as discussed by the authors, including B.1.7 and B.2.351.
Abstract: As the global severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic continues to spread, several variants have emerged. Variants B.1.1.7 and B.1.351 have attracted significant atten...

129 citations

References
More filters
Journal ArticleDOI
TL;DR: This poster presents a poster presenting a poster presented at the 2016 International Conference of the Association for the Study of Viral Influenza and its Disruption in China, where it was presented for the first time.
Abstract: Yuelong Shu1, John McCauley2 1. WHO Collaborating Center for Reference and Research on Influenza, Chinese National Influenza Center, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China 2. WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, the Francis Crick Institute, London, United Kingdom

2,306 citations

Journal ArticleDOI
03 Sep 2020-Cell
TL;DR: Most variants with amino acid change at receptor binding domain were less infectious but variants including A475V, L452R, V483A and F490L became resistant to some neutralizing antibodies, while deletion of both N331 and N343 glycosylation drastically reduced infectivity, revealing the importance of gly cosylation for viral infectivity.

1,282 citations

Journal ArticleDOI
TL;DR: In this article, the authors map how convalescent plasma antibodies are impacted by all mutations to the spike's receptor-binding domain (RBD), the main target of plasma neutralizing activity.

912 citations

Posted ContentDOI
30 Dec 2020-medRxiv
TL;DR: In this article, the authors analyzed 2 million RT-PCR SARS-CoV-2 tests performed at Helix to identify the rate of S gene dropout, which has been recently shown to occur in tests from individuals infected with strains of SARS CoV2 that carry the H69del/V70del mutation.
Abstract: Recently, multiple novel strains of SARS-CoV-2 have been found to share the same deletion of amino acids H69 and V70 in the virus S gene. This includes strain B.1.1.7 / SARS-CoV-2 VUI 202012/01, which has been found to be more infectious than other strains of SARS-CoV-2, and its increasing presence has resulted in new lockdowns in and travel restrictions leaving the UK. Here, we analyze 2 million RT-PCR SARS-CoV-2 tests performed at Helix to identify the rate of S gene dropout, which has been recently shown to occur in tests from individuals infected with strains of SARS-CoV-2 that carry the H69del/V70del mutation. We observe a rise in S gene dropout in the US starting in early October, with 0.25% of our daily SARS-CoV-2-positive tests exhibiting this pattern during the first week. The rate of positive samples with S gene dropout has grown slowly over time, with last week exhibiting the highest level yet, at 0.5%. Focusing on the 14 states for which we have sufficient sample size to assess the frequency of this rare event (n>1000 SARS-CoV-2-positive samples), we see a recent expansion in the Eastern part of the US, concentrated in MA, OH, and FL. However, we cannot say from these data whether the S gene dropout samples we observe here represent the B.1.1.7. strain. Only with an expansion of genomic surveillance sequencing in the US will we know for certain the prevalence of the B.1.1.7 strain in the US.

36 citations

Journal ArticleDOI
01 Oct 2020
TL;DR: It is suggested that SARS-CoV-2 genomes in Los Angeles were predominantly related to the isolates originating from Europe, which are similar to viral strain distributions in New York, New York; a smaller subgroup of Sars- CoV- 2 genomes shared similarities to those from originating from Asia, indicating multiple sources of viral introduction within the Los Angeles community.
Abstract: Importance In late December 2019, an outbreak of a novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in Wuhan, China. Data on the routes of transmission to Los Angeles, California, the US West Coast epicenter for coronavirus disease 2019 (COVID-19), and subsequent community spread are limited. Objective To determine the transmission routes of SARS-CoV-2 to Southern California and elucidate local community spread within the Los Angeles metropolitan area. Design, Setting, and Participants This case series included 192 consecutive patients with reverse transcription–polymerase chain reaction (RT-PCR) test results positive for SARS-CoV-2 who were evaluated at Cedars-Sinai Medical Center in Los Angeles, California, from March 22 to April 15, 2020. Data analysis was performed from April to May 2020. Main Outcomes and Measures SARS-CoV-2 viral genomes were sequenced. Los Angeles isolates were compared with genomes from global subsampling and from New York, New York; Washington state; and China to determine potential sources of viral dissemination. Demographic data and outcomes were collected. Results The cohort included 192 patients (median [interquartile range] age, 59.5 [43-75] years; 110 [57.3%] men). The genetic characterization of SARS-CoV-2 isolates in the Los Angeles population pinpointed community transmission of 13 patients within a 3.81 km2radius. Variation landscapes of this case series also revealed a cluster of 10 patients that contained 5 residents at a skilled nursing facility, 1 resident of a nearby skilled nursing facility, 3 health care workers, and a family member of a resident of one of the skilled nursing facilities. Person-to-person transmission was detected in a cluster of 5 patients who shared the same single-nucleotide variation in their SARS-CoV-2 genomes. High viral genomic diversity was identified: 20 Los Angeles isolates (15.0%) resembled SARS-CoV-2 genomes from Asia, while 109 Los Angeles isolates (82.0%) were similar to isolates originating from Europe. Analysis of other common respiratory viral pathogens did not reveal coinfection in the cohort. Conclusions and Relevance These findings highlight the precision of detecting person-to-person transmission and accurate contact tracing directly through SARS-CoV-2 genome isolation and sequencing. Development and application of phylogenetic analyses from the Los Angeles population established connections between COVID-19 clusters locally and throughout the US.

27 citations

Related Papers (5)
Frequently Asked Questions (7)
Q1. What are the contributions in "Emergence of a novel sars-cov-2 strain in southern california, usa" ?

While this strain ( B. 1. 1. 7 ) may play an important role in increased COVID rates in the UK, there are still no reported strains to account for the spike of cases in Los Angeles ( LA ) and California as a whole, which currently has some of the highest absolute and per-capita COVID transmission rates in the country. The authors report a novel strain emerging in Southern California. ( which was not certified by peer review ) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice. While the B. 1. 1. 7 strain may play an important role in increased COVID rates in the UK and Europe, there are still no reports to account for the current spike of cases in Los Angeles and California as a whole that began in early November 2020. The authors report the existence of a novel strain CAL. 11/22/2020 and 12/28/2020 were processed for NGS and as per their protocol ( 1 ) with the following updates. This data was combined with all publicly available sequences ( N = 4,337 ) from Southern California ( Imperial, Kern, Los Angeles, Orange, Riverside, San Bernardino, San Diego, San Luis Obispo, Santa Barbara, and Ventura Counties ) and phylogenetic analysis was performed as previously reported ( 1 ). Unlike clade 20G, which is currently the largest reported clade in North America, this clade is defined by multiple mutations in the S protein ( similar to B. 1. ( which was not certified by peer review ) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. 

The S protein L452R mutation is within a known receptor binding domain that has been found to be markedly resistant to certain monoclonal antibodies to the spike protein(4). 

Since October 2020, novel strains of SARS-CoV-2 including B.1.1.7, have been identified to be of global significance from an infection and surveillance perspective. 

Since October 2020, novel strains of SARS-CoV-2 including B.1.1.7(UK) and B.1.351(South Africa) have become globally important because of their local dominance in COVID-19 positive cases(2). 

The first systematic genetic analysis of SARS-CoV-2 from Southern California showed most isolates originated from clade 20C that likely emerged from New York via Europe early in the COVID-19 pandemic(1). 

Unlike clade 20G, which is currently the largest reported clade in North America, this clade is defined by multiple mutations in the S protein (similar to B.1.1.7/B.1.351) and represents a separate subclade of 20C (Figure 3). 

Currently the CAL.20C strain is primarily found in Southern California, however the authors have detected multiple recent isolates in Northern California, New York and Washington DC.