Sequence Analysis of 20,453 SARS-CoV-2 Genomes from the Houston
1
Metropolitan Area Identifies the Emergence and Widespread
2
Distributi on of Multiple Isol ates of All Major Variant s of Concern
3
4
S. Wesley Long,
a,b,1
Randall J. Olsen,
a,b,1
Paul A. Christensen,
a
Sishir Subedi,
a
Robert Olson,
c,d
5
James J. Davis,
c,d
Matthew Ojeda Saavedra,
a
Prasanti Yerramilli,
a
Layne Pruitt,
a
Kristina 6
Reppond,
a
Madison N. Shyer,
a
Jessica Cambric,
a
Ilya J. Finkelstein,
e
Jimmy Gollihar,
a,f
and 7
James M. Musser
a,b#
8
9
From the
a
Center for Molecular and Translational Human Infectious Diseases Research,
10
Department of Pathology and Genomic Medicine, Houston Methodist Research Institute and
11
Houston Methodist Hospital, 6565 Fannin Street, Houston, Texas, 77030
12
b
Departments of Pathology and Laboratory Medicine, and Microbiology and Immunology, Weill
13
Cornell Medical College, 1300 York Avenue, New York, New York, 10065
14
c
Consortium for Advanced Science and Engineering, 22 University of Chicago, 5801 South Ellis
15
Avenue, Chicago, Illinois, 60637
16
d
Computing, Environment and Life Sciences, Argonne National Laboratory, 9700 South Cass
17
Avenue, Lemont, Illinois, 60439
18
e
Department of Molecular Biosciences and Institute of Molecular Biosciences, The University of
19
Texas at Austin, Austin, Texas 78712
20
f
CCDC Army Research Laboratory-South, University of Texas, Austin, Texas 78712
21
. CC-BY-NC-ND 4.0 International licenseIt is made available under a
is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)
The copyright holder for this preprint this version posted March 2, 2021. ; https://doi.org/10.1101/2021.02.26.21252227doi: medRxiv preprint
NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice.
22
Number of text pages: 12 23
Number of tables: 1 24
Number of figures: 2 25
Running head (40 characters or less): SARS-CoV-2 variants of concern in Houston, TX 26
27
# Address correspondence to James M. Musser, M.D., Ph.D., Department of Pathology and 28
Genomic Medicine, Houston Methodist Research Institute, 6565 Fannin Street, Suite B490, 29
Houston, Texas 77030. Tel: 713.441.5 890, E- mail: jmmusser@houstonmethodist.org. 30
31
Disclosures: None. 32
33
34
35
36
37
38
39
40
41
42
43
. CC-BY-NC-ND 4.0 International licenseIt is made available under a
is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)
The copyright holder for this preprint this version posted March 2, 2021. ; https://doi.org/10.1101/2021.02.26.21252227doi: medRxiv preprint
[Abstract (220 words)] 44
Since the beginning of the SARS-CoV-2 pandemic, there has been international concern about 45
the emergence of virus variants with mutations that increase transmissibility, enhance escape 46
from the human immune response, or otherwise alter biologically important phenotypes. In 47
late 2020, several “variants of concern” emerged globally, including the UK variant (B.1.1.7), 48
South Africa variant (B.1.351), Brazil variants (P.1 and P.2), and two related California “variants 49
of interest” (B.1.429 and B.1.427). These variants are believed to have enhanced 50
transmissibility capacity. For the South Africa and Brazil variants, there is evidence that 51
mutations in spike protein permit it to escape from some vaccines and therapeutic monoclonal 52
antibodies. Based on our extensive genome sequencing program involving 20,453 virus 53
specimens from COVID-19 patients dating from March 2020, we report identification of all 54
important SARS-CoV-2 variants among Houston Methodist Hospital patients residing in the 55
greater metropolitan area. Although these variants are currently at relatively low frequency in 56
the population, they are geographically widespread. Houston is the first city in the United 57
States to have all variants documented by genome sequencing. As vaccine deployment 58
accelerates worldwide, increased genomic surveillance of SARS-CoV-2 is essential to 59
understanding the presence and frequency of consequential variants and their patterns and 60
trajectory of dissemination. This information is critical for medical and public health efforts to 61
effectively address and mitigate this global crisis. 62
63
64
. CC-BY-NC-ND 4.0 International licenseIt is made available under a
is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)
The copyright holder for this preprint this version posted March 2, 2021. ; https://doi.org/10.1101/2021.02.26.21252227doi: medRxiv preprint
[Introduction] 65
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of 66
coronavirus disease 2019 (COVID-19). Since first being identified in December 2019,
1-4
the virus 67
has spread globally and is responsible for massive human morbidity and mortality worldwide.
5-9
68
At the onset of the pandemic, effective treatments for COVID-19 were lacking. But as a result of 69
intense global research efforts, monoclonal antibody (mAbs) therapies
10, 11
and several 70
vaccines,
12, 13
primarily directed against the spike protein, have been developed to treat and 71
prevent SARS-CoV-2 infection. 72
In late 2020 the international research community described several SARS-CoV-2 73
“variants of concern” that warranted special scrutiny. These include the United Kingdom (UK) 74
variant (B.1.1.7), South Africa variant (B.1.351), Brazil variants (P.1 and P.2) and two California 75
variants (B.1.429/CAL.20C and B.1.427/CAL.20C).
14-22
These virus variants were designated as 76
“concerning” predominantly due to their reported enhanced person-to-person transmission in 77
some geographic areas, and they have since been detected in several countries worldwide. For 78
example, the UK B.1.1.7 variant spread rapidly in southeast England where it caused large 79
numbers of COVID-19 cases,
14
and was identified shortly thereafter in the United States (US) 80
[CDC; https://www.cdc.gov/coronavirus/2019-ncov/transmission/variant.html].
23
More than 81
1,600 cases have since been documented in the US, and at least one large outbreak recently 82
was reported in a Michigan prison.
24, 25
There is concern at the Centers for Disease Control and 83
Prevention (CDC) that it could become the dominant variant causing disease in the US by 84
March.
23, 24, 26
Moreover, the UK B.1.1.7 variant may be linked to an increased death rate 85
compared to other virus types, adding further concern.
18, 21, 27, 28
86
. CC-BY-NC-ND 4.0 International licenseIt is made available under a
is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)
The copyright holder for this preprint this version posted March 2, 2021. ; https://doi.org/10.1101/2021.02.26.21252227doi: medRxiv preprint
Similarly, the South Africa and Brazil variants caused large disease outbreaks in their 87
respective countries.
19, 20
These variants also are of concern because they contain a mutation 88
(E484K) in the spike protein that decreases efficacy of some therapeutic mAbs, decreases
in
89
vitr
o virus neutralization, and may result in potential escape from immunity induced by natural 90
infection or vaccination.
29-37
All three variants (UK B.1.1.7, Brazil P.1, and South Africa B.1.351) 91
also have a N501Y mutation in spike protein that is associated with stronger binding to the 92
ACE2 receptor, possibly contributing to increased transmissibility.
38,39
93
The Houston metropolitan area is the fourth largest and most ethnically diverse city in 94
the US, with a population of approximately 7 million (https://www.houston.org/houston-
95
data
).
40
The 2,400-bed Houston Methodist health system has eight hospitals and cares for a 96
large, multiethnic, and geographically and socioeconomically diverse patient population 97
throughout greater Houston. The eight Houston Methodist hospitals have a single central 98
molecular diagnostic laboratory, which means that all RT-PCR-specimens can readily be 99
identified, banked, and subjected to further study as needed. In addition, the Department of 100
Pathology and Genomic Medicine has a long-standing record of integrating genome sequencing 101
efforts into clinical care and research, especially related to microbial pathogens infecting our 102
patients.
41-49
In the aggregate, strategic co-localization of these diagnostic attributes coupled 103
with a contiguous research institute building seamlessly facilitates comprehensive population 104
genomic studies of SARS-CoV-2 viruses causing infections in the Houston metropolitan region. 105
46, 49
106
Before the SARS-CoV-2 virus arrived in Houston, we planned an integrated strategy to 107
confront and mitigate this microbial threat to our patients. In addition to rapidly validating an 108
. CC-BY-NC-ND 4.0 International licenseIt is made available under a
is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review)
The copyright holder for this preprint this version posted March 2, 2021. ; https://doi.org/10.1101/2021.02.26.21252227doi: medRxiv preprint