scispace - formally typeset
Search or ask a question
Author

Zachary H. Strasser

Bio: Zachary H. Strasser is an academic researcher from Harvard University. The author has contributed to research in topics: Medicine & Coronavirus disease 2019 (COVID-19). The author has an hindex of 4, co-authored 8 publications receiving 65 citations.

Papers
More filters
Journal ArticleDOI
04 Feb 2021
TL;DR: In this paper, the authors used a combination of computational methods and clinical expertise to predict death after COVID-19 using only the past medical information collected in electronic health records (EHRs) and understand the differences in risk factors across age groups.
Abstract: This study aims to predict death after COVID-19 using only the past medical information routinely collected in electronic health records (EHRs) and to understand the differences in risk factors across age groups. Combining computational methods and clinical expertise, we curated clusters that represent 46 clinical conditions as potential risk factors for death after a COVID-19 infection. We trained age-stratified generalized linear models (GLMs) with component-wise gradient boosting to predict the probability of death based on what we know from the patients before they contracted the virus. Despite only relying on previously documented demographics and comorbidities, our models demonstrated similar performance to other prognostic models that require an assortment of symptoms, laboratory values, and images at the time of diagnosis or during the course of the illness. In general, we found age as the most important predictor of mortality in COVID-19 patients. A history of pneumonia, which is rarely asked in typical epidemiology studies, was one of the most important risk factors for predicting COVID-19 mortality. A history of diabetes with complications and cancer (breast and prostate) were notable risk factors for patients between the ages of 45 and 65 years. In patients aged 65-85 years, diseases that affect the pulmonary system, including interstitial lung disease, chronic obstructive pulmonary disease, lung cancer, and a smoking history, were important for predicting mortality. The ability to compute precise individual-level risk scores exclusively based on the EHR is crucial for effectively allocating and distributing resources, such as prioritizing vaccination among the general population.

74 citations

Journal ArticleDOI
TL;DR: In this paper, the authors applied a computational framework for knowledge discovery from clinical data, MLHO, to identify phenotypes that positively associate with a past positive reverse transcription-polymerase chain reaction (RT-PCR) test for COVID-19.
Abstract: For some SARS-CoV-2 survivors, recovery from the acute phase of the infection has been grueling with lingering effects. Many of the symptoms characterized as the post-acute sequelae of COVID-19 (PASC) could have multiple causes or are similarly seen in non-COVID patients. Accurate identification of PASC phenotypes will be important to guide future research and help the healthcare system focus its efforts and resources on adequately controlled age- and gender-specific sequelae of a COVID-19 infection. In this retrospective electronic health record (EHR) cohort study, we applied a computational framework for knowledge discovery from clinical data, MLHO, to identify phenotypes that positively associate with a past positive reverse transcription-polymerase chain reaction (RT-PCR) test for COVID-19. We evaluated the post-test phenotypes in two temporal windows at 3–6 and 6–9 months after the test and by age and gender. Data from longitudinal diagnosis records stored in EHRs from Mass General Brigham in the Boston Metropolitan Area was used for the analyses. Statistical analyses were performed on data from March 2020 to June 2021. Study participants included over 96 thousand patients who had tested positive or negative for COVID-19 and were not hospitalized. We identified 33 phenotypes among different age/gender cohorts or time windows that were positively associated with past SARS-CoV-2 infection. All identified phenotypes were newly recorded in patients’ medical records 2 months or longer after a COVID-19 RT-PCR test in non-hospitalized patients regardless of the test result. Among these phenotypes, a new diagnosis record for anosmia and dysgeusia (OR 2.60, 95% CI [1.94–3.46]), alopecia (OR 3.09, 95% CI [2.53–3.76]), chest pain (OR 1.27, 95% CI [1.09–1.48]), chronic fatigue syndrome (OR 2.60, 95% CI [1.22–2.10]), shortness of breath (OR 1.41, 95% CI [1.22–1.64]), pneumonia (OR 1.66, 95% CI [1.28–2.16]), and type 2 diabetes mellitus (OR 1.41, 95% CI [1.22–1.64]) is one of the most significant indicators of a past COVID-19 infection. Additionally, more new phenotypes were found with increased confidence among the cohorts who were younger than 65. The findings of this study confirm many of the post-COVID-19 symptoms and suggest that a variety of new diagnoses, including new diabetes mellitus and neurological disorder diagnoses, are more common among those with a history of COVID-19 than those without the infection. Additionally, more than 63% of PASC phenotypes were observed in patients under 65 years of age, pointing out the importance of vaccination to minimize the risk of debilitating post-acute sequelae of COVID-19 among younger adults.

68 citations

Posted ContentDOI
10 Jul 2021-medRxiv
TL;DR: In this paper, the authors applied a computational framework for knowledge discovery from clinical data, MLHO, to identify phenotypes that positively associate with a past positive reverse transcription-polymerase chain reaction (RT-PCR) test for COVID-19.
Abstract: For some SARS-CoV-2 survivors, recovery from the acute phase of the infection has been grueling with lingering effects. Many of the symptoms characterized as the post-acute sequelae of COVID-19 (PASC) could have multiple causes or are similarly seen in non-COVID patients. Accurate identification of phenotypes will be important to guide future research and help the healthcare system focus its efforts and resources on adequately controlled age- and gender-specific sequelae of a COVID-19 infection. In this retrospective electronic health records (EHR) cohort study, we applied a computational framework for knowledge discovery from clinical data, MLHO, to identify phenotypes that positively associate with a past positive reverse transcription-polymerase chain reaction (RT-PCR) test for COVID-19. We evaluated the post-test phenotypes in two temporal windows at 3-6 and 6-9 months after the test and by age and gender. Data from longitudinal diagnosis records stored in EHRs from Mass General Brigham in the Boston metropolitan area was used for the analyses. Statistical analyses were performed on data from March 2020 to June 2021. Study participants included over 96 thousand patients who had tested positive or negative for COVID-19 and were not hospitalized. We identified 33 phenotypes among different age/gender cohorts or time windows that were positively associated with past SARS-CoV-2 infection. All identified phenotypes were newly recorded in patients’ medical records two months or longer after a COVID-19 RT-PCR test in non-hospitalized patients regardless of the test result. Among these phenotypes, a new diagnosis record for anosmia and dysgeusia (OR: 2.60, 95% CI [1.94 - 3.46]), alopecia (OR: 3.09, 95% CI [2.53 - 3.76]), chest pain (OR: 1.27, 95% CI [1.09 - 1.48]), chronic fatigue syndrome (OR 2.60, 95% CI [1.22-2.10]), shortness of breath (OR 1.41, 95% CI [1.22 - 1.64]), pneumonia (OR 1.66, 95% CI [1.28 - 2.16]), and type 2 diabetes mellitus (OR 1.41, 95% CI [1.22 - 1.64]) are some of the most significant indicators of a past COVID-19 infection. Additionally, more new phenotypes were found with increased confidence among the cohorts who were younger than 65. Our approach avoids a flood of false positive discoveries while offering a more robust probabilistic approach compared to the standard linear phenome-wide association study (PheWAS). The findings of this study confirm many of the post-COVID symptoms and suggest that a variety of new diagnoses, including new diabetes mellitus and neurological disorder diagnoses, are more common among those with a history of COVID-19 than those without the infection. Additionally, more than 63 percent of PASC phenotypes were observed in patients under 65 years of age, pointing out the importance of vaccination to minimize the risk of debilitating post-acute sequelae of COVID-19 among younger adults.

49 citations

Journal ArticleDOI
TL;DR: The results demonstrated that while demographic variables (namely age) are important predictors of adverse outcomes after a COVID-19 infection, the incorporation of the past clinical records are vital for a reliable prediction model.
Abstract: We developed MLHO (pronounced as melo), an end-to-end Machine Learning framework that leverages iterative feature and algorithm selection to predict Health Outcomes. MLHO implements iterative sequential representation mining, and feature and model selection, for predicting the patient-level risk of hospitalization, ICU admission, need for mechanical ventilation, and death. It bases this prediction on data from patients' past medical records (before their COVID-19 infection). MLHO's architecture enables a parallel and outcome-oriented model calibration, in which different statistical learning algorithms and vectors of features are simultaneously tested to improve the prediction of health outcomes. Using clinical and demographic data from a large cohort of over 13,000 COVID-19-positive patients, we modeled the four adverse outcomes utilizing about 600 features representing patients' pre-COVID health records and demographics. The mean AUC ROC for mortality prediction was 0.91, while the prediction performance ranged between 0.80 and 0.81 for the ICU, hospitalization, and ventilation. We broadly describe the clusters of features that were utilized in modeling and their relative influence for predicting each outcome. Our results demonstrated that while demographic variables (namely age) are important predictors of adverse outcomes after a COVID-19 infection, the incorporation of the past clinical records are vital for a reliable prediction model. As the COVID-19 pandemic unfolds around the world, adaptable and interpretable machine learning frameworks (like MLHO) are crucial to improve our readiness for confronting the potential future waves of COVID-19, as well as other novel infectious diseases that may emerge.

30 citations

Journal ArticleDOI
TL;DR: To quantify the frequency of incidental hospitalizations over the first 15 months of the pandemic in multiple hospital systems in the United States and apply electronic phenotyping techniques to automatically improve COVID-19 hospitalization classification, a large proportion of SARS-CoV-2 PCR-positive admissions were incidental.
Abstract: Background Admissions are generally classified as COVID-19 hospitalizations if the patient has a positive SARS-CoV-2 polymerase chain reaction (PCR) test. However, because 35% of SARS-CoV-2 infections are asymptomatic, patients admitted for unrelated indications with an incidentally positive test could be misclassified as a COVID-19 hospitalization. Electronic health record (EHR)–based studies have been unable to distinguish between a hospitalization specifically for COVID-19 versus an incidental SARS-CoV-2 hospitalization. Although the need to improve classification of COVID-19 versus incidental SARS-CoV-2 is well understood, the magnitude of the problems has only been characterized in small, single-center studies. Furthermore, there have been no peer-reviewed studies evaluating methods for improving classification. Objective The aims of this study are to, first, quantify the frequency of incidental hospitalizations over the first 15 months of the pandemic in multiple hospital systems in the United States and, second, to apply electronic phenotyping techniques to automatically improve COVID-19 hospitalization classification. Methods From a retrospective EHR-based cohort in 4 US health care systems in Massachusetts, Pennsylvania, and Illinois, a random sample of 1123 SARS-CoV-2 PCR-positive patients hospitalized from March 2020 to August 2021 was manually chart-reviewed and classified as “admitted with COVID-19” (incidental) versus specifically admitted for COVID-19 (“for COVID-19”). EHR-based phenotyping was used to find feature sets to filter out incidental admissions. Results EHR-based phenotyped feature sets filtered out incidental admissions, which occurred in an average of 26% of hospitalizations (although this varied widely over time, from 0% to 75%). The top site-specific feature sets had 79%-99% specificity with 62%-75% sensitivity, while the best-performing across-site feature sets had 71%-94% specificity with 69%-81% sensitivity. Conclusions A large proportion of SARS-CoV-2 PCR-positive admissions were incidental. Straightforward EHR-based phenotypes differentiated admissions, which is important to assure accurate public health reporting and research.

29 citations


Cited by
More filters
Book ChapterDOI
01 Jan 2010

5,842 citations

01 Jan 2020
TL;DR: Prolonged viral shedding provides the rationale for a strategy of isolation of infected patients and optimal antiviral interventions in the future.
Abstract: Summary Background Since December, 2019, Wuhan, China, has experienced an outbreak of coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Epidemiological and clinical characteristics of patients with COVID-19 have been reported but risk factors for mortality and a detailed clinical course of illness, including viral shedding, have not been well described. Methods In this retrospective, multicentre cohort study, we included all adult inpatients (≥18 years old) with laboratory-confirmed COVID-19 from Jinyintan Hospital and Wuhan Pulmonary Hospital (Wuhan, China) who had been discharged or had died by Jan 31, 2020. Demographic, clinical, treatment, and laboratory data, including serial samples for viral RNA detection, were extracted from electronic medical records and compared between survivors and non-survivors. We used univariable and multivariable logistic regression methods to explore the risk factors associated with in-hospital death. Findings 191 patients (135 from Jinyintan Hospital and 56 from Wuhan Pulmonary Hospital) were included in this study, of whom 137 were discharged and 54 died in hospital. 91 (48%) patients had a comorbidity, with hypertension being the most common (58 [30%] patients), followed by diabetes (36 [19%] patients) and coronary heart disease (15 [8%] patients). Multivariable regression showed increasing odds of in-hospital death associated with older age (odds ratio 1·10, 95% CI 1·03–1·17, per year increase; p=0·0043), higher Sequential Organ Failure Assessment (SOFA) score (5·65, 2·61–12·23; p Interpretation The potential risk factors of older age, high SOFA score, and d-dimer greater than 1 μg/mL could help clinicians to identify patients with poor prognosis at an early stage. Prolonged viral shedding provides the rationale for a strategy of isolation of infected patients and optimal antiviral interventions in the future. Funding Chinese Academy of Medical Sciences Innovation Fund for Medical Sciences; National Science Grant for Distinguished Young Scholars; National Key Research and Development Program of China; The Beijing Science and Technology Project; and Major Projects of National Science and Technology on New Drug Creation and Development.

4,408 citations

Journal ArticleDOI
TL;DR: In this article , the authors show that Omicron variant infections were associated with substantially reduced risk of progression to severe clinical outcomes relative to time-matched Delta (B.1.2) variant infections within a large, integrated healthcare system in Southern California.
Abstract: Epidemiologic surveillance has revealed decoupling of Coronavirus Disease 2019 (COVID-19) hospitalizations and deaths from case counts after emergence of the Omicron (B.1.1.529) severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variant globally. However, assessment of the relative severity of Omicron variant infections presents challenges because of differential acquired immune protection against Omicron and prior variants and because longer-term changes have occurred in testing and healthcare practices. Here we show that Omicron variant infections were associated with substantially reduced risk of progression to severe clinical outcomes relative to time-matched Delta (B.1.617.2) variant infections within a large, integrated healthcare system in Southern California. Adjusted hazard ratios (aHRs) for any hospital admission, symptomatic hospital admission, intensive care unit admission, mechanical ventilation and death comparing individuals with Omicron versus Delta variant infection were 0.59 (95% confidence interval: 0.51–0.69), 0.59 (0.51–0.68), 0.50 (0.29–0.87), 0.36 (0.18–0.72) and 0.21 (0.10–0.44), respectively. This reduced severity could not be explained by differential history of prior infection among individuals with Omicron or Delta variant infection and was starkest among individuals not previously vaccinated against COVID-19 (aHR = 0.40 (0.33–0.49) for any hospital admission and 0.14 (0.07–0.28) for death). Infections with the Omicron BA.2 subvariant were not associated with differential risk of severe outcomes in comparison to BA.1/BA.1.1 subvariant infections. Lower risk of severe clinical outcomes among individuals with Omicron variant infection should inform public health response amid establishment of the Omicron variant as the dominant SARS-CoV-2 lineage globally. Comparison of outcomes of SARS-CoV-2 Delta and Omicron infections shows reduced severity of Omicron infections, most notably in unvaccinated individuals, and no differential risk of severe outcomes between subvariants BA.1 and BA.2. The findings highlight the importance of continual assessment of clinical outcomes associated with SARS-CoV-2 variants of concern to inform both medical interventions and healthcare resource management.

203 citations

Journal ArticleDOI
TL;DR: In this article , a systematic search of 1458 articles, 18 studies, encompassing a total of 10,530 patients, were analyzed to determine the prevalence of neurological and neuropsychiatric symptoms reported 12 weeks (3 months) or more after acute COVID-19 onset in adults.

200 citations

Journal ArticleDOI
Ittai Dayan1, Holger R. Roth2, Aoxiao Zhong1, Ahmed Harouni2, Amilcare Gentili, Anas Z. Abidin2, Andrew Liu2, Anthony Costa3, Bradford J. Wood4, Chien-Sung Tsai5, Chih-Hung Wang5, Chun-Nan Hsu6, C. K. Lee2, Peiying Ruan2, Daguang Xu2, Dufan Wu1, Eddie Huang2, Felipe Kitamura7, Griffin Lacey2, Gustavo César de Antônio Corradi7, Gustavo Nino, Hao-Hsin Shin8, Hirofumi Obinata, Hui Ren1, Jason C. Crane9, Jesse Tetreault2, Jiahui Guan2, John Garrett10, Joshua D. Kaggie11, Jung Gil Park12, Keith J. Dreyer1, Krishna Juluru8, Kristopher Kersten2, Marcio Aloisio Bezerra Cavalcanti Rockenbach, Marius George Linguraru4, Marius George Linguraru13, Masoom A. Haider14, Masoom A. Haider15, Meena AbdelMaseeh15, Nicola Rieke2, Pablo F. Damasceno9, Pedro Mário Cruz e Silva2, Pochuan Wang16, Sheng Xu4, Shuichi Kawano, Sira Sriswasdi17, Soo-Young Park18, Thomas M. Grist10, Varun Buch, Watsamon Jantarabenjakul19, Watsamon Jantarabenjakul17, Weichung Wang16, Won Young Tak18, Xiang Li1, Xihong Lin1, Young Joon Kwon3, Abood Quraini2, Andrew Feng2, Andrew N. Priest11, Baris Turkbey4, Benjamin S. Glicksberg3, Bernardo Bizzo, Byung Seok Kim20, Carlos Tor-Díez4, Chia-Cheng Lee5, Chia-Jung Hsu5, Chin Lin5, Chiu-Ling Lai, Christopher P. Hess9, Colin B. Compas2, Deepeksha Bhatia2, Eric K. Oermann, Evan Leibovitz, Hisashi Sasaki, Hitoshi Mori, Isaac Yang2, Jae Ho Sohn9, Krishna Nand Keshava Murthy8, Li-Chen Fu16, Matheus Ribeiro Furtado de Mendonça7, Mike Fralick, Min Kyu Kang12, Mohammad Adil2, Natalie Gangai8, Peerapon Vateekul17, Pierre Elnajjar8, Sarah E Hickman11, Sharmila Majumdar9, Shelley McLeod14, Sheridan Reed4, Stefan Gräf11, Stephanie Harmon4, Tatsuya Kodama, Thanyawee Puthanakit19, Thanyawee Puthanakit17, Tony Mazzulli21, Tony Mazzulli14, Vitor Lavor7, Yothin Rakvongthai17, Yu Rim Lee18, Yuhong Wen2, Fiona J. Gilbert11, Mona Flores2, Quanzheng Li1 
TL;DR: In this article, the authors used federated learning to predict future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays.
Abstract: Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site’s data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare. Federated learning, a method for training artificial intelligence algorithms that protects data privacy, was used to predict future oxygen requirements of symptomatic patients with COVID-19 using data from 20 different institutes across the globe.

162 citations