scispace - formally typeset
Search or ask a question
Author

Claire Duvallet

Bio: Claire Duvallet is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Population & Wastewater. The author has an hindex of 13, co-authored 45 publications receiving 5678 citations.

Papers published on a yearly basis

Papers
More filters
Journal ArticleDOI
Evan Bolyen1, Jai Ram Rideout1, Matthew R. Dillon1, Nicholas A. Bokulich1, Christian C. Abnet2, Gabriel A. Al-Ghalith3, Harriet Alexander4, Harriet Alexander5, Eric J. Alm6, Manimozhiyan Arumugam7, Francesco Asnicar8, Yang Bai9, Jordan E. Bisanz10, Kyle Bittinger11, Asker Daniel Brejnrod7, Colin J. Brislawn12, C. Titus Brown5, Benjamin J. Callahan13, Andrés Mauricio Caraballo-Rodríguez14, John Chase1, Emily K. Cope1, Ricardo Silva14, Christian Diener15, Pieter C. Dorrestein14, Gavin M. Douglas16, Daniel M. Durall17, Claire Duvallet6, Christian F. Edwardson, Madeleine Ernst14, Madeleine Ernst18, Mehrbod Estaki17, Jennifer Fouquier19, Julia M. Gauglitz14, Sean M. Gibbons15, Sean M. Gibbons20, Deanna L. Gibson17, Antonio Gonzalez14, Kestrel Gorlick1, Jiarong Guo21, Benjamin Hillmann3, Susan Holmes22, Hannes Holste14, Curtis Huttenhower23, Curtis Huttenhower24, Gavin A. Huttley25, Stefan Janssen26, Alan K. Jarmusch14, Lingjing Jiang14, Benjamin D. Kaehler25, Benjamin D. Kaehler27, Kyo Bin Kang28, Kyo Bin Kang14, Christopher R. Keefe1, Paul Keim1, Scott T. Kelley29, Dan Knights3, Irina Koester14, Tomasz Kosciolek14, Jorden Kreps1, Morgan G. I. Langille16, Joslynn S. Lee30, Ruth E. Ley31, Ruth E. Ley32, Yong-Xin Liu, Erikka Loftfield2, Catherine A. Lozupone19, Massoud Maher14, Clarisse Marotz14, Bryan D Martin20, Daniel McDonald14, Lauren J. McIver24, Lauren J. McIver23, Alexey V. Melnik14, Jessica L. Metcalf33, Sydney C. Morgan17, Jamie Morton14, Ahmad Turan Naimey1, Jose A. Navas-Molina34, Jose A. Navas-Molina14, Louis-Félix Nothias14, Stephanie B. Orchanian, Talima Pearson1, Samuel L. Peoples35, Samuel L. Peoples20, Daniel Petras14, Mary L. Preuss36, Elmar Pruesse19, Lasse Buur Rasmussen7, Adam R. Rivers37, Michael S. Robeson38, Patrick Rosenthal36, Nicola Segata8, Michael Shaffer19, Arron Shiffer1, Rashmi Sinha2, Se Jin Song14, John R. Spear39, Austin D. Swafford, Luke R. Thompson40, Luke R. Thompson41, Pedro J. Torres29, Pauline Trinh20, Anupriya Tripathi14, Peter J. Turnbaugh10, Sabah Ul-Hasan42, Justin J. J. van der Hooft43, Fernando Vargas, Yoshiki Vázquez-Baeza14, Emily Vogtmann2, Max von Hippel44, William A. Walters32, Yunhu Wan2, Mingxun Wang14, Jonathan Warren45, Kyle C. Weber37, Kyle C. Weber46, Charles H. D. Williamson1, Amy D. Willis20, Zhenjiang Zech Xu14, Jesse R. Zaneveld20, Yilong Zhang47, Qiyun Zhu14, Rob Knight14, J. Gregory Caporaso1 
TL;DR: QIIME 2 development was primarily funded by NSF Awards 1565100 to J.G.C. and R.K.P. and partial support was also provided by the following: grants NIH U54CA143925 and U54MD012388.
Abstract: QIIME 2 development was primarily funded by NSF Awards 1565100 to J.G.C. and 1565057 to R.K. Partial support was also provided by the following: grants NIH U54CA143925 (J.G.C. and T.P.) and U54MD012388 (J.G.C. and T.P.); grants from the Alfred P. Sloan Foundation (J.G.C. and R.K.); ERCSTG project MetaPG (N.S.); the Strategic Priority Research Program of the Chinese Academy of Sciences QYZDB-SSW-SMC021 (Y.B.); the Australian National Health and Medical Research Council APP1085372 (G.A.H., J.G.C., Von Bing Yap and R.K.); the Natural Sciences and Engineering Research Council (NSERC) to D.L.G.; and the State of Arizona Technology and Research Initiative Fund (TRIF), administered by the Arizona Board of Regents, through Northern Arizona University. All NCI coauthors were supported by the Intramural Research Program of the National Cancer Institute. S.M.G. and C. Diener were supported by the Washington Research Foundation Distinguished Investigator Award.

8,821 citations

Posted ContentDOI
Evan Bolyen1, Jai Ram Rideout1, Matthew R. Dillon1, Nicholas A. Bokulich1, Christian C. Abnet, Gabriel A. Al-Ghalith2, Harriet Alexander3, Harriet Alexander4, Eric J. Alm5, Manimozhiyan Arumugam6, Francesco Asnicar7, Yang Bai8, Jordan E. Bisanz9, Kyle Bittinger10, Asker Daniel Brejnrod6, Colin J. Brislawn11, C. Titus Brown4, Benjamin J. Callahan12, Andrés Mauricio Caraballo-Rodríguez13, John Chase1, Emily K. Cope1, Ricardo Silva13, Pieter C. Dorrestein13, Gavin M. Douglas14, Daniel M. Durall15, Claire Duvallet5, Christian F. Edwardson16, Madeleine Ernst13, Mehrbod Estaki15, Jennifer Fouquier17, Julia M. Gauglitz13, Deanna L. Gibson15, Antonio Gonzalez18, Kestrel Gorlick1, Jiarong Guo19, Benjamin Hillmann2, Susan Holmes20, Hannes Holste18, Curtis Huttenhower21, Curtis Huttenhower22, Gavin A. Huttley23, Stefan Janssen24, Alan K. Jarmusch13, Lingjing Jiang18, Benjamin D. Kaehler23, Kyo Bin Kang25, Kyo Bin Kang13, Christopher R. Keefe1, Paul Keim1, Scott T. Kelley26, Dan Knights2, Irina Koester13, Irina Koester18, Tomasz Kosciolek18, Jorden Kreps1, Morgan G. I. Langille14, Joslynn S. Lee27, Ruth E. Ley28, Ruth E. Ley29, Yong-Xin Liu8, Erikka Loftfield, Catherine A. Lozupone17, Massoud Maher18, Clarisse Marotz18, Bryan D Martin30, Daniel McDonald18, Lauren J. McIver22, Lauren J. McIver21, Alexey V. Melnik13, Jessica L. Metcalf31, Sydney C. Morgan15, Jamie Morton18, Ahmad Turan Naimey1, Jose A. Navas-Molina18, Jose A. Navas-Molina32, Louis-Félix Nothias13, Stephanie B. Orchanian18, Talima Pearson1, Samuel L. Peoples30, Samuel L. Peoples33, Daniel Petras13, Mary L. Preuss34, Elmar Pruesse17, Lasse Buur Rasmussen6, Adam R. Rivers35, Ii Michael S Robeson36, Patrick Rosenthal34, Nicola Segata7, Michael Shaffer17, Arron Shiffer1, Rashmi Sinha, Se Jin Song18, John R. Spear37, Austin D. Swafford18, Luke R. Thompson38, Luke R. Thompson39, Pedro J. Torres26, Pauline Trinh30, Anupriya Tripathi13, Anupriya Tripathi18, Peter J. Turnbaugh9, Sabah Ul-Hasan40, Justin J. J. van der Hooft41, Fernando Vargas18, Yoshiki Vázquez-Baeza18, Emily Vogtmann, Max von Hippel42, William A. Walters28, Yunhu Wan, Mingxun Wang13, Jonathan Warren43, Kyle C. Weber44, Kyle C. Weber35, Chase Hd Williamson1, Amy D. Willis30, Zhenjiang Zech Xu18, Jesse R. Zaneveld30, Yilong Zhang45, Rob Knight18, J. Gregory Caporaso1 
24 Oct 2018-PeerJ
TL;DR: QIIME 2 provides new features that will drive the next generation of microbiome research, including interactive spatial and temporal analysis and visualization tools, support for metabolomics and shotgun metagenomics analysis, and automated data provenance tracking to ensure reproducible, transparent microbiome data science.
Abstract: We present QIIME 2, an open-source microbiome data science platform accessible to users spanning the microbiome research ecosystem, from scientists and engineers to clinicians and policy makers. QIIME 2 provides new features that will drive the next generation of microbiome research. These include interactive spatial and temporal analysis and visualization tools, support for metabolomics and shotgun metagenomics analysis, and automated data provenance tracking to ensure reproducible, transparent microbiome data science.

875 citations

Journal ArticleDOI
TL;DR: The MicrobiomeHD database, which includes 28 published case–control gut microbiome studies spanning ten diseases, is introduced, and a cross-disease meta-analysis of these studies using standardized methods finds consistent patterns characterizing disease-associated microbiome changes.
Abstract: Hundreds of clinical studies have demonstrated associations between the human microbiome and disease, yet fundamental questions remain on how we can generalize this knowledge. Results from individual studies can be inconsistent, and comparing published data is further complicated by a lack of standard processing and analysis methods. Here we introduce the MicrobiomeHD database, which includes 28 published case–control gut microbiome studies spanning ten diseases. We perform a cross-disease meta-analysis of these studies using standardized methods. We find consistent patterns characterizing disease-associated microbiome changes. Some diseases are associated with over 50 genera, while most show only 10–15 genus-level changes. Some diseases are marked by the presence of potentially pathogenic microbes, whereas others are characterized by a depletion of health-associated bacteria. Furthermore, we show that about half of genera associated with individual studies are bacteria that respond to more than one disease. Thus, many associations found in case–control studies are likely not disease-specific but rather part of a non-specific, shared response to health and disease. Reported associations between the human microbiome and disease are often inconsistent. Here, Duvallet et al. perform a meta-analysis of 28 gut microbiome studies spanning ten diseases, and find associations that are likely not disease-specific but potentially part of a shared response to disease.

641 citations

Journal ArticleDOI
25 Aug 2020
TL;DR: A laboratory protocol to quantify viral titers in raw sewage via qPCR analysis and validate results with sequencing analysis suggests that the number of positive cases estimated from wastewater viral titer is orders of magnitude greater than the numberof confirmed clinical cases and therefore may significantly impact efforts to understand the case fatality rate and progression of disease.
Abstract: Wastewater surveillance represents a complementary approach to clinical surveillance to measure the presence and prevalence of emerging infectious diseases like the novel coronavirus SARS-CoV-2. This innovative data source can improve the precision of epidemiological modeling to understand the penetrance of SARS-CoV-2 in specific vulnerable communities. Here, we tested wastewater collected at a major urban treatment facility in Massachusetts and detected SARS-CoV-2 RNA from the N gene at significant titers (57 to 303 copies per ml of sewage) in the period from 18 to 25 March 2020 using RT-qPCR. We validated detection of SARS-CoV-2 by Sanger sequencing the PCR product from the S gene. Viral titers observed were significantly higher than expected based on clinically confirmed cases in Massachusetts as of 25 March. Our approach is scalable and may be useful in modeling the SARS-CoV-2 pandemic and future outbreaks. IMPORTANCE Wastewater-based surveillance is a promising approach for proactive outbreak monitoring. SARS-CoV-2 is shed in stool early in the clinical course and infects a large asymptomatic population, making it an ideal target for wastewater-based monitoring. In this study, we develop a laboratory protocol to quantify viral titers in raw sewage via qPCR analysis and validate results with sequencing analysis. Our results suggest that the number of positive cases estimated from wastewater viral titers is orders of magnitude greater than the number of confirmed clinical cases and therefore may significantly impact efforts to understand the case fatality rate and progression of disease. These data may help inform decisions surrounding the advancement or scale-back of social distancing and quarantine efforts based on dynamic wastewater catchment-level estimations of prevalence.

612 citations

Posted ContentDOI
07 Apr 2020-medRxiv
TL;DR: Wastewater surveillance at a major urban treatment facility in Massachusetts found the presence of SARS-CoV-2 at high titers in the period from March 18 - 25 using RT-qPCR, and the identity of the PCR product was confirmed by direct DNA sequencing.
Abstract: Wastewater surveillance may represent a complementary approach to measure the presence and even prevalence of infectious diseases when the capacity for clinical testing is limited. Moreover, aggregate, population-wide data can help inform modeling efforts. We tested wastewater collected at a major urban treatment facility in Massachusetts and found the presence of SARS-CoV-2 at high titers in the period from March 18 - 25 using RT-qPCR. We then confirmed the identity of the PCR product by direct DNA sequencing. Viral titers observed were significantly higher than expected based on clinically confirmed cases in Massachusetts as of March 25. The reason for the discrepancy is not yet clear, however, and until further experiments are complete, these data do not necessarily indicate that clinical estimates are incorrect. Our approach is scalable and may be useful in modeling the SARS-CoV-2 pandemic and future outbreaks.

358 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Some notable features of IQ-TREE version 2 are described and the key advantages over other software are highlighted.
Abstract: IQ-TREE (http://www.iqtree.org, last accessed February 6, 2020) is a user-friendly and widely used software package for phylogenetic inference using maximum likelihood. Since the release of version 1 in 2014, we have continuously expanded IQ-TREE to integrate a plethora of new models of sequence evolution and efficient computational approaches of phylogenetic inference to deal with genomic data. Here, we describe notable features of IQ-TREE version 2 and highlight the key advantages over other software.

4,337 citations

Journal Article
TL;DR: This volume is keyed to high resolution electron microscopy, which is a sophisticated form of structural analysis, but really morphology in a modern guise, the physical and mechanical background of the instrument and its ancillary tools are simply and well presented.
Abstract: I read this book the same weekend that the Packers took on the Rams, and the experience of the latter event, obviously, colored my judgment. Although I abhor anything that smacks of being a handbook (like, \"How to Earn a Merit Badge in Neurosurgery\") because too many volumes in biomedical science already evince a boyscout-like approach, I must confess that parts of this volume are fast, scholarly, and significant, with certain reservations. I like parts of this well-illustrated book because Dr. Sj6strand, without so stating, develops certain subjects on technique in relation to the acquisition of judgment and sophistication. And this is important! So, given that the author (like all of us) is somewhat deficient in some areas, and biased in others, the book is still valuable if the uninitiated reader swallows it in a general fashion, realizing full well that what will be required from the reader is a modulation to fit his vision, propreception, adaptation and response, and the kind of problem he is undertaking. A major deficiency of this book is revealed by comparison of its use of physics and of chemistry to provide understanding and background for the application of high resolution electron microscopy to problems in biology. Since the volume is keyed to high resolution electron microscopy, which is a sophisticated form of structural analysis, but really morphology in a modern guise, the physical and mechanical background of The instrument and its ancillary tools are simply and well presented. The potential use of chemical or cytochemical information as it relates to biological fine structure , however, is quite deficient. I wonder when even sophisticated morphol-ogists will consider fixation a reaction and not a technique; only then will the fundamentals become self-evident and predictable and this sine qua flon will become less mystical. Staining reactions (the most inadequate chapter) ought to be something more than a technique to selectively enhance contrast of morphological elements; it ought to give the structural addresses of some of the chemical residents of cell components. Is it pertinent that auto-radiography gets singled out for more complete coverage than other significant aspects of cytochemistry by a high resolution microscopist, when it has a built-in minimal error of 1,000 A in standard practice? I don't mean to blind-side (in strict football terminology) Dr. Sj6strand's efforts for what is \"routinely used in our laboratory\"; what is done is usually well done. It's just that …

3,197 citations

Journal Article
TL;DR: FastTree as mentioned in this paper uses sequence profiles of internal nodes in the tree to implement neighbor-joining and uses heuristics to quickly identify candidate joins, then uses nearest-neighbor interchanges to reduce the length of the tree.
Abstract: Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement neighbor-joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest-neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N^2) space and O(N^2 L) time, but FastTree requires just O( NLa + N sqrt(N) ) memory and O( N sqrt(N) log(N) L a ) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 hours and 2.4 gigabytes of memory. Just computing pairwise Jukes-Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 hours and 50 gigabytes of memory. In simulations, FastTree was slightly more accurate than neighbor joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree.

2,436 citations

01 Mar 2001
TL;DR: Using singular value decomposition in transforming genome-wide expression data from genes x arrays space to reduced diagonalized "eigengenes" x "eigenarrays" space gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype.
Abstract: ‡We describe the use of singular value decomposition in transforming genome-wide expression data from genes 3 arrays space to reduced diagonalized ‘‘eigengenes’’ 3 ‘‘eigenarrays’’ space, where the eigengenes (or eigenarrays) are unique orthonormal superpositions of the genes (or arrays). Normalizing the data by filtering out the eigengenes (and eigenarrays) that are inferred to represent noise or experimental artifacts enables meaningful comparison of the expression of different genes across different arrays in different experiments. Sorting the data according to the eigengenes and eigenarrays gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype, respectively. After normalization and sorting, the significant eigengenes and eigenarrays can be associated with observed genome-wide effects of regulators, or with measured samples, in which these regulators are overactive or underactive, respectively.

1,815 citations

25 Apr 2017
TL;DR: This presentation is a case study taken from the travel and holiday industry and describes the effectiveness of various techniques as well as the performance of Python-based libraries such as Python Data Analysis Library (Pandas), and Scikit-learn (built on NumPy, SciPy and matplotlib).
Abstract: This presentation is a case study taken from the travel and holiday industry. Paxport/Multicom, based in UK and Sweden, have recently adopted a recommendation system for holiday accommodation bookings. Machine learning techniques such as Collaborative Filtering have been applied using Python (3.5.1), with Jupyter (4.0.6) as the main framework. Data scale and sparsity present significant challenges in the case study, and so the effectiveness of various techniques are described as well as the performance of Python-based libraries such as Python Data Analysis Library (Pandas), and Scikit-learn (built on NumPy, SciPy and matplotlib). The presentation is suitable for all levels of programmers.

1,338 citations