Home
/
Authors
/
Sharvari Gujja

Author

Sharvari Gujja

Other affiliations: University of Massachusetts Medical School, Massachusetts Institute of Technology

Bio: Sharvari Gujja is an academic researcher from Broad Institute. The author has contributed to research in topics: Genome & Human microbiome. The author has an hindex of 23, co-authored 37 publications receiving 18424 citations. Previous affiliations of Sharvari Gujja include University of Massachusetts Medical School & Massachusetts Institute of Technology.

Topics: Genome, Human microbiome, Microbiome, Genome evolution, Unconventional computing ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower¹, Curtis Huttenhower², Dirk Gevers², Rob Knight³ +250 more•Institutions (42)

14 Jun 2012-Nature

TL;DR: The Human Microbiome Project Consortium reported the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome as discussed by the authors.

...read moreread less

Abstract: The Human Microbiome Project Consortium reports the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

8,410 citations

Journal Article•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower, Dirk Gevers, Rob Knight, Sahar Abubucker +244 more

01 Jun 2012-PubMed Central

TL;DR: The Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far, finding the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals.

...read moreread less

Abstract: Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that occupy habitats such as the gut, skin and vagina. Much of this diversity remains unexplained, although diet, environment, host genetics and early microbial exposure have all been implicated. Accordingly, to characterize the ecology of human-associated microbial communities, the Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far. We found the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals. The project encountered an estimated 81–99% of the genera, enzyme families and community configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range of structural and functional configurations normal in the microbial communities of a healthy population, enabling future characterization of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

6,350 citations

Journal Article•DOI•

A framework for human microbiome research

[...]

Barbara A. Methé¹, Karen E. Nelson¹, Mihai Pop², Heather Huot Creasy³ +250 more•Institutions (42)

14 Jun 2012-Nature

TL;DR: The Human Microbiome Project (HMP) Consortium has established a population-scale framework which catalyzed significant development of metagenomic protocols resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomics data available to the scientific community as mentioned in this paper.

...read moreread less

Abstract: A variety of microbial communities and their genes (microbiome) exist throughout the human body, playing fundamental roles in human health and disease. The NIH funded Human Microbiome Project (HMP) Consortium has established a population-scale framework which catalyzed significant development of metagenomic protocols resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 to 18 body sites up to three times, which to date, have generated 5,177 microbial taxonomic profiles from 16S rRNA genes and over 3.5 Tb of metagenomic sequence. In parallel, approximately 800 human-associated reference genomes have been sequenced. Collectively, these data represent the largest resource to date describing the abundance and variety of the human microbiome, while providing a platform for current and future studies.

...read moreread less

2,172 citations

Journal Article•DOI•

A Catalog of Reference Genomes from the Human Microbiome

[...]

Karen E. Nelson, George M. Weinstock, Sarah K. Highlander, Kim C. Worley, Heather Huot Creasy, Jennifer R. Wortman, Douglas B. Rusch, Makedonka Mitreva, Erica Sodergren, Asif T. Chinwalla, Michael Feldgarden, Dirk Gevers, Brian J. Haas, Ramana Madupu, Doyle V. Ward, Bruce W. Birren, Richard A. Gibbs, Barbara A. Methé, Joseph F. Petrosino, Robert L. Strausberg, Granger G. Sutton, Owen White, Richard K. Wilson, Scott Durkin, Michelle G. Giglio, Sharvari Gujja, Clint Howarth, Chinnappa D. Kodira, Nikos C. Kyrpides, Teena Mehta, Donna M. Muzny, Matthew D. Pearson, Kymberlie H. Pepin, Amrita Pati, Xiang Qin, Chandri N. Yandava, Qiandong Zeng, Lan Zhang, Aaron M. Berlin, Lei Chen, Theresa A. Hepburn, Justin Johnson, Jamison McCorrison, Jason R. Miller, Patrick Minx, Chad Nusbaum, Carsten Russ, Sean M. Sykes, Chad Tomlinson, Sarah Young, Wesley C. Warren, Jonathan H. Badger, Jonathan Crabtree, Victor Markowitz, Joshua Orvis, Andrew Cree, Steve Ferriera, Lucinda Fulton, Robert S. Fulton, Marcus Gillis, Lisa Hemphill, Vandita Joshi, Christie Kovar, Manolito Torralba, Kris A. Wetterstrand, Amr Abouellleil, Aye Wollam, Christian J. Buhay, Yan Ding, Shannon Dugan, Michael Fitzgerald, Mike Holder, Jessica B. Hostetler, Sandra W. Clifton, Emma Allen-Vercoe, Ashlee M. Earl, Candace N. Farmer, Konstantinos Liolios, Michael G. Surette, Qiang Xu, Craig Pohl, Katarzyna Wilczek-Boney, Dianhui Zhu - Show less +79 more

21 May 2010-Science

TL;DR: Results from an initial reference genome sequencing of 178 microbial genomes allow for ~40% of random sequences from the microbiome of the gastrointestinal tract to be associated with organisms based on the match criteria used, suggesting that the authors are still far from saturating microbial species genetic data sets.

...read moreread less

Abstract: The human microbiome refers to the community of microorganisms, including prokaryotes, viruses, and microbial eukaryotes, that populate the human body. The National Institutes of Health launched an initiative that focuses on describing the diversity of microbial species that are associated with health and disease. The first phase of this initiative includes the sequencing of hundreds of microbial reference genomes, coupled to metagenomic sequencing from multiple body sites. Here we present results from an initial reference genome sequencing of 178 microbial genomes. From 547,968 predicted polypeptides that correspond to the gene complement of these strains, previously unidentified ("novel") polypeptides that had both unmasked sequence length greater than 100 amino acids and no BLASTP match to any nonreference entry in the nonredundant subset were defined. This analysis resulted in a set of 30,867 polypeptides, of which 29,987 (approximately 97%) were unique. In addition, this set of microbial genomes allows for approximately 40% of random sequences from the microbiome of the gastrointestinal tract to be associated with organisms based on the match criteria used. Insights into pan-genome analysis suggest that we are still far from saturating microbial species genetic data sets. In addition, the associated metrics and standards used by our group for quality assurance are presented.

...read moreread less

649 citations

Journal Article•DOI•

Comparative functional genomics of the fission yeasts

[...]

Nicholas Rhind¹, Zehua Chen², Moran Yassour³, Moran Yassour², Dawn Thompson², Brian J. Haas², Naomi Habib³, Ilan Wapinski², Ilan Wapinski⁴, Sushmita Roy², Michael F. Lin², David I. Heiman², Sarah Young², Kanji Furuya⁵, Yabin Guo⁶, Alison L. Pidoux⁷, Huei Mei Chen⁸, Barbara Robbertse⁹, Jonathan M. Goldberg², Keita Aoki⁵, Elizabeth H. Bayne⁷, Aaron M. Berlin², Christopher A. Desjardins², Edward Dobbs⁷, Livio Dukaj¹, Lin Fan², Michael Fitzgerald², Courtney French³, Sharvari Gujja², Klavs R. Hansen¹⁰, Daniel Keifenheim¹, Joshua Z. Levin², Rebecca A. Mosher¹¹, Carolin A. Müller¹², Jenna Pfiffner², Margaret Priest², Carsten Russ², Agata Smialowska¹³, Agata Smialowska¹⁴, Peter Swoboda¹⁴, Sean M. Sykes², Matthew W. Vaughn¹⁰, Sonya Vengrova¹⁵, Ryan J. Yoder⁹, Qiandong Zeng², Robin C. Allshire⁷, David C. Baulcombe¹¹, Bruce W. Birren², William Brown¹², Karl Ekwall¹³, Karl Ekwall¹⁴, Manolis Kellis², Janet Leatherwood⁸, Henry L. Levin⁶, Hanah Margalit³, Robert A. Martienssen¹⁰, Conrad A. Nieduszynski¹², Joseph W. Spatafora⁹, Nir Friedman³, Jacob Z. Dalgaard¹⁵, Peter Baumann¹⁶, Peter Baumann¹⁷, Peter Baumann¹⁸, Hironori Niki⁵, Aviv Regev², Aviv Regev¹⁸, Chad Nusbaum² - Show less +63 more•Institutions (18)

University of Massachusetts Medical School¹, Massachusetts Institute of Technology², Hebrew University of Jerusalem³, Harvard University⁴, National Institute of Genetics⁵, National Institutes of Health⁶, University of Edinburgh⁷, State University of New York System⁸, Oregon State University⁹, Cold Spring Harbor Laboratory¹⁰, University of Cambridge¹¹, University of Nottingham¹², Södertörn University¹³, Karolinska Institutet¹⁴, University of Warwick¹⁵, University of Kansas¹⁶, Stowers Institute for Medical Research¹⁷, Howard Hughes Medical Institute¹⁸

20 May 2011-Science

TL;DR: Differences in gene content and regulation explain why, unlike the budding yeast of Saccharomycotina, fission yeasts cannot use ethanol as a primary carbon source and provide tools for investigation across the Schizosaccharomyces clade.

...read moreread less

Abstract: The fission yeast clade--comprising Schizosaccharomyces pombe, S. octosporus, S. cryophilus, and S. japonicus--occupies the basal branch of Ascomycete fungi and is an important model of eukaryote biology. A comparative annotation of these genomes identified a near extinction of transposons and the associated innovation of transposon-free centromeres. Expression analysis established that meiotic genes are subject to antisense transcription during vegetative growth, which suggests a mechanism for their tight regulation. In addition, trans-acting regulators control new genes within the context of expanded functional modules for meiosis and stress response. Differences in gene content and regulation also explain why, unlike the budding yeast of Saccharomycotina, fission yeasts cannot use ethanol as a primary carbon source. These analyses elucidate the genome structure and gene regulation of fission yeast and provide tools for investigation across the Schizosaccharomyces clade.

...read moreread less

474 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

Full-length transcriptome assembly from RNA-Seq data without a reference genome.

[...]

Manfred Grabherr¹, Brian J. Haas¹, Moran Yassour², Moran Yassour¹, Joshua Z. Levin¹, Dawn Thompson¹, Ido Amit¹, Xian Adiconis¹, Lin Fan¹, Raktima Raychowdhury¹, Qiandong Zeng¹, Zehua Chen¹, Evan Mauceli¹, Nir Hacohen¹, Andreas Gnirke¹, Nicholas Rhind³, Federica Di Palma¹, Bruce W. Birren¹, Chad Nusbaum¹, Kerstin Lindblad-Toh¹, Kerstin Lindblad-Toh⁴, Nir Friedman², Aviv Regev¹ - Show less +19 more•Institutions (4)

Massachusetts Institute of Technology¹, Hebrew University of Jerusalem², University of Massachusetts Medical School³, Science for Life Laboratory⁴

01 Jul 2011-Nature Biotechnology

TL;DR: The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample.

...read moreread less

Abstract: Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

...read moreread less

15,665 citations

Journal Article•DOI•

DADA2: High-resolution sample inference from Illumina amplicon data

[...]

Benjamin J. Callahan¹, Paul J. McMurdie, Michael J. Rosen¹, Andrew W. Han, Amy Jo A. Johnson, Susan Holmes¹ - Show less +2 more•Institutions (1)

Stanford University¹

01 Jul 2016-Nature Methods

TL;DR: The open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors is presented, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

Abstract: We present the open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors (https://github.com/benjjneb/dada2). DADA2 infers sample sequences exactly and resolves differences of as little as 1 nucleotide. In several mock communities, DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

14,505 citations

Journal Article•DOI•

UPARSE: highly accurate OTU sequences from microbial amplicon reads

[...]

Robert C. Edgar

01 Oct 2013-Nature Methods

TL;DR: The UPARSE pipeline reports operational taxonomic unit (OTU) sequences with ≤1% incorrect bases in artificial microbial community tests, compared with >3% correct bases commonly reported by other methods.

...read moreread less

Abstract: Amplified marker-gene sequences can be used to understand microbial community structure, but they suffer from a high level of sequencing and amplification artifacts. The UPARSE pipeline reports operational taxonomic unit (OTU) sequences with ≤1% incorrect bases in artificial microbial community tests, compared with >3% incorrect bases commonly reported by other methods. The improved accuracy results in far fewer OTUs, consistently closer to the expected number of species in a community.

...read moreread less

11,329 citations

Journal Article•DOI•

phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

[...]

Paul J. McMurdie¹, Susan Holmes¹•Institutions (1)

Stanford University¹

22 Apr 2013-PLOS ONE

TL;DR: The phyloseq project for R is a new open-source software package dedicated to the object-oriented representation and analysis of microbiome census data in R, which supports importing data from a variety of common formats, as well as many analysis techniques.

...read moreread less

Abstract: Background The analysis of microbial communities through DNA sequencing brings many challenges: the integration of different types of data with methods from ecology, genetics, phylogenetics, multivariate statistics, visualization and testing. With the increased breadth of experimental designs now being pursued, project-specific statistical analyses are often needed, and these analyses are often difficult (or impossible) for peer researchers to independently reproduce. The vast majority of the requisite tools for performing these analyses reproducibly are already implemented in R and its extensions (packages), but with limited support for high throughput microbiome census data. Results Here we describe a software project, phyloseq, dedicated to the object-oriented representation and analysis of microbiome census data in R. It supports importing data from a variety of common formats, as well as many analysis techniques. These include calibration, filtering, subsetting, agglomeration, multi-table comparisons, diversity analysis, parallelized Fast UniFrac, ordination methods, and production of publication-quality graphics; all in a manner that is easy to document, share, and modify. We show how to apply functions from other R packages to phyloseq-represented data, illustrating the availability of a large number of open source analysis techniques. We discuss the use of phyloseq with tools for reproducible research, a practice common in other fields but still rare in the analysis of highly parallel microbiome census data. We have made available all of the materials necessary to completely reproduce the analysis and figures included in this article, an example of best practices for reproducible research. Conclusions The phyloseq project for R is a new open-source software package, freely available on the web from both GitHub and Bioconductor.

...read moreread less

11,272 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse