Home
/
Authors
/
Yuanqing Wu

Author

Yuanqing Wu

Other affiliations: University of Texas MD Anderson Cancer Center, Human Genome Sequencing Center

Bio: Yuanqing Wu is an academic researcher from Baylor College of Medicine. The author has contributed to research in topics: Population & Genome-wide association study. The author has an hindex of 18, co-authored 19 publications receiving 20944 citations. Previous affiliations of Yuanqing Wu include University of Texas MD Anderson Cancer Center & Human Genome Sequencing Center.

Topics: Population, Genome-wide association study, Genome, Exome sequencing, Genomics ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower¹, Curtis Huttenhower², Dirk Gevers², Rob Knight³ +250 more•Institutions (42)

14 Jun 2012-Nature

TL;DR: The Human Microbiome Project Consortium reported the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome as discussed by the authors.

...read moreread less

Abstract: The Human Microbiome Project Consortium reports the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

8,410 citations

Journal Article•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower, Dirk Gevers, Rob Knight, Sahar Abubucker +244 more

01 Jun 2012-PubMed Central

TL;DR: The Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far, finding the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals.

...read moreread less

Abstract: Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that occupy habitats such as the gut, skin and vagina. Much of this diversity remains unexplained, although diet, environment, host genetics and early microbial exposure have all been implicated. Accordingly, to characterize the ecology of human-associated microbial communities, the Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far. We found the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals. The project encountered an estimated 81–99% of the genera, enzyme families and community configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range of structural and functional configurations normal in the microbial communities of a healthy population, enabling future characterization of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

6,350 citations

Journal Article•DOI•

Patterns and rates of exonic de novo mutations in autism spectrum disorders

[...]

Benjamin M. Neale¹, Yan Kou², Li Liu³, Avi Ma'ayan², Kaitlin E. Samocha⁴, Kaitlin E. Samocha¹, Aniko Sabo⁵, Chiao-Feng Lin⁶, Christine Stevens⁴, Li-San Wang⁶, Vladimir Makarov², Paz Polak⁷, Paz Polak⁴, Seungtai Yoon², Jared Maguire⁴, Emily L. Crawford⁸, Nicholas G. Campbell⁸, Evan T. Geller⁶, Otto Valladares⁶, Chad M. Schafer³, Han Liu⁹, Tuo Zhao⁹, Guiqing Cai², Jayon Lihm², Ruth Dannenfelser², Omar Jabado², Zuleyma Peralta², Uma Nagaswamy⁵, Donna M. Muzny⁵, Jeffrey G. Reid⁵, Irene Newsham⁵, Yuanqing Wu⁵, Lora Lewis⁵, Yi Han⁵, Benjamin F. Voight⁶, Benjamin F. Voight⁴, Elaine T. Lim¹, Elaine T. Lim⁴, Elizabeth J. Rossin⁴, Elizabeth J. Rossin¹, Andrew Kirby¹, Andrew Kirby⁴, Jason Flannick⁴, Menachem Fromer¹, Menachem Fromer⁴, Khalid Shakir⁴, Timothy Fennell⁴, Kiran V. Garimella⁴, Eric Banks⁴, Ryan Poplin⁴, Stacey Gabriel⁴, Mark A. DePristo⁴, Jack R. Wimbish, Braden E. Boone, Shawn Levy, Catalina Betancur¹⁰, Shamil R. Sunyaev⁷, Shamil R. Sunyaev⁴, Eric Boerwinkle¹¹, Eric Boerwinkle⁵, Joseph D. Buxbaum, Edwin H. Cook¹², Bernie Devlin¹³, Richard A. Gibbs⁵, Kathryn Roeder³, Gerard D. Schellenberg⁶, James S. Sutcliffe⁸, Mark J. Daly⁴, Mark J. Daly¹ - Show less +65 more•Institutions (13)

Harvard University¹, Icahn School of Medicine at Mount Sinai², Carnegie Mellon University³, Broad Institute⁴, Baylor College of Medicine⁵, University of Pennsylvania⁶, Brigham and Women's Hospital⁷, Vanderbilt University⁸, Johns Hopkins University⁹, French Institute of Health and Medical Research¹⁰, University of Texas Health Science Center at Houston¹¹, University of Illinois at Chicago¹², University of Pittsburgh¹³

04 Apr 2012-Nature

TL;DR: Results from de novo events and a large parallel case–control study provide strong evidence in favour of CHD8 and KATNAL2 as genuine autism risk factors and support polygenic models in which spontaneous coding mutations in any of a large number of genes increases risk by 5- to 20-fold.

...read moreread less

Abstract: Autism spectrum disorders (ASD) are believed to have genetic and environmental origins, yet in only a modest fraction of individuals can specific causes be identified. To identify further genetic risk factors, here we assess the role of de novo mutations in ASD by sequencing the exomes of ASD cases and their parents (n = 175 trios). Fewer than half of the cases (46.3%) carry a missense or nonsense de novo variant, and the overall rate of mutation is only modestly higher than the expected rate. In contrast, the proteins encoded by genes that harboured de novo missense or nonsense mutations showed a higher degree of connectivity among themselves and to previous ASD genes as indexed by protein-protein interaction screens. The small increase in the rate of de novo events, when taken together with the protein interaction results, are consistent with an important but limited role for de novo point mutations in ASD, similar to that documented for de novo copy number variants. Genetic models incorporating these data indicate that most of the observed de novo events are unconnected to ASD; those that do confer risk are distributed across many genes and are incompletely penetrant (that is, not necessarily sufficient for disease). Our results support polygenic models in which spontaneous coding mutations in any of a large number of genes increases risk by 5- to 20-fold. Despite the challenge posed by such models, results from de novo events and a large parallel case-control study provide strong evidence in favour of CHD8 and KATNAL2 as genuine autism risk factors.

...read moreread less

1,700 citations

Journal Article•DOI•

Exome sequencing of head and neck squamous cell carcinoma reveals inactivating mutations in NOTCH1

[...]

Nishant Agrawal¹, Mitchell J. Frederick², Curtis R. Pickering², Chetan Bettegowda¹, Kyle Chang³, Ryan J. Li¹, Carole Fakhry¹, Tong Xin Xie², Jiexin Zhang², Jing Wang², Nianxiang Zhang², Adel K. El-Naggar², Samar A. Jasser², John N. Weinstein², Lisa R. Trevino³, Jennifer Drummond³, Donna M. Muzny³, Yuanqing Wu³, Laura D. Wood¹, Ralph H. Hruban¹, William H. Westra¹, Wayne M. Koch¹, Joseph A. Califano¹, Joseph A. Califano⁴, Richard A. Gibbs³, Richard A. Gibbs⁴, David Sidransky¹, Bert Vogelstein¹, Victor E. Velculescu¹, Nickolas Papadopoulos¹, David A. Wheeler³, Kenneth W. Kinzler¹, Jeffrey N. Myers² - Show less +29 more•Institutions (4)

Johns Hopkins University¹, University of Texas MD Anderson Cancer Center², Baylor College of Medicine³, Greater Baltimore Medical Center⁴

26 Aug 2011-Science

TL;DR: To explore the genetic origins of head and neck squamous cell carcinoma, whole-exome sequencing and gene copy number analyses were used to study 32 primary tumors and identified mutations in FBXW7 and NotCH1, suggesting that NOTCH1 may function as a tumor suppressor gene rather than an oncogene in this tumor type.

...read moreread less

Abstract: Head and neck squamous cell carcinoma (HNSCC) is the sixth most common cancer worldwide. To explore the genetic origins of this cancer, we used whole-exome sequencing and gene copy number analyses to study 32 primary tumors. Tumors from patients with a history of tobacco use had more mutations than did tumors from patients who did not use tobacco, and tumors that were negative for human papillomavirus (HPV) had more mutations than did HPV-positive tumors. Six of the genes that were mutated in multiple tumors were assessed in up to 88 additional HNSCCs. In addition to previously described mutations in TP53, CDKN2A, PIK3CA, and HRAS, we identified mutations in FBXW7 and NOTCH1. Nearly 40% of the 28 mutations identified in NOTCH1 were predicted to truncate the gene product, suggesting that NOTCH1 may function as a tumor suppressor gene rather than an oncogene in this tumor type.

...read moreread less

1,613 citations

Journal Article•DOI•

The Drosophila melanogaster Genetic Reference Panel

[...]

Trudy F. C. Mackay¹, Stephen Richards², Eric A. Stone¹, Antonio Barbadilla, Julien F. Ayroles³, Julien F. Ayroles¹, Dianhui Zhu², Sònia Casillas, Yi Han², Michael M. Magwire¹, Julie M. Cridland⁴, Mark F. Richardson⁵, Robert R. H. Anholt¹, Maite G. Barrón, Crystal Bess², Kerstin P. Blankenburg², Mary Anna Carbone¹, David Castellano, Lesley S. Chaboub², Laura H Duncan¹, Zeke Harris¹, Mehwish Javaid², Joy Jayaseelan², Shalini N. Jhangiani², Katherine W. Jordan¹, Fremiet Lara², Faye Lawrence¹, Sandra L. Lee², Pablo Librado⁶, Raquel S. Linheiro⁵, Richard F. Lyman¹, Aaron J. Mackey⁷, Mala Munidasa², Donna M. Muzny², Lynne V. Nazareth², Irene Newsham, Lora Perales², Ling-Ling Pu², Carson Qu², Miquel Ràmia, Jeffrey G. Reid², Stephanie M. Rollmann⁸, Stephanie M. Rollmann¹, Julio Rozas⁶, Nehad Saada², Lavanya Turlapati¹, Kim C. Worley², Yuanqing Wu², Akihiko Yamamoto¹, Yiming Zhu², Casey M. Bergman⁵, Kevin R. Thornton⁴, David Mittelman⁹, Richard A. Gibbs² - Show less +50 more•Institutions (9)

North Carolina State University¹, Baylor College of Medicine², Harvard University³, University of California, Irvine⁴, University of Manchester⁵, University of Barcelona⁶, University of Virginia⁷, University of Cincinnati⁸, Virginia Bioinformatics Institute⁹

09 Feb 2012-Nature

TL;DR: The Drosophila melanogaster Genetic Reference Panel is described, a community resource for analysis of population genomics and quantitative traits, which reveals reduced polymorphism in centromeric autosomal regions and the X chromosomes, evidence for positive and negative selection, and rapid evolution of the X chromosome.

...read moreread less

Abstract: A major challenge of biology is understanding the relationship between molecular genetic variation and variation in quantitative traits, including fitness. This relationship determines our ability to predict phenotypes from genotypes and to understand how evolutionary forces shape variation within and between species. Previous efforts to dissect the genotype-phenotype map were based on incomplete genotypic information. Here, we describe the Drosophila melanogaster Genetic Reference Panel (DGRP), a community resource for analysis of population genomics and quantitative traits. The DGRP consists of fully sequenced inbred lines derived from a natural population. Population genomic analyses reveal reduced polymorphism in centromeric autosomal regions and the X chromosome, evidence for positive and negative selection, and rapid evolution of the X chromosome. Many variants in novel genes, most at low frequency, are associated with quantitative traits and explain a large fraction of the phenotypic variance. The DGRP facilitates genotype-phenotype mapping using the power of Drosophila genetics.

...read moreread less

1,568 citations

1
2
3
4
…

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

DADA2: High-resolution sample inference from Illumina amplicon data

[...]

Benjamin J. Callahan¹, Paul J. McMurdie, Michael J. Rosen¹, Andrew W. Han, Amy Jo A. Johnson, Susan Holmes¹ - Show less +2 more•Institutions (1)

Stanford University¹

01 Jul 2016-Nature Methods

TL;DR: The open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors is presented, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

Abstract: We present the open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors (https://github.com/benjjneb/dada2). DADA2 infers sample sequences exactly and resolves differences of as little as 1 nucleotide. In several mock communities, DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.

...read moreread less

14,505 citations

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

Journal Article•DOI•

UPARSE: highly accurate OTU sequences from microbial amplicon reads

[...]

Robert C. Edgar

01 Oct 2013-Nature Methods

TL;DR: The UPARSE pipeline reports operational taxonomic unit (OTU) sequences with ≤1% incorrect bases in artificial microbial community tests, compared with >3% correct bases commonly reported by other methods.

...read moreread less

Abstract: Amplified marker-gene sequences can be used to understand microbial community structure, but they suffer from a high level of sequencing and amplification artifacts. The UPARSE pipeline reports operational taxonomic unit (OTU) sequences with ≤1% incorrect bases in artificial microbial community tests, compared with >3% incorrect bases commonly reported by other methods. The improved accuracy results in far fewer OTUs, consistently closer to the expected number of species in a community.

...read moreread less

11,329 citations

Journal Article•DOI•

phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

[...]

Paul J. McMurdie¹, Susan Holmes¹•Institutions (1)

Stanford University¹

22 Apr 2013-PLOS ONE

TL;DR: The phyloseq project for R is a new open-source software package dedicated to the object-oriented representation and analysis of microbiome census data in R, which supports importing data from a variety of common formats, as well as many analysis techniques.

...read moreread less

Abstract: Background The analysis of microbial communities through DNA sequencing brings many challenges: the integration of different types of data with methods from ecology, genetics, phylogenetics, multivariate statistics, visualization and testing. With the increased breadth of experimental designs now being pursued, project-specific statistical analyses are often needed, and these analyses are often difficult (or impossible) for peer researchers to independently reproduce. The vast majority of the requisite tools for performing these analyses reproducibly are already implemented in R and its extensions (packages), but with limited support for high throughput microbiome census data. Results Here we describe a software project, phyloseq, dedicated to the object-oriented representation and analysis of microbiome census data in R. It supports importing data from a variety of common formats, as well as many analysis techniques. These include calibration, filtering, subsetting, agglomeration, multi-table comparisons, diversity analysis, parallelized Fast UniFrac, ordination methods, and production of publication-quality graphics; all in a manner that is easy to document, share, and modify. We show how to apply functions from other R packages to phyloseq-represented data, illustrating the availability of a large number of open source analysis techniques. We discuss the use of phyloseq with tools for reproducible research, a practice common in other fields but still rare in the analysis of highly parallel microbiome census data. We have made available all of the materials necessary to completely reproduce the analysis and figures included in this article, an example of best practices for reproducible research. Conclusions The phyloseq project for R is a new open-source software package, freely available on the web from both GitHub and Bioconductor.

...read moreread less

11,272 citations

Journal Article•DOI•

Signatures of mutational processes in human cancer

[...]

Ludmil B. Alexandrov¹, Serena Nik-Zainal², Serena Nik-Zainal³, David C. Wedge¹, Samuel Aparicio⁴, Sam Behjati⁵, Sam Behjati¹, Andrew V. Biankin, Graham R. Bignell¹, Niccolo Bolli⁵, Niccolo Bolli¹, Åke Borg³, Anne Lise Børresen-Dale⁶, Anne Lise Børresen-Dale⁷, Sandrine Boyault⁸, Birgit Burkhardt⁸, Adam Butler¹, Carlos Caldas⁹, Helen Davies¹, Christine Desmedt, Roland Eils⁵, Jorunn E. Eyfjord¹⁰, John A. Foekens¹¹, Mel Greaves¹², Fumie Hosoda¹³, Barbara Hutter⁵, Tomislav Ilicic¹, Sandrine Imbeaud¹⁴, Sandrine Imbeaud¹⁵, Marcin Imielinsk¹⁵, Natalie Jäger⁵, David T. W. Jones¹⁶, David T. Jones¹, Stian Knappskog¹⁷, Stian Knappskog¹¹, Marcel Kool¹¹, Sunil R. Lakhani¹⁸, Carlos López-Otín¹⁸, Sancha Martin¹, Nikhil C. Munshi¹⁹, Nikhil C. Munshi²⁰, Hiromi Nakamura¹³, Paul A. Northcott¹⁶, Marina Pajic²¹, Elli Papaemmanuil¹, Angelo Paradiso²², John V. Pearson²³, Xose S. Puente¹⁸, Keiran Raine¹, Manasa Ramakrishna¹, Andrea L. Richardson²², Andrea L. Richardson²⁰, Julia Richter²², Philip Rosenstiel²², Matthias Schlesner⁵, Ton N. Schumacher²⁴, Paul N. Span²⁵, Jon W. Teague¹, Yasushi Totoki¹³, Andrew Tutt²⁴, Rafael Valdés-Mas¹⁸, Marit M. van Buuren²⁵, Laura van ’t Veer²⁶, Anne Vincent-Salomon²⁷, Nicola Waddell²³, Lucy R. Yates¹, Icgc PedBrain²⁴, Jessica Zucman-Rossi¹⁴, Jessica Zucman-Rossi¹⁵, P. Andrew Futreal¹, Ultan McDermott¹, Peter Lichter²⁴, Matthew Meyerson²⁰, Matthew Meyerson¹⁵, Sean M. Grimmond²³, Reiner Siebert²², Elias Campo²⁸, Tatsuhiro Shibata¹³, Stefan M. Pfister¹¹, Stefan M. Pfister¹⁶, Peter J. Campbell²⁹, Peter J. Campbell², Peter J. Campbell³⁰, Michael R. Stratton², Michael R. Stratton³¹ - Show less +81 more•Institutions (31)

Wellcome Trust Sanger Institute¹, Wellcome Trust², Cambridge University Hospitals NHS Foundation Trust³, University of British Columbia⁴, University of Cambridge⁵, Oslo University Hospital⁶, The Breast Cancer Research Foundation⁷, University of Oslo⁸, University of Münster⁹, Université libre de Bruxelles¹⁰, German Cancer Research Center¹¹, University of Iceland¹², Erasmus University Rotterdam¹³, Paris Descartes University¹⁴, French Institute of Health and Medical Research¹⁵, University of Paris¹⁶, Broad Institute¹⁷, University of Bergen¹⁸, University of Oviedo¹⁹, University of Queensland²⁰, University of Glasgow²¹, Harvard University²², United States Department of Veterans Affairs²³, Netherlands Cancer Institute²⁴, University of Kiel²⁵, Radboud University Nijmegen²⁶, King's College London²⁷, Curie Institute²⁸, Bankstown Lidcombe Hospital²⁹, University of New South Wales³⁰, University of Barcelona³¹

22 Aug 2013-Nature

TL;DR: It is shown that hypermutation localized to small genomic regions, ‘kataegis’, is found in many cancer types, and this results reveal the diversity of mutational processes underlying the development of cancer.

...read moreread less

Abstract: All cancers are caused by somatic mutations; however, understanding of the biological processes generating these mutations is limited. The catalogue of somatic mutations from a cancer genome bears the signatures of the mutational processes that have been operative. Here we analysed 4,938,362 mutations from 7,042 cancers and extracted more than 20 distinct mutational signatures. Some are present in many cancer types, notably a signature attributed to the APOBEC family of cytidine deaminases, whereas others are confined to a single cancer class. Certain signatures are associated with age of the patient at cancer diagnosis, known mutagenic exposures or defects in DNA maintenance, but many are of cryptic origin. In addition to these genome-wide mutational signatures, hypermutation localized to small genomic regions, 'kataegis', is found in many cancer types. The results reveal the diversity of mutational processes underlying the development of cancer, with potential implications for understanding of cancer aetiology, prevention and therapy.

...read moreread less

7,904 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse