Home
/
Authors
/
Yinlong Xie

Author

Yinlong Xie

Other affiliations: University of Hong Kong, South China University of Technology

Bio: Yinlong Xie is an academic researcher from Beijing Genomics Institute. The author has contributed to research in topics: Sequence assembly & Genomics. The author has an hindex of 19, co-authored 26 publications receiving 20600 citations. Previous affiliations of Yinlong Xie include University of Hong Kong & South China University of Technology.

Topics: Sequence assembly, Genomics, Phylogenomics, RNA-Seq, Genome ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A human gut microbial gene catalogue established by metagenomic sequencing

[...]

Junjie Qin¹, Ruiqiang Li¹, Jeroen Raes², Manimozhiyan Arumugam, Kristoffer Sølvsten Burgdorf, Chaysavanh Manichanh, Trine Nielsen, Nicolas Pons³, Florence Levenez³, Takuji Yamada, Daniel R. Mende, Junhua Li¹, Junming Xu¹, Shaochuan Li¹, Dongfang Li¹, Jianjun Cao¹, Bo Wang¹, Huiqing Liang¹, Huisong Zheng¹, Yinlong Xie¹, Julien Tap³, Patricia Lepage³, Marcelo Bertalan, Jean-Michel Batto³, Torben Hansen, Denis Le Paslier, Allan Linneberg, H. Bjørn Nielsen, Eric Pelletier, Pierre Renault³, Thomas Sicheritz-Pontén, Keith Turner⁴, Hongmei Zhu¹, Chang Yu¹, Shengting Li¹, Min Jian¹, Yan Zhou¹, Yingrui Li¹, Xiuqing Zhang¹, Songgang Li¹, Nan Qin¹, Huanming Yang¹, Jian Wang¹, Søren Brunak, Joël Doré³, Francisco Guarner⁵, Karsten Kristiansen, Oluf Pedersen, Julian Parkhill, Jean Weissenbach, Peer Bork, S. Dusko Ehrlich³, Jun Wang¹ - Show less +49 more•Institutions (5)

Beijing Genomics Institute¹, Vrije Universiteit Brussel², Institut national de la recherche agronomique³, Wellcome Trust Sanger Institute⁴, Hebron University⁵

04 Mar 2010-Nature

TL;DR: The Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals are described, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species.

...read moreread less

Abstract: To understand the impact of gut microbes on human health and well-being it is crucial to assess their genetic potential. Here we describe the Illumina-based metagenomic sequencing, assembly and characterization of 3.3 million non-redundant microbial genes, derived from 576.7 gigabases of sequence, from faecal samples of 124 European individuals. The gene set, ~150 times larger than the human gene complement, contains an overwhelming majority of the prevalent (more frequent) microbial genes of the cohort and probably includes a large proportion of the prevalent human intestinal microbial genes. The genes are largely shared among individuals of the cohort. Over 99% of the genes are bacterial, indicating that the entire cohort harbours between 1,000 and 1,150 prevalent bacterial species and each individual at least 160 such species, which are also largely shared. We define and describe the minimal gut metagenome and the minimal gut bacterial genome in terms of functions present in all individuals and most bacteria, respectively

...read moreread less

9,268 citations

Journal Article•DOI•

SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler

[...]

Ruibang Luo¹, Binghang Liu¹, Yinlong Xie¹, Yinlong Xie², Zhenyu Li¹, Weihua Huang, Jianying Yuan, Guangzhu He, Yanxiang Chen, Qi Pan, Yunjie Liu, Jingbo Tang, Gengxiong Wu, Hao Zhang, Yujian Shi, Yong Liu, Chang Yu, Bo Wang, Yao Lu, Changlei Han, David W. Cheung¹, Siu-Ming Yiu¹, Shaoliang Peng³, Zhu Xiao-qian³, Guangming Liu³, Xiangke Liao³, Yingrui Li¹, Huanming Yang, Jian Wang, Tak-Wah Lam¹, Jun Wang - Show less +27 more•Institutions (3)

University of Hong Kong¹, South China University of Technology², National University of Defense Technology³

27 Dec 2012-GigaScience

TL;DR: This work provides an updated assembly version of the 2008 Asian genome using SOAPdenovo2, a new algorithm design that reduces memory consumption in graph construction, resolves more repeat regions in contig assembly, increases coverage and length in scaffold construction, improves gap closing, and optimizes for large genome.

...read moreread less

Abstract: There is a rapidly increasing amount of de novo genome assembly using next-generation sequencing (NGS) short reads; however, several big challenges remain to be overcome in order for this to be efficient and accurate. SOAPdenovo has been successfully applied to assemble many published genomes, but it still needs improvement in continuity, accuracy and coverage, especially in repeat regions. To overcome these challenges, we have developed its successor, SOAPdenovo2, which has the advantage of a new algorithm design that reduces memory consumption in graph construction, resolves more repeat regions in contig assembly, increases coverage and length in scaffold construction, improves gap closing, and optimizes for large genome. Benchmark using the Assemblathon1 and GAGE datasets showed that SOAPdenovo2 greatly surpasses its predecessor SOAPdenovo and is competitive to other assemblers on both assembly length and accuracy. We also provide an updated assembly version of the 2008 Asian (YH) genome using SOAPdenovo2. Here, the contig and scaffold N50 of the YH genome were ~20.9 kbp and ~22 Mbp, respectively, which is 3-fold and 50-fold longer than the first published version. The genome coverage increased from 81.16% to 93.91%, and memory consumption was ~2/3 lower during the point of largest memory consumption.

...read moreread less

4,284 citations

A global reference for human genetic variation

[...]

Adam Auton, Gonçalo R. Abecasis, David Altshuler, Richard Durbin +476 more

01 Oct 2015

TL;DR: The 1000 Genomes Project as mentioned in this paper provided a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and reported the completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole genome sequencing, deep exome sequencing and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

3,247 citations

Journal Article•DOI•

Phylogenomics resolves the timing and pattern of insect evolution

[...]

Bernhard Misof, Shanlin Liu, Karen Meusemann¹, Ralph S. Peters, Alexander Donath, Christoph Mayer, Paul B. Frandsen², Jessica L. Ware², Tomas Flouri³, Rolf G. Beutel⁴, Oliver Niehuis, Malte Petersen, Fernando Izquierdo-Carrasco³, Torsten Wappler⁵, Jes Rust⁵, Andre J. Aberer³, Ulrike Aspöck⁶, Ulrike Aspöck⁷, Horst Aspöck⁶, Daniela Bartel⁶, Alexander Blanke⁸, Simon Berger³, Alexander Böhm⁶, Thomas R. Buckley⁹, Brett Calcott¹⁰, Junqing Chen, Frank Friedrich¹¹, Makiko Fukui¹², Mari Fujita⁸, Carola Greve, Peter Grobe, Shengchang Gu, Ying Huang, Lars S. Jermiin¹, Akito Y. Kawahara¹³, Lars Krogmann¹⁴, Martin Kubiak¹¹, Robert Lanfear¹⁵, Robert Lanfear¹⁶, Robert Lanfear¹⁷, Harald Letsch⁶, Yiyuan Li, Zhenyu Li, Jiguang Li, Haorong Lu, Ryuichiro Machida⁸, Yuta Mashimo⁸, Pashalia Kapli¹⁸, Pashalia Kapli³, Duane D. McKenna¹⁹, Guanliang Meng, Yasutaka Nakagaki⁸, José Luis Navarrete-Heredia²⁰, Michael Ott²¹, Yanxiang Ou, Günther Pass⁶, Lars Podsiadlowski⁵, Hans Pohl⁴, Björn M. von Reumont²², Kai Schütte¹¹, Kaoru Sekiya⁸, Shota Shimizu⁸, Adam Slipinski¹, Alexandros Stamatakis³, Alexandros Stamatakis²³, Wenhui Song, Xu Su, Nikolaus U. Szucsich⁶, Meihua Tan, Xuemei Tan, Min Tang, Jingbo Tang, Gerald Timelthaler⁶, Shigekazu Tomizuka⁸, Michelle D. Trautwein²⁴, Xiaoli Tong²⁵, Toshiki Uchifune⁸, Manfred Walzl⁶, Brian M. Wiegmann²⁶, Jeanne Wilbrandt, Benjamin Wipfler⁴, Thomas K. F. Wong¹, Qiong Wu, Gengxiong Wu, Yinlong Xie, Shenzhou Yang, Qing Yang, David K. Yeates¹, Kazunori Yoshizawa²⁷, Qing Zhang, Rui Zhang, Wenwei Zhang, Yunhui Zhang, Jing Zhao, Chengran Zhou, Lili Zhou, Tanja Ziesmann, Shijie Zou, Yingrui Li, Xun Xu, Yong Zhang, Huanming Yang, Jian Wang, Jun Wang, Karl M. Kjer², Xin Zhou - Show less +102 more•Institutions (27)

Commonwealth Scientific and Industrial Research Organisation¹, Rutgers University², Heidelberg Institute for Theoretical Studies³, University of Jena⁴, University of Bonn⁵, University of Vienna⁶, Naturhistorisches Museum⁷, University of Tsukuba⁸, Landcare Research⁹, Johns Hopkins University¹⁰, University of Hamburg¹¹, Ehime University¹², Florida Museum of Natural History¹³, Staatliches Museum für Naturkunde Stuttgart¹⁴, Australian National University¹⁵, Macquarie University¹⁶, National Evolutionary Synthesis Center¹⁷, American Museum of Natural History¹⁸, University of Memphis¹⁹, University of Guadalajara²⁰, Bavarian Academy of Sciences and Humanities²¹, Natural History Museum²², Karlsruhe Institute of Technology²³, California Academy of Sciences²⁴, South China Agricultural University²⁵, North Carolina State University²⁶, Hokkaido University²⁷

07 Nov 2014-Science

TL;DR: The phylogeny of all major insect lineages reveals how and when insects diversified and provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

Abstract: Insects are the most speciose group of animals, but the phylogenetic relationships of many major lineages remain unresolved. We inferred the phylogeny of insects from 1478 protein-coding genes. Phylogenomic analyses of nucleotide and amino acid sequences, with site-specific nucleotide or domain-specific amino acid substitution models, produced statistically robust and congruent results resolving previously controversial phylogenetic relations hips. We dated the origin of insects to the Early Ordovician [~479 million years ago (Ma)], of insect flight to the Early Devonian (~406 Ma), of major extant lineages to the Mississippian (~345 Ma), and the major diversification of holometabolous insects to the Early Cretaceous. Our phylogenomic study provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

1,998 citations

Journal Article•DOI•

The oyster genome reveals stress adaptation and complexity of shell formation

[...]

Guofan Zhang¹, Xiaodong Fang, Ximing Guo², Li Li, Ruibang Luo, Fei Xu, Pengcheng Yang, Linlin Zhang, Xiaotong Wang, Haigang Qi, Zhiqiang Xiong, Huayong Que, Yinlong Xie, Peter W. H. Holland³, Jordi Paps³, Yabing Zhu, Fucun Wu, Yuanxin Chen, Jiafeng Wang, Chunfang Peng, Jie Meng, Lan Yang, Jun Liu, Bo Wen, Na Zhang, Zhiyong Huang, Qihui Zhu, Yue Feng, Andrew S. Mount⁴, Dennis Hedgecock⁵, Zhe Xu⁶, Yunjie Liu, Tomislav Domazet-Lošo, Yishuai Du, Xiaoqing Sun, Shoudu Zhang, Binghang Liu, Peizhou Cheng, Xuanting Jiang, Juan Li, Dingding Fan, Wei Wang, Wenjing Fu, Tong Wang, Bo Wang, Jibiao Zhang, Zhiyu Peng, Yingxiang Li, Na Li, Jinpeng Wang, Maoshan Chen, Yan He², Fengji Tan, Xiaorui Song, Qiumei Zheng, Ronglian Huang, Hailong Yang, Du Xuedi, Li Chen, Mei Yang, Patrick M. Gaffney⁷, Shan Wang², Longhai Luo, Zhicai She, Yao Ming, Huang Wen, Shu Zhang, Baoyu Huang, Yong Zhang, Tao Qu, Peixiang Ni, Guoying Miao, Junyi Wang, Qiang Wang, Christian E. W. Steinberg⁸, Haiyan Wang, Ning Li, Lumin Qian², Guojie Zhang, Yingrui Li, Huanming Yang, Xiao Liu, Jian Wang, Ye Yin, Jun Wang⁹ - Show less +81 more•Institutions (9)

Chinese Academy of Sciences¹, Rutgers University², University of Oxford³, Clemson University⁴, University of Southern California⁵, Atlantic Cape Community College⁶, University of Delaware⁷, Humboldt University of Berlin⁸, University of Copenhagen⁹

04 Oct 2012-Nature

TL;DR: The sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy and transcriptomes of development and stress response and the proteome of the shell are reported, showing that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes.

...read moreread less

Abstract: The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa.

...read moreread less

1,806 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower¹, Curtis Huttenhower², Dirk Gevers², Rob Knight³ +250 more•Institutions (42)

14 Jun 2012-Nature

TL;DR: The Human Microbiome Project Consortium reported the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome as discussed by the authors.

...read moreread less

Abstract: The Human Microbiome Project Consortium reports the first results of their analysis of microbial communities from distinct, clinically relevant body habitats in a human cohort; the insights into the microbial communities of a healthy population lay foundations for future exploration of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

8,410 citations

Journal Article•

Structure, function and diversity of the healthy human microbiome

[...]

Curtis Huttenhower, Dirk Gevers, Rob Knight, Sahar Abubucker +244 more

01 Jun 2012-PubMed Central

TL;DR: The Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far, finding the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals.

...read moreread less

Abstract: Studies of the human microbiome have revealed that even healthy individuals differ remarkably in the microbes that occupy habitats such as the gut, skin and vagina. Much of this diversity remains unexplained, although diet, environment, host genetics and early microbial exposure have all been implicated. Accordingly, to characterize the ecology of human-associated microbial communities, the Human Microbiome Project has analysed the largest cohort and set of distinct, clinically relevant body habitats so far. We found the diversity and abundance of each habitat’s signature microbes to vary widely even among healthy subjects, with strong niche specialization both within and among individuals. The project encountered an estimated 81–99% of the genera, enzyme families and community configurations occupied by the healthy Western microbiome. Metagenomic carriage of metabolic pathways was stable among individuals despite variation in community structure, and ethnic/racial background proved to be one of the strongest associations of both pathways and microbes with clinical metadata. These results thus delineate the range of structural and functional configurations normal in the microbial communities of a healthy population, enabling future characterization of the epidemiology, ecology and translational applications of the human microbiome.

...read moreread less

6,350 citations

Journal Article•DOI•

Human gut microbiome viewed across age and geography

[...]

Tanya Yatsunenko¹, Federico E. Rey¹, Mark J. Manary², Mark J. Manary¹, Indi Trehan¹, Indi Trehan², Maria Gloria Dominguez-Bello³, Monica Contreras⁴, Magda Magris, Glida Hidalgo, Robert N. Baldassano⁵, Andrey P. Anokhin¹, Andrew C. Heath¹, Barbara B. Warner¹, Jens Reeder⁶, Justin Kuczynski⁶, J. Gregory Caporaso⁷, Catherine A. Lozupone⁶, Christian L. Lauber⁶, Jose C. Clemente⁶, Dan Knights⁶, Rob Knight⁶, Jeffrey I. Gordon¹ - Show less +19 more•Institutions (7)

Washington University in St. Louis¹, University of Malawi², University of Puerto Rico³, Venezuelan Institute for Scientific Research⁴, University of Pennsylvania⁵, University of Colorado Boulder⁶, Northern Arizona University⁷

14 Jun 2012-Nature

TL;DR: The need to consider the microbiome when evaluating human development, nutritional needs, physiological variations and the impact of westernization is underscored, as distinctive features of the functional maturation of the gut microbiome are evident in early infancy as well as adulthood.

...read moreread less

Abstract: Gut microbial communities represent one source of human genetic and metabolic diversity. To examine how gut microbiomes differ among human populations, here we characterize bacterial species in fecal samples from 531 individuals, plus the gene content of 110 of them. The cohort encompassed healthy children and adults from the Amazonas of Venezuela, rural Malawi and US metropolitan areas and included mono- and dizygotic twins. Shared features of the functional maturation of the gut microbiome were identified during the first three years of life in all three populations, including age-associated changes in the genes involved in vitamin biosynthesis and metabolism. Pronounced differences in bacterial assemblages and functional gene repertoires were noted between US residents and those in the other two countries. These distinctive features are evident in early infancy as well as adulthood. Our findings underscore the need to consider the microbiome when evaluating human development, nutritional needs, physiological variations and the impact of westernization.

...read moreread less

6,047 citations

Journal Article•DOI•

Cd-hit

[...]

Limin Fu¹, Beifang Niu¹, Zhengwei Zhu¹, Sitao Wu¹, Weizhong Li¹ - Show less +1 more•Institutions (1)

University of California, San Diego¹

01 Dec 2012-Bioinformatics

TL;DR: A new CD-HIT program accelerated with a novel parallelization strategy and some other techniques to allow efficient clustering of such datasets to reduce sequence redundancy and improve the performance of other sequence analyses is developed.

...read moreread less

Abstract: Summary: CD-HIT is a widely used program for clustering biological sequences to reduce sequence redundancy and improve the performance of other sequence analyses. In response to the rapid increase in the amount of sequencing data produced by the next-generation sequencing technologies, we have developed a new CD-HIT program accelerated with a novel parallelization strategy and some other techniques to allow efficient clustering of such datasets. Our tests demonstrated very good speedup derived from the parallelization for up to ~24 cores and a quasi-linear speedup for up to ~8 cores. The enhanced CD-HIT is capable of handling very large datasets in much shorter time than previous versions. Availability: http://cd-hit.org. Contact: [email protected] Supplementary information:Supplementary data are available at Bioinformatics online.

...read moreread less

5,959 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse