Home
/
Authors
/
Xin Zhou

Author

Xin Zhou

Other affiliations: Beijing Genomics Institute, Rutgers University, Chinese Academy of Sciences ...read more

Bio: Xin Zhou is an academic researcher from China Agricultural University. The author has contributed to research in topics: DNA barcoding & Phylogenetic tree. The author has an hindex of 44, co-authored 140 publications receiving 9603 citations. Previous affiliations of Xin Zhou include Beijing Genomics Institute & Rutgers University.

Topics: DNA barcoding, Phylogenetic tree, Genome, Mitochondrial DNA, Metagenomics ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2007
1995

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Phylogenomics resolves the timing and pattern of insect evolution

[...]

Bernhard Misof, Shanlin Liu, Karen Meusemann¹, Ralph S. Peters, Alexander Donath, Christoph Mayer, Paul B. Frandsen², Jessica L. Ware², Tomas Flouri³, Rolf G. Beutel⁴, Oliver Niehuis, Malte Petersen, Fernando Izquierdo-Carrasco³, Torsten Wappler⁵, Jes Rust⁵, Andre J. Aberer³, Ulrike Aspöck⁶, Ulrike Aspöck⁷, Horst Aspöck⁷, Daniela Bartel⁷, Alexander Blanke⁸, Simon Berger³, Alexander Böhm⁷, Thomas R. Buckley⁹, Brett Calcott¹⁰, Junqing Chen, Frank Friedrich¹¹, Makiko Fukui¹², Mari Fujita⁸, Carola Greve, Peter Grobe, Shengchang Gu, Ying Huang, Lars S. Jermiin¹, Akito Y. Kawahara¹³, Lars Krogmann¹⁴, Martin Kubiak¹¹, Robert Lanfear¹⁵, Robert Lanfear¹⁶, Robert Lanfear¹⁷, Harald Letsch⁷, Yiyuan Li, Zhenyu Li, Jiguang Li, Haorong Lu, Ryuichiro Machida⁸, Yuta Mashimo⁸, Pashalia Kapli³, Pashalia Kapli¹⁸, Duane D. McKenna¹⁹, Guanliang Meng, Yasutaka Nakagaki⁸, José Luis Navarrete-Heredia²⁰, Michael Ott²¹, Yanxiang Ou, Günther Pass⁷, Lars Podsiadlowski⁵, Hans Pohl⁴, Björn M. von Reumont²², Kai Schütte¹¹, Kaoru Sekiya⁸, Shota Shimizu⁸, Adam Slipinski¹, Alexandros Stamatakis³, Alexandros Stamatakis²³, Wenhui Song, Xu Su, Nikolaus U. Szucsich⁷, Meihua Tan, Xuemei Tan, Min Tang, Jingbo Tang, Gerald Timelthaler⁷, Shigekazu Tomizuka⁸, Michelle D. Trautwein²⁴, Xiaoli Tong²⁵, Toshiki Uchifune⁸, Manfred Walzl⁷, Brian M. Wiegmann²⁶, Jeanne Wilbrandt, Benjamin Wipfler⁴, Thomas K. F. Wong¹, Qiong Wu, Gengxiong Wu, Yinlong Xie, Shenzhou Yang, Qing Yang, David K. Yeates¹, Kazunori Yoshizawa²⁷, Qing Zhang, Rui Zhang, Wenwei Zhang, Yunhui Zhang, Jing Zhao, Chengran Zhou, Lili Zhou, Tanja Ziesmann, Shijie Zou, Yingrui Li, Xun Xu, Yong Zhang, Huanming Yang, Jian Wang, Jun Wang, Karl M. Kjer², Xin Zhou - Show less +102 more•Institutions (27)

Commonwealth Scientific and Industrial Research Organisation¹, Rutgers University², Heidelberg Institute for Theoretical Studies³, University of Jena⁴, University of Bonn⁵, Naturhistorisches Museum⁶, University of Vienna⁷, University of Tsukuba⁸, Landcare Research⁹, Johns Hopkins University¹⁰, University of Hamburg¹¹, Ehime University¹², Florida Museum of Natural History¹³, Staatliches Museum für Naturkunde Stuttgart¹⁴, Australian National University¹⁵, National Evolutionary Synthesis Center¹⁶, Macquarie University¹⁷, American Museum of Natural History¹⁸, University of Memphis¹⁹, University of Guadalajara²⁰, Bavarian Academy of Sciences and Humanities²¹, Natural History Museum²², Karlsruhe Institute of Technology²³, California Academy of Sciences²⁴, South China Agricultural University²⁵, North Carolina State University²⁶, Hokkaido University²⁷

07 Nov 2014-Science

TL;DR: The phylogeny of all major insect lineages reveals how and when insects diversified and provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

Abstract: Insects are the most speciose group of animals, but the phylogenetic relationships of many major lineages remain unresolved. We inferred the phylogeny of insects from 1478 protein-coding genes. Phylogenomic analyses of nucleotide and amino acid sequences, with site-specific nucleotide or domain-specific amino acid substitution models, produced statistically robust and congruent results resolving previously controversial phylogenetic relations hips. We dated the origin of insects to the Early Ordovician [~479 million years ago (Ma)], of insect flight to the Early Devonian (~406 Ma), of major extant lineages to the Mississippian (~345 Ma), and the major diversification of holometabolous insects to the Early Cretaceous. Our phylogenomic study provides a comprehensive reliable scaffold for future comparative analyses of evolutionary innovations among insects.

...read moreread less

1,998 citations

Journal Article•DOI•

SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads

[...]

Yinlong Xie¹, Yinlong Xie², Gengxiong Wu, Jingbo Tang³, Ruibang Luo², Jordan Patterson⁴, Shanlin Liu, Weihua Huang, Guangzhu He, Shengchang Gu, Shengkang Li, Xin Zhou, Tak-Wah Lam², Yingrui Li, Xun Xu, Gane Ka-Shu Wong⁴, Jun Wang - Show less +13 more•Institutions (4)

South China University of Technology¹, University of Hong Kong², Central South University³, University of Alberta⁴

15 Jun 2014-Bioinformatics

TL;DR: The conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution, compared with two other popular transcriptome assemblers.

...read moreread less

Abstract: Motivation: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining a large number of gene sequences from an organism with no reference genome. Owing to the rapid increase in throughputs and decrease in costs of next- generation sequencing, RNA-Seq in particular has become the method of choice. However, the very short reads (e.g. 2 � 90 bp paired ends) from next generation sequencing makes de novo assembly to recover complete or full-length transcript sequences an algorithmic challenge. Results: Here, we present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. We evaluated its performance on transcriptome datasets from rice and mouse. Using as our benchmarks the known transcripts from these wellannotated genomes (sequenced a decade ago), we assessed how SOAPdenovo-Trans and two other popular transcriptome assemblers handled such practical issues as alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution. Availability and implementation: Source code and user manual are available at http://sourceforge.net/projects/soapdenovotrans/. Contact: xieyl@genomics.cn or bgi-soap@googlegroups.com Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

730 citations

Posted Content•

SOAPdenovo-Trans: De novo transcriptome assembly with short RNA-Seq reads

[...]

Yinlong Xie¹, Yinlong Xie², Gengxiong Wu, Jingbo Tang³, Ruibang Luo¹, Jordan Patterson⁴, Shanlin Liu, Weihua Huang, Guangzhu He, Shengchang Gu, Shengkang Li, Xin Zhou, Tak-Wah Lam¹, Yingrui Li, Xun Xu, Gane Ka-Shu Wong⁴, Jun Wang - Show less +13 more•Institutions (4)

University of Hong Kong¹, South China University of Technology², Central South University³, University of Alberta⁴

29 May 2013-arXiv: Genomics

TL;DR: SOAPdenovo-Trans as mentioned in this paper is a de novo transcriptome assembler designed specifically for RNA-Seq that provides higher contiguity, lower redundancy, and faster execution.

...read moreread less

Abstract: Motivation: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining the sequences for a large number of genes from an organism with no reference genome. With the rapidly increasing throughputs and decreasing costs of next generation sequencing, RNA-Seq has gained in popularity; but given the typically short reads (e.g. 2 x 90 bp paired ends) of this technol- ogy, de novo assembly to recover complete or full-length transcript sequences remains an algorithmic challenge. Results: We present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. Its performance was evaluated on transcriptome datasets from rice and mouse. Using the known transcripts from these well-annotated genomes (sequenced a decade ago) as our benchmark, we assessed how SOAPdenovo- Trans and two other popular software handle the practical issues of alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy, and faster execution. Availability and Implementation: Source code and user manual are at this http URL Contact: xieyl@genomics.cn or bgi-soap@googlegroups.com

...read moreread less

615 citations

Journal Article•DOI•

Evolutionary History of the Hymenoptera

[...]

Ralph S. Peters, Lars Krogmann¹, Christoph Mayer, Alexander Donath, Simon Gunkel, Karen Meusemann², Alexey M. Kozlov³, Lars Podsiadlowski⁴, Malte Petersen, Robert Lanfear⁵, Patricia A. Diez⁶, John M. Heraty⁷, Karl M. Kjer⁸, Seraina Klopfstein⁹, Rudolf Meier¹⁰, Carlo Polidori¹¹, Thomas Schmitt¹², Shanlin Liu¹³, Xin Zhou¹⁴, Torsten Wappler, Jes Rust, Bernhard Misof, Oliver Niehuis², Oliver Niehuis¹⁵ - Show less +20 more•Institutions (15)

Staatliches Museum für Naturkunde Stuttgart¹, University of Freiburg², Heidelberg Institute for Theoretical Studies³, University of Bonn⁴, Australian National University⁵, National Scientific and Technical Research Council⁶, University of California, Riverside⁷, Rutgers University⁸, Naturhistorisches Museum⁹, National University of Singapore¹⁰, University of Castilla–La Mancha¹¹, University of Würzburg¹², University of Copenhagen¹³, China Agricultural University¹⁴, Arizona State University¹⁵

03 Apr 2017-Current Biology

TL;DR: The results reveal that the extant sawfly diversity is largely the result of a previously unrecognized major radiation of phytophagous Hymenoptera that did not lead to wood-dwelling and parasitoidism.

...read moreread less

549 citations

Journal Article•DOI•

Corrigendum: Genome-wide adaptive complexes to underground stresses in blind mole rats Spalax.

[...]

Xiaodong Fang, Eviatar Nevo, Lijuan Han, Erez Y. Levanon, Jing Zhao, Aaron Avivi, Denis M. Larkin, Xuanting Jiang, Sergey Feranchuk, Yabing Zhu, Alla Fishman, Yue Feng, Noa Sher, Zhiqiang Xiong, Thomas Hankeln, Zhiyong Huang, Vera Gorbunova, Lu Zhang, Wei Zhao, Derek E. Wildman¹, Derek E. Wildman², Yingqi Xiong, Andrei V. Gudkov, Qiumei Zheng, Gideon Rechavi, Sanyang Liu, Lily Bazak, Jie Chen², Jie Chen¹, Binyamin A. Knisbacher, Yao Lu, Imad Shams, Krzysztof Gajda, Marta Farré, Jaebum Kim, Harris A. Lewin, Jian Ma, Mark Band, Anne Bicker, Angela Kranz, Tobias Mattheus, Hanno Schmidt, Andrei Seluanov, Jorge Azpurua, Michael R. McGowen, Eshel Ben Jacob, Kexin Li, Shaoliang Peng, Xiaoqian Zhu, Xiangke Liao, Shuai Cheng Li, Anders Krogh, Xin Zhou, Leonid Brodsky, Jun Wang - Show less +51 more•Institutions (2)

University of Illinois at Urbana–Champaign¹, Illinois College²

12 Aug 2015-Nature Communications

TL;DR: In this paper, a Genome-Wide Adaptive Complex to underground stresses in blind mole rats Spalax is described, where the adaptive complexes are based on adaptive complexes to underground stress.

...read moreread less

Abstract: Corrigendum: Genome-wide adaptive complexes to underground stresses in blind mole rats Spalax

...read moreread less

527 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

[...]

Fumio Tajima¹•Institutions (1)

Kyushu University¹

30 Oct 1989-Genomics

TL;DR: It is suggested that the natural selection against large insertion/deletion is so weak that a large amount of variation is maintained in a population.

...read moreread less

11,521 citations

SPAdes, a new genome assembly algorithm and its applications to single-cell sequencing ( 7th Annual SFAF Meeting, 2012)

[...]

Glenn Tesler

01 Jun 2012

TL;DR: SPAdes as mentioned in this paper is a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler and on popular assemblers Velvet and SoapDeNovo (for multicell data).

...read moreread less

Abstract: The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online ( http://bioinf.spbau.ru/spades ). It is distributed as open source software.

...read moreread less

10,124 citations

Journal Article•DOI•

ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R.

[...]

Emmanuel Paradis¹, Klaus Schliep²•Institutions (2)

Centre national de la recherche scientifique¹, University of Massachusetts Boston²

01 Feb 2019-Bioinformatics

TL;DR: Efforts have been put to improve efficiency, flexibility, support for 'big data' (R's long vectors), ease of use and quality check before a new release of ape.

...read moreread less

Abstract: Summary After more than fifteen years of existence, the R package ape has continuously grown its contents, and has been used by a growing community of users The release of version 50 has marked a leap towards a modern software for evolutionary analyses Efforts have been put to improve efficiency, flexibility, support for 'big data' (R's long vectors), ease of use and quality check before a new release These changes will hopefully make ape a useful software for the study of biodiversity and evolution in a context of increasing data quantity Availability and implementation ape is distributed through the Comprehensive R Archive Network: http://cranr-projectorg/package=ape Further information may be found at http://ape-packageirdfr/

...read moreread less

4,303 citations

Journal Article•DOI•

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation

[...]

Nuala A. O'Leary¹, Mathew W. Wright¹, J. Rodney Brister¹, Stacy Ciufo¹, Diana Haddad¹, Richard McVeigh¹, Bhanu Rajput¹, Barbara Robbertse¹, Brian Smith-White¹, Danso Ako-adjei¹, Alexander Astashyn¹, Azat Badretdin¹, Yiming Bao¹, Olga Blinkova¹, Vyacheslav Brover¹, Vyacheslav Chetvernin¹, Jinna Choi¹, Eric Cox¹, Olga Ermolaeva¹, Catherine M. Farrell¹, Tamara Goldfarb¹, Tripti Gupta¹, Daniel H. Haft¹, Eneida L. Hatcher¹, Wratko Hlavina¹, Vinita Joardar¹, Vamsi K. Kodali¹, Wenjun Li¹, Donna Maglott¹, Patrick Masterson¹, Kelly M. McGarvey¹, Michael R. Murphy¹, Kathleen O'Neill¹, Shashikant Pujar¹, Sanjida H. Rangwala¹, Daniel Rausch¹, Lillian D. Riddick¹, Conrad L. Schoch¹, Andrei Shkeda¹, Susan S. Storz¹, Hanzhen Sun¹, Françoise Thibaud-Nissen¹, Igor Tolstoy¹, Raymond E. Tully¹, Anjana R. Vatsan¹, Craig Wallin¹, David Webb¹, Wendy Wu¹, Melissa J. Landrum¹, Avi Kimchi¹, Tatiana Tatusova¹, Michael DiCuccio¹, Paul Kitts¹, Terence Murphy¹, Kim D. Pruitt¹ - Show less +51 more•Institutions (1)

National Institutes of Health¹

04 Jan 2016-Nucleic Acids Research

TL;DR: The approach to utilizing available RNA-Seq and other data types in the authors' manual curation process for vertebrate, plant, and other species is summarized, and a new direction for prokaryotic genomes and protein name management is described.

...read moreread less

Abstract: The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.

...read moreread less

4,104 citations

Journal Article•DOI•

Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown

[...]

Mihaela Pertea¹, Daehwan Kim¹, Geo Pertea¹, Jeffrey T. Leek¹, Steven L. Salzberg¹ - Show less +1 more•Institutions (1)

Johns Hopkins University¹

01 Sep 2016-Nature Protocols

TL;DR: This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts.

...read moreread less

Abstract: High-throughput sequencing of mRNA (RNA-seq) has become the standard method for measuring and comparing the levels of gene expression in a wide variety of species and conditions. RNA-seq experiments generate very large, complex data sets that demand fast, accurate and flexible software to reduce the raw read data to comprehensible results. HISAT (hierarchical indexing for spliced alignment of transcripts), StringTie and Ballgown are free, open-source software tools for comprehensive analysis of RNA-seq experiments. Together, they allow scientists to align reads to a genome, assemble transcripts including novel splice variants, compute the abundance of these transcripts in each sample and compare experiments to identify differentially expressed genes and transcripts. This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts. The protocol's execution time depends on the computing resources, but it typically takes under 45 min of computer time. HISAT, StringTie and Ballgown are available from http://ccb.jhu.edu/software.shtml.

...read moreread less

3,755 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse