Home
/
Authors
/
Hui Guo

Author

Hui Guo

Other affiliations: Sun Yat-sen University, North China University of Science and Technology, University of Georgia

Bio: Hui Guo is an academic researcher from Plant Genome Mapping Laboratory. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 22, co-authored 39 publications receiving 9996 citations. Previous affiliations of Hui Guo include Sun Yat-sen University & North China University of Science and Technology.

Topics: Genome, Gene, Quantitative trait locus, Genome evolution, Genomics ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity

[...]

Yupeng Wang¹, Haibao Tang¹, Jeremy D. DeBarry¹, Xu-fei Tan¹, Jingping Li¹, Xiyin Wang¹, Tae-Ho Lee¹, Huizhe Jin¹, Barry S. Marler¹, Hui Guo¹, Jessica C. Kissinger¹, Andrew H. Paterson¹ - Show less +8 more•Institutions (1)

Plant Genome Mapping Laboratory¹

01 Apr 2012-Nucleic Acids Research

TL;DR: The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses.

...read moreread less

Abstract: MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/.

...read moreread less

3,388 citations

Journal Article•DOI•

The tomato genome sequence provides insights into fleshy fruit evolution

[...]

Shusei Sato, Satoshi Tabata, Hideki Hirakawa, Erika Asamizu +320 more•Institutions (51)

31 May 2012-Nature

TL;DR: A high-quality genome sequence of domesticated tomato is presented, a draft sequence of its closest wild relative, Solanum pimpinellifolium, is compared, and the two tomato genomes are compared to each other and to the potato genome.

...read moreread less

Abstract: Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera1 and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium2, and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness.

...read moreread less

2,687 citations

Journal Article•DOI•

The genome of the mesopolyploid crop species Brassica rapa

[...]

Xiaowu Wang¹, Hanzhong Wang, Jun Wang², Jun Wang³, Jun Wang⁴, Rifei Sun, Jian Wu, Shengyi Liu, Yinqi Bai³, Jeong-Hwan Mun⁵, Ian Bancroft⁶, Feng Cheng, Sanwen Huang, Xixiang Li, Wei Hua, Junyi Wang³, Xiyin Wang⁷, Xiyin Wang⁸, Michael Freeling⁹, J. Chris Pires¹⁰, Andrew H. Paterson⁷, Boulos Chalhoub, Bo Wang³, Alice Hayward¹¹, Alice Hayward¹², Andrew G. Sharpe¹³, Beom-Seok Park⁵, Bernd Weisshaar¹⁴, Binghang Liu³, Bo Li³, Bo Liu, Chaobo Tong, Chi Song³, Chris Duran¹², Chris Duran¹⁵, Chunfang Peng³, Geng Chunyu³, Chushin Koh¹³, Chuyu Lin³, David Edwards¹⁵, David Edwards¹², Desheng Mu³, Di Shen, Eleni Soumpourou⁶, Fei Li, Fiona Fraser⁶, Gavin C. Conant¹⁰, Gilles Lassalle¹⁶, Graham J.W. King⁴, Guusje Bonnema¹⁷, Haibao Tang⁹, Haiping Wang, Harry Belcram, Heling Zhou³, Hideki Hirakawa, Hiroshi Abe, Hui Guo⁷, Hui Wang, Huizhe Jin⁷, Isobel A. P. Parkin¹⁸, Jacqueline Batley¹², Jacqueline Batley¹¹, Jeong-Sun Kim⁵, Jérémy Just, Jianwen Li³, Jiaohui Xu³, Jie Deng, Jin A Kim⁵, Jingping Li⁷, Jingyin Yu, Jinling Meng¹⁹, Jinpeng Wang⁸, Jiumeng Min³, Julie Poulain²⁰, Katsunori Hatakeyama, Kui Wu³, Li Wang⁸, Lu Fang, Martin Trick⁶, Matthew G. Links¹⁸, Meixia Zhao, Mina Jin⁵, Nirala Ramchiary²¹, Nizar Drou²², Paul J. Berkman¹⁵, Paul J. Berkman¹², Qingle Cai³, Quanfei Huang³, Ruiqiang Li³, Satoshi Tabata, Shifeng Cheng³, Shu Zhang³, Shujiang Zhang, Shunmou Huang, Shusei Sato, Silong Sun, Soo-Jin Kwon⁵, Su-Ryun Choi²¹, Tae-Ho Lee⁷, Wei Fan³, Xiang Zhao³, Xu Tan⁷, Xun Xu³, Yan Wang, Yang Qiu, Ye Yin³, Yingrui Li³, Yongchen Du, Yongcui Liao, Yong Pyo Lim²¹, Yoshihiro Narusaka, Yupeng Wang⁸, Zhenyi Wang⁸, Zhenyu Li³, Zhiwen Wang³, Zhiyong Xiong¹⁰, Zhonghua Zhang - Show less +113 more•Institutions (22)

Civil Aviation Authority of Singapore¹, University of Copenhagen², Beijing Institute of Genomics³, Rothamsted Research⁴, Rural Development Administration⁵, John Innes Centre⁶, University of Georgia⁷, North China University of Science and Technology⁸, University of California, Berkeley⁹, University of Missouri¹⁰, Australian Research Council¹¹, University of Queensland¹², National Research Council¹³, Bielefeld University¹⁴, Australian Centre for Plant Functional Genomics¹⁵, University of Rennes¹⁶, Wageningen University and Research Centre¹⁷, Agriculture and Agri-Food Canada¹⁸, Huazhong Agricultural University¹⁹, French Alternative Energies and Atomic Energy Commission²⁰, Chungnam National University²¹, Norwich Research Park²²

01 Oct 2011-Nature Genetics

TL;DR: The annotation and analysis of the draft genome sequence of Brassica rapa accession Chiifu-401-42, a Chinese cabbage, and used Arabidopsis thaliana as an outgroup for investigating the consequences of genome triplication, such as structural and functional evolution.

...read moreread less

Abstract: We report the annotation and analysis of the draft genome sequence of Brassica rapa accession Chiifu-401-42, a Chinese cabbage. We modeled 41,174 protein coding genes in the B. rapa genome, which has undergone genome triplication. We used Arabidopsis thaliana as an outgroup for investigating the consequences of genome triplication, such as structural and functional evolution. The extent of gene loss (fractionation) among triplicated genome segments varies, with one of the three copies consistently retaining a disproportionately large fraction of the genes expected to have been present in its ancestor. Variation in the number of members of gene families present in the genome may contribute to the remarkable morphological plasticity of Brassica species. The B. rapa genome sequence provides an important resource for studying the evolution of polyploid genomes and underpins the genetic improvement of Brassica oil and vegetable crops.

...read moreread less

1,811 citations

Journal Article•DOI•

Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome

[...]

Boulos Chalhoub¹, Shengyi Liu², Isobel A. P. Parkin³, Haibao Tang⁴, Haibao Tang⁵, Xiyin Wang⁶, Julien Chiquet¹, Harry Belcram¹, Chaobo Tong², Birgit Samans⁷, Margot Correa⁸, Corinne Da Silva⁸, Jérémy Just¹, Cyril Falentin⁹, Chu Shin Koh¹⁰, Isabelle Le Clainche¹, Maria Bernard⁸, Pascal Bento⁸, Benjamin Noel⁸, Karine Labadie⁸, Adriana Alberti⁸, Mathieu Charles⁹, Dominique Arnaud¹, Hui Guo⁶, Christian Daviaud, Salman Alamery¹¹, Kamel Jabbari¹², Kamel Jabbari¹, Meixia Zhao¹³, Patrick P. Edger¹⁴, Houda Chelaifa¹, David C. Tack¹⁵, Gilles Lassalle⁹, Imen Mestiri¹, Nicolas Schnel⁹, Marie-Christine Le Paslier⁹, Guangyi Fan, Victor Renault¹⁶, Philippe E. Bayer¹¹, Agnieszka A. Golicz¹¹, Sahana Manoli¹¹, Tae-Ho Lee⁶, Vinh Ha Dinh Thi¹, Smahane Chalabi¹, Qiong Hu², Chuchuan Fan¹⁷, Reece Tollenaere¹¹, Yunhai Lu¹, Christophe Battail⁸, Jinxiong Shen¹⁷, Christine Sidebottom¹⁰, Xinfa Wang², Aurélie Canaguier¹, Aurélie Chauveau⁹, Aurélie Bérard⁹, G. Deniot⁹, Mei Guan¹⁸, Zhongsong Liu¹⁸, Fengming Sun, Yong Pyo Lim¹⁹, Eric Lyons²⁰, Christopher D. Town⁴, Ian Bancroft²¹, Xiaowu Wang, Jinling Meng¹⁷, Jianxin Ma¹³, J. Chris Pires²², Graham J.W. King²³, Dominique Brunel⁹, Régine Delourme⁹, Michel Renard⁹, Jean-Marc Aury⁸, Keith L. Adams¹⁵, Jacqueline Batley²⁴, Jacqueline Batley¹¹, Rod J. Snowdon⁷, Jörg Tost, David Edwards²⁴, David Edwards¹¹, Yongming Zhou¹⁷, Wei Hua², Andrew G. Sharpe¹⁰, Andrew H. Paterson⁶, Chunyun Guan¹⁸, Patrick Wincker²⁵, Patrick Wincker¹, Patrick Wincker⁸ - Show less +83 more•Institutions (25)

University of Évry Val d'Essonne¹, Crops Research Institute², Agriculture and Agri-Food Canada³, J. Craig Venter Institute⁴, Fujian Agriculture and Forestry University⁵, Plant Genome Mapping Laboratory⁶, University of Giessen⁷, French Alternative Energies and Atomic Energy Commission⁸, Institut national de la recherche agronomique⁹, National Research Council¹⁰, Australian Centre for Plant Functional Genomics¹¹, University of Cologne¹², Purdue University¹³, University of California, Berkeley¹⁴, University of British Columbia¹⁵, Fondation Jean Dausset Centre d'Etude du Polymorphisme Humain¹⁶, Huazhong Agricultural University¹⁷, Hunan Agricultural University¹⁸, Chungnam National University¹⁹, University of Arizona²⁰, University of York²¹, University of Missouri²², Southern Cross University²³, University of Western Australia²⁴, Centre national de la recherche scientifique²⁵

22 Aug 2014-Science

TL;DR: The polyploid genome of Brassica napus, which originated from a recent combination of two distinct genomes approximately 7500 years ago and gave rise to the crops of rape oilseed, is sequenced.

...read moreread less

Abstract: Oilseed rape (Brassica napus L.) was formed ~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent An and Cn subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.

...read moreread less

1,743 citations

Journal Article•DOI•

Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres

[...]

Andrew H. Paterson¹, Jonathan F. Wendel², Heidrun Gundlach, Hui Guo¹, Jerry Jenkins³, Dianchuan Jin, Danny J. Llewellyn⁴, Kurtis C. Showmaker⁵, Shengqiang Shu³, Joshua A. Udall⁶, Mi-Jeong Yoo², Robert L. Byers⁶, Wei Chen, Adi Doron-Faigenboim, Mary V. Duke⁷, Lei Gong², Jane Grimwood³, Corrinne E. Grover², Kara Grupp², Guanjing Hu², Tae-Ho Lee¹, Jingping Li¹, Lifeng Lin¹, Tao Liu, Barry S. Marler¹, Justin T. Page⁶, Alison W. Roberts⁸, Elisson Romanel⁹, William S. Sanders⁵, Emmanuel Szadkowski², Xu Tan¹, Haibao Tang¹, Haibao Tang¹⁰, Chunming Xu², Chunming Xu¹¹, Jinpeng Wang, Zining Wang¹, Dong Zhang¹, Lan Zhang, Hamid Ashrafi¹², Frank Bedon⁴, John E. Bowers¹, Curt L. Brubaker⁴, Curt L. Brubaker¹³, Peng W. Chee¹⁴, Sayan Das¹, Alan R. Gingle¹, Candace H. Haigler¹⁵, David B. Harker⁶, Lucia Vieira Hoffmann¹⁶, Ran Hovav, Don C. Jones¹⁷, Cornelia Lemke¹, Shahid Mansoor¹, Shahid Mansoor¹⁸, Mehboob-ur Rahman¹⁸, Lisa N. Rainville¹, Aditi Rambani⁶, Umesh K. Reddy¹⁹, Junkang Rong¹, Yehoshua Saranga²⁰, Brian E. Scheffler⁷, Jodi A. Scheffler⁷, David M. Stelly²¹, Barbara A. Triplett⁷, Allen Van Deynze¹², Maite F S Vaslin⁹, V. N. Waghmare²², Sally A. Walford⁴, Robert J. Wright²³, Essam A. Zaki, Tianzhen Zhang²⁴, Elizabeth S. Dennis⁴, Klaus F. X. Mayer, Daniel G. Peterson⁵, Daniel S. Rokhsar³, Xiyin Wang¹, Jeremy Schmutz³ - Show less +74 more•Institutions (24)

Plant Genome Mapping Laboratory¹, Iowa State University², Joint Genome Institute³, Commonwealth Scientific and Industrial Research Organisation⁴, Mississippi State University⁵, Brigham Young University⁶, Agricultural Research Service⁷, University of Rhode Island⁸, Federal University of Rio de Janeiro⁹, J. Craig Venter Institute¹⁰, Northeast Normal University¹¹, University of California, Davis¹², Bayer¹³, University of Georgia¹⁴, North Carolina State University¹⁵, Empresa Brasileira de Pesquisa Agropecuária¹⁶, Cotton Incorporated¹⁷, National Institute for Biotechnology and Genetic Engineering¹⁸, West Virginia State University¹⁹, Hebrew University of Jerusalem²⁰, Texas A&M University²¹, Central Institute for Cotton Research²², Texas Tech University²³, Nanjing Agricultural University²⁴

20 Dec 2012-Nature

TL;DR: It is shown that an abrupt five- to sixfold ploidy increase approximately 60 million years (Myr) ago, and allopolyploidy reuniting divergent Gossypium genomes approximately 1–2 Myr ago, conferred about 30–36-fold duplication of ancestral angiosperm genes in elite cottons, genetic complexity equalled only by Brassica among sequenced angiosperms.

...read moreread less

Abstract: Polyploidy often confers emergent properties, such as the higher fibre productivity and quality of tetraploid cottons than diploid cottons bred for the same environments. Here we show that an abrupt five- to sixfold ploidy increase approximately 60 million years (Myr) ago, and allopolyploidy reuniting divergent Gossypium genomes approximately 1-2 Myr ago, conferred about 30-36-fold duplication of ancestral angiosperm (flowering plant) genes in elite cottons (Gossypium hirsutum and Gossypium barbadense), genetic complexity equalled only by Brassica among sequenced angiosperms. Nascent fibre evolution, before allopolyploidy, is elucidated by comparison of spinnable-fibred Gossypium herbaceum A and non-spinnable Gossypium longicalyx F genomes to one another and the outgroup D genome of non-spinnable Gossypium raimondii. The sequence of a G. hirsutum A(t)D(t) (in which 't' indicates tetraploid) cultivar reveals many non-reciprocal DNA exchanges between subgenomes that may have contributed to phenotypic innovation and/or other emergent properties such as ecological adaptation by polyploids. Most DNA-level novelty in G. hirsutum recombines alleles from the D-genome progenitor native to its New World habitat and the Old World A-genome progenitor in which spinnable fibre evolved. Coordinated expression changes in proximal groups of functionally distinct genes, including a nuclear mitochondrial DNA block, may account for clusters of cotton-fibre quantitative trait loci affecting diverse traits. Opportunities abound for dissecting emergent properties of other polyploids, particularly angiosperms, by comparison to diploid progenitors and outgroups.

...read moreread less

1,015 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Integrative genomics viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

10 Jan 2011-Nature Biotechnology

TL;DR: In this article, the authors present an approach for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

10,798 citations

Integrative Genomics Viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Jan 2011

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

2,187 citations

Journal Article•DOI•

Shifting the limits in wheat research and breeding using a fully annotated reference genome

[...]

Rudi Appels¹, Rudi Appels², Kellye Eversole, Nils Stein³ +204 more•Institutions (45)

17 Aug 2018-Science

TL;DR: This annotated reference sequence of wheat is a resource that can now drive disruptive innovation in wheat improvement, as this community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding.

...read moreread less

Abstract: An annotated reference sequence representing the hexaploid bread wheat genome in 21 pseudomolecules has been analyzed to identify the distribution and genomic context of coding and noncoding elements across the A, B, and D subgenomes. With an estimated coverage of 94% of the genome and containing 107,891 high-confidence gene models, this assembly enabled the discovery of tissue- and developmental stage-related coexpression networks by providing a transcriptome atlas representing major stages of wheat development. Dynamics of complex gene families involved in environmental adaptation and end-use quality were revealed at subgenome resolution and contextualized to known agronomic single-gene or quantitative trait loci. This community resource establishes the foundation for accelerating wheat research and application through improved understanding of wheat biology and genomics-assisted breeding.

...read moreread less

2,118 citations

Journal Article•DOI•

Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome

[...]

Boulos Chalhoub¹, Shengyi Liu², Isobel A. P. Parkin³, Haibao Tang⁴, Haibao Tang⁵, Xiyin Wang⁶, Julien Chiquet¹, Harry Belcram¹, Chaobo Tong², Birgit Samans⁷, Margot Correa⁸, Corinne Da Silva⁸, Jérémy Just¹, Cyril Falentin⁹, Chu Shin Koh¹⁰, Isabelle Le Clainche¹, Maria Bernard⁸, Pascal Bento⁸, Benjamin Noel⁸, Karine Labadie⁸, Adriana Alberti⁸, Mathieu Charles⁹, Dominique Arnaud¹, Hui Guo⁶, Christian Daviaud, Salman Alamery¹¹, Kamel Jabbari¹², Kamel Jabbari¹, Meixia Zhao¹³, Patrick P. Edger¹⁴, Houda Chelaifa¹, David C. Tack¹⁵, Gilles Lassalle⁹, Imen Mestiri¹, Nicolas Schnel⁹, Marie-Christine Le Paslier⁹, Guangyi Fan, Victor Renault¹⁶, Philippe E. Bayer¹¹, Agnieszka A. Golicz¹¹, Sahana Manoli¹¹, Tae-Ho Lee⁶, Vinh Ha Dinh Thi¹, Smahane Chalabi¹, Qiong Hu², Chuchuan Fan¹⁷, Reece Tollenaere¹¹, Yunhai Lu¹, Christophe Battail⁸, Jinxiong Shen¹⁷, Christine Sidebottom¹⁰, Xinfa Wang², Aurélie Canaguier¹, Aurélie Chauveau⁹, Aurélie Bérard⁹, G. Deniot⁹, Mei Guan¹⁸, Zhongsong Liu¹⁸, Fengming Sun, Yong Pyo Lim¹⁹, Eric Lyons²⁰, Christopher D. Town⁴, Ian Bancroft²¹, Xiaowu Wang, Jinling Meng¹⁷, Jianxin Ma¹³, J. Chris Pires²², Graham J.W. King²³, Dominique Brunel⁹, Régine Delourme⁹, Michel Renard⁹, Jean-Marc Aury⁸, Keith L. Adams¹⁵, Jacqueline Batley²⁴, Jacqueline Batley¹¹, Rod J. Snowdon⁷, Jörg Tost, David Edwards¹¹, David Edwards²⁴, Yongming Zhou¹⁷, Wei Hua², Andrew G. Sharpe¹⁰, Andrew H. Paterson⁶, Chunyun Guan¹⁸, Patrick Wincker⁸, Patrick Wincker¹, Patrick Wincker²⁵ - Show less +83 more•Institutions (25)

22 Aug 2014-Science

...read moreread less

1,743 citations

Automated Eukaryotic Gene Structure Annotation Using EVidenceModeler and the Program to Assemble Spliced Alignments

[...]

Brian J. Haas, Steven L. Salzberg, Wei Zhu, Mihaela Pertea, Jonathan E. Allen, Joshua Orvis, Owen White, C R Buell, Jennifer R. Wortman - Show less +5 more

10 Dec 2007

TL;DR: The experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

...read moreread less

Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

...read moreread less

1,528 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse