scispace - formally typeset
Search or ask a question
Author

Wei Zhu

Bio: Wei Zhu is an academic researcher from MedImmune. The author has contributed to research in topics: Genome & Cancer. The author has an hindex of 33, co-authored 67 publications receiving 6588 citations. Previous affiliations of Wei Zhu include Iowa State University & Research Medical Center.


Papers
More filters
Journal ArticleDOI
TL;DR: In this article, an automated eukaryotic gene structure annotation tool, EVM, is presented as a weighted consensus of all available evidence, combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein coding genes and alternatively spliced isoforms.
Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

1,996 citations

10 Dec 2007
TL;DR: The experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.
Abstract: EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

1,528 citations

Journal ArticleDOI
TL;DR: Through incorporation of multiple transcript and proteomic expression data sets, the Institute for Genomic Research has been able to annotate 24 799 genes (31 739 gene models), representing ∼50% of the total gene models, as expressed in the rice genome.
Abstract: In The Institute for Genomic Research Rice Genome Annotation project (http://rice.tigr.org), we have continued to update the rice genome sequence with new data and improve the quality of the annotation. In our current release of annotation (Release 4.0; January 12, 2006), we have identified 42,653 non-transposable element-related genes encoding 49,472 gene models as a result of the detection of alternative splicing. We have refined our identification methods for transposable element-related genes resulting in 13,237 genes that are related to transposable elements. Through incorporation of multiple transcript and proteomic expression data sets, we have been able to annotate 24 799 genes (31,739 gene models), representing approximately 50% of the total gene models, as expressed in the rice genome. All structural and functional annotation is viewable through our Rice Genome Browser which currently supports 59 tracks. Enhanced data access is available through web interfaces, FTP downloads and a Data Extractor tool developed in order to support discrete dataset downloads.

1,117 citations

Journal ArticleDOI
TL;DR: The results indicate that the type I IFN pathway is activated in patient subsets of five rheumatic diseases and suggest that these subsets may benefit from anti-IFN therapy.
Abstract: Objective To characterise activation of the type I interferon (IFN) pathway in patients with systemic lupus erythematosus (SLE), dermatomyositis (DM), polymyositis (PM), rheumatoid arthritis (RA) and systemic scleroderma (SSc) and to evaluate the potential to develop a molecular diagnostic tool from the peripheral blood that reflects this activation in disease-affected tissues. Methods Overexpressed transcripts were identified in the whole blood (WB) of 262 patients with SLE, 44 with DM, 33 with PM, 28 with SSc and 89 with RA and compared with 24 healthy subjects using Affymetrix microarrays. A five gene type I IFN signature was assessed in these subjects to identify subpopulations showing both activation and concordance of the type I IFN pathway in the peripheral blood and disease-affected tissues of each disease and to correlate activation of this pathway in the WB with clinical measurements. Results A common set of 36 type I IFN inducible transcripts were identified among the most overexpressed in the WB of all subjects. Significant activation of the type I IFN pathway in subgroups of each of the five diseases studied was observed. Baseline disease activity measurements correlated with a type I IFN gene signature in the WB of subjects with SLE, PM and SSc, as did various serum autoantibody levels in subjects with SLE and DM. This signature was also well correlated between disease-affected tissue and WB in subjects with SLE, DM, PM and SSc. Conclusions The results indicate that the type I IFN pathway is activated in patient subsets of five rheumatic diseases and suggest that these subsets may benefit from anti-IFN therapy.

330 citations

Journal ArticleDOI
TL;DR: A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species.
Abstract: We have developed a rice (Oryza sativa) genome annotation database (Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis (Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 nontransposable element-related genes, 18,545 (42.4%) were annotated with a putative function, 5,777 (13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 (44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms (5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser. All data can be obtained through the project Web pages at http://rice.tigr.org.

208 citations


Cited by
More filters
01 Aug 2000
TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.
Abstract: BIOE 402. Medical Technology Assessment. 2 or 3 hours. Bioentrepreneur course. Assessment of medical technology in the context of commercialization. Objectives, competition, market share, funding, pricing, manufacturing, growth, and intellectual property; many issues unique to biomedical products. Course Information: 2 undergraduate hours. 3 graduate hours. Prerequisite(s): Junior standing or above and consent of the instructor.

4,833 citations

Journal ArticleDOI
TL;DR: Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number of complete plant genomes.
Abstract: The number of sequenced plant genomes and associated genomic resources is growing rapidly with the advent of both an increased focus on plant genomics from funding agencies, and the application of inexpensive next generation sequencing. To interact with this increasing body of data, we have developed Phytozome (http://www.phytozome.net), a comparative hub for plant genome and gene family data and analysis. Phytozome provides a view of the evolutionary history of every plant gene at the level of sequence, gene structure, gene family and genome organization, while at the same time providing access to the sequences and functional annotations of a growing number (currently 25) of complete plant genomes, including all the land plants and selected algae sequenced at the Joint Genome Institute, as well as selected species sequenced elsewhere. Through a comprehensive plant genome database and web portal, these data and analyses are available to the broader plant science research community, providing powerful comparative genomics tools that help to link model systems with other plants of economic and ecological importance.

3,728 citations

Journal ArticleDOI
29 Jan 2009-Nature
TL;DR: An initial analysis of the ∼730-megabase Sorghum bicolor (L.) Moench genome is presented, placing ∼98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information.
Abstract: Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.

2,809 citations

Journal ArticleDOI
Sabeeha S. Merchant1, Simon E. Prochnik2, Olivier Vallon3, Elizabeth H. Harris4, Steven J. Karpowicz1, George B. Witman5, Astrid Terry2, Asaf Salamov2, Lillian K. Fritz-Laylin6, Laurence Maréchal-Drouard7, Wallace F. Marshall8, Liang-Hu Qu9, David R. Nelson10, Anton A. Sanderfoot11, Martin H. Spalding12, Vladimir V. Kapitonov13, Qinghu Ren, Patrick J. Ferris14, Erika Lindquist2, Harris Shapiro2, Susan Lucas2, Jane Grimwood15, Jeremy Schmutz15, Pierre Cardol16, Pierre Cardol3, Heriberto Cerutti17, Guillaume Chanfreau1, Chun-Long Chen9, Valérie Cognat7, Martin T. Croft18, Rachel M. Dent6, Susan K. Dutcher19, Emilio Fernández20, Hideya Fukuzawa21, David González-Ballester22, Diego González-Halphen23, Armin Hallmann, Marc Hanikenne16, Michael Hippler24, William Inwood6, Kamel Jabbari25, Ming Kalanon26, Richard Kuras3, Paul A. Lefebvre11, Stéphane D. Lemaire27, Alexey V. Lobanov17, Martin Lohr28, Andrea L Manuell29, Iris Meier30, Laurens Mets31, Maria Mittag32, Telsa M. Mittelmeier33, James V. Moroney34, Jeffrey L. Moseley22, Carolyn A. Napoli33, Aurora M. Nedelcu35, Krishna K. Niyogi6, Sergey V. Novoselov17, Ian T. Paulsen, Greg Pazour5, Saul Purton36, Jean-Philippe Ral7, Diego Mauricio Riaño-Pachón37, Wayne R. Riekhof, Linda A. Rymarquis38, Michael Schroda, David B. Stern39, James G. Umen14, Robert D. Willows40, Nedra F. Wilson41, Sara L. Zimmer39, Jens Allmer42, Janneke Balk18, Katerina Bisova43, Chong-Jian Chen9, Marek Eliáš44, Karla C Gendler33, Charles R. Hauser45, Mary Rose Lamb46, Heidi K. Ledford6, Joanne C. Long1, Jun Minagawa47, M. Dudley Page1, Junmin Pan48, Wirulda Pootakham22, Sanja Roje49, Annkatrin Rose50, Eric Stahlberg30, Aimee M. Terauchi1, Pinfen Yang51, Steven G. Ball7, Chris Bowler25, Carol L. Dieckmann33, Vadim N. Gladyshev17, Pamela J. Green38, Richard A. Jorgensen33, Stephen P. Mayfield29, Bernd Mueller-Roeber37, Sathish Rajamani30, Richard T. Sayre30, Peter Brokstein2, Inna Dubchak2, David Goodstein2, Leila Hornick2, Y. Wayne Huang2, Jinal Jhaveri2, Yigong Luo2, Diego Martinez2, Wing Chi Abby Ngau2, Bobby Otillar2, Alexander Poliakov2, Aaron Porter2, Lukasz Szajkowski2, Gregory Werner2, Kemin Zhou2, Igor V. Grigoriev2, Daniel S. Rokhsar6, Daniel S. Rokhsar2, Arthur R. Grossman22 
University of California, Los Angeles1, United States Department of Energy2, University of Paris3, Duke University4, University of Massachusetts Medical School5, University of California, Berkeley6, Centre national de la recherche scientifique7, University of California, San Francisco8, Sun Yat-sen University9, University of Tennessee Health Science Center10, University of Minnesota11, Iowa State University12, Genetic Information Research Institute13, Salk Institute for Biological Studies14, Stanford University15, University of Liège16, University of Nebraska–Lincoln17, University of Cambridge18, Washington University in St. Louis19, University of Córdoba (Spain)20, Kyoto University21, Carnegie Institution for Science22, National Autonomous University of Mexico23, University of Münster24, École Normale Supérieure25, University of Melbourne26, University of Paris-Sud27, University of Mainz28, Scripps Research Institute29, Ohio State University30, University of Chicago31, University of Jena32, University of Arizona33, Louisiana State University34, University of New Brunswick35, University College London36, University of Potsdam37, Delaware Biotechnology Institute38, Boyce Thompson Institute for Plant Research39, Macquarie University40, Oklahoma State University Center for Health Sciences41, İzmir University of Economics42, Academy of Sciences of the Czech Republic43, Charles University in Prague44, St. Edward's University45, University of Puget Sound46, Hokkaido University47, Tsinghua University48, Washington State University49, Appalachian State University50, Marquette University51
12 Oct 2007-Science
TL;DR: Analyses of the Chlamydomonas genome advance the understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.
Abstract: Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the approximately 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.

2,554 citations

Journal ArticleDOI
Zhou Du1, Xin Zhou1, Yi Ling1, Zhenhai Zhang1, Zhen Su1 
TL;DR: AgriGO as discussed by the authors is an integrated web-based GO analysis toolkit for the agricultural community, using the advantages of EasyGO, to meet analysis demands from new technologies and research objectives.
Abstract: Gene Ontology (GO), the de facto standard in gene functionality description, is used widely in functional annotation and enrichment analysis. Here, we introduce agriGO, an integrated web-based GO analysis toolkit for the agricultural community, using the advantages of our previous GO enrichment tool (EasyGO), to meet analysis demands from new technologies and research objectives. EasyGO is valuable for its proficiency, and has proved useful in uncovering biological knowledge in massive data sets from high-throughput experiments. For agriGO, the system architecture and website interface were redesigned to improve performance and accessibility. The supported organisms and gene identifiers were substantially expanded (including 38 agricultural species composed of 274 data types). The requirement on user input is more flexible, in that user-defined reference and annotation are accepted. Moreover, a new analysis approach using Gene Set Enrichment Analysis strategy and customizable features is provided. Four tools, SEA (Singular enrichment analysis), PAGE (Parametric Analysis of Gene set Enrichment), BLAST4ID (Transfer IDs by BLAST) and SEACOMPARE (Cross comparison of SEA), are integrated as a toolkit to meet different demands. We also provide a cross-comparison service so that different data sets can be compared and explored in a visualized way. Lastly, agriGO functions as a GO data repository with search and download functions; agriGO is publicly accessible at http://bioinfo.cau.edu.cn/agriGO/.

2,274 citations