Home
/
Authors
/
Xin Zhou

Author

Xin Zhou

Other affiliations: Rice University, Washington University in St. Louis, University of Minnesota ...read more

Bio: Xin Zhou is an academic researcher from St. Jude Children's Research Hospital. The author has contributed to research in topics: Computer science & Medicine. The author has an hindex of 36, co-authored 71 publications receiving 13444 citations. Previous affiliations of Xin Zhou include Rice University & Washington University in St. Louis.

Topics: Computer science, Medicine, Biology, Artificial intelligence, Engineering ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje¹, Wouter Meuleman¹, Wouter Meuleman², Jason Ernst³, Misha Bilenky⁴, Angela Yen¹, Angela Yen², Alireza Heravi-Moussavi⁴, Pouya Kheradpour², Pouya Kheradpour¹, Zhizhuo Zhang², Zhizhuo Zhang¹, Jianrong Wang², Jianrong Wang¹, Michael J. Ziller², Viren Amin⁵, John W. Whitaker, Matthew D. Schultz⁶, Lucas D. Ward², Lucas D. Ward¹, Abhishek Sarkar¹, Abhishek Sarkar², Gerald Quon¹, Gerald Quon², Richard Sandstrom⁷, Matthew L. Eaton², Matthew L. Eaton¹, Yi-Chieh Wu², Yi-Chieh Wu¹, Andreas R. Pfenning¹, Andreas R. Pfenning², Xinchen Wang¹, Xinchen Wang², Melina Claussnitzer¹, Melina Claussnitzer², Yaping Liu¹, Yaping Liu², Cristian Coarfa⁵, R. Alan Harris⁵, Noam Shoresh², Charles B. Epstein², Elizabeta Gjoneska¹, Elizabeta Gjoneska², Danny Leung⁸, Wei Xie⁸, R. David Hawkins⁸, Ryan Lister⁶, Chibo Hong⁹, Philippe Gascard⁹, Andrew J. Mungall⁴, Richard A. Moore⁴, Eric Chuah⁴, Angela Tam⁴, Theresa K. Canfield⁷, R. Scott Hansen⁷, Rajinder Kaul⁷, Peter J. Sabo⁷, Mukul S. Bansal¹, Mukul S. Bansal², Mukul S. Bansal¹⁰, Annaick Carles⁴, Jesse R. Dixon⁸, Kai How Farh², Soheil Feizi¹, Soheil Feizi², Rosa Karlic¹¹, Ah Ram Kim², Ah Ram Kim¹, Ashwinikumar Kulkarni¹², Daofeng Li¹³, Rebecca F. Lowdon¹³, Ginell Elliott¹³, Tim R. Mercer¹⁴, Shane Neph⁷, Vitor Onuchic⁵, Paz Polak², Paz Polak¹⁵, Nisha Rajagopal⁸, Pradipta R. Ray¹², Richard C Sallari², Richard C Sallari¹, Kyle Siebenthall⁷, Nicholas A Sinnott-Armstrong¹, Nicholas A Sinnott-Armstrong², Michael Stevens¹³, Robert E. Thurman⁷, Jie Wu¹⁶, Bo Zhang¹³, Xin Zhou¹³, Arthur E. Beaudet⁵, Laurie A. Boyer¹, Philip L. De Jager², Philip L. De Jager¹⁵, Peggy J. Farnham¹⁷, Susan J. Fisher⁹, David Haussler¹⁸, Steven J.M. Jones⁴, Steven J.M. Jones¹⁹, Wei Li⁵, Marco A. Marra⁴, Michael T. McManus⁹, Shamil R. Sunyaev², Shamil R. Sunyaev¹⁵, James A. Thomson²⁰, Thea D. Tlsty⁹, Li-Huei Tsai¹, Li-Huei Tsai², Wei Wang, Robert A. Waterland⁵, Michael Q. Zhang²¹, Lisa Helbling Chadwick²², Bradley E. Bernstein², Bradley E. Bernstein¹⁵, Bradley E. Bernstein⁶, Joseph F. Costello⁹, Joseph R. Ecker¹¹, Martin Hirst⁴, Alexander Meissner², Aleksandar Milosavljevic⁵, Bing Ren⁸, John A. Stamatoyannopoulos⁷, Ting Wang¹³, Manolis Kellis¹, Manolis Kellis² - Show less +120 more•Institutions (22)

Massachusetts Institute of Technology¹, Broad Institute², University of California, Los Angeles³, University of British Columbia⁴, Baylor College of Medicine⁵, Howard Hughes Medical Institute⁶, University of Washington⁷, Ludwig Institute for Cancer Research⁸, University of California, San Francisco⁹, University of Connecticut¹⁰, University of Zagreb¹¹, University of Texas at Austin¹², Washington University in St. Louis¹³, University of Queensland¹⁴, Harvard University¹⁵, Cold Spring Harbor Laboratory¹⁶, University of Southern California¹⁷, University of California, Santa Cruz¹⁸, Simon Fraser University¹⁹, Morgridge Institute for Research²⁰, University of Texas at Dallas²¹, National Institutes of Health²²

19 Feb 2015-Nature

TL;DR: It is shown that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease.

...read moreread less

Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

...read moreread less

5,037 citations

Journal Article•DOI•

agriGO: a GO analysis toolkit for the agricultural community

[...]

Zhou Du¹, Xin Zhou¹, Yi Ling¹, Zhenhai Zhang¹, Zhen Su¹ - Show less +1 more•Institutions (1)

University of Minnesota¹

01 Jul 2010-Nucleic Acids Research

TL;DR: AgriGO as discussed by the authors is an integrated web-based GO analysis toolkit for the agricultural community, using the advantages of EasyGO, to meet analysis demands from new technologies and research objectives.

...read moreread less

Abstract: Gene Ontology (GO), the de facto standard in gene functionality description, is used widely in functional annotation and enrichment analysis. Here, we introduce agriGO, an integrated web-based GO analysis toolkit for the agricultural community, using the advantages of our previous GO enrichment tool (EasyGO), to meet analysis demands from new technologies and research objectives. EasyGO is valuable for its proficiency, and has proved useful in uncovering biological knowledge in massive data sets from high-throughput experiments. For agriGO, the system architecture and website interface were redesigned to improve performance and accessibility. The supported organisms and gene identifiers were substantially expanded (including 38 agricultural species composed of 274 data types). The requirement on user input is more flexible, in that user-defined reference and annotation are accepted. Moreover, a new analysis approach using Gene Set Enrichment Analysis strategy and customizable features is provided. Four tools, SEA (Singular enrichment analysis), PAGE (Parametric Analysis of Gene set Enrichment), BLAST4ID (Transfer IDs by BLAST) and SEACOMPARE (Cross comparison of SEA), are integrated as a toolkit to meet different demands. We also provide a cross-comparison service so that different data sets can be compared and explored in a visualized way. Lastly, agriGO functions as a GO data repository with search and download functions; agriGO is publicly accessible at http://bioinfo.cau.edu.cn/agriGO/.

...read moreread less

2,274 citations

Journal Article•DOI•

The landscape of genomic alterations across childhood cancers

[...]

Susanne Gröbner¹, Barbara C. Worst, Joachim Weischenfeldt², Joachim Weischenfeldt³ +182 more•Institutions (23)

15 Mar 2018-Nature

TL;DR: The data suggest that 7–8% of the children in this cohort carry an unambiguous predisposing germline variant and that nearly 50% of paediatric neoplasms harbour a potentially druggable event, which is highly relevant for the design of future clinical trials.

...read moreread less

Abstract: Pan-cancer analyses that examine commonalities and differences among various cancer types have emerged as a powerful way to obtain novel insights into cancer biology. Here we present a comprehensive analysis of genetic alterations in a pan-cancer cohort including 961 tumours from children, adolescents, and young adults, comprising 24 distinct molecular types of cancer. Using a standardized workflow, we identified marked differences in terms of mutation frequency and significantly mutated genes in comparison to previously analysed adult cancers. Genetic alterations in 149 putative cancer driver genes separate the tumours into two classes: small mutation and structural/copy-number variant (correlating with germline variants). Structural variants, hyperdiploidy, and chromothripsis are linked to TP53 mutation status and mutational signatures. Our data suggest that 7-8% of the children in this cohort carry an unambiguous predisposing germline variant and that nearly 50% of paediatric neoplasms harbour a potentially druggable event, which is highly relevant for the design of future clinical trials.

...read moreread less

958 citations

Journal Article•DOI•

Germline Mutations in Predisposition Genes in Pediatric Cancer

[...]

Jinghui Zhang, Michael Walsh, Gang Wu, Michael N. Edmonson, Tanja A. Gruber, John Easton, Dale J. Hedges, Xiaotu Ma, Xin Zhou, Donald Yergeau, Mark R. Wilkinson, Bhavin Vadodaria, Xiang Chen, Rose B. McGee, Stacy Hines-Dowell, Regina Nuccio, Emily Quinn, Sheila A. Shurtleff, Michael Rusch, Aman Patel, Jared Becksfort, Shuoguo Wang, Meaghann S. Weaver, Li Ding¹, Elaine R. Mardis¹, Richard K. Wilson¹, Amar Gajjar, David W. Ellison, Alberto S. Pappo, Ching-Hon Pui, Kim E. Nichols, James R. Downing - Show less +28 more•Institutions (1)

Washington University in St. Louis¹

09 Dec 2015-The New England Journal of Medicine

TL;DR: Germline mutations in cancer-predisposing genes were identified in 8.5% of the children and adolescents with cancer, and family history did not predict the presence of an underlying predisposition syndrome in most patients.

...read moreread less

Abstract: BackgroundThe prevalence and spectrum of predisposing mutations among children and adolescents with cancer are largely unknown. Knowledge of such mutations may improve the understanding of tumorigenesis, direct patient care, and enable genetic counseling of patients and families. MethodsIn 1120 patients younger than 20 years of age, we sequenced the whole genomes (in 595 patients), whole exomes (in 456), or both (in 69). We analyzed the DNA sequences of 565 genes, including 60 that have been associated with autosomal dominant cancer-predisposition syndromes, for the presence of germline mutations. The pathogenicity of the mutations was determined by a panel of medical experts with the use of cancer-specific and locus-specific genetic databases, the medical literature, computational predictions, and second hits identified in the tumor genome. The same approach was used to analyze data from 966 persons who did not have known cancer in the 1000 Genomes Project, and a similar approach was used to analyze data...

...read moreread less

886 citations

Journal Article•DOI•

The whole-genome landscape of medulloblastoma subtypes

[...]

Paul A. Northcott¹, Paul A. Northcott², Ivo Buchhalter³, Ivo Buchhalter², A. Sorana Morrissy, Volker Hovestadt², Joachim Weischenfeldt⁴, Tobias Ehrenberger⁵, Susanne Gröbner², Maia Segura-Wang⁶, Thomas Zichner⁶, Vasilisa A. Rudneva, Hans-Jörg Warnatz⁷, Nikos Sidiropoulos⁴, Aaron H. Phillips¹, Steven E. Schumacher⁸, Kortine Kleinheinz², Sebastian M. Waszak⁶, Serap Erkek⁶, Serap Erkek², David T.W. Jones², Barbara C. Worst², Marcel Kool², Marc Zapatka², Natalie Jäger², Lukas Chavez², Barbara Hutter², Matthias Bieg², Nagarajan Paramasivam³, Nagarajan Paramasivam², Michael Heinold², Michael Heinold³, Zuguang Gu², Naveed Ishaque², Christina Jäger-Schmidt², Charles D. Imbusch², Alke Jugold², Daniel Hübschmann⁹, Daniel Hübschmann³, Daniel Hübschmann², Thomas Risch⁷, Vyacheslav Amstislavskiy⁷, Francisco German Rodriguez Gonzalez⁴, Ursula D. Weber², Stephan Wolf², Giles W. Robinson¹, Xin Zhou¹, Gang Wu¹, David Finkelstein¹, Yanling Liu¹, Florence M.G. Cavalli, Betty Luu, Vijay Ramaswamy, Xiaochong Wu, Jan Koster, Marina Ryzhova, Yoon Jae Cho¹⁰, Scott L. Pomeroy¹¹, Christel Herold-Mende³, Martin U. Schuhmann¹², Martin Ebinger, Linda M. Liau¹³, Jaume Mora¹⁴, Roger E. McLendon¹⁵, Nada Jabado¹⁶, Toshihiro Kumabe¹⁷, Eric Chuah¹⁸, Yussanne Ma¹⁸, Richard A. Moore¹⁸, Andrew J. Mungall¹⁸, Karen Mungall¹⁸, Nina Thiessen¹⁸, Kane Tse¹⁸, Tina Wong¹⁸, Steven J.M. Jones¹⁸, Olaf Witt⁹, Till Milde⁹, Andreas von Deimling⁹, David Capper⁹, Andrey Korshunov⁹, Marie-Laure Yaspo⁷, Richard W. Kriwacki¹, Amar Gajjar¹, Jinghui Zhang¹, Rameen Beroukhim⁸, Ernest Fraenkel⁵, Jan O. Korbel⁶, Benedikt Brors², Matthias Schlesner², Roland Eils³, Roland Eils², Marco A. Marra¹⁸, Stefan M. Pfister², Stefan M. Pfister⁹, Michael D. Taylor¹⁹, Peter Lichter² - Show less +92 more•Institutions (19)

St. Jude Children's Research Hospital¹, German Cancer Research Center², Heidelberg University³, University of Copenhagen⁴, Massachusetts Institute of Technology⁵, European Bioinformatics Institute⁶, Max Planck Society⁷, Broad Institute⁸, University Hospital Heidelberg⁹, Oregon Health & Science University¹⁰, Boston Children's Hospital¹¹, University of Tübingen¹², University of California, Los Angeles¹³, Hospital Sant Joan de Déu Barcelona¹⁴, Duke University¹⁵, McGill University¹⁶, Kitasato University¹⁷, BC Cancer Agency¹⁸, University of Toronto¹⁹

19 Jul 2017-Nature

TL;DR: The application of integrative genomics to an extensive cohort of clinical samples derived from a single childhood cancer entity revealed a series of cancer genes and biologically relevant subtype diversity that represent attractive therapeutic targets for the treatment of patients with medulloblastoma.

...read moreread less

Abstract: Current therapies for medulloblastoma, a highly malignant childhood brain tumour, impose debilitating effects on the developing child, and highlight the need for molecularly targeted treatments with reduced toxicity. Previous studies have been unable to identify the full spectrum of driver genes and molecular processes that operate in medulloblastoma subgroups. Here we analyse the somatic landscape across 491 sequenced medulloblastoma samples and the molecular heterogeneity among 1,256 epigenetically analysed cases, and identify subgroup-specific driver alterations that include previously undiscovered actionable targets. Driver mutations were confidently assigned to most patients belonging to Group 3 and Group 4 medulloblastoma subgroups, greatly enhancing previous knowledge. New molecular subtypes were differentially enriched for specific driver events, including hotspot in-frame insertions that target KBTBD4 and 'enhancer hijacking' events that activate PRDM6. Thus, the application of integrative genomics to an extensive cohort of clinical samples derived from a single childhood cancer entity revealed a series of cancer genes and biologically relevant subtype diversity that represent attractive therapeutic targets for the treatment of patients with medulloblastoma.

...read moreread less

706 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

[...]

Yang Liao¹, Gordon K. Smyth¹, Wei Shi¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

01 Apr 2014-Bioinformatics

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.

...read moreread less

Abstract: MOTIVATION: Next-generation sequencing technologies generate millions of short sequence reads, which are usually aligned to a reference genome. In many applications, the key information required for downstream analysis is the number of reads mapping to each genomic feature, for example to each exon or each gene. The process of counting reads is called read summarization. Read summarization is required for a great variety of genomic analyses but has so far received relatively little attention in the literature. RESULTS: We present featureCounts, a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments. featureCounts implements highly efficient chromosome hashing and feature blocking techniques. It is considerably faster than existing methods (by an order of magnitude for gene-level summarization) and requires far less computer memory. It works with either single or paired-end reads and provides a wide range of options appropriate for different sequencing applications. AVAILABILITY AND IMPLEMENTATION: featureCounts is available under GNU General Public License as part of the Subread (http://subread.sourceforge.net) or Rsubread (http://www.bioconductor.org) software packages.

...read moreread less

14,103 citations

Journal Article•DOI•

Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists

[...]

Da-Wei Huang¹, Brad T. Sherman¹, Richard A. Lempicki¹•Institutions (1)

Science Applications International Corporation¹

01 Jan 2009-Nucleic Acids Research

TL;DR: The survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.

...read moreread less

Abstract: Functional analysis of large gene lists, derived in most cases from emerging high-throughput genomic, proteomic and bioinformatics scanning approaches, is still a challenging and daunting task. The gene-annotation enrichment analysis is a promising high-throughput strategy that increases the likelihood for investigators to identify biological processes most pertinent to their study. Approximately 68 bioinformatics enrichment tools that are currently available in the community are collected in this survey. Tools are uniquely categorized into three major classes, according to their underlying enrichment algorithms. The comprehensive collections, unique tool classifications and associated questions/issues will provide a more comprehensive and up-to-date view regarding the advantages, pitfalls and recent trends in a simpler tool-class level rather than by a tool-by-tool approach. Thus, the survey will help tool designers/developers and experienced end users understand the underlying algorithms and pertinent details of particular tool categories/tools, enabling them to make the best choices for their particular research interests.

...read moreread less

13,102 citations

Journal Article•DOI•

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje¹, Wouter Meuleman², Wouter Meuleman¹, Jason Ernst³, Misha Bilenky⁴, Angela Yen², Angela Yen¹, Alireza Heravi-Moussavi⁴, Pouya Kheradpour¹, Pouya Kheradpour², Zhizhuo Zhang¹, Zhizhuo Zhang², Jianrong Wang², Jianrong Wang¹, Michael J. Ziller², Viren Amin⁵, John W. Whitaker, Matthew D. Schultz⁶, Lucas D. Ward², Lucas D. Ward¹, Abhishek Sarkar², Abhishek Sarkar¹, Gerald Quon¹, Gerald Quon², Richard Sandstrom⁷, Matthew L. Eaton², Matthew L. Eaton¹, Yi-Chieh Wu¹, Yi-Chieh Wu², Andreas R. Pfenning¹, Andreas R. Pfenning², Xinchen Wang², Xinchen Wang¹, Melina Claussnitzer², Melina Claussnitzer¹, Yaping Liu², Yaping Liu¹, Cristian Coarfa⁵, R. Alan Harris⁵, Noam Shoresh², Charles B. Epstein², Elizabeta Gjoneska¹, Elizabeta Gjoneska², Danny Leung⁸, Wei Xie⁸, R. David Hawkins⁸, Ryan Lister⁶, Chibo Hong⁹, Philippe Gascard⁹, Andrew J. Mungall⁴, Richard A. Moore⁴, Eric Chuah⁴, Angela Tam⁴, Theresa K. Canfield⁷, R. Scott Hansen⁷, Rajinder Kaul⁷, Peter J. Sabo⁷, Mukul S. Bansal¹⁰, Mukul S. Bansal², Mukul S. Bansal¹, Annaick Carles⁴, Jesse R. Dixon⁸, Kai How Farh², Soheil Feizi², Soheil Feizi¹, Rosa Karlic¹¹, Ah Ram Kim², Ah Ram Kim¹, Ashwinikumar Kulkarni¹², Daofeng Li¹³, Rebecca F. Lowdon¹³, Ginell Elliott¹³, Tim R. Mercer¹⁴, Shane Neph⁷, Vitor Onuchic⁵, Paz Polak¹⁵, Paz Polak², Nisha Rajagopal⁸, Pradipta R. Ray¹², Richard C Sallari², Richard C Sallari¹, Kyle Siebenthall⁷, Nicholas A Sinnott-Armstrong², Nicholas A Sinnott-Armstrong¹, Michael Stevens¹³, Robert E. Thurman⁷, Jie Wu¹⁶, Bo Zhang¹³, Xin Zhou¹³, Arthur E. Beaudet⁵, Laurie A. Boyer¹, Philip L. De Jager¹⁵, Philip L. De Jager², Peggy J. Farnham¹⁷, Susan J. Fisher⁹, David Haussler¹⁸, Steven J.M. Jones¹⁹, Steven J.M. Jones⁴, Wei Li⁵, Marco A. Marra⁴, Michael T. McManus⁹, Shamil R. Sunyaev², Shamil R. Sunyaev¹⁵, James A. Thomson²⁰, Thea D. Tlsty⁹, Li-Huei Tsai², Li-Huei Tsai¹, Wei Wang, Robert A. Waterland⁵, Michael Q. Zhang²¹, Lisa Helbling Chadwick²², Bradley E. Bernstein⁶, Bradley E. Bernstein¹⁵, Bradley E. Bernstein², Joseph F. Costello⁹, Joseph R. Ecker¹¹, Martin Hirst⁴, Alexander Meissner², Aleksandar Milosavljevic⁵, Bing Ren⁸, John A. Stamatoyannopoulos⁷, Ting Wang¹³, Manolis Kellis², Manolis Kellis¹ - Show less +120 more•Institutions (22)

19 Feb 2015-Nature

...read moreread less

5,037 citations

Journal Article•DOI•

Gene ontology analysis for RNA-seq: accounting for selection bias

[...]

Matthew D. Young¹, Matthew Wakefield¹, Gordon K. Smyth¹, Alicia Oshlack¹•Institutions (1)

Walter and Eliza Hall Institute of Medical Research¹

04 Feb 2010-Genome Biology

TL;DR: Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.

...read moreread less

Abstract: We present GOseq, an application for performing Gene Ontology (GO) analysis on RNA-seq data. GO analysis is widely used to reduce complexity and highlight biological processes in genome-wide expression studies, but standard methods give biased results on RNA-seq data due to over-detection of differential expression for long and highly expressed transcripts. Application of GOseq to a prostate cancer data set shows that GOseq dramatically changes the results, highlighting categories more consistent with the known biology.

...read moreread less

5,034 citations

Journal Article•DOI•

REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms

[...]

Fran Supek, Matko Bošnjak, Nives Škunca, Tomislav Šmuc

18 Jul 2011-PLOS ONE

TL;DR: REVIGO is a Web server that summarizes long, unintelligible lists of GO terms by finding a representative subset of the terms using a simple clustering algorithm that relies on semantic similarity measures.

...read moreread less

Abstract: Outcomes of high-throughput biological experiments are typically interpreted by statistical testing for enriched gene functional categories defined by the Gene Ontology (GO). The resulting lists of GO terms may be large and highly redundant, and thus difficult to interpret. REVIGO is a Web server that summarizes long, unintelligible lists of GO terms by finding a representative subset of the terms using a simple clustering algorithm that relies on semantic similarity measures. Furthermore, REVIGO visualizes this non-redundant GO term set in multiple ways to assist in interpretation: multidimensional scaling and graph-based visualizations accurately render the subdivisions and the semantic relationships in the data, while treemaps and tag clouds are also offered as alternative views. REVIGO is freely available at http://revigo.irb.hr/.

...read moreread less

4,919 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse