Home
/
Authors
/
Haizhou Wang

Author

Haizhou Wang

Bio: Haizhou Wang is an academic researcher from New Mexico State University. The author has contributed to research in topics: Inference & Heuristic (computer science). The author has an hindex of 3, co-authored 4 publications receiving 1691 citations.

Papers

PDF

Open Access

More filters

Wisdom of crowds for robust gene network inference

[...]

Daniel Marbach, James C. Costello, Robert Küffner, Nicole M. Vega, Robert J. Prill, Diogo M. Camacho, Kyle R. Allison, Andrej Aderhold, Richard Bonneau, Yukun Chen, James J. Collins, Francesca Cordero, Martin Crane, Frank Dondelinger, Mathias Drton, Roberto Esposito, Rina Foygel, Alberto de la Fuente, Jan Gertheiss, Pierre Geurts, Alex Greenfield, Marco Grzegorczyk, Anne-Claire Haury, Benjamin Holmes, Torsten Hothorn, Dirk Husmeier, Vân Anh Huynh-Thu, Alexandre Irrthum, Manolis Kellis, Guy Karlebach, Sophie Lèbre, Vincenzo De Leo, Aviv Madar, Subramani Mani, Fantine Mordelet, Harry Ostrer, Zhengyu Ouyang, Ravi Pandya, Tobias Petri, Andrea Pinna, Christopher S. Poultney, Serena Rezny, Heather J. Ruskin, Yvan Saeys, Ron Shamir, Alina Sîrbu, Mingzhou Song, Nicola Soranzo, Alexander Statnikov, Gustavo Stolovitzky, Nicci Vega, Paola Vera-Licona, Jean-Philippe Vert, Alessia Visconti, Haizhou Wang, Louis Wehenkel, Lukas Windhager, Yang Zhang, Ralf Zimmer - Show less +55 more

01 Jul 2012

TL;DR: A comprehensive blind assessment of over 30 network inference methods on Escherichia coli, Staphylococcus aureus, Saccharomyces cerevisiae and in silico microarray data defines the performance, data requirements and inherent biases of different inference approaches, and provides guidelines for algorithm application and development.

...read moreread less

Abstract: Reconstructing gene regulatory networks from high-throughput data is a long-standing challenge. Through the Dialogue on Reverse Engineering Assessment and Methods (DREAM) project, we performed a comprehensive blind assessment of over 30 network inference methods on Escherichia coli, Staphylococcus aureus, Saccharomyces cerevisiae and in silico microarray data. We characterize the performance, data requirements and inherent biases of different inference approaches, and we provide guidelines for algorithm application and development. We observed that no single inference method performs optimally across all data sets. In contrast, integration of predictions from multiple inference methods shows robust and high performance across diverse data sets. We thereby constructed high-confidence networks for E. coli and S. aureus, each comprising ∼1,700 transcriptional interactions at a precision of ∼50%. We experimentally tested 53 previously unobserved regulatory interactions in E. coli, of which 23 (43%) were supported. Our results establish community-based methods as a powerful and robust tool for the inference of transcriptional gene regulatory networks.

...read moreread less

1,355 citations

Journal Article•DOI•

Ckmeans.1d.dp: Optimal k-means Clustering in One Dimension by Dynamic Programming

[...]

Haizhou Wang¹, Mingzhou Song¹•Institutions (1)

New Mexico State University¹

01 Dec 2011-R Journal

TL;DR: In this paper, a dynamic programming algorithm for optimal one-dimensional clustering is proposed, which is implemented as an R package called Ckmeans.1d.dp.

...read moreread less

Abstract: The heuristic k-means algorithm, widely used for cluster analysis, does not guarantee optimality. We developed a dynamic programming algorithm for optimal one-dimensional clustering. The algorithm is implemented as an R package called Ckmeans.1d.dp. We demonstrate its advantage in optimality and runtime over the standard iterative k-means algorithm.

...read moreread less

328 citations

Journal Article•DOI•

Inferring causal molecular networks: empirical assessment through a community-based effort

[...]

Steven M. Hill¹, Laura M. Heiser², Thomas Cokelaer³, Michael Unger⁴, Nicole K. Nesser², Daniel E. Carlin⁵, Yang Zhang⁶, Artem Sokolov⁵, Evan O. Paull⁵, Christopher K. Wong⁵, Kiley Graim⁵, Adrian Bivol⁵, Haizhou Wang⁶, Zhu Fan⁷, Bahman Afsari⁸, Ludmila Danilova⁸, Alexander V. Favorov⁸, Wai Shing Lee⁸, Dane Taylor⁹, Chenyue W. Hu¹⁰, Byron L. Long¹⁰, David P. Noren¹⁰, Alex Bisberg¹⁰, Gordon B. Mills¹¹, Joe W. Gray², Michael R. Kellen¹², Thea Norman¹², Stephen H. Friend¹², Amina A. Qutub¹⁰, Elana J. Fertig⁸, Yuanfang Guan⁷, Mingzhou Song⁶, Joshua M. Stuart⁵, Paul T. Spellman², Heinz Koeppl⁴, Gustavo Stolovitzky¹³, Julio Saez-Rodriguez³, Sach Mukherjee¹ - Show less +34 more•Institutions (13)

University of Cambridge¹, Oregon Health & Science University², European Bioinformatics Institute³, ETH Zurich⁴, University of California, Santa Cruz⁵, New Mexico State University⁶, University of Michigan⁷, Johns Hopkins University⁸, Statistical and Applied Mathematical Sciences Institute⁹, Rice University¹⁰, University of Texas MD Anderson Cancer Center¹¹, Sage Bionetworks¹², IBM¹³

01 Apr 2016-Nature Methods

TL;DR: The HPN-DREAM network inference challenge, which focused on learning causal influences in signaling networks, used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model to score networks.

...read moreread less

Abstract: It remains unclear whether causal, rather than merely correlational, relationships in molecular networks can be inferred in complex biological settings. Here we describe the HPN-DREAM network inference challenge, which focused on learning causal influences in signaling networks. We used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model. Using the phosphoprotein data, we scored more than 2,000 networks submitted by challenge participants. The networks spanned 32 biological contexts and were scored in terms of causal validity with respect to unseen interventional data. A number of approaches were effective, and incorporating known biology was generally advantageous. Additional sub-challenges considered time-course prediction and visualization. Our results suggest that learning causal relationships may be feasible in complex settings such as disease states. Furthermore, our scoring approach provides a practical way to empirically assess inferred molecular networks in a causal sense.

...read moreread less

231 citations

Journal Article•DOI•

Constrained inference of protein interaction networks for invadopodium formation in cancer.

[...]

Haizhou Wang¹, Ming Leung², Angela Wandinger-Ness³, Laurie G. Hudson³, Mingzhou Song¹ - Show less +1 more•Institutions (3)

New Mexico State University¹, Duke University², University of New Mexico³

01 Apr 2016-Iet Systems Biology

TL;DR: The CGLN method offers constrained network inference without requiring prior probabilities and thus can promote novel interactions, consistent with the discovery process of scientific facts that are not yet in common beliefs.

...read moreread less

Abstract: Integrating prior molecular network knowledge into interpretation of new experimental data is routine practice in biology research. However, a dilemma for deciphering interactome using Bayes’ rule is the demotion of novel interactions with low prior probabilities. Here the authors present constrained generalised logical network (CGLN) inference to predict novel interactions in dynamic networks, respecting previously known interactions and observed temporal coherence. It encodes prior interactions as probabilistic logic rules called local constraints, and forms global constraints using observed dynamic patterns. CGLN finds constraint-satisfying trajectories by solving a k-stops problem in the state space of dynamic networks and then reconstructs candidate networks. They benchmarked CGLN on randomly generated networks, and CGLN outperformed its alternatives when 50% or more interactions in a network are given as local constraints. CGLN is then applied to infer dynamic protein interaction networks regulating invadopodium formation in motile cancer cells. CGLN predicted 134 novel protein interactions for their involvement in invadopodium formation. The most frequently predicted interactions centre around focal adhesion kinase and tyrosine kinase substrate TKS4, and 14 interactions are supported by the literature in molecular contexts related to invadopodium formation. As an alternative to the Bayesian paradigm, the CGLN method offers constrained network inference without requiring prior probabilities and thus can promote novel interactions, consistent with the discovery process of scientific facts that are not yet in common beliefs.

...read moreread less

1 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

SCENIC: single-cell regulatory network inference and clustering.

[...]

Sara Aibar¹, Carmen Bravo González-Blas¹, Thomas Moerman¹, Vân Anh Huynh-Thu², Hana Imrichova¹, Gert Hulselmans¹, Florian Rambow¹, Jean-Christophe Marine¹, Pierre Geurts², Jan Aerts¹, Joost van den Oord¹, Zeynep Kalender Atak¹, Jasper Wouters¹, Stein Aerts¹ - Show less +10 more•Institutions (2)

Katholieke Universiteit Leuven¹, University of Liège²

09 Oct 2017-Nature Methods

TL;DR: On a compendium of single-cell data from tumors and brain, it is demonstrated that cis-regulatory analysis can be exploited to guide the identification of transcription factors and cell states.

...read moreread less

Abstract: We present SCENIC, a computational method for simultaneous gene regulatory network reconstruction and cell-state identification from single-cell RNA-seq data (http://scenicaertslaborg) On a compendium of single-cell data from tumors and brain, we demonstrate that cis-regulatory analysis can be exploited to guide the identification of transcription factors and cell states SCENIC provides critical biological insights into the mechanisms driving cellular heterogeneity

...read moreread less

2,277 citations

Journal Article•DOI•

Transcriptome-Based Network Analysis Reveals a Spectrum Model of Human Macrophage Activation

[...]

Jia Xue¹, Susanne Schmidt¹, Jil Sander¹, Astrid M. Draffehn¹, Wolfgang Krebs¹, Inga Quester¹, Dominic De Nardo¹, Trupti D. Gohel¹, Martina Emde¹, Lisa Schmidleithner¹, Hariharasudan Ganesan¹, Andrea Niño-Castro¹, Michael R. Mallmann¹, Larisa I. Labzin¹, Heidi Theis¹, Michael Kraut¹, Marc Beyer¹, Eicke Latz², Eicke Latz¹, Tom C. Freeman³, Thomas Ulas¹, Joachim L. Schultze¹ - Show less +18 more•Institutions (3)

University of Bonn¹, University of Massachusetts Medical School², University of Edinburgh³

20 Feb 2014-Immunity

TL;DR: By integrating murine data from the ImmGen project, this work proposes a refined, activation-independent core signature for human and murine macrophages that serves as a framework for future research into regulation of macrophage activation in health and disease.

...read moreread less

1,648 citations

Posted Content•DOI•

SCENIC: Single-Cell Regulatory Network Inference And Clustering

[...]

Sara Aibar¹, Carmen Bravo González-Blas¹, Thomas Moerman¹, Jasper Wouters¹, Vân Anh Huynh-Thu², Hana Imrichova¹, Zeynep Kalender Atak¹, Gert Hulselmans¹, Michael Dewaele¹, Florian Rambow¹, Pierre Geurts², Jan Aerts¹, Jean-Christophe Marine¹, Joost van den Oord¹, Stein Aerts¹ - Show less +11 more•Institutions (2)

Katholieke Universiteit Leuven¹, University of Liège²

31 May 2017-bioRxiv

TL;DR: SCENIC (Single Cell rEgulatory Network Inference and Clustering) is the first method to analyze scRNA-seq data using a network-centric, rather than cell-centric approach and allows for the simultaneous tracing of genomic regulatory programs and the mapping of cellular identities emerging from these programs.

...read moreread less

Abstract: Single-cell RNA-seq allows building cell atlases of any given tissue and infer the dynamics of cellular state transitions during developmental or disease trajectories. Both the maintenance and transitions of cell states are encoded by regulatory programs in the genome sequence. However, this regulatory code has not yet been exploited to guide the identification of cellular states from single-cell RNA-seq data. Here we describe a computational resource, called SCENIC (Single Cell rEgulatory Network Inference and Clustering), for the simultaneous reconstruction of gene regulatory networks (GRNs) and the identification of stable cell states, using single-cell RNA-seq data. SCENIC outperforms existing approaches at the level of cell clustering and transcription factor identification. Importantly, we show that cell state identification based on GRNs is robust towards batch-effects and technical-biases. We applied SCENIC to a compendium of single-cell data from the mouse and human brain and demonstrate that the proper combinations of transcription factors, target genes, enhancers, and cell types can be identified. Moreover, we used SCENIC to map the cell state landscape in melanoma and identified a gene regulatory network underlying a proliferative melanoma state driven by MITF and STAT and a contrasting network controlling an invasive state governed by NFATC2 and NFIB. We further validated these predictions by showing that two transcription factors are predominantly expressed in early metastatic sentinel lymph nodes. In summary, SCENIC is the first method to analyze scRNA-seq data using a network-centric, rather than cell-centric approach. SCENIC is generic, easy to use, and flexible, and allows for the simultaneous tracing of genomic regulatory programs and the mapping of cellular identities emerging from these programs. Availability: SCENIC is available as an R workflow based on three new R/Bioconductor packages: GENIE3, RcisTarget and AUCell. As scalable alternative to GENIE3, we also provide GRNboost, paving the way towards the network analysis across millions of single cells.

...read moreread less

1,101 citations

Journal Article•DOI•

A Validated Regulatory Network for Th17 Cell Specification

[...]

Maria Ciofani¹, Aviv Madar², Aviv Madar¹, Carolina Galan¹, MacLean Sellars¹, Kieran Mace¹, Florencia Pauli, Ashish Agarwal¹, Wendy Huang¹, Christopher N. Parkurst¹, Michael Muratet, Kim M. Newberry, Sarah Meadows, Alex Greenfield¹, Yi Yang¹, Preti Jain, Francis K. Kirigin¹, Carmen Birchmeier, Erwin F. Wagner, Kenneth M. Murphy³, Kenneth M. Murphy⁴, Richard M. Myers, Richard Bonneau¹, Richard Bonneau², Dan R. Littman³, Dan R. Littman¹ - Show less +22 more•Institutions (4)

New York University¹, Courant Institute of Mathematical Sciences², Howard Hughes Medical Institute³, Washington University in St. Louis⁴

12 Oct 2012-Cell

TL;DR: It is found that cooperatively bound BATF and IRF4 contribute to initial chromatin accessibility and, with STAT3, initiate a transcriptional program that is then globally tuned by the lineage-specifying TF RORγt, which plays a focal deterministic role at key loci.

...read moreread less

1,021 citations

Journal Article•DOI•

Structure and dynamics of molecular networks: A novel paradigm of drug discovery: A comprehensive review

[...]

Peter Csermely¹, Tamas Korcsmaros¹, Tamas Korcsmaros², Huba Kiss¹, Gábor London³, Ruth Nussinov⁴, Ruth Nussinov⁵ - Show less +3 more•Institutions (5)

Semmelweis University¹, Eötvös Loránd University², École Polytechnique Fédérale de Lausanne³, Tel Aviv University⁴, Science Applications International Corporation⁵

01 Jun 2013-Pharmacology & Therapeutics

TL;DR: It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates and an optimized protocol of network-aided drug development is suggested, and a list of systems-level hallmarks of drug quality is provided.

...read moreread less

806 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse