Home
/
Authors
/
Aalt D. J. van Dijk

Author

Aalt D. J. van Dijk

Wageningen University and Research Centre

Other affiliations: University of Florence, Utrecht University

Bio: Aalt D. J. van Dijk is an academic researcher from Wageningen University and Research Centre. The author has contributed to research in topics: Arabidopsis & Protein function prediction. The author has an hindex of 34, co-authored 88 publications receiving 5042 citations. Previous affiliations of Aalt D. J. van Dijk include University of Florence & Utrecht University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A large-scale evaluation of computational protein function prediction

[...]

Predrag Radivojac¹, Wyatt T. Clark¹, Tal Ronnen Oron², Alexandra M. Schnoes³, Tobias Wittkop², Artem Sokolov⁴, Artem Sokolov⁵, Kiley Graim⁵, Christopher S. Funk⁶, Karin Verspoor⁶, Asa Ben-Hur⁵, Gaurav Pandey⁷, Gaurav Pandey⁸, Jeffrey M. Yunes⁸, Ameet Talwalkar⁸, Susanna Repo⁸, Susanna Repo⁹, Michael L Souza⁸, Damiano Piovesan¹⁰, Rita Casadio¹⁰, Zheng Wang¹¹, Jianlin Cheng¹¹, Hai Fang, Julian Gough¹², Patrik Koskinen¹³, Petri Törönen¹³, Jussi Nokso-Koivisto¹³, Liisa Holm¹³, Domenico Cozzetto¹⁴, Daniel W. A. Buchan¹⁴, Kevin Bryson¹⁴, David T. Jones¹⁴, Bhakti Limaye¹⁵, Harshal Inamdar¹⁵, Avik Datta¹⁵, Sunitha K Manjari¹⁵, Rajendra Joshi¹⁵, Meghana Chitale¹⁶, Daisuke Kihara¹⁶, Andreas Martin Lisewski¹⁷, Serkan Erdin¹⁷, Eric Venner¹⁷, Olivier Lichtarge¹⁷, Robert Rentzsch¹⁴, Haixuan Yang¹⁸, Alfonso E. Romero¹⁸, Prajwal Bhat¹⁸, Alberto Paccanaro¹⁸, Tobias Hamp¹⁹, Rebecca Kaßner¹⁹, Stefan Seemayer¹⁹, Esmeralda Vicedo¹⁹, Christian Schaefer¹⁹, Dominik Achten¹⁹, Florian Auer¹⁹, Ariane Boehm¹⁹, Tatjana Braun¹⁹, Maximilian Hecht¹⁹, Mark Heron¹⁹, Peter Hönigschmid¹⁹, Thomas A. Hopf¹⁹, Stefanie Kaufmann¹⁹, Michael Kiening¹⁹, Denis Krompass¹⁹, Cedric Landerer¹⁹, Yannick Mahlich¹⁹, Manfred Roos¹⁹, Jari Björne²⁰, Tapio Salakoski²⁰, Andrew Wong²¹, Hagit Shatkay²¹, Hagit Shatkay²², Fanny Gatzmann²³, Ingolf Sommer²³, Mark N. Wass²⁴, Michael J.E. Sternberg²⁴, Nives Škunca, Fran Supek, Matko Bošnjak, Panče Panov, Sašo Džeroski, Tomislav Šmuc, Yiannis A. I. Kourmpetis²⁵, Yiannis A. I. Kourmpetis²⁶, Aalt D. J. van Dijk²⁵, Cajo J. F. ter Braak²⁵, Yuanpeng Zhou²⁷, Qingtian Gong²⁷, Xinran Dong²⁷, Weidong Tian²⁷, Marco Falda²⁸, Paolo Fontana, Enrico Lavezzo²⁸, Barbara Di Camillo²⁸, Stefano Toppo²⁸, Liang Lan²⁹, Nemanja Djuric²⁹, Yuhong Guo²⁹, Slobodan Vucetic²⁹, Amos Marc Bairoch³⁰, Amos Marc Bairoch³¹, Michal Linial³², Patricia C. Babbitt³, Steven E. Brenner⁸, Christine A. Orengo¹⁴, Burkhard Rost¹⁹, Sean D. Mooney², Iddo Friedberg³³ - Show less +104 more•Institutions (33)

Indiana University¹, Buck Institute for Research on Aging², University of California, San Francisco³, University of California, Santa Cruz⁴, Colorado State University⁵, University of Colorado Denver⁶, Icahn School of Medicine at Mount Sinai⁷, University of California, Berkeley⁸, European Bioinformatics Institute⁹, University of Bologna¹⁰, University of Missouri¹¹, University of Bristol¹², University of Helsinki¹³, University College London¹⁴, Centre for Development of Advanced Computing¹⁵, Purdue University¹⁶, Baylor College of Medicine¹⁷, Royal Holloway, University of London¹⁸, Technische Universität München¹⁹, University of Turku²⁰, Queen's University²¹, University UCINF²², Max Planck Society²³, Imperial College London²⁴, Wageningen University and Research Centre²⁵, Nestlé²⁶, Fudan University²⁷, University of Padua²⁸, Temple University²⁹, University of Geneva³⁰, Swiss Institute of Bioinformatics³¹, Hebrew University of Jerusalem³², Miami University³³

01 Mar 2013-Nature Methods

TL;DR: Today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets, and there is considerable need for improvement of currently available tools.

...read moreread less

Abstract: Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools.

...read moreread less

859 citations

Journal Article•DOI•

HADDOCK versus HADDOCK: new features and performance of HADDOCK2.0 on the CAPRI targets.

[...]

Sjoerd J. de Vries¹, Aalt D. J. van Dijk¹, Mickaël Krzeminski¹, Marc van Dijk¹, Aurelien Thureau¹, Victor L. Hsu², Tsjerk A. Wassenaar¹, Alexandre M. J. J. Bonvin¹ - Show less +4 more•Institutions (2)

Utrecht University¹, Oregon State University²

01 Dec 2007-Proteins

TL;DR: HADDOCK2.0 as mentioned in this paper is the most recent version of HADDOCK, which incorporates considerable improvements and new features, such as random patch definition or center-of-mass restraints.

...read moreread less

Abstract: Here we present version 2.0 of HADDOCK, which incorporates considerable improvements and new features. HADDOCK is now able to model not only protein-protein complexes but also other kinds of biomolecular complexes and multi-component (N > 2) systems. In the absence of any experimental and/or predicted information to drive the docking, HADDOCK now offers two additional ab initio docking modes based on either random patch definition or center-of-mass restraints. The docking protocol has been considerably improved, supporting among other solvated docking, automatic definition of semi-flexible regions, and inclusion of a desolvation energy term in the scoring scheme. The performance of HADDOCK2.0 is evaluated on the targets of rounds 4-11, run in a semi-automated mode using the original information we used in our CAPRI submissions. This enables a direct assessment of the progress made since the previous versions. Although HADDOCK performed very well in CAPRI (65% and 71% success rates, overall and for unbound targets only, respectively), a substantial improvement was achieved with HADDOCK2.0.

...read moreread less

542 citations

Journal Article•DOI•

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

[...]

Yuxiang Jiang¹, Tal Ronnen Oron², Wyatt T. Clark³, Asma R. Bankapur⁴ +153 more•Institutions (59)

07 Sep 2016-Genome Biology

TL;DR: The second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function, was conducted by as mentioned in this paper. But the results of the CAFA2 assessment are limited.

...read moreread less

Abstract: BACKGROUND: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. RESULTS: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. CONCLUSIONS: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent.

...read moreread less

330 citations

Journal Article•DOI•

Rice cytochrome P450 MAX1 homologs catalyze distinct steps in strigolactone biosynthesis

[...]

Yanxia Zhang¹, Aalt D. J. van Dijk, Adrian Scaffidi², Gavin R. Flematti², Manuel Hofmann³, Tatsiana Charnikhova¹, Francel W.A. Verstappen¹, Jo Hepworth⁴, Sander van der Krol¹, Ottoline Leyser⁵, Steven M. Smith², Binne Zwanenburg⁶, Salim Al-Babili⁷, Carolien Ruyter-Spira¹, Harro J. Bouwmeester¹ - Show less +11 more•Institutions (7)

Wageningen University and Research Centre¹, University of Western Australia², University of Freiburg³, University of York⁴, University of Cambridge⁵, Radboud University Nijmegen⁶, King Abdullah University of Science and Technology⁷

01 Dec 2014-Nature Chemical Biology

TL;DR: It is reported that two members of CYP711 enzymes can catalyze two distinct steps in SL biosynthesis, identifying the first enzymes involved in B-C ring closure and a subsequent structural diversification step of SLs.

...read moreread less

Abstract: Strigolactones (SLs) are a class of phytohormones and rhizosphere signaling compounds with high structural diversity. Three enzymes, carotenoid isomerase DWARF27 and carotenoid cleavage dioxygenases CCD7 and CCD8, were previously shown to convert all-trans-β-carotene to carlactone (CL), the SL precursor. However, how CL is metabolized to SLs has remained elusive. Here, by reconstituting the SL biosynthetic pathway in Nicotiana benthamiana, we show that a rice homolog of Arabidopsis More Axillary Growth 1 (MAX1), encodes a cytochrome P450 CYP711 subfamily member that acts as a CL oxidase to stereoselectively convert CL into ent-2'-epi-5-deoxystrigol (B-C lactone ring formation), the presumed precursor of rice SLs. A protein encoded by a second rice MAX1 homolog then catalyzes the conversion of ent-2'-epi-5-deoxystrigol to orobanchol. We therefore report that two members of CYP711 enzymes can catalyze two distinct steps in SL biosynthesis, identifying the first enzymes involved in B-C ring closure and a subsequent structural diversification step of SLs.

...read moreread less

289 citations

Journal Article•DOI•

SEPALLATA3: the 'glue' for MADS box transcription factor complex formation

[...]

Richard G. H. Immink¹, Isabella A. Nougalli Tonaco¹, Stefan de Folter², Stefan de Folter¹, Anna V. Shchennikova¹, Aalt D. J. van Dijk¹, Jacqueline Busscher-Lange¹, Jan Willem Borst¹, Gerco C. Angenent¹ - Show less +5 more•Institutions (2)

Wageningen University and Research Centre¹, Instituto Politécnico Nacional²

25 Feb 2009-Genome Biology

TL;DR: Significant indications are provided that higher-order complex formation is a general and essential molecular mechanism for plant MADS box protein functioning and attribute a pivotal role to the SEP3 'glue' protein in mediating multimerization.

...read moreread less

Abstract: Plant MADS box proteins play important roles in a plethora of developmental processes. In order to regulate specific sets of target genes, MADS box proteins dimerize and are thought to assemble into multimeric complexes. In this study a large-scale yeast three-hybrid screen is utilized to provide insight into the higher-order complex formation capacity of the Arabidopsis MADS box family. SEPALLATA3 (SEP3) has been shown to mediate complex formation and, therefore, special attention is paid to this factor in this study. In total, 106 multimeric complexes were identified; in more than half of these at least one SEP protein was present. Besides the known complexes involved in determining floral organ identity, various complexes consisting of combinations of proteins known to play a role in floral organ identity specification, and flowering time determination were discovered. The capacity to form this latter type of complex suggests that homeotic factors play essential roles in down-regulation of the MADS box genes involved in floral timing in the flower via negative auto-regulatory loops. Furthermore, various novel complexes were identified that may be important for the direct regulation of the floral transition process. A subsequent detailed analysis of the APETALA3, PISTILLATA, and SEP3 proteins in living plant cells suggests the formation of a multimeric complex in vivo. Overall, these results provide strong indications that higher-order complex formation is a general and essential molecular mechanism for plant MADS box protein functioning and attribute a pivotal role to the SEP3 'glue' protein in mediating multimerization.

...read moreread less

261 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

node2vec: Scalable Feature Learning for Networks

[...]

Aditya Grover¹, Jure Leskovec¹•Institutions (1)

Stanford University¹

13 Aug 2016

TL;DR: Node2vec as mentioned in this paper learns a mapping of nodes to a low-dimensional space of features that maximizes the likelihood of preserving network neighborhoods of nodes by using a biased random walk procedure.

...read moreread less

Abstract: Prediction tasks over nodes and edges in networks require careful effort in engineering features used by learning algorithms. Recent research in the broader field of representation learning has led to significant progress in automating prediction by learning the features themselves. However, present feature learning approaches are not expressive enough to capture the diversity of connectivity patterns observed in networks. Here we propose node2vec, an algorithmic framework for learning continuous feature representations for nodes in networks. In node2vec, we learn a mapping of nodes to a low-dimensional space of features that maximizes the likelihood of preserving network neighborhoods of nodes. We define a flexible notion of a node's network neighborhood and design a biased random walk procedure, which efficiently explores diverse neighborhoods. Our algorithm generalizes prior work which is based on rigid notions of network neighborhoods, and we argue that the added flexibility in exploring neighborhoods is the key to learning richer representations. We demonstrate the efficacy of node2vec over existing state-of-the-art techniques on multi-label classification and link prediction in several real-world networks from diverse domains. Taken together, our work represents a new way for efficiently learning state-of-the-art task-independent representations in complex networks.

...read moreread less

7,072 citations

Journal Article•DOI•

NCBI prokaryotic genome annotation pipeline

[...]

Tatiana Tatusova¹, Michael DiCuccio¹, Azat Badretdin¹, Vyacheslav Chetvernin¹, Eric P. Nawrocki¹, Leonid Zaslavsky¹, Alexandre Lomsadze², Kim D. Pruitt¹, Mark Borodovsky², James Ostell¹ - Show less +6 more•Institutions (2)

National Institutes of Health¹, Georgia Institute of Technology²

19 Aug 2016-Nucleic Acids Research

TL;DR: The new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies less on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence.

...read moreread less

Abstract: Recent technological advances have opened unprecedented opportunities for large-scale sequencing and analysis of populations of pathogenic species in disease outbreaks, as well as for large-scale diversity studies aimed at expanding our knowledge across the whole domain of prokaryotes. To meet the challenge of timely interpretation of structure, function and meaning of this vast genetic information, a comprehensive approach to automatic genome annotation is critically needed. In collaboration with Georgia Tech, NCBI has developed a new approach to genome annotation that combines alignment based methods with methods of predicting protein-coding and RNA genes and other functional elements directly from sequence. A new gene finding tool, GeneMarkS+, uses the combined evidence of protein and RNA placement by homology as an initial map of annotation to generate and modify ab initio gene predictions across the whole genome. Thus, the new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies more on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence. The pipeline provides a framework for generation and analysis of annotation on the full breadth of prokaryotic taxonomy. For additional information on PGAP see https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ and the NCBI Handbook, https://www.ncbi.nlm.nih.gov/books/NBK174280/.

...read moreread less

3,902 citations

Integrative Genomics Viewer

[...]

James T. Robinson¹, Helga Thorvaldsdottir¹, Wendy Winckler¹, Mitchell Guttman¹, Eric S. Lander¹, Eric S. Lander², Gad Getz¹, Jill P. Mesirov¹ - Show less +4 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

01 Jan 2011

TL;DR: The sheer volume and scope of data posed by this flood of data pose a significant challenge to the development of efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data.

...read moreread less

Abstract: Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole-genome sequencing, epigenetic surveys, expression profiling of coding and noncoding RNAs, single nucleotide polymorphism (SNP) and copy number profiling, and functional assays. Analysis of these large, diverse data sets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large data sets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data pose a significant challenge to the development of such tools.

...read moreread less

2,187 citations

Posted Content•

node2vec: Scalable Feature Learning for Networks

[...]

Aditya Grover¹, Jure Leskovec¹•Institutions (1)

Stanford University¹

03 Jul 2016-arXiv: Social and Information Networks

TL;DR: In node2vec, an algorithmic framework for learning continuous feature representations for nodes in networks, a flexible notion of a node's network neighborhood is defined and a biased random walk procedure is designed, which efficiently explores diverse neighborhoods.

...read moreread less

2,174 citations

Journal Article•DOI•

Jasmonates: biosynthesis, perception, signal transduction and action in plant stress response, growth and development. An update to the 2007 review in Annals of Botany

[...]

Claus Wasternack¹, Bettina Hause¹•Institutions (1)

Leibniz Association¹

01 Jun 2013-Annals of Botany

TL;DR: Important new components of jasmonate signalling including its receptor were identified, providing deeper insight into the role ofJASMONATE signalling pathways in stress responses and development.

...read moreread less

1,868 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse