Home
/
Authors
/
Daniël de Kok

Author

Daniël de Kok

Other affiliations: University of Groningen

Bio: Daniël de Kok is an academic researcher from University of Tübingen. The author has contributed to research in topics: Parsing & Treebank. The author has an hindex of 8, co-authored 24 publications receiving 430 citations. Previous affiliations of Daniël de Kok include University of Groningen.

Topics: Parsing, Treebank, Dependency grammar, Sentence, Ranking (information retrieval) ...read more

Papers

PDF

Open Access

More filters

Journal Article•

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

[...]

Daniël de Kok, Barbara Plank, van Gerardus Noord

01 Jan 2011-The Association for Computational Linguistics

324 citations

Book Chapter•DOI•

Large Scale Syntactic Annotation of written Dutch (LASSY)

[...]

Gertjan van Noord¹, Gosse Bouma¹, Frank Van Eynde², Daniël de Kok¹, Jelmer van der Linde¹, Ineke Schuurman², Erik Tjong Kim Sang¹, Vincent Vandeghinste² - Show less +4 more•Institutions (2)

University of Groningen¹, Katholieke Universiteit Leuven²

01 Jan 2013

TL;DR: This chapter presents the Lassy Small and Lassy Large treebanks, as well as related tools and applications, which have been developed and made available for syntactically annotated corpora.

...read moreread less

Abstract: This chapter presents the Lassy Small and Lassy Large treebanks, as well as related tools and applications. Lassy Small is a corpus of written Dutch texts (1,000,000 words) which has been syntactically annotated with manual verification and correction. Lassy Large is a much larger corpus (over 500,000,000 words) which has been syntactically annotated fully automatically. In addition, various browse and search tools for syntactically annotated corpora have been developed and made available. Their potential for applications in corpus linguistics and information extraction has been illustrated and evaluated in a series of case studies.

...read moreread less

94 citations

Proceedings Article•

Reversible Stochastic Attribute-Value Grammars

[...]

Daniël de Kok¹, Barbara Plank¹, Gertjan van Noord¹•Institutions (1)

University of Groningen¹

19 Jun 2011

TL;DR: This work proposes reversible stochastic attribute-value grammars, in which a single statistical model is employed both for parse selection and fluency ranking.

...read moreread less

Abstract: An attractive property of attribute-value grammars is their reversibility. Attribute-value grammars are usually coupled with separate statistical components for parse selection and fluency ranking. We propose reversible stochastic attribute-value grammars, in which a single statistical model is employed both for parse selection and fluency ranking.

...read moreread less

58 citations

Proceedings Article•DOI•

A generalized method for iterative error mining in parsing results

[...]

Daniël de Kok¹, Jianqiang Ma¹, Gertjan van Noord¹•Institutions (1)

University of Groningen¹

06 Aug 2009

TL;DR: This work extends the iterative method of Sagot and de la Clergerie (2006) to treat n-grams of an arbitrary length, and proposes a new evaluation metric which will enable us to compare different error miners.

...read moreread less

Abstract: Error mining is a useful technique for identifying forms that cause incomplete parses of sentences. We extend the iterative method of Sagot and de la Clergerie (2006) to treat n-grams of an arbitrary length. An inherent problem of incorporating longer n-grams is data sparseness. Our new method takes sparseness into account, producing n-grams that are as long as necessary to identify problematic forms, but not longer. Not every cause for parsing errors can be captured effectively by looking at word n-grams. We report on an algorithm for building more general patterns for mining, consisting of words and part of speech tags. It is not easy to evaluate the various error mining techniques. We propose a new evaluation metric which will enable us to compare different error miners.

...read moreread less

24 citations

Journal Article•

Essential Speech and Language Technology for Dutch

[...]

van Gerardus Noord, Gosse Bouma, F. Van Eynde, Daniël de Kok, J. van der Linde, Ineke Schuurman, E.F. Tjong Kim Sang, Vandeghinste - Show less +4 more

01 Jan 2012-Springer US

14 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

The Face of Emotion

[...]

Bruce Bowe

06 Jul 1985-Science News

682 citations

Proceedings Article•DOI•

Improving efficiency and accuracy in multilingual entity extraction

[...]

Joachim Daiber¹, Max Jakob, Chris Hokamp², Pablo N. Mendes³•Institutions (3)

University of Groningen¹, University of North Texas², Wright State University³

04 Sep 2013

TL;DR: This paper discusses some implementation and data processing challenges encountered while developing a new multilingual version of DBpedia Spotlight that is faster, more accurate and easier to configure, and compares the solution to the previous system.

...read moreread less

Abstract: There has recently been an increased interest in named entity recognition and disambiguation systems at major conferences such as WWW, SIGIR, ACL, KDD, etc. However, most work has focused on algorithms and evaluations, leaving little space for implementation details. In this paper, we discuss some implementation and data processing challenges we encountered while developing a new multilingual version of DBpedia Spotlight that is faster, more accurate and easier to configure. We compare our solution to the previous system, considering time performance, space requirements and accuracy in the context of the Dutch and English languages. Additionally, we report results for 9 additional languages among the largest Wikipedias. Finally, we present challenges and experiences to foment the discussion with other developers interested in recognition and disambiguation of entities in natural language text.

...read moreread less

529 citations

Journal Article•DOI•

The ACL anthology network corpus

[...]

Dragomir R. Radev¹, Pradeep Muthukrishnan¹, Vahed Qazvinian¹, Amjad Abu-Jbara¹•Institutions (1)

University of Michigan¹

01 Dec 2013

TL;DR: The ACL Anthology Network is introduced, a comprehensive manually curated networked database of citations, collaborations, and summaries in the field of Computational Linguistics and a number of statistics about the network including the most cited authors, the most central collaborators, as well as network statistics.

...read moreread less

Abstract: We introduce the ACL Anthology Network (AAN), a comprehensive manually curated networked database of citations, collaborations, and summaries in the field of Computational Linguistics. We also present a number of statistics about the network including the most cited authors, the most central collaborators, as well as network statistics about the paper citation, author citation, and author collaboration networks.

...read moreread less

332 citations

Journal Article•DOI•

Parsing Argumentation Structures in Persuasive Essays

[...]

Christian Stab¹, Iryna Gurevych¹•Institutions (1)

Technische Universität Darmstadt¹

15 Sep 2017-Computational Linguistics

TL;DR: The authors identify argument components using sequence labeling at the token level and apply a new joint model for detecting argumentation structures, which is a novel approach for parsing argumentation structure, and apply it to the problem of argumentation parsing.

...read moreread less

Abstract: In this article, we present a novel approach for parsing argumentation structures. We identify argument components using sequence labeling at the token level and apply a new joint model for detecti...

...read moreread less

301 citations

Journal Article•DOI•

Computer Intensive Methods for Testing Hypotheses: An Introduction

[...]

K. J. Evans

01 Nov 1990-Journal of the Operational Research Society

258 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110

Collapse