Home
/
Topics
/
Shallow parsing

Topic

Shallow parsing

About: Shallow parsing is a research topic. Over the lifetime, 397 publications have been published within this topic receiving 10211 citations.

...read moreread less

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1991

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Large-Scale Experiments with NP Chunking of Polish

[...]

Adam Radziszewski¹, Adam Pawlaczek¹•Institutions (1)

Wrocław University of Technology¹

03 Sep 2012

TL;DR: Three Machine Learning techniques are tested on the 1-million token manually annotated subcorpus of the National Corpus of Polish: Decision Tree induction, Memory-Based Learning and Conditional Random Fields.

...read moreread less

Abstract: The published experiments with shallow parsing for Slavic languages are characterised with small size of the corpora used. With the publication of the National Corpus of Polish (NCP), a new opportunity was opened: to test several chunking algorithms on the 1-million token manually annotated subcorpus of the NCP. We test three Machine Learning techniques: Decision Tree induction, Memory-Based Learning and Conditional Random Fields. We also investigate the influence of tagging errors on the overall chunker performance, which happens to be quite substantial.

...read moreread less

14 citations

Proceedings Article•

SHAPAQA : Shallow Parsing for Question Answering on the World Wide Web

[...]

Sabine Buchholz, Walter Daelemans

01 Jan 2001

TL;DR: This work introduces shapaqa, a shallow parsing approach to online, open-domain question answering on the WorldWideWeb that uses a memory-based shallow parser to analyze web pages retrieved using normal keyword search on a search engine.

...read moreread less

Abstract: We introduce shapaqa, a shallow parsing approach to online, open-domain question answering on the WorldWideWeb. Given a form-based natural language question as input, the system uses a memory-based shallow parser to analyze web pages retrieved using normal keyword search on a search engine. Two versions of the system are evaluated on a test set of 200 questions. In combination with two back-off methods a mean reciprocal rank of .46 is achieved.

...read moreread less

14 citations

A Grammatical Approach to the Extraction of Index Terms

[...]

Jesús Vilares¹, Miguel A. Alonso¹•Institutions (1)

University of A Coruña¹

01 Jan 2003

TL;DR: This article proposes to apply shallow parsing, implemented by means of cascades of finite-state transducers, to extract complex index terms based on an approximate grammar of Spanish to improve the effectiveness of the index terms extracted.

...read moreread less

Abstract: The extraction of the keywords that characterize each document in a given collection is one of the most important components of an Information Retrieval system. In this article, we propose to apply shallow parsing, implemented by means of cascades of finite-state transducers, to extract complex index terms based on an approximate grammar of Spanish. The effectiveness of the index terms extracted has been evaluated through the CLEF collection.

...read moreread less

14 citations

Journal Article•DOI•

A block bigram prediction model for statistical machine translation

[...]

Christoph Tillmann¹, Tong Zhang•Institutions (1)

IBM¹

01 Jul 2007-ACM Transactions on Speech and Language Processing

TL;DR: A novel training method for a localized phrase-based prediction model for statistical machine translation (SMT) that explicitly handles local phrase reordering and a novel stochastic gradient descent training algorithm is presented that can easily handle millions of features.

...read moreread less

Abstract: In this article, we present a novel training method for a localized phrase-based prediction model for statistical machine translation (SMT). The model predicts block neighbors to carry out a phrase-based translation that explicitly handles local phrase reordering. We use a maximum likelihood criterion to train a log-linear block bigram model which uses real-valued features (e.g., a language model score) as well as binary features based on the block identities themselves (e.g., block bigram features). The model training relies on an efficient enumeration of local block neighbors in parallel training data. A novel stochastic gradient descent (SGD) training algorithm is presented that can easily handle millions of features. Moreover, when viewing SMT as a block generation process, it becomes quite similar to sequential natural language annotation problems such as part-of-speech tagging, phrase chunking, or shallow parsing. Our novel approach is successfully tested on a standard Arabic-English translation task using two different phrase reordering models: a block orientation model and a phrase-distortion model.

...read moreread less

14 citations

Journal Article•

Extraction of Opinion Targets Based on Shallow Parsing Features

[...]

Zhao Tie

01 Jan 2011-Acta Automatica Sinica

TL;DR: This paper proposes to integrate shallow parsing features and heuristic position information for modeling of the training process without introducing domain lexicon, and shows that after adding the proposed features, nearly all specifications of both conditional random fields and contrast model are improved, and the results of conditionalrandom fields are more efficient than that of the contrast model.

...read moreread less

13 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
…
21
22
23
24
25
26
27
…
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80

Collapse

Network Information

Performance

Metrics

397

Papers

10,872

Citations

No. of papers in the topic in previous years
Year	Papers
2021	7
2020	12
2019	6
2018	5
2017	11
2016	11

Shallow parsing

Papers published on a yearly basis

Papers

Trending Questions (9)

Network Information

Related Topics (5)

Performance

Metrics