scispace - formally typeset
Search or ask a question
Topic

Shallow parsing

About: Shallow parsing is a research topic. Over the lifetime, 397 publications have been published within this topic receiving 10211 citations.


Papers
More filters
Proceedings ArticleDOI
12 Apr 2003
TL;DR: To investigate the contributions of taggers or chunkers to the performance of a deep syntactic parser, Weighted Constraint Dependency Grammars have been extended to also take into consideration information from external sources.
Abstract: To investigate the contributions of taggers or chunkers to the performance of a deep syntactic parser, Weighted Constraint Dependency Grammars have been extended to also take into consideration information from external sources. Using a weak information fusion scheme based on constraint optimization techniques, a parsing accuracy has been achieved which is comparable to other (stochastic) parsers.

30 citations

Proceedings ArticleDOI
Lin Li1, Xia Hu1, Biyun Hu1, Jun Wang1, Yiming Zhou1 
12 Jul 2009
TL;DR: Experiments show that the proposed method makes the sentence similarity comparison more exactly and give out a more reasonable result, which is similar to the people's comprehension to the meanings of the sentences.
Abstract: The paper proposes to determine sentence similarities from different aspects. Based on the information people get from a sentence, Objects-Specified Similarity, Objects-Property Similarity, Objects-Behavior Similarity and Overall Similarity are defined to determine sentence similarities from four aspects. Experiments show that the proposed method makes the sentence similarity comparison more exactly and give out a more reasonable result, which is similar to the people's comprehension to the meanings of the sentences.

29 citations

Journal ArticleDOI
TL;DR: DiSeg is presented, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules, which means that discourse segmentation is possible using shallow parsing.
Abstract: Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing.

29 citations

01 Jan 2002
TL;DR: It is argued that a chunked syntactic representation can usefully be exploited as such for non trivial NLP applications which do not require full text understanding such as automatic lexical acquisition and information retrieval.
Abstract: This paper illustrates a technique of shallow parsing named “text chunking” whereby “parse incompleteness” is reinterpreted as “parse underspecification”. A text is chunked into structured units which can be identified with certainty on the basis of available knowledge. The chunking process stops at that level of granularity beyond which the analysis gets undecidable. We argue that a chunked syntactic representation can usefully be exploited as such for non trivial NLP applications which do not require full text understanding such as automatic lexical acquisition and information retrieval.

28 citations

Patent
07 Jun 2004
TL;DR: This article proposed a method for parsing Chinese sentences by employing lexical and syntactical information to extract more prominent entities in a Chinese sentence, and the sentence is then transformed into a Triple representation by employing the Triple rules referring to elemental Chinese syntax.
Abstract: A method for processing natural language Chinese sentences can transform a Chinese sentence into a Triple representation using shallow parsing techniques. The method is concerned with parsing Chinese sentences by employing lexical and syntactical information to extract more prominent entities in a Chinese sentence, and the sentence is then transformed into a Triple representation by employing the Triple rules referring to elemental Chinese syntax—SVO (subject, verb, and object in order). The lexical and syntactical information in our method is referring a lexicon possessed of part-of-speech (POS) information and phrase-level syntax in Chinese respectively. The Triple representation consists of three elements which are agent, predicate, and patient in a sentence.

27 citations


Network Information
Related Topics (5)
Machine translation
22.1K papers, 574.4K citations
81% related
Natural language
31.1K papers, 806.8K citations
79% related
Language model
17.5K papers, 545K citations
79% related
Parsing
21.5K papers, 545.4K citations
79% related
Query language
17.2K papers, 496.2K citations
74% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20217
202012
20196
20185
201711
201611