Topic
Shallow parsing
About: Shallow parsing is a research topic. Over the lifetime, 397 publications have been published within this topic receiving 10211 citations.
Papers published on a yearly basis
Papers
More filters
••
12 Apr 2003TL;DR: To investigate the contributions of taggers or chunkers to the performance of a deep syntactic parser, Weighted Constraint Dependency Grammars have been extended to also take into consideration information from external sources.
Abstract: To investigate the contributions of taggers or chunkers to the performance of a deep syntactic parser, Weighted Constraint Dependency Grammars have been extended to also take into consideration information from external sources. Using a weak information fusion scheme based on constraint optimization techniques, a parsing accuracy has been achieved which is comparable to other (stochastic) parsers.
30 citations
••
12 Jul 2009TL;DR: Experiments show that the proposed method makes the sentence similarity comparison more exactly and give out a more reasonable result, which is similar to the people's comprehension to the meanings of the sentences.
Abstract: The paper proposes to determine sentence similarities from different aspects. Based on the information people get from a sentence, Objects-Specified Similarity, Objects-Property Similarity, Objects-Behavior Similarity and Overall Similarity are defined to determine sentence similarities from four aspects. Experiments show that the proposed method makes the sentence similarity comparison more exactly and give out a more reasonable result, which is similar to the people's comprehension to the meanings of the sentences.
29 citations
••
TL;DR: DiSeg is presented, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules, which means that discourse segmentation is possible using shallow parsing.
Abstract: Nowadays discourse parsing is a very prominent research topic. However, there is not a discourse parser for Spanish texts. The first stage in order to develop this tool is discourse segmentation. In this work, we present DiSeg, the first discourse segmenter for Spanish, which uses the framework of Rhetorical Structure Theory and is based on lexical and syntactic rules. We describe the system and we evaluate its performance against a gold standard corpus, divided in a medical and a terminological subcorpus. We obtain promising results, which means that discourse segmentation is possible using shallow parsing.
29 citations
01 Jan 2002
TL;DR: It is argued that a chunked syntactic representation can usefully be exploited as such for non trivial NLP applications which do not require full text understanding such as automatic lexical acquisition and information retrieval.
Abstract: This paper illustrates a technique of shallow parsing named “text chunking” whereby “parse incompleteness” is reinterpreted as “parse underspecification”. A text is chunked into structured units which can be identified with certainty on the basis of available knowledge. The chunking process stops at that level of granularity beyond which the analysis gets undecidable. We argue that a chunked syntactic representation can usefully be exploited as such for non trivial NLP applications which do not require full text understanding such as automatic lexical acquisition and information retrieval.
28 citations
•
07 Jun 2004
TL;DR: This article proposed a method for parsing Chinese sentences by employing lexical and syntactical information to extract more prominent entities in a Chinese sentence, and the sentence is then transformed into a Triple representation by employing the Triple rules referring to elemental Chinese syntax.
Abstract: A method for processing natural language Chinese sentences can transform a Chinese sentence into a Triple representation using shallow parsing techniques. The method is concerned with parsing Chinese sentences by employing lexical and syntactical information to extract more prominent entities in a Chinese sentence, and the sentence is then transformed into a Triple representation by employing the Triple rules referring to elemental Chinese syntax—SVO (subject, verb, and object in order). The lexical and syntactical information in our method is referring a lexicon possessed of part-of-speech (POS) information and phrase-level syntax in Chinese respectively. The Triple representation consists of three elements which are agent, predicate, and patient in a sentence.
27 citations