scispace - formally typeset
Open AccessProceedings Article

Stochastic Parse-Tree Recognition by a Pushdown Automaton.

Frederic Tendeau
- pp 234-249
Reads0
Chats0
TLDR
The stochastic generalization of what is usually called correctness theorems is presented: the probabilities computed operationally by the parsing algorithms are the same as those defined denotationally on the trees and forests defined by the grammar.
Abstract
We present the stochastic generalization of what is usually called correctness theorems: we guarantee that the probabilities computed operationally by the parsing algorithms are the same as those defined denotationally on the trees and forests defined by the grammar The main idea of the paper is to precisely relate the parsing strategy with a parse-tree exploration strategy: a computational path of a parsing algorithm simply performs an exploration of a parse-tree for the input portion already parsed This approach is applied in particular to Earley and Left-Corner parsing algorithms Probability computations follow parsing operations: looping problems (in rule prediction and subtree recognition) are solved by introducing probability variables (which may not be immediately evaluated) Convergence is ensured by the syntactic construction that leads to stochastic equations systems, which are solved as soon as possible Our algorithms accept any (probabilistic) CF grammar No restrictions are made such as prescribing normal form, proscribing empty rules or cyclic grammars

read more

Citations
More filters
Proceedings Article

Probabilistic parsing strategies

TL;DR: The authors showed that preservation of probability distribution is possible under two conditions, viz. the correct-prefix property and the property of strong predictiveness, for purely symbolic context-free parsing strategies.
Journal ArticleDOI

Probabilistic parsing strategies

TL;DR: It is shown that preservation of probability distribution is possible under two conditions, viz. the correct-prefix property and the property of strong predictiveness in purely symbolic context-free parsing strategies.
Journal ArticleDOI

Computing abstract decorations of parse forests using dynamic programming and algebraic power series

TL;DR: The formalism of algebraic power series over non-commuting variables is used to show how to apply dynamic programming techniques to compute decorations in an abstract semiring without specializing for a particular interpretation such as booleans, forests, or any decoration domain with more practical purposes.
Proceedings ArticleDOI

Probabilistic Parsing Strategies

TL;DR: These results generalize existing results in the literature that were obtained by considering parsing strategies in isolation and provide a necessary condition and a sufficient condition for the probabilistic extension of parsing strategies.
Proceedings Article

An Earley Algorithm for Generic Attribute Augmented Grammars and Applications.

TL;DR: In this paper, an extension of Earley's algorithm which computes the decoration of a shared forest in a generic domain is described, where the computations are defined by a morphism from leftmost derivations to the generic domain.
References
More filters

An efficient context-free parsing algorithm

TL;DR: A parsing algorithm which seems to be the most efficient general context-free algorithm known is described and appears to be superior to the top-down and bottom-up algorithms studied by Griffiths and Petrick.
Posted Content

An Efficient Probabilistic Context-Free Parsing Algorithm that Computes Prefix Probabilities

TL;DR: In this paper, an extension of Earley's parser for stochastic context-free grammars is presented, which computes probabilities of successive prefixes being generated by the grammar, probabilities of substrings being produced by the nonterminals, including the entire string, most likely (Viterbi) parse of the string, posterior expected number of applications of each grammar production, as required for reestimating rule probabilities.
Journal ArticleDOI

Applying Probability Measures to Abstract Languages

TL;DR: The problem of assigning a probability to each word of a language is considered and two methods are discussed.
Book ChapterDOI

Towards a Uniform Formal Framework for Parsing

TL;DR: Many of the formalisms used to define the syntax of natural (and programming) languages may be located in a continuum that ranges from propositional Horn logic to full first order Horn logic, possibly with non-Herbrand interpretations.
Proceedings ArticleDOI

How to cover a grammar

TL;DR: A novel formalism is presented for Earley-like parsers that accommodates the simulation of non-deterministic pushdown automata and is applied to non-Deterministic LR-parsers for RTN grammars.