Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling
Citations
7,070 citations
Cites methods from "Incorporating Non-local Information..."
...This is implemented with a discriminative model using a CRF sequence tagger (Finkel et al., 2005)....
[...]
...With the default annotators, named entities are recognized using a combination of CRF sequence taggers trained on various corpora (Finkel et al., 2005), while numerical entities are recognized using two rule-based systems, one for money and numbers, and a separate state-of-the-art system for…...
[...]
...This is implemented with a discriminative model using a CRF sequence tagger (Finkel et al., 2005). pos Labels tokens with their part-of-speech (POS) tag, using a maximum entropy POS tagger (Toutanova et al., 2003). lemma Generates the lemmas (base forms) for all tokens in the annotation. gender…...
[...]
...annotators, named entities are recognized using a combination of CRF sequence taggers trained on various corpora (Finkel et al., 2005), while numerical entities are recognized using two rule-based systems, one for money and numbers, and a separate state-of-the-art system for processing temporal expressions (Chang and Manning, 2012)....
[...]
3,158 citations
Cites methods from "Incorporating Non-local Information..."
...These features are similar to the features extracted from Stanford NER tool (Finkel et al., 2005; Wang and Manning, 2013)....
[...]
2,965 citations
Cites methods from "Incorporating Non-local Information..."
...We perform named entity tagging using the Stanford four-class named entity tagger (Finkel et al., 2005)....
[...]
2,835 citations
Cites methods from "Incorporating Non-local Information..."
...They aligned Freebase relations with the New York Times corpus by tagging entities in text using Stanford NER (Finkel, Grenager, and Manning 2005) and linking them to Freebase IDs through string matching on names....
[...]
1,539 citations
Cites result from "Incorporating Non-local Information..."
...The results we obtained on the CoNLL03 test set were consistent with what was reported in (Finkel et al., 2005)....
[...]
References
41,772 citations
21,819 citations
18,761 citations
"Incorporating Non-local Information..." refers background or methods in this paper
...Geman and Geman (1984) show that it is easy to modify a Gibbs Markov chain to do annealing; at timet we replace the distribution in (1) with PA(s (t)|s(t−1)) = PM (s (t) i |s (t−1) −i ,o) 1/ct ∑ j PM (s (t) j |s (t−1) −j ,o) 1/ct (2) wherec = {c0, . . . , cT } defines acooling schedule....
[...]
...One such algorithm isGibbs sampling , a simple Monte Carlo algorithm that is appropriate for inference in any factored probabilistic model, including sequence models and probabilistic context free grammars (Geman and Geman, 1984)....
[...]
...Gibbs samplingprovides a clever solution (Geman and Geman, 1984)....
[...]
...One such algorithm isGibbs sampling, a simple Monte Carlo algorithm that is appropriate for inference in any factored probabilistic model, including sequence models and probabilistic context free grammars (Geman and Geman, 1984)....
[...]
...Geman and Geman (1984) show that it is easy to modify a Gibbs Markov chain to do annealing; at time t we replace the distribution in (1) with...
[...]
13,190 citations
"Incorporating Non-local Information..." refers background or methods in this paper
...Our basic CRF model follows that of Lafferty et al. (2001)....
[...]
...Statistical hidden state sequence models, such as Hidden Markov Models (HMMs) (Leek, 1997; Freitag and McCallum, 1999), Conditional Markov Models (CMMs) (Borthwick, 1999), and Conditional Random Fields (CRFs) (Lafferty et al., 2001) are a prominent recent approach to information extraction tasks....
[...]
...Our basic CRF model follows that of Lafferty et al. (2001). We choose a CRF because it represents the state of the art in sequence modeling, allowing both discriminative training and the bi-directional flow of probabilistic information across the sequence....
[...]