The Penn Chinese TreeBank: Phrase structure annotation of a large corpus
Citations
801 citations
Cites background or methods from "The Penn Chinese TreeBank: Phrase s..."
...The Chinese data are taken from the Penn Chinese Treebank (CTB), version 5.1 (Xue et al. 2005), and the texts are mostly from Xinhua newswire, Sinorama news magazine and Hong Kong News....
[...]
...1 (Xue et al. 2005), and the texts are mostly from Xinhua newswire, Sinorama news magazine and Hong Kong News....
[...]
783 citations
773 citations
633 citations
531 citations
Cites methods from "The Penn Chinese TreeBank: Phrase s..."
...The Chinese data used in the shared task is based on Chinese Treebank 6.0 and the Chinese Proposition Bank 2.0, both of which are publicly available via the Linguistic Data Consortium....
[...]
...The version of the Chinese Treebank used in this shared task, CTB 6.0, includes newswire, magazine articles, and transcribed broadcast news12....
[...]
...The Chinese Proposition Bank adds a layer of semantic annotation to the syntactic parses in the Chinese Treebank....
[...]
...The Chinese Treebank and the Chinese Proposition Bank were funded by DOD, NSF and DARPA....
[...]
...The data sources of the Chinese Treebank range from Xinhua newswire (mainland China), Hong Kong news, and Sinorama Magazine (Taiwan)....
[...]
References
8,377 citations
7,936 citations
"The Penn Chinese TreeBank: Phrase s..." refers background in this paper
...While the influence of Government and Binding (GB) theory (Chomsky 1981) and X-bar theory (Jackendoff 1977) is obvious in our corpus, we do not adopt the whole package....
[...]
2,416 citations
1,709 citations
"The Penn Chinese TreeBank: Phrase s..." refers background in this paper
...Most notably, the Penn English Treebank (Marcus, Santorini and Marcinkiewicz 1993) has proven to be a crucial resource in the recent success of English Part-Of-Speech (POS) taggers and parsers (Collins 1997, 2000; Charniak 2000), as it provides common training and testing material so that different algorithms can be compared and progress be gauged....
[...]
...…and Marcinkiewicz 1993) has proven to be a crucial resource in the recent success of English Part-Of-Speech (POS) taggers and parsers (Collins 1997, 2000; Charniak 2000), as it provides common training and testing material so that different algorithms can be compared and progress be gauged....
[...]