The planning and production of a corpus of Yilumbu: Progress of the building stage
Citations
References
126 citations
"The planning and production of a co..." refers background in this paper
...language identification at sentence basis (Biemann and Teresniak, 2005); f....
[...]
...…ill-formed sentences based on handwritten regular expressions (Eckart, Quasthoff and Goldhahn, 2012); e. language identification at sentence basis (Biemann and Teresniak, 2005); f. duplicate sentence removal; g. tokenisation and word co-occurrence calculation; and h. the corpora are stored as…...
[...]
77 citations
"The planning and production of a co..." refers methods in this paper
...According to Prinsloo (2000), when it comes to corpus compilation, there are three steps to be considered, namely: 1) corpus design; 2) text collection; and 3) text encoding....
[...]
41 citations
24 citations
16 citations
Additional excerpts
...The second type of texts refers to pedagogical and scientific contributions (Blanchon, 1984; Emejulu and Pambou-Loueya, 1990; Saphou-Bivigat, 2000; 2010; Mavoungou, 2002a; 2002b; 2002c; 2005; 2006; 2008; 2009; 2010a; 2010b; 2010c; 2011; 2012; Mboumba, 2009; Mavoungou and Plumel, 2010;…...
[...]