Showing papers in "Zeitschrift Fur Katalanistik in 2020"
•
TL;DR: The CORMA corpus as mentioned in this paper is a corpus of peninsular Spanish including spontaneous conversations recorded in Madrid between 2016 and 2019, which was compiled in order to remedy the scarce documentation of 21st century colloquial Spanish.
Abstract: CORMA is a corpus of peninsular Spanish including spontaneous conversations recorded in Madrid between 2016 and 2019. The corpus was compiled in order
to remedy the scarce documentation of 21st century colloquial Spanish. Indeed, a short
overview of the corpora of conversational Spanish shows that there is a sharp contrast
between the increasing interest in linguistic phenomena representative of this variety
and the availability of suitable empirical data. This paper discusses the different steps
that were taken to build the dataset, and presents the main characteristics of the final
CORMA corpus. It also includes an illustration of how the data can be successfully
exploited through a pilot study of the rise and spread of the intensification marker mazo
(de) in young speak since the beginning of the 21st century
3 citations