T
Thomas Proisl
Researcher at University of Erlangen-Nuremberg
Publications - 28
Citations - 341
Thomas Proisl is an academic researcher from University of Erlangen-Nuremberg. The author has contributed to research in topics: Collocation & Stylometry. The author has an hindex of 9, co-authored 27 publications receiving 280 citations.
Papers
More filters
Journal ArticleDOI
Understanding and explaining Delta measures for authorship attribution
Stefan Evert,Thomas Proisl,Fotis Jannidis,Isabella Reger,Steffen Pielström,Christof Schöch,Thorsten Vitt +6 more
TL;DR: It is shown that feature vector normalization, that is, the transformation of the feature vectors to a uniform length of 1 (implicit in the cosine measure), is the decisive factor for the improvement of Delta proposed recently.
Proceedings ArticleDOI
SoMaJo: State-of-the-art tokenization for German web and social media texts
Thomas Proisl,Peter Uhrig +1 more
TL;DR: SoMaJo, a rulebased tokenizer for German web and social media texts that was the best-performing system in the EmpiriST 2015 shared task with an average F1-score of 99.57 is described.
Proceedings Article
KLUE: Simple and robust methods for polarity classification
TL;DR: This paper uses simple bag-of-words models, a freely available sentiment dictionary automatically extended with distributionally similar terms, as well as lists of emoticons and internet slang abbreviations in conjunction with fast and robust machine learning algorithms to solve the SemEval-2013 sentiment analysis task.
Proceedings ArticleDOI
SemantiKLUE: Robust Semantic Similarity at Multiple Levels Using Maximum Weight Matching
TL;DR: The SemantiKLUE system is a word-to-word alignment of two texts using a maximum weight matching algorithm that combines unsupervised and supervised techniques into a robust system for measuring semantic similarity.
E-VIEW-alation – a Large-scale Evaluation Study of Association Measures for Collocation Identification
TL;DR: An interactive Web-based application is created that allows users to manipulate all evaluation parameters and dynamically updates evaluation graphs and summaries, and shows the results of a large-scale evaluation study against two different gold standards of lexical collocations.