Open Access
xTAS and ThemeStreams : Extendable Text Analysis Service and its Usage in a Topic Monitoring Tool
Ork de Rooij,Tom Kenter,Maarten de Rijke +2 more
- pp 58-59
Reads0
Chats0
TLDR
The xTAS system as mentioned in this paper is an extendable multi-user text analysis service for large scale multi-lingual document analysis developed at the University of Amsterdam, which can process large amounts of documents in a timely manner through a web interface that can be used by multiple users at once.Abstract:
xTAS is an extendable multi-user text analysis service for large scale multi-lingual document analysis developed at the University of Amsterdam. It can process large amounts of documents in a timely manner through a web interface that can be used by multiple users at once. In this demonstration paper we present recent additions which include semanticization, on the fly TF-IDF model generation and on the fly co-occurrence metrics. Furthermore, we demonstrate ThemeStreams, a novel topic monitoring tool built on top of xTAS.read more
References
More filters
N-gram-based text categorization
W.B. Cavnar,John M. Trenkle +1 more
TL;DR: An N-gram-based approach to text categorization that is tolerant of textual errors is described, which worked very well for language classification and worked reasonably well for classifying articles from a number of different computer-oriented newsgroups according to subject.
Journal ArticleDOI
Stacked Graphs – Geometry & Aesthetics
L. Byron,Martin Wattenberg +1 more
TL;DR: It is suggested that this type of complex layered graph is effective for displaying large data sets to a mass audience and the design decisions and algorithms behind these graphics are described.