scispace - formally typeset
Open Access

xTAS and ThemeStreams : Extendable Text Analysis Service and its Usage in a Topic Monitoring Tool

Reads0
Chats0
TLDR
The xTAS system as mentioned in this paper is an extendable multi-user text analysis service for large scale multi-lingual document analysis developed at the University of Amsterdam, which can process large amounts of documents in a timely manner through a web interface that can be used by multiple users at once.
Abstract
xTAS is an extendable multi-user text analysis service for large scale multi-lingual document analysis developed at the University of Amsterdam. It can process large amounts of documents in a timely manner through a web interface that can be used by multiple users at once. In this demonstration paper we present recent additions which include semanticization, on the fly TF-IDF model generation and on the fly co-occurrence metrics. Furthermore, we demonstrate ThemeStreams, a novel topic monitoring tool built on top of xTAS.

read more

References
More filters

N-gram-based text categorization

TL;DR: An N-gram-based approach to text categorization that is tolerant of textual errors is described, which worked very well for language classification and worked reasonably well for classifying articles from a number of different computer-oriented newsgroups according to subject.
Journal ArticleDOI

Stacked Graphs – Geometry & Aesthetics

TL;DR: It is suggested that this type of complex layered graph is effective for displaying large data sets to a mass audience and the design decisions and algorithms behind these graphics are described.
Related Papers (5)