scispace - formally typeset
Book ChapterDOI

Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer

Reads0
Chats0
TLDR
The platform is intended to enable human experts (knowledge workers) to get a grasp and understand the contents of large document collections in an efficient way so that they can curate, process and further analyse the collection according to their sector-specific needs.
Abstract
In an attempt to put a Semantic Web-layer that provides linguistic analysis and discourse information on top of digital content, we develop a platform for digital curation technologies. The platform offers language-, knowledge- and data-aware services as a flexible set of workflows and pipelines for the efficient processing of various types of digital content. The platform is intended to enable human experts (knowledge workers) to get a grasp and understand the contents of large document collections in an efficient way so that they can curate, process and further analyse the collection according to their sector-specific needs.

read more

Citations
More filters
Proceedings ArticleDOI

From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles

TL;DR: This work wants to contribute to the debate on how to deal with fake news and related online phenomena with technological means, by providing means to separate related from unrelated headlines and further classifying the related headlines.
Book ChapterDOI

An Infrastructure for Empowering Internet Users to Handle Fake News and Other Online Media Phenomena

Georg Rehm
TL;DR: An infrastructure to address phenomena of modern online media production, circulation and manipulation is proposed by establishing a distributed architecture for automatic processing and human feedback.
Proceedings Article

A Dataset of German Legal Documents for Named Entity Recognition

TL;DR: A dataset developed for Named Entity Recognition in German federal court decisions that consists of approx.
References
More filters
Proceedings ArticleDOI

DBpedia spotlight: shedding light on the web of documents

TL;DR: DBpedia Spotlight, a system for automatically annotating text documents with DBpedia URIs, is developed, and results are evaluated in light of three baselines and six publicly available annotation systems, demonstrating the competitiveness of the system.
Journal ArticleDOI

Learning multilingual named entity recognition from Wikipedia

TL;DR: The approach outperforms other approaches to automatic ne annotation; competes with gold-standard training when tested on an evaluation corpus from a different source; and performs 10% better than newswire-trained models on manually-annotated Wikipedia text.
Proceedings Article

WikiWars: A New Corpus for Research on Temporal Expressions

TL;DR: A new corpus of temporally-rich documents sourced from English Wikipedia, which is annotated with TIMEX2 tags, is presented, thus comparing favourably in size to other existing corpora used in these areas.

Introducing FREME: Deploying Linguistic Linked Data.

TL;DR: The paper discusses how the FREME project deploys Linguistic Linked Data (LLD), especially existing LLD resources, LLD best practices and the LLD reference architecture.
Related Papers (5)