Book ChapterDOI
Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer
Peter Bourgonje,Julián Moreno-Schneider,Jan Nehring,Georg Rehm,Felix Sasaki,Ankit Srivastava +5 more
- pp 65-68
Reads0
Chats0
TLDR
The platform is intended to enable human experts (knowledge workers) to get a grasp and understand the contents of large document collections in an efficient way so that they can curate, process and further analyse the collection according to their sector-specific needs.Abstract:
In an attempt to put a Semantic Web-layer that provides linguistic analysis and discourse information on top of digital content, we develop a platform for digital curation technologies. The platform offers language-, knowledge- and data-aware services as a flexible set of workflows and pipelines for the efficient processing of various types of digital content. The platform is intended to enable human experts (knowledge workers) to get a grasp and understand the contents of large document collections in an efficient way so that they can curate, process and further analyse the collection according to their sector-specific needs.read more
Citations
More filters
Proceedings ArticleDOI
From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles
TL;DR: This work wants to contribute to the debate on how to deal with fake news and related online phenomena with technological means, by providing means to separate related from unrelated headlines and further classifying the related headlines.
Book ChapterDOI
An Infrastructure for Empowering Internet Users to Handle Fake News and Other Online Media Phenomena
TL;DR: An infrastructure to address phenomena of modern online media production, circulation and manipulation is proposed by establishing a distributed architecture for automatic processing and human feedback.
Proceedings Article
A Dataset of German Legal Documents for Named Entity Recognition
TL;DR: A dataset developed for Named Entity Recognition in German federal court decisions that consists of approx.
Posted Content
QURATOR: Innovative Technologies for Content and Data Curation
Georg Rehm,Peter Bourgonje,Stefanie Hegele,Florian Kintzel,Julián Moreno Schneider,Malte Ostendorff,Karolina Zaczynska,Armin Berger,Stefan Grill,Sören Räuchle,Jens Rauenbusch,Lisa Rutenburg,André Schmidt,Mikka Wild,Henry Hoffmann,Julian Fink,Sarah Schulz,Jurica Ševa,Joachim Quantz,Joachim Böttger,Josefine Matthey,Rolf Fricke,Jan Thomsen,Adrian Paschke,Jamal Al Qundus,Thomas Hoppe,Naouel Karam,Frauke Weichhardt,Christian Fillies,Clemens Neudecker,Mike Gerber,Kai Labusch,Vahid Rezanezhad,Robin Schaefer,David Zellhöfer,Daniel Siewert,Patrick Bunk,Lydia Pintscher,Elena Aleynikova,Franziska Heine +39 more
TL;DR: The QURATOR project, funded by the German Federal Ministry of Education and Research, develops a sustainable and innovative technology platform that provides services to support knowledge workers in various industries to address the challenges they face when curating digital content.
Proceedings ArticleDOI
Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters
Georg Rehm,Julián Moreno Schneider,Peter Bourgonje,Ankit Srivastava,Jan Nehring,Armin Berger,Luca König,Sören Räuchle,Jens Gerth +8 more
TL;DR: An approach at identifying a specific class of events, movement action events (MAEs), in a data set that consists of ca.
References
More filters
Proceedings ArticleDOI
DBpedia spotlight: shedding light on the web of documents
TL;DR: DBpedia Spotlight, a system for automatically annotating text documents with DBpedia URIs, is developed, and results are evaluated in light of three baselines and six publicly available annotation systems, demonstrating the competitiveness of the system.
Journal ArticleDOI
Learning multilingual named entity recognition from Wikipedia
TL;DR: The approach outperforms other approaches to automatic ne annotation; competes with gold-standard training when tested on an evaluation corpus from a different source; and performs 10% better than newswire-trained models on manually-annotated Wikipedia text.
Proceedings Article
WikiWars: A New Corpus for Research on Temporal Expressions
Pawel Mazur,Robert Dale +1 more
TL;DR: A new corpus of temporally-rich documents sourced from English Wikipedia, which is annotated with TIMEX2 tags, is presented, thus comparing favourably in size to other existing corpora used in these areas.
Introducing FREME: Deploying Linguistic Linked Data.
Felix Sasaki,Tatiana Gornostay,Milan Dojchinovski,Michele Osella,Erik Mannens,Giannis Stoitsis,Phil Ritchie,Thierry Declerck,Kevin Koidl +8 more
TL;DR: The paper discusses how the FREME project deploys Linguistic Linked Data (LLD), especially existing LLD resources, LLD best practices and the LLD reference architecture.