Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer

doi:10.1007/978-3-319-47602-5_14

Book ChapterDOI

Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer

Peter Bourgonje, +5 more

- pp 65-68

Chats0

TLDR

The platform is intended to enable human experts (knowledge workers) to get a grasp and understand the contents of large document collections in an efficient way so that they can curate, process and further analyse the collection according to their sector-specific needs.

Abstract:

In an attempt to put a Semantic Web-layer that provides linguistic analysis and discourse information on top of digital content, we develop a platform for digital curation technologies. The platform offers language-, knowledge- and data-aware services as a flexible set of workflows and pipelines for the efficient processing of various types of digital content. The platform is intended to enable human experts (knowledge workers) to get a grasp and understand the contents of large document collections in an efficient way so that they can curate, process and further analyse the collection according to their sector-specific needs.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles

Peter Bourgonje, +2 more

TL;DR: This work wants to contribute to the debate on how to deal with fake news and related online phenomena with technological means, by providing means to separate related from unrelated headlines and further classifying the related headlines.

...read moreread less

Book ChapterDOI

An Infrastructure for Empowering Internet Users to Handle Fake News and Other Online Media Phenomena

Georg Rehm

TL;DR: An infrastructure to address phenomena of modern online media production, circulation and manipulation is proposed by establishing a distributed architecture for automatic processing and human feedback.

...read moreread less

Proceedings Article

A Dataset of German Legal Documents for Named Entity Recognition

Elena Leitner, +2 more

TL;DR: A dataset developed for Named Entity Recognition in German federal court decisions that consists of approx.

...read moreread less

Posted Content

QURATOR: Innovative Technologies for Content and Data Curation

Georg Rehm, +39 more

- 25 Apr 2020 -

arXiv: Digital Libraries

TL;DR: The QURATOR project, funded by the German Federal Ministry of Education and Research, develops a sustainable and innovative technology platform that provides services to support knowledge workers in various industries to address the challenges they face when curating digital content.

...read moreread less

Proceedings ArticleDOI

Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters

Georg Rehm, +8 more

TL;DR: An approach at identifying a specific class of events, movement action events (MAEs), in a data set that consists of ca.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

DBpedia spotlight: shedding light on the web of documents

Pablo N. Mendes, +3 more

TL;DR: DBpedia Spotlight, a system for automatically annotating text documents with DBpedia URIs, is developed, and results are evaluated in light of three baselines and six publicly available annotation systems, demonstrating the competitiveness of the system.

...read moreread less

Journal ArticleDOI

Learning multilingual named entity recognition from Wikipedia

Joel Nothman, +4 more

- 01 Jan 2013 -

Artificial Intelligence

TL;DR: The approach outperforms other approaches to automatic ne annotation; competes with gold-standard training when tested on an evaluation corpus from a different source; and performs 10% better than newswire-trained models on manually-annotated Wikipedia text.

...read moreread less

Proceedings Article

WikiWars: A New Corpus for Research on Temporal Expressions

Pawel Mazur, +1 more

TL;DR: A new corpus of temporally-rich documents sourced from English Wikipedia, which is annotated with TIMEX2 tags, is presented, thus comparing favourably in size to other existing corpora used in these areas.

...read moreread less

Dissertation

Hypertextsorten: Definition - Struktur - Klassifikation.

Georg Rehm

Introducing FREME: Deploying Linguistic Linked Data.

Felix Sasaki, +8 more

TL;DR: The paper discusses how the FREME project deploys Linguistic Linked Data (LLD), especially existing LLD resources, LLD best practices and the LLD reference architecture.

...read moreread less

Related Papers (5)

Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows

Bourgonje Peter, +3 more

Towards a Platform for Curation Technologies: Enriching Text Collections with a Semantic-Web Layer

Citations

From Clickbait to Fake News Detection: An Approach based on Detecting the Stance of Headlines to Articles

An Infrastructure for Empowering Internet Users to Handle Fake News and Other Online Media Phenomena

A Dataset of German Legal Documents for Named Entity Recognition

QURATOR: Innovative Technologies for Content and Data Curation

Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters

References

DBpedia spotlight: shedding light on the web of documents

Learning multilingual named entity recognition from Wikipedia

WikiWars: A New Corpus for Research on Temporal Expressions

Hypertextsorten: Definition - Struktur - Klassifikation.

Introducing FREME: Deploying Linguistic Linked Data.

Related Papers (5)

Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows

Towards User Interfaces for Semantic Storytelling

Designing User Interfaces for Curation Technologies

Event Detection and Semantic Storytelling: Generating a Travelogue from a large Collection of Personal Letters

Exploiting Ontology Lexica for Generating Natural Language Texts from RDF Data