scispace - formally typeset
Open AccessProceedings Article

A general architecture to enhance wiki systems with natural language processing techniques

TLDR
This research work presents a general architecture that allows wiki systems to benefit from NLP services made available through the Semantic Assistants framework – a service-oriented architecture for brokering NLP pipelines as web services.
Abstract
Wikis are web-based software applications that allow users to collaboratively create and edit web page content, through a Web browser using a simplified syntax. The ease-of-use and “open” philosophy of wikis has brought them to the attention of organizations and online communities, leading to a wide-spread adoption as a simple and “quick” way of collaborative knowledge management. However, these characteristics of wiki systems can act as a double-edged sword: When wiki content is not properly structured, it can turn into a “tangle of links”, making navigation, organization and content retrieval difficult for their end-users. Since wiki content is mostly written in unstructured natural language, we believe that existing state-of-the-art techniques from the Natural Language Processing (NLP) and Semantic Computing domains can help mitigating these common problems when using wikis and improve their users’ experience by introducing new features. The challenge, however, is to find a solution for integrating novel semantic analysis algorithms into the multitude of existing wiki systems, without the need for modifying their engines. In this research work, we present a general architecture that allows wiki systems to benefit from NLP services made available through the Semantic Assistants framework – a service-oriented architecture for brokering NLP pipelines as web services. Our main contributions in this thesis include an analysis of wiki engines, the development of collaboration patterns be- tween wikis and NLP, and the design of a cohesive integration architecture. As a concrete application, we deployed our integration to MediaWiki – the powerful wiki engine behind Wikipedia – to prove its practicability. Finally, we evaluate the usability and efficiency of our integration through a number of user studies we performed in real-world projects from various domains, including cultural heritage data management, software requirements engineering, and biomedical literature curation.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Natural language processing for MediaWiki: the semantic assistants approach

TL;DR: A novel architecture for the integration of Natural Language Processing capabilities into wiki systems is presented to enable wiki users -- novice or expert -- to benefit from modern text mining techniques directly within their wiki environment.
Proceedings ArticleDOI

ReqWiki: a semantic system for collaborative software requirements engineering

TL;DR: ReqWiki is presented, a novel open source web-based approach based on a semantic wiki that includes Natural Language Processing (NLP) assistants, which work collaboratively with humans on the requirements specification documents.
Proceedings ArticleDOI

Supporting wiki users with natural language processing

TL;DR: A "self-aware" wiki system that can develop and organize its content using state-of-art techniques from the Natural Language Processing (NLP) and Semantic Computing domains is presented.
References
More filters
Book

Design Patterns: Elements of Reusable Object-Oriented Software

TL;DR: The book is an introduction to the idea of design patterns in software engineering, and a catalog of twenty-three common patterns, which most experienced OOP designers will find out they've known about patterns all along.
Journal ArticleDOI

Understanding and Using Context

TL;DR: An operational definition of context is provided and the different ways in which context can be used by context-aware applications are discussed, including the features and abstractions in the toolkit that make the task of building applications easier.
Proceedings Article

TextRank: Bringing Order into Text

Rada Mihalcea, +1 more
TL;DR: TextRank, a graph-based ranking model for text processing, is introduced and it is shown how this model can be successfully used in natural language applications.
Journal ArticleDOI

Ontologies: principles, methods and applications

TL;DR: This paper outlines a methodology for developing and evaluating ontologies, first discussing informal techniques, concerning such issues as scoping, handling ambiguity, reaching agreement and producing definitions, and considers, a more formal approach.
Proceedings ArticleDOI

A mathematical model of the finding of usability problems

TL;DR: It is found that the detection of usability problems as a function of number of users tested or heuristic evaluators employed is well modeled as a Poisson process, which can be used to plan the amount of evaluation required to achieve desired levels of thoroughness or benefits.