scispace - formally typeset
Book ChapterDOI

GEO-NASS: A Semantic Tagging Experience from Geographical Data on the Media

Reads0
Chats0
TLDR
A method is proposed that combines geographic location techniques with Natural Language Processing and statistical and semantic disambiguation tools to perform an appropriate labeling in a general way and shows the potential of the proposal.
Abstract
From a documentary point of view, an important aspect when we are conducting a rigorous labeling is to consider the geographic locations related to each document. Although there exist tools and geographic databases, it is not easy to find an automated labeling system for multilingual texts specialized in this type of recognition and further adapted to a particular context. This paper proposes a method that combines geographic location techniques with Natural Language Processing and statistical and semantic disambiguation tools to perform an appropriate labeling in a general way. The method can be configured and fine-tuned for a given context in order to optimize the results. The paper also details an experience of using the proposed method over a content management system in a real organization a major Spanish newspaper. The experimental results obtained show an overall accuracy of around 80%, which shows the potential of the proposal.

read more

Citations
More filters
Proceedings ArticleDOI

The AIS Project: Boosting Information Extraction from Legal Documents by using Ontologies

TL;DR: This work presents an approach to obtain relevant information out from legal documents based on the use of ontologies to capture and take advantage of such structure and language conventions, and implements this approach in a framework that allows to address different types of documents with minimal effort.
Journal ArticleDOI

An Artificial Intelligence Driven Multi-Feature Extraction Scheme for Big Data Detection

TL;DR: A multi-feature fusion clustering algorithm is proposed based on user attention with two main stages and based on this, an artificial intelligence driven big data MFE scheme is designed, then an application example of the general scheme is expanded and detailed.
Proceedings Article

The GENIE Project - A Semantic Pipeline for Automatic Document Categorisation

TL;DR: A multi-language rule-based pipeline system for automatic document categorisation is proposed and the results of applying techniques that rely on statistics and supervised learning with the support of smarter tools based on language semantics and ontologies are compared.
Proceedings ArticleDOI

KGNR: A knowledge-based geographical news recommender

TL;DR: KGNR (Knowledge-based Geographical News Recommender), a new approach to develop a personalized news recommendation system as an application for mobile phones that takes into account the geolocation of the user and uses learned user profiles to generate personalized news recommendations.
Book ChapterDOI

Emerging Semantic-Based Applications

TL;DR: This chapter presents different semantic-based applications and projects that have been developed in the Distributed Information Systems (SID) research group of the University of Zaragoza and addresses different application fields benefiting from semantic technologies to broaden their capabilities.
References
More filters
Journal ArticleDOI

A translation approach to portable ontology specifications

TL;DR: This paper describes a mechanism for defining ontologies that are portable over representation systems, basing Ontolingua itself on an ontology of domain-independent, representational idioms.
Book

Introduction to Information Retrieval

TL;DR: In this article, the authors present an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections.
Journal ArticleDOI

Term Weighting Approaches in Automatic Text Retrieval

TL;DR: This paper summarizes the insights gained in automatic term weighting, and provides baseline single term indexing models with which other more elaborate content analysis procedures can be compared.
Book ChapterDOI

Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

TL;DR: This paper explores the use of Support Vector Machines for learning text classifiers from examples and analyzes the particular properties of learning with text data and identifies why SVMs are appropriate for this task.

OWL Web ontology language overview

TL;DR: This document provides an introduction to OWL by informally describing the features of each of the sublanguages of OWL, the Web Ontology Language by providing additional vocabulary along with a formal semantics.