scispace - formally typeset
Journal ArticleDOI

Terminological ontology learning and population using latent Dirichlet allocation

TLDR
An ontology learning and population system that combines both statistical and semantic methodologies is presented that achieves good performances on standard datasets.
Abstract
The success of Semantic Web will heavily rely on the availability of formal ontologies to structure machine understanding data. However, there is still a lack of general methodologies for ontology automatic learning and population, i.e. the generation of domain ontologies from various kinds of resources by applying natural language processing and machine learning techniques In this paper, the authors present an ontology learning and population system that combines both statistical and semantic methodologies. Several experiments have been carried out, demonstrating the effectiveness of the proposed approach. HighlightsA graph of terms can be effectively used for ontology building.Such a graph is extracted from documents thanks to a LDA based methodology.Ontology learning involves the use of annotated lexicons (WordNet).Proposed method achieves good performances on standard datasets.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Information extraction meets the semantic web: a survey

TL;DR: Millennium Institute for Foundational Research on Data (IMFD) Comision Nacional de Investigacion Cientifica y Tecnologica (CONICYT), CONICyT FONDECYT: 1181896
Journal ArticleDOI

Semi-automatic terminology ontology learning based on topic modeling

TL;DR: In this paper, two topic modeling algorithms are explored, namely LSI and SVD and Mr.LDA for learning topic ontology and the objective is to determine the statistical relationship between document and terms to build a topic ontologies and ontology graph with minimum human intervention.
Journal ArticleDOI

Automatic Concept Extraction Based on Semantic Graphs From Big Data in Smart City

TL;DR: A novel concept extraction method that can effectively use semantic information, and the results of the concept extraction are better from domain big data in smart cities is proposed.
Journal ArticleDOI

CHAT-Bot: A cultural heritage aware teller-bot for supporting touristic experiences

TL;DR: A recommender system capable of developing adaptive tourist routes according to both the profile of the tourist and contextual aspects is introduced and a prototype was developed to support the user in building a customized tourist route related to some of the most important cultural sites in Campania.
Journal ArticleDOI

Automatic Non-Taxonomic Relation Extraction from Big Data in Smart City

TL;DR: A multi-phase correlation search framework to automatically extract non-taxonomic relations from domain documents and a Semantic Graph-Based method to combine structure information of semantic graph and context information of terms together for non- taxonomic relationships identification is proposed.
References
More filters
Journal ArticleDOI

Latent dirichlet allocation

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.
Proceedings Article

Latent Dirichlet Allocation

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).
Proceedings ArticleDOI

Verb semantics and lexical selection

Abstract: This paper will focus on the semantic representation of verbs in computer systems and its impact on lexical selection problems in machine translation (MT). Two groups of English and Chinese verbs are examined to show that lexical selection must be based on interpretation of the sentences as well as selection restrictions placed on the verb arguments. A novel representation scheme is suggested, and is compared to representations with selection restrictions used in transfer-based MT. We see our approach as closely aligned with knowledge-based MT approaches (KBMT), and as a separate component that could be incorporated into existing systems. Examples and experimental results will show that, using this scheme, inexact matches can achieve correct lexical selection.
Book

Ontology Learning for the Semantic Web

TL;DR: The authors present an ontology learning framework that extends typical ontology engineering environments by using semiautomatic ontology construction tools and encompasses ontology import, extraction, pruning, refinement and evaluation.
Proceedings ArticleDOI

WordNet::Similarity: measuring the relatedness of concepts

TL;DR: WordNet::Similarity as mentioned in this paper is a Perl package that makes it possible to measure the semantic similarity and relatedness between a pair of concepts (or synsets) using WordNet.
Related Papers (5)