scispace - formally typeset
Journal ArticleDOI

A New Similarity Measure for Automatic Construction of the Unknown Word Lexical Dictionary

Myunggwon Hwang, +1 more
- 01 Jan 2009 - 
- Vol. 5, Iss: 1, pp 48-64
Reads0
Chats0
TLDR
In this paper, the concept of unknown word (UW) is defined and a method to construct a lexical dictionary of unknown words through inputting various document collections scattered on the web is proposed.
Abstract
This article deals with research that automatically constructs a lexical dictionary of unknown words. The lexical dictionary has been usefully applied to various fields for semantic information processing. It has limitations in which it only processes terms defined in the dictionary. Under this circumstance, the concept of “Unknown Word (UW)†is defined. UW, in this research, is considered a word not defined in WordNet. Here is where a new method to construct UW lexical dictionary through inputting various document collections scattered on the web is proposed. We grasp related terms of UW and measure semantic relatedness (similarity) between an UW and a related term(s). The relatedness is obtained by calculating both probabilistic relationship and semantic relationship. This research can extend UW lexical dictionary with an abundant number of UW. It is also possible to prepare a foundation for semantic retrieval by simultaneously using the UW lexical dictionary and WordNet.

read more

Citations
More filters
Journal ArticleDOI

Automatic Enrichment of Semantic Relation Network and Its Application to Word Sense Disambiguation

TL;DR: A rule based method using WordNet's glossaries and an inference method using axioms for WordNet relations are applied for the enrichment and an enriched WordNet (E- wordNet) is built as the result, substantiating the usefulness of E-WordNet.
Journal ArticleDOI

A New Model to Compute the Information Content of Concepts from Taxonomic Knowledge

TL;DR: This paper proposes a new model to compute Information Content IC of a concept exploiting the taxonomic knowledge modeled in an ontology, and shows that the use of the authors' model produces, in most cases, more accurate similarity estimations than related works.
Journal ArticleDOI

Tools for the Automatic Generation of Ontology Documentation: A Task-Based Evaluation

TL;DR: Three tools are described, LODE, Parrot and the OWLDoc-based Ontology Browser, that can be used automatically to create documentation from a well-formed OWL ontology at any stage of its development.
Proceedings ArticleDOI

Information Retrieval Techniques to Grasp User Intention in Pervasive Computing Environment

TL;DR: An approach based on co-occurrence and statistical method, kinds of information retrieval technique, to grasp user intention based on diverse device sensors (context information), including both physical and logical objects is suggested.
Journal ArticleDOI

A term normalization method for efficient knowledge acquisition through text processing

TL;DR: A method of term normalization is proposed which finds a normalized form ( original and standard form defined in dictionaries) of variant terms to solve the problem of variations in terms.
References
More filters
Journal ArticleDOI

Introduction to WordNet: An On-line Lexical Database

TL;DR: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list.
Proceedings ArticleDOI

Feature-rich part-of-speech tagging with a cyclic dependency network

TL;DR: A new part-of-speech tagger is presented that demonstrates the following ideas: explicit use of both preceding and following tag contexts via a dependency network representation, broad use of lexical features, and effective use of priors in conditional loglinear models.
Proceedings ArticleDOI

Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger

TL;DR: This paper presents results for a maximum-entropy-based part of speech tagger, which achieves superior performance principally by enriching the information sources used for tagging by incorporating these features: more extensive treatment of capitalization for unknown words, and features for the disambiguation of the tense forms of verbs.
Journal ArticleDOI

Semantic annotation, indexing, and retrieval

TL;DR: This paper presents a semantically enhanced information extraction system, which provides automatic semantic annotation with references to classes in the ontology and to instances and argues that such large-scale, fully automatic methods are essential for the transformation of the current largely textual web into a Semantic Web.
Journal ArticleDOI

Automatic ontology-based knowledge extraction from Web documents

TL;DR: The Artequakt project is considered, which links a knowledge extraction tool with an ontology to achieve continuous knowledge support and guide information extraction and is further enhanced using a lexicon-based term expansion mechanism that provides extended ontology terminology.
Related Papers (5)