A New Similarity Measure for Automatic Construction of the Unknown Word Lexical Dictionary

doi:10.4018/JSWIS.2009010102

Journal ArticleDOI

A New Similarity Measure for Automatic Construction of the Unknown Word Lexical Dictionary

Myunggwon Hwang, +1 more

- 01 Jan 2009 -

International Journal on Semantic Web an...

- Vol. 5, Iss: 1, pp 48-64

Chats0

TLDR

In this paper, the concept of unknown word (UW) is defined and a method to construct a lexical dictionary of unknown words through inputting various document collections scattered on the web is proposed.

Abstract:

This article deals with research that automatically constructs a lexical dictionary of unknown words. The lexical dictionary has been usefully applied to various fields for semantic information processing. It has limitations in which it only processes terms defined in the dictionary. Under this circumstance, the concept of â€œUnknown Word (UW)â€ is defined. UW, in this research, is considered a word not defined in WordNet. Here is where a new method to construct UW lexical dictionary through inputting various document collections scattered on the web is proposed. We grasp related terms of UW and measure semantic relatedness (similarity) between an UW and a related term(s). The relatedness is obtained by calculating both probabilistic relationship and semantic relationship. This research can extend UW lexical dictionary with an abundant number of UW. It is also possible to prepare a foundation for semantic retrieval by simultaneously using the UW lexical dictionary and WordNet.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Automatic Enrichment of Semantic Relation Network and Its Application to Word Sense Disambiguation

Myunggwon Hwang, +2 more

- 01 Jun 2011 -

IEEE Transactions on Knowledge and Data ...

TL;DR: A rule based method using WordNet's glossaries and an inference method using axioms for WordNet relations are applied for the enrichment and an enriched WordNet (E- wordNet) is built as the result, substantiating the usefulness of E-WordNet.

...read moreread less

Journal ArticleDOI

A New Model to Compute the Information Content of Concepts from Taxonomic Knowledge

David Sánchez, +1 more

- 01 Apr 2012 -

International Journal on Semantic Web an...

TL;DR: This paper proposes a new model to compute Information Content IC of a concept exploiting the taxonomic knowledge modeled in an ontology, and shows that the use of the authors' model produces, in most cases, more accurate similarity estimations than related works.

...read moreread less

Journal ArticleDOI

Tools for the Automatic Generation of Ontology Documentation: A Task-Based Evaluation

Silvio Peroni, +2 more

- 01 Jan 2013 -

International Journal on Semantic Web an...

TL;DR: Three tools are described, LODE, Parrot and the OWLDoc-based Ontology Browser, that can be used automatically to create documentation from a well-formed OWL ontology at any stage of its development.

...read moreread less

Proceedings ArticleDOI

Information Retrieval Techniques to Grasp User Intention in Pervasive Computing Environment

Myunggwon Hwang, +2 more

TL;DR: An approach based on co-occurrence and statistical method, kinds of information retrieval technique, to grasp user intention based on diverse device sensors (context information), including both physical and logical objects is suggested.

...read moreread less

Journal ArticleDOI

A term normalization method for efficient knowledge acquisition through text processing

Myunggwon Hwang, +6 more

- 01 Jul 2013 -

Multimedia Tools and Applications

TL;DR: A method of term normalization is proposed which finds a normalized form ( original and standard form defined in dictionaries) of variant terms to solve the problem of variations in terms.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

Introduction to WordNet: An On-line Lexical Database

George A. Miller, +4 more

- 01 Dec 1990 -

International Journal of Lexicography

TL;DR: Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list.

...read moreread less

Proceedings ArticleDOI

Feature-rich part-of-speech tagging with a cyclic dependency network

Kristina Toutanova, +3 more

TL;DR: A new part-of-speech tagger is presented that demonstrates the following ideas: explicit use of both preceding and following tag contexts via a dependency network representation, broad use of lexical features, and effective use of priors in conditional loglinear models.

...read moreread less

Proceedings ArticleDOI

Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger

Kristina Toutanvoa, +1 more

TL;DR: This paper presents results for a maximum-entropy-based part of speech tagger, which achieves superior performance principally by enriching the information sources used for tagging by incorporating these features: more extensive treatment of capitalization for unknown words, and features for the disambiguation of the tense forms of verbs.

...read moreread less

Journal ArticleDOI

Semantic annotation, indexing, and retrieval

Atanas Kiryakov, +4 more

- 01 Dec 2004 -

Journal of Web Semantics

TL;DR: This paper presents a semantically enhanced information extraction system, which provides automatic semantic annotation with references to classes in the ontology and to instances and argues that such large-scale, fully automatic methods are essential for the transformation of the current largely textual web into a Semantic Web.

...read moreread less

Journal ArticleDOI

Automatic ontology-based knowledge extraction from Web documents

Harith Alani, +6 more

- 01 Jan 2003 -

IEEE Intelligent Systems

TL;DR: The Artequakt project is considered, which links a knowledge extraction tool with an ontology to achieve continuous knowledge support and guide information extraction and is further enhanced using a lexicon-based term expansion mechanism that provides extended ontology terminology.

...read moreread less