scispace - formally typeset
Journal ArticleDOI

Corpus based part-of-speech tagging

Reads0
Chats0
TLDR
Several typical models of three kings of tagging are introduced in this article: rule-based tagging, statistical approaches and evolution algorithms, and the advantages and the pitfalls of each typical tagging are discussed and analyzed.
Abstract
In natural language processing, a crucial subsystem in a wide range of applications is a part-of-speech (POS) tagger, which labels (or classifies) unannotated words of natural language with POS labels corresponding to categories such as noun, verb or adjective. Mainstream approaches are generally corpus-based: a POS tagger learns from a corpus of pre-annotated data how to correctly tag unlabeled data. Presented here is a brief state-of-the-art account on POS tagging. POS tagging approaches make use of labeled corpus to train computational trained models. Several typical models of three kings of tagging are introduced in this article: rule-based tagging, statistical approaches and evolution algorithms. The advantages and the pitfalls of each typical tagging are discussed and analyzed. Some rule-based and stochastic methods have been successfully achieved accuracies of 93–96 %, while that of some evolution algorithms are about 96–97 %.

read more

Citations
More filters
Posted Content

Transformation-Based Learning in the Fast Lane

TL;DR: This paper presents a novel and realistic method for speeding up the training time of a transformation-based learner without sacrificing performance and shows that this system is able to achieve a significant improvement in training time while still achieving the same performance as a standard transformation- based learner.
Journal ArticleDOI

Part of speech tagging: a systematic review of deep learning and machine learning approaches

TL;DR: A comprehensive review of the latest POS tagging articles is provided by discussing the weakness and strengths of the proposed approaches as mentioned in this paper , which emphasized various research gaps and presented future recommendations for the research in advancing DL and ML-based POS tagging.

Query Translation Using Evolutionary Programming for Multi-Lingual Information Retrieval

TL;DR: Constructing a query from machine-readable, bilingual dictionaries and assigning term weights by the evolutionary optimization of a population of potential weighting schemes presents a solution to the difficulties of generating translated queries.
Journal ArticleDOI

Part of speech tagging: a systematic review of deep learning and machine learning approaches

TL;DR: A comprehensive review of the latest POS tagging articles is provided by discussing the weakness and strengths of the proposed approaches as discussed by the authors , which emphasized various research gaps and presented future recommendations for the research in advancing DL and ML-based POS tagging.
Journal ArticleDOI

Longitudinal Study of a Website for Assessing American Presidential Candidates and Decision Making of Potential Election Irregularities Detection

TL;DR: This article employed the concept of word sense disambiguation to determine the inherent meaning of voter intentions regarding possible political candidates from the 2016 U.S. presidential election using a website (www.presidentselect.com).
References
More filters
Book

Genetic algorithms in search, optimization, and machine learning

TL;DR: In this article, the authors present the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields, including computer programming and mathematics.

Genetic algorithms in search, optimization and machine learning

TL;DR: This book brings together the computer techniques, mathematical tools, and research results that will enable both students and practitioners to apply genetic algorithms to problems in many fields.
Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.
Book

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods

TL;DR: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning system based on recent advances in statistical learning theory, and will guide practitioners to updated literature, new applications, and on-line software.
Book

Foundations of Statistical Natural Language Processing

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.