scispace - formally typeset
Journal ArticleDOI

Feature extraction based on information gain and sequential pattern for English question classification

Reads0
Chats0
TLDR
This study presents a hybrid approach to semantic feature extraction and lexical feature extraction, which achieves a coarse accuracy of 96% and fine accuracy of 90.4%, which is superior to existing methods.
Abstract
The purpose of question classification (QC) is to assign a question to an appropriate category from the set of predefined categories that constitute a question taxonomy. Selected question features are able to significantly improve the performance of QC. However, feature extraction, particularly syntax feature extraction, has a high computational cost. To maintain or enhance performance without syntax features, this study presents a hybrid approach to semantic feature extraction and lexical feature extraction. These features are generated by improved information gain and sequential pattern mining methods, respectively. Selected features are then fed into classifiers for questions classification. Benchmark testing is performed using the public UIUC data set. The results reveal that the proposed approach achieves a coarse accuracy of 96% and fine accuracy of 90.4%, which is superior to existing methods.

read more

Citations
More filters
Journal ArticleDOI

An Improved Ant Colony Optimization Algorithm Based on Hybrid Strategies for Scheduling Problem

TL;DR: The experiment results show that the proposed ICMPACO algorithm can effectively obtain the best optimization value in solving TSP and effectively solve the gate assignment problem, obtain better assignment result, and it takes on better optimization ability and stability.
Journal ArticleDOI

Fault Diagnosis Method Based on Principal Component Analysis and Broad Learning System

TL;DR: Experimental results show that the PCA method can effectively eliminate the feature correlation and realize the dimension reduction of the feature matrix, the BLS can take on better adaptability, faster computation speed, and higher classification accuracy, and the PABSFD method can efficiently and accurately obtain the fault diagnosis results.
Journal ArticleDOI

Daily Activity Feature Selection in Smart Homes Based on Pearson Correlation Coefficient

TL;DR: A daily activity feature selection strategy based on the Pearson Correlation Coefficient that yields higher recognition rates and achieves average improvement F-measures of 1.56% and 2.7%, respectively.
Journal ArticleDOI

Improving laser cutting quality of polymethylmethacrylate sheet: experimental investigation and optimization

TL;DR: In this article, an experimental investigation and optimization of laser cutting parameters including cutting speed, assisted gas pressure, laser beam power, and sheet thickness was conducted to enhance the quality of polymethyl methacrylate (PMMA) sheets using pulsed CO2 laser beam.
Journal ArticleDOI

Timely daily activity recognition from headmost sensor events.

TL;DR: Experimental findings show the effectiveness of this approach for timely daily activity recognition from an incomplete stream of sensor events in terms of precision, recall, average saved time, and saved time proportion.
References
More filters
Proceedings ArticleDOI

Verb semantics and lexical selection

Abstract: This paper will focus on the semantic representation of verbs in computer systems and its impact on lexical selection problems in machine translation (MT). Two groups of English and Chinese verbs are examined to show that lexical selection must be based on interpretation of the sentences as well as selection restrictions placed on the verb arguments. A novel representation scheme is suggested, and is compared to representations with selection restrictions used in transfer-based MT. We see our approach as closely aligned with knowledge-based MT approaches (KBMT), and as a separate component that could be incorporated into existing systems. Examples and experimental results will show that, using this scheme, inexact matches can achieve correct lexical selection.
Proceedings ArticleDOI

Learning question classifiers

TL;DR: A hierarchical classifier is learned that is guided by a layered semantic hierarchy of answer types, and eventually classifies questions into fine-grained classes.
Proceedings ArticleDOI

Question classification using support vector machines

TL;DR: This paper proposes to use a special kernel function called the tree kernel to enable the SVM to take advantage of the syntactic structures of questions, and describes how the tree Kernel can be computed efficiently by dynamic programming.
Journal ArticleDOI

Scaling question answering to the web

TL;DR: Mulder is introduced, which is believed to be the first general-purpose, fully-automated question-answering system available on the web, and its architecture is described, which relies on multiple search-engine queries, natural-language parsing, and a novel voting procedure to yield reliable answers coupled with high recall.
Journal ArticleDOI

Learning question classifiers: the role of semantic information

TL;DR: The authors developed a hierarchical classifier that classifies questions into fine-grained classes, guided by a layered semantic hierarchy of answer types, and performed a systematic study of the use of semantic information sources in natural language classification tasks.
Related Papers (5)