scispace - formally typeset
Search or ask a question

Showing papers by "Aly A. Fahmy published in 2012"


Journal ArticleDOI
TL;DR: This paper investigates constructing a comprehensive feature set to compensate the lack of parsing structural outcomes in Arabic Language and presents a leading research for the opinion holder extraction in Arabic news independent from any lexical parsers.
Abstract: Opinion mining aims at extracting useful subjective information from reliable amounts of text. Opinion mining holder recognition is a task that has not been considered yet in Arabic Language. This task essentially requires deep understanding of clauses structures. Unfortunately, the lack of a robust, publicly available, Arabic parser further complicates the research. This paper presents a leading research for the opinion holder extraction in Arabic news independent from any lexical parsers. We investigate constructing a comprehensive feature set to compensate the lack of parsing structural outcomes. The proposed feature set is tuned from English previous works coupled with our proposed semantic field and named entities features. Our feature analysis is based on Conditional Random Fields (CRF) and semi-supervised pattern recognition techniques. Different research models are evaluated via cross-validation experiments achieving 54.03 F-measure. We publicly release our own research outcome corpus and lexicon for opinion mining community to encourage further research.

97 citations


Journal ArticleDOI
TL;DR: This paper presents a different unsupervised approach which deals with students’ answers holistically using text to text similarity using Bag of Words (BOW) when compared to previous work.
Abstract: Most automatic scoring systems use pattern based that requires a lot of hard and tedious work. These systems work in a supervised manner where predefined patterns and scoring rules are generated. This paper presents a different unsupervised approach which deals with students’ answers holistically using text to text similarity. Different String-based and Corpus-based similarity measures were tested separately and then combined to achieve a maximum correlation value of 0.504. The achieved correlation is the best value achieved for unsupervised approach Bag of Words (BOW) when compared to previous work.

49 citations


Proceedings ArticleDOI
01 Nov 2012
TL;DR: The most popular objectives proposed over the past years are used and it is shown how those objective correlate with each other, and their performances when they are used in the single-objective Genetic Algorithm and the Multi-Objective genetic Al algorithm and the community structure properties they tend to produce.
Abstract: Community detection in complex networks has attracted a lot of attention in recent years. Community detection can be viewed as an optimization problem, in which an objective function that captures the intuition of a community as a group of nodes with better internal connectivity than external connectivity is chosen to be optimized. Many single-objective optimization techniques have been used to solve the problem however those approaches have its drawbacks since they try optimizing one objective function and this results to a solution with a particular community structure property. More recently researchers viewed the problem as a multi-objective optimization problem and many approaches have been proposed to solve it. However which objective functions could be used with each other is still under debated since many objective functions have been proposed over the past years and in somehow most of them are similar in definition. In this paper we use Genetic Algorithm (GA) as an effective optimization technique to solve the community detection problem as a single-objective and multi-objective problem, we use the most popular objectives proposed over the past years, and we show how those objective correlate with each other, and their performances when they are used in the single-objective Genetic Algorithm and the Multi-Objective Genetic Algorithm and the community structure properties they tend to produce.

30 citations


Journal ArticleDOI
TL;DR: In this paper, the authors presented a leading research for opinion holder extraction in Arabic news independent from any lexical parsers, which is a task that has not been considered yet in Arabic Language.
Abstract: Opinion mining aims at extracting useful subjective information from reliable amounts of text. Opinion mining holder recognition is a task that has not been considered yet in Arabic Language. This task essentially requires deep understanding of clauses structures. Unfortunately, the lack of a robust, publicly available, Arabic parser further complicates the research. This paper presents a leading research for the opinion holder extraction in Arabic news independent from any lexical parsers. We investigate constructing a comprehensive feature set to compensate the lack of parsing structural outcomes. The proposed feature set is tuned from English previous works coupled with our proposed semantic field and named entities features. Our feature analysis is based on Conditional Random Fields (CRF) and semi-supervised pattern recognition techniques. Different research models are evaluated via cross-validation experiments achieving 54.03 F-measure. We publicly release our own research outcome corpus and lexicon for opinion mining community to encourage further research.

16 citations


Book ChapterDOI
01 Jan 2012
TL;DR: This research addresses issues related to the morphological analysis of ill-formed Arabic verbs in order to identify the source of errors and provide an informative feedback to SLLs of Arabic.
Abstract: Arabic is a language of rich and complex morphology. The nature and peculiarity of Arabic make its morphological and phonological rules confusing for second language learners (SLLs). The conjugation of Arabic verbs is central to the formulation of an Arabic sentence because of its richness of form and meaning. In this research, we address issues related to the morphological analysis of ill-formed Arabic verbs in order to identify the source of errors and provide an informative feedback to SLLs of Arabic. The edit distance and constraint relaxation techniques are used to demonstrate the capability of the proposed system in generating all possible analyses of erroneous Arabic verbs written by SLLs. Filtering mechanisms are applied to exclude the irrelevant constructions and determine the target stem which is used as the base for constructing the feedback to the learner. The proposed system has been developed and effectively evaluated using real test data. It achieved satisfactory results in terms of the recall rate.

8 citations


Book ChapterDOI
28 Mar 2012
TL;DR: The ability of rough set methodology to successfully classify heart sound diseases without the need applying feature selection is introduced and the overall classification accuracy offered by the employed rough set approach is high compared with other machine learning techniques.
Abstract: Recently, heart sound signals have been used in the detection of the heart valve status and the identification of the heart valve disease Heart sound data sets represents real life data that contains continuous and a large number of features that could be hardly classified by most of classification techniques Feature reduction techniques should be applied prior applying data classifier to increase the classification accuracy results This paper introduces the ability of rough set methodology to successfully classify heart sound diseases without the need applying feature selection The capabilities of rough set in discrimination, feature reduction classification have proved their superior in classification of objects with very excellent accuracy results The experimental results obtained, show that the overall classification accuracy offered by the employed rough set approach is high compared with other machine learning techniques including Support Vector Machine (SVM), Hidden Naive Bayesian network (HNB), Bayesian network (BN), Naive Bayesian tree (NBT), Decision tree (DT), Sequential minimal optimization (SMO)

3 citations