Home
/
Authors
/
Wael Hassan Gomaa

Author

Wael Hassan Gomaa

Other affiliations: Modern Academy In Maadi

Bio: Wael Hassan Gomaa is an academic researcher from Beni-Suef University. The author has contributed to research in topics: Semantic similarity & Document clustering. The author has an hindex of 7, co-authored 11 publications receiving 727 citations. Previous affiliations of Wael Hassan Gomaa include Modern Academy In Maadi.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Arabic Short Answer Scoring with Effective Feedback for Students

[...]

Wael Hassan Gomaa, Aly A. Fahmy

16 Jan 2014-International Journal of Computer Applications

TL;DR: Overall, the obtained correlation and error rate results prove that the presented system performs well enough for deployment in a real scoring environment.

...read moreread less

Abstract: In this paper, we explore text similarity techniques for the task of automatic short answer scoring in Arabic language. We compare a number of string-based and corpus-based similarity measures, evaluate the effect of combining these measures, handle student’s answers holistically and partially, provide immediate useful feedback to student and also introduce a new benchmark Arabic data set that contains 50 questions and 600 student answers. Overall, the obtained correlation and error rate results prove that the presented system performs well enough for deployment in a real scoring environment. General Terms Natural Language Processing, Text Mining

...read moreread less

20 citations

Proceedings Article•DOI•

Supervised Learning Approach for Twitter Credibility Detection

[...]

Noha Y. Hassan¹, Wael Hassan Gomaa¹, Ghada Khoriba², Mohammed H. Haggag²•Institutions (2)

Beni-Suef University¹, Helwan University²

01 Dec 2018

TL;DR: A classification model based on supervised machine learning techniques is proposed to detect credibility on Twitter using both content-based and source-based features and achieves improvement of 22% when compared to CRF which applies the same approach in terms of F1-measure.

...read moreread less

Abstract: Twitter is the most popular micro-blogging medium that allows users to exchange short messages, provides a platform for public people to share the news. Nowadays, Twitter counts with an average of 328 million monthly active users and is growing rapidly. Detecting the credibility of shared information on Twitter becomes a necessity, especially during high impact events. In this paper a classification model based on supervised machine learning techniques is proposed to detect credibility. The proposed model uses an extensive set of features including both content-based and source-based features. The research compares the performance of five different machine learning classifiers using three feature sets: content based, source based and a combination of both sets. The best performance is achieved when using a combined set of features and applying Random Forests as a classifier with accuracy 78.4%, precision 79.6%, recall 91.6% and f1-measure 85.2%. Experiments also revealed that the proposed model achieves improvement of 22% when compared to CRF which applies the same approach in terms of F1-measure. Feature analysis is presented to highlight the importance of the source-based features compared with the content-based features as deciders for credibility.

...read moreread less

16 citations

Journal Article•DOI•

A Hybrid Model for Paraphrase Detection Combines pros of Text Similarity with Deep Learning

[...]

Mohamed I. El Desouki, Wael Hassan Gomaa, Hawaf Abdalhakim

18 Jun 2019-International Journal of Computer Applications

TL;DR: This paper proposes a hybrid model that combines the text similarity approach with deep learning approach in order to improve paraphrase detection and verified results with Microsoft Research Paraphrase Corpus dataset.

...read moreread less

Abstract: Paraphrase detection (PD) is a very essential and important task in Natural language processing. The goal of paraphrase detection is to check whether two statements written in natural language have the identical semantic or not. Its importance appears in many fields like plagiarism detection, question answering, document clustering and information retrieval, etc. This paper proposes a hybrid model that combines the text similarity approach with deep learning approach in order to improve paraphrase detection. This model verified results with Microsoft Research Paraphrase Corpus (MSPR) dataset, shows that accuracy measure is about 76.6% and F-measure is about 83.5%.

...read moreread less

7 citations

Proceedings Article•

Document Clustering using Word Sense Disambiguation.

[...]

M. S. Mostafa, Mohammed H. Haggag¹, Wael Hassan Gomaa²•Institutions (2)

Helwan University¹, Modern Academy In Maadi²

01 Jan 2008

TL;DR: The experimental results proved that the efficiency of document clustering using WSD increases linearly with the size of the documents dataset, and different part of speech taggers were tested to determine the best.

...read moreread less

Abstract: In computational linguistics, word sense disambiguation (WSD) is the problem of determining in which sense a word having a number of distinct senses is used in a given sentence . This paper handles text document clustering as one of the major tasks of text processing. Document clustering is the process of finding out groups of information from the text documents and cluster these documents into the most relevant groups. Large document corpus suffers from ambiguity problems like synonyms, polysemous and other semantic relations. For this reason we perform WSD task for all terms in all documents to get the best sense to be used as document features in the clustering process. Our experimental results proved that the efficiency of document clustering using WSD increases linearly with the size of the documents dataset. Different part of speech (POS) taggers were tested to determine the best; also the effect of different window sizes on WSD task was compared.

...read moreread less

5 citations

Journal Article•DOI•

The Impact of Deep Learning Techniques on SMS Spam Filtering

[...]

Wael Hassan Gomaa

01 Jan 2020-International Journal of Advanced Computer Science and Applications

TL;DR: This paper explores the impact of applying various deep learning techniques on SMS spam filtering; by comparing the results of seven different deep neural network architectures and six classifiers for classical machine learning.

...read moreread less

Abstract: Over the past decade, phone calls and bulk SMS have been fashionable. Although many advertisers assume that SMS has died, it is still alive. It is one of the simplest and most cost-effective marketing tools for companies to communicate on a personal level to their customers. The spread of SMS has led to the risk of spam. Most of the previous studies that attempted to detect spam were based on manually extracted features using classical machine learning classifiers. This paper explores the impact of applying various deep learning techniques on SMS spam filtering; by comparing the results of seven different deep neural network architectures and six classifiers for classical machine learning. Proposed methodologies are based on the automatic extraction of the required features. On a benchmark data set consisting of 5574 records, a fabulous accuracy of 99.26% has been resulted using Random Multimodel Deep Learning (RMDL) architecture.

...read moreread less

3 citations

Cited by

PDF

Open Access

More filters

Journal Article•

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

[...]

Daniël de Kok, Barbara Plank, van Gerardus Noord

01 Jan 2011-The Association for Computational Linguistics

324 citations

Journal Article•DOI•

Authors. in profile

[...]

Marjorie V. Batey

01 Jan 1969-Nursing Forum

164 citations

Journal Article•DOI•

Network structure and influence of the climate change counter-movement

[...]

Justin Farrell¹•Institutions (1)

Yale University¹

01 Apr 2016-Nature Climate Change

TL;DR: In this article, an application of network science reveals the institutional and corporate structure of the climate change counter-movement in the United States, while computational text analysis shows its influence in the news media and within political circles.

...read moreread less

Abstract: An application of network science reveals the institutional and corporate structure of the climate change counter-movement in the United States, while computational text analysis shows its influence in the news media and within political circles.

...read moreread less

144 citations

Proceedings Article•DOI•

[...]

Alfirna Rizqi Lahitani¹, Adhistya Erna Permanasari¹, Noor Akhmad Setiawan¹•Institutions (1)

Gadjah Mada University¹

26 Apr 2016

TL;DR: This research implemented the weighting of Term Frequency - Inverse Document Frequency (TF-IDF) method and Cosine Similarity with the measuring degree concept of similarity terms in a document to rank the document weight that have closesness match level with expert's document.

...read moreread less

Abstract: Development of technology in educational field brings the easier ways through the variety of facilitation for learning process, sharing files, giving assignment and assessment. Automated Essay Scoring (AES) is one of the development systems for determining a score automatically from text document source to facilitate the correction and scoring by utilizing applications that run on the computer. AES process is used to help the lecturers to score efficiently and effectively. Besides it can reduce the subjectivity scoring problem. However, implementation of AES depends on many factors and cases, such as language and mechanism of scoring process especially for essay scoring. A number of methods implemented for weighting the terms from document and reaching the solutions for handling comparative level between documents answer and expert's document still defined. In this research, we implemented the weighting of Term Frequency — Inverse Document Frequency (TF-IDF) method and Cosine Similarity with the measuring degree concept of similarity terms in a document. Tests carried out on a number of Indonesian text-based documents that have gone through the stage of pre-processing for data extraction purposes. This process results is in a ranking of the document weight that have closesness match level with expert's document.

...read moreread less

137 citations

Journal Article•DOI•

Powergrading: a Clustering Approach to Amplify Human Effort for Short Answer Grading

[...]

Sumit Basu¹, Charles E. Jacobs¹, Lucy Vanderwende¹•Institutions (1)

Microsoft¹

31 Oct 2013

TL;DR: This paper used a similarity metric between student responses, and then used this metric to group responses into clusters and subclusters, which allowed teachers to grade multiple responses with a single action, provide rich feedback to groups of similar answers, and discover modalities of misunderstanding among students.

...read moreread less

Abstract: We introduce a new approach to the machine-assisted grading of short answer questions. We follow past work in automated grading by first training a similarity metric between student responses, but then go on to use this metric to group responses into clusters and subclusters. The resulting groupings allow teachers to grade multiple responses with a single action, provide rich feedback to groups of similar answers, and discover modalities of misunderstanding among students; we refer to this amplification of grader effort as “powergrading.” We develop the means to further reduce teacher effort by automatically performing actions when an answer key is available. We show results in terms of grading progress with a small “budget” of human actions, both from our method and an LDA-based approach, on a test corpus of 10 questions answered by 698 respondents.

...read moreread less

134 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174

Collapse