scispace - formally typeset
Search or ask a question
Author

Ashraf Elnagar

Other affiliations: Sultan Qaboos University
Bio: Ashraf Elnagar is an academic researcher from University of Sharjah. The author has contributed to research in topics: Motion planning & Mobile robot. The author has an hindex of 20, co-authored 99 publications receiving 1140 citations. Previous affiliations of Ashraf Elnagar include Sultan Qaboos University.


Papers
More filters
Journal ArticleDOI
TL;DR: This work introduces new rich and unbiased datasets for both the single-label (SANAD) as well as the multi- label (NADiA) Arabic text categorization tasks and presents an extensive comparison of several deep learning models for Arabic text classification.
Abstract: Text classification or categorization is the process of automatically tagging a textual document with most relevant labels or categories. When the number of labels is restricted to one, the task becomes single-label text categorization. However, the multi-label version is challenging. For Arabic language, both tasks (especially the latter one) become more challenging in the absence of large and free Arabic rich and rational datasets. Therefore, we introduce new rich and unbiased datasets for both the single-label (SANAD) as well as the multi-label (NADiA) Arabic text categorization tasks. Both corpora are made freely available to the research community on Arabic computational linguistics. Further, we present an extensive comparison of several deep learning (DL) models for Arabic text categorization in order to evaluate the effectiveness of such models on SANAD and NADiA. A unique characteristic of our proposed work, when compared to existing ones, is that it does not require a pre-processing phase and fully based on deep learning models. Besides, we studied the impact of utilizing word2vec embedding models to improve the performance of the classification tasks. Our experimental results showed solid performance of all models on SANAD corpus with a minimum accuracy of 91.18%, achieved by convolutional-GRU, and top performance of 96.94%, achieved by attention-GRU. As for NADiA, attention-GRU achieved the highest overall accuracy of 88.68% for a maximum subsets of 10 categories on “Masrawy” dataset.

152 citations

Book ChapterDOI
01 Jan 2018
TL;DR: This paper introduces HARD (Hotel Arabic-Reviewsdataset), the largest Book Reviews in Arabic Dataset for subjective sentiment analysis and machine language applications, and implements a polarity lexicon-based sentiment analyzer.
Abstract: Arabic language suffers from the lack of available large datasets for machine learning and sentiment analysis applications. This work adds to the recently reported large dataset BRAD, which is the largest Book Reviews in Arabic Dataset. In this paper, we introduce HARD (Hotel Arabic-Reviews Dataset), the largest Book Reviews in Arabic Dataset for subjective sentiment analysis and machine language applications. HARD comprises of 490587 hotel reviews collected from the Booking.com website. Each record contains the review text in the Arabic language, the reviewer’s rating on a scale of 1 to 10 stars, and other attributes about the hotel/reviewer. We make available the full unbalanced dataset as well as a balanced subset. To examine the datasets, we implement six popular classifiers using Modern Standard Arabic (MSA) as well as Dialectal Arabic (DA). We test the sentiment analyzers for polarity and rating classifications. Furthermore, we implement a polarity lexicon-based sentiment analyzer. The findings confirm the effectiveness of the classifiers and the datasets. Our core contribution is to make this benchmark-dataset available and accessible to the research community on Arabic language.

101 citations

Book ChapterDOI
08 Apr 2020
TL;DR: The datasets used in machine learning techniques, which are the primary tools for analyzing network traffic and detecting abnormalities, are highlighted and elaborate on the issues faced in using ML/DL for cybersecurity and offer recommendations for future studies.
Abstract: In this review, significant literature surveys on machine learning (ML) and deep learning (DL) techniques for network analysis of intrusion detection are explained. In addition, it presents a short tutorial explanation on every ML/DL method. Data holds a significant position in ML/DL methods; hence this paper highlights the datasets used in machine learning techniques, which are the primary tools for analyzing network traffic and detecting abnormalities. In addition, we elaborate on the issues faced in using ML/DL for cybersecurity and offer recommendations for future studies.

91 citations

Book ChapterDOI
08 Apr 2020
TL;DR: This review has the objective of examining the way data mining was handled by researchers in the past and the most recent trends on data mining in educational research, as well as to evaluate the likelihood of employing machine learning in the field of education.
Abstract: One of the developing fields of the present times is educational data mining that pertains to developing methods that help in examining various kinds of data obtained from the educational field. A vital part is played by data mining in the education field, particularly when behavior is being assessed in an online learning setting. This is because data mining is capable of analyzing and identifying the hidden information regarding the data itself, which is very difficult and takes up a lot of time if performed manually. This review has the objective of examining the way data mining was handled by researchers in the past and the most recent trends on data mining in educational research, as well as to evaluate the likelihood of employing machine learning in the field of education. The various limitations inherent in the current research are examined and recommendations are made for future research.

88 citations

Journal ArticleDOI
01 Nov 1998
TL;DR: A framework for predicting future positions and orientation of moving obstacles in a time-varying environment using autoregressive model (ARM) with conditional maximum likelihood estimate of the model parameters is described.
Abstract: In this paper, we describe a framework for predicting future positions and orientation of moving obstacles in a time-varying environment using autoregressive model (ARM) with conditional maximum likelihood estimate of the model parameters. No constraints are placed on the obstacles motion. The proposed algorithm can be used in a variety of applications, one of which is robot motion planning in time varying environments.

78 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Abstract: Machine Learning is the study of methods for programming computers to learn. Computers are applied to a wide range of tasks, and for most of these it is relatively easy for programmers to design and implement the necessary software. However, there are many tasks for which this is difficult or impossible. These can be divided into four general categories. First, there are problems for which there exist no human experts. For example, in modern automated manufacturing facilities, there is a need to predict machine failures before they occur by analyzing sensor readings. Because the machines are new, there are no human experts who can be interviewed by a programmer to provide the knowledge necessary to build a computer system. A machine learning system can study recorded data and subsequent machine failures and learn prediction rules. Second, there are problems where human experts exist, but where they are unable to explain their expertise. This is the case in many perceptual tasks, such as speech recognition, hand-writing recognition, and natural language understanding. Virtually all humans exhibit expert-level abilities on these tasks, but none of them can describe the detailed steps that they follow as they perform them. Fortunately, humans can provide machines with examples of the inputs and correct outputs for these tasks, so machine learning algorithms can learn to map the inputs to the outputs. Third, there are problems where phenomena are changing rapidly. In finance, for example, people would like to predict the future behavior of the stock market, of consumer purchases, or of exchange rates. These behaviors change frequently, so that even if a programmer could construct a good predictive computer program, it would need to be rewritten frequently. A learning program can relieve the programmer of this burden by constantly modifying and tuning a set of learned prediction rules. Fourth, there are applications that need to be customized for each computer user separately. Consider, for example, a program to filter unwanted electronic mail messages. Different users will need different filters. It is unreasonable to expect each user to program his or her own rules, and it is infeasible to provide every user with a software engineer to keep the rules up-to-date. A machine learning system can learn which mail messages the user rejects and maintain the filtering rules automatically. Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis. Statistics focuses on understanding the phenomena that have generated the data, often with the goal of testing different hypotheses about those phenomena. Data mining seeks to find patterns in the data that are understandable by people. Psychological studies of human learning aspire to understand the mechanisms underlying the various learning behaviors exhibited by people (concept learning, skill acquisition, strategy change, etc.).

13,246 citations

01 Jan 2002

9,314 citations

Journal ArticleDOI
TL;DR: The multiple traveling salesman problem (mTSP) as discussed by the authors is a generalization of the well-known TSP, where more than one salesman is allowed to be used in the solution.
Abstract: The multiple traveling salesman problem (mTSP) is a generalization of the well-known traveling salesman problem (TSP), where more than one salesman is allowed to be used in the solution. Moreover, the characteristics of the mTSP seem more appropriate for real-life applications, and it is also possible to extend the problem to a wide variety of vehicle routing problems (VRPs) by incorporating some additional side constraints. Although there exists a wide body of the literature for the TSP and the VRP, the mTSP has not received the same amount of attention. The purpose of this survey is to review the problem and its practical applications, to highlight some formulations and to describe exact and heuristic solution procedures proposed for this problem.

1,019 citations

01 Jan 2009
TL;DR: Wang et al. as discussed by the authors developed a fixed effect log-linear regression model to assess the influence of online reviews on the number of hotel room bookings, which indicated a significant relationship between online consumer reviews and business performance of hotels.
Abstract: Despite hospitality and tourism researchers’ recent attempts on examining different aspects of online word-of-mouth [WOM], its impact on hotel sales remains largely unknown in the existing literature. To fill this void, we conduct a study to empirically investigate the impact of online consumer-generated reviews on hotel room sales. Utilizing data collected from the largest travel website in China, we develop a fixed effect log-linear regression model to assess the influence of online reviews on the number of hotel room bookings. Our results indicate a significant relationship between online consumer reviews and business performance of hotels.

877 citations