scispace - formally typeset
Search or ask a question
Author

Emmanuel Bugingo

Bio: Emmanuel Bugingo is an academic researcher from Xiamen University. The author has contributed to research in topics: Cloud computing & Workflow. The author has an hindex of 4, co-authored 11 publications receiving 68 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: A variety of algorithms are proposed to help the users to schedule their big data processing workflow applications on clouds so that the cost can be minimized and the deadline constraints can be satisfied.

41 citations

Journal ArticleDOI
TL;DR: A hybrid model of two super classifiers: Convolutional Neural Network (CNN) as well as eXtreme Gradient Boosting (XGBoost) are proposed for classification, which gave better results than the traditional fully connected layer.
Abstract: Handwritten character recognition has been profoundly studied for many years in the field of pattern recognition. Due to its vast practical applications and financial implications, handwritten character recognition is still an important research area. In this research, the Handwritten Ethiopian Character Recognition (HECR) dataset has been prepared to train the model. The images in the HECR dataset were organized with more than one color pen RGB main spaces that have been size normalized to 28 × 28 pixels. The dataset is a combination of scripts (Fidel in Ethiopia), numerical representations, punctuations, tonal symbols, combining symbols, and special characters. These scripts have been used to write ancient histories, science, and arts of Ethiopia and Eritrea. In this study, a hybrid model of two super classifiers: Convolutional Neural Network (CNN) and eXtreme Gradient Boosting (XGBoost) is proposed for classification. In this integrated model, CNN works as a trainable automatic feature extractor from the raw images and XGBoost takes the extracted features as an input for recognition and classification. The output error rates of the hybrid model and CNN with a fully connected layer are compared. A 0.4630 and 0.1612 error rates are achieved in classifying the handwritten testing dataset images, respectively. Thus XGBoost as a classifier performs a better result than the traditional fully connected layer.

27 citations

Journal ArticleDOI
TL;DR: This paper proposed a multi-objective workflow-scheduling algorithm based on decomposition that is capable of finding optimal solutions with a single run and manages to obtain the Pareto Front solutions.
Abstract: A workflow is a group of tasks that are processed in a particular order to complete an application. Also, it is a popular paradigm used to model complex big-data applications. Executing complex applications in a distributed system such as cloud or cluster implicates optimization of several conflicting objectives such as monetary cost, energy consumption, total execution time of the application (makespan). Regardless of this trend, most of the workflow scheduling approaches focused on single or bi-objective optimization problem. In this paper, we considered the problem of scheduling workflows in a cloud environment as a multi-objective optimization problem, and hence proposed a multi-objective workflow-scheduling algorithm based on decomposition. The proposed algorithm is capable of finding optimal solutions with a single run. Our evaluation results show that, by a single run, the proposed approach manages to obtain the Pareto Front solutions which are at least as good as schedules produced by running a single-objective scheduling algorithm with constraints for multiple times.

13 citations

Journal ArticleDOI
TL;DR: For some common diseases, such as diabetes, hypertension and heart disease, the work is able to identify correctly the first two or three nutritional ingredients in food that can benefit the rehabilitation of those diseases.
Abstract: Suitable nutritional diets have been widely recognized as important measures to prevent and control non-communicable diseases (NCDs). However, there is little research on nutritional ingredients in food now, which are beneficial to the rehabilitation of NCDs. In this paper, we profoundly analyzed the relationship between nutritional ingredients and diseases by using data mining methods. First, more than 7000 diseases were obtained, and we collected the recommended food and taboo food for each disease. Then, referring to the China Food Nutrition , we used noise intensity and information entropy to find out which nutritional ingredients can exert positive effects on diseases. Finally, we proposed an improved algorithm named CVNDA_Red based on rough sets to select the corresponding core ingredients from the positive nutritional ingredients. To the best of our knowledge, this is the first study to discuss the relationship between nutritional ingredients in food and diseases through data mining based on rough set theory in China. The experiments on real-life data show that our method based on data mining improves the performance compared with the traditional statistical approach, with the precision of 1.682. In addition, for some common diseases, such as diabetes, hypertension and heart disease, our work is able to identify correctly the first two or three nutritional ingredients in food that can benefit the rehabilitation of those diseases. These experimental results demonstrate the effectiveness of applying data mining in selecting of nutritional ingredients in food for disease analysis.

8 citations

Journal ArticleDOI
TL;DR: A discrete-event based simulator with a novel benchmark approach is proposed to ease the analysis of the online deadline-constrained workflow scheduling problem of how to schedule a set of sequentially submitted workflows with deadline constraints to maximize the resource utilization as well as the success rate of meeting the deadlines.

4 citations


Cited by
More filters
01 Jan 2016
TL;DR: In this paper, the authors present computers and intractability, a guide to the theory of completeness, but end up in infectious downloads, rather than enjoying a good book with a cup of tea in the afternoon, instead they juggled with some harmful virus inside their desktop computer.
Abstract: Thank you very much for reading computers and intractability a guide to the theory of np completeness. As you may know, people have look hundreds times for their favorite novels like this computers and intractability a guide to the theory of np completeness, but end up in infectious downloads. Rather than enjoying a good book with a cup of tea in the afternoon, instead they juggled with some harmful virus inside their desktop computer.

185 citations

Journal ArticleDOI
TL;DR: Enhanced versions of the NSGA-III algorithm are proposed through introducing the concept of Stud and designing several improved crossover operators of SBX, UC, and SI, and experimental results indicate that the NS GA-III methods with UC and UC-Stud (UCS) outperform the other developed variants.

149 citations

Journal ArticleDOI
TL;DR: The algorithm introduced in this paper utilizes a load balancing routine to maximize resources’ efficiency at execution time and performs task scheduling with the least makespan and cost.
Abstract: Cloud infrastructures are suitable environments for processing large scientific workflows. Nowadays, new challenges are emerging in the field of optimizing workflows such that it can meet user’s service quality requirements. The key to workflow optimization is the scheduling of workflow tasks, which is a famous NP-hard problem. Although several methods have been proposed based on the genetic algorithm for task scheduling in clouds, our proposed method is more efficient than other proposed methods due to the use of new genetic operators as well as modified genetic operators and the use of load balancing routine. Moreover, a solution obtained from a heuristic used as one of the initial population chromosomes and an efficient routine also used for generating the rest of the primary population chromosomes. An adaptive fitness function is used that takes into account both cost and makespan. The algorithm introduced in this paper utilizes a load balancing routine to maximize resources’ efficiency at execution time. The performance of the proposed algorithm is evaluated by comparing the results with state of the art algorithms of this field, and the results indicate that the proposed algorithm has remarkable superiority in comparison to other algorithms and performs task scheduling with the least makespan and cost.

43 citations

Journal ArticleDOI
01 Nov 2020
TL;DR: In this paper, a reinforcement learning approach is exploited in a multi-agent system for task scheduling and resource provisioning, in order to reduce the makespan, minimize the required power, optimize the cost of using the resources, and maximize the utilization of the resources (considering their expiration time), simultaneously.
Abstract: Due to different heterogeneous cloud resources and diverse and complex applications of the users, an optimal task scheduling, which can satisfy users and cloud service providers with energy-saving and cost-effective use of resources, is a major issue in cloud computing. On the one hand, network users are demanding the quality assurance of their requested services, minimizing their costs, and their own data security, and on the other hand, the service providers consider less power consumption, more efficient use of resources, and optimal utilization. In dependent tasks dealing with massive data, resource scheduling is considered as an important challenge. Due to the time limitation of online scheduling process of dependent tasks, many existing methods of the literature are not able to guarantee the best resource utilization. In this paper, a reinforcement learning approach is exploited in a multi-agent system for task scheduling and resource provisioning, in order to reduce the makespan, minimize the required power, optimize the cost of using the resources, and maximize the utilization of the resources (considering their expiration time), simultaneously. The proposed algorithm has two phases. In the first phase, the tasks are scheduled using reinforcement learning techniques, and in the second one, considering the information obtained from the scheduling phase, resources are allocated in a multi-agent environment. The results of experiments show that this method improves the efficiency of the use of resources and reduces their costs. Moreover, the expiration time of the tasks is observed and the total execution time and energy consumption will be significantly reduced.

33 citations

Journal ArticleDOI
TL;DR: The proposed MSMC system proposes a new scheduling algorithm, called Ideal Distribution Algorithm (IDA), to schedule the workflow tasks to the virtual machines of the cloud considering both the deadline and cost constraints, and an enhanced version of the IDA, called Enhanced IDA (EIDA) is proposed to provide load balancing required by the cloud.
Abstract: The emergence of cloud computing has been growing rapidly in the last decades especially for workflow scheduling. Organizations with the same requirements and needs go to use the community cloud for saving costs. One of the important challenges of using the community cloud is resource allocation and task scheduling. In this paper, we propose a new Management System for servicing Multi-organizations in a Community cloud (MSMC) in a secure cloud environment. The MSMC employs a virtual machine allocation algorithm to organize the community cloud usage among the organizations, where it allocates the available virtual machines according to the use of each organization in an efficient and fair way to execute the submitted applications. Moreover, the MSMC system proposes a new scheduling algorithm, called Ideal Distribution Algorithm (IDA), to schedule the workflow tasks to the virtual machines of the cloud considering both the deadline and cost constraints. Additionally, an enhanced version of the IDA, called Enhanced IDA (EIDA) is proposed to provide load balancing required by the cloud. The simulation experiments show that the system can improve the system ability under deadline constraints and improve the monetary cost.

29 citations