scispace - formally typeset
Search or ask a question
Author

Waheed Iqbal

Bio: Waheed Iqbal is an academic researcher from University of the Punjab. The author has contributed to research in topics: Cloud computing & Web application. The author has an hindex of 12, co-authored 38 publications receiving 699 citations. Previous affiliations of Waheed Iqbal include Asian Institute of Technology & College of Information Technology.

Papers
More filters
Journal ArticleDOI
TL;DR: This paper proposes a methodology and presents a working prototype system for automatic detection and resolution of bottlenecks in a multi-tier Web application hosted on a cloud in order to satisfy specific maximum response time requirements.

291 citations

Book ChapterDOI
22 Nov 2009
TL;DR: This paper presents the design and implementation of a working prototype built on a EUCALYPTUS-based heterogeneous compute cloud that actively monitors the response time of each virtual machine assigned to the farm and adaptively scales up the application to satisfy a SLA promising a specific average response time.
Abstract: Current service-level agreements (SLAs) offered by cloud providers make guarantees about quality attributes such as availability. However, although one of the most important quality attributes from the perspective of the users of a cloud-based Web application is its response time, current SLAs do not guarantee response time. Satisfying a maximum average response time guarantee for Web applications is difficult due to unpredictable traffic patterns, but in this paper we show how it can be accomplished through dynamic resource allocation in a virtual Web farm. We present the design and implementation of a working prototype built on a EUCALYPTUS-based heterogeneous compute cloud that actively monitors the response time of each virtual machine assigned to the farm and adaptively scales up the application to satisfy a SLA promising a specific average response time. We demonstrate the feasibility of the approach in an experimental evaluation with a testbed cloud and a synthetic workload. Adaptive resource management has the potential to increase the usability of Web applications while maximizing resource utilization.

74 citations

Proceedings ArticleDOI
17 May 2010
TL;DR: The preliminary results shows that dynamic bottleneck detection and resolution for multi-tier Web application hosted on the cloud will help to offer SLAs that can offer response time guarantees.
Abstract: Current service-level agreements (SLAs) offered by cloud providers do not make guarantees about response time of Web applications hosted on the cloud. Satisfying a maximum average response time guarantee for Web applications is difficult due to unpredictable traffic patterns. The complex nature of multi-tier Web applications increases the difficulty of identifying bottlenecks and resolving them automatically. It may be possible to minimize the probability that tiers (hosted on virtual machines) become bottlenecks by optimizing the placement of the virtual machines in a cloud. This research focuses on enabling clouds to offer multi-tier Web application owners maximum response time guarantees while minimizing resource utilization. We present our basic approach, preliminary experiments, and results on a EUCALYPTUS-based testbed cloud. Our preliminary results shows that dynamic bottleneck detection and resolution for multi-tier Web application hosted on the cloud will help to offer SLAs that can offer response time guarantees.

68 citations

Journal ArticleDOI
TL;DR: A complete automated system to decompose an application into microservices, deploy the micro services using appropriate resources, and auto-scale the microservices to maintain the desired response time is proposed.

52 citations

Journal ArticleDOI
TL;DR: The results show that the proposed techniques would enable cloud infrastructure providers or application owners to build systems that automatically manage multitier Web applications, while meeting SLOs, without any prior knowledge of the applications' resource utilization or workload patterns.
Abstract: Dynamic resource provisioning for Web applications allows for low operational costs while meeting service-level objectives (SLOs). However, the complexity of multitier Web applications makes it difficult to automatically provision resources for each tier without human supervision. In this paper, we introduce unsupervised machine learning methods to dynamically provision multitier Web applications, while observing user-defined performance goals. The proposed technique operates in real time and uses learning techniques to identify workload patterns from access logs, reactively identifies bottlenecks for specific workload patterns, and dynamically builds resource allocation policies for each particular workload. We demonstrate the effectiveness of the proposed approach in several experiments using synthetic workloads on the Amazon Elastic Compute Cloud (EC2) and compare it with industry-standard rule-based autoscale strategies. Our results show that the proposed techniques would enable cloud infrastructure providers or application owners to build systems that automatically manage multitier Web applications, while meeting SLOs, without any prior knowledge of the applications' resource utilization or workload patterns.

48 citations


Cited by
More filters
Journal ArticleDOI
01 Dec 2014
TL;DR: This work proposes a classification of techniques for automating application scaling in the cloud into five main categories: static threshold-based rules, control theory, reinforcement learning, queuing theory and time series analysis, and uses this classification to carry out a literature review of proposals.
Abstract: Cloud computing environments allow customers to dynamically scale their applications. The key problem is how to lease the right amount of resources, on a pay-as-you-go basis. Application re-dimensioning can be implemented effortlessly, adapting the resources assigned to the application to the incoming user demand. However, the identification of the right amount of resources to lease in order to meet the required Service Level Agreement, while keeping the overall cost low, is not an easy task. Many techniques have been proposed for automating application scaling. We propose a classification of these techniques into five main categories: static threshold-based rules, control theory, reinforcement learning, queuing theory and time series analysis. Then we use this classification to carry out a literature review of proposals for auto-scaling in the cloud.

688 citations

Proceedings ArticleDOI
04 Jul 2011
TL;DR: A model-predictive algorithm for workload forecasting that is used for resource auto scaling is developed and empirical results are provided that demonstrate that resources can be allocated and deal located by the algorithm in a way that satisfies both the application QoS while keeping operational costs low.
Abstract: Large-scale component-based enterprise applications that leverage Cloud resources expect Quality of Service(QoS) guarantees in accordance with service level agreements between the customer and service providers. In the context of Cloud computing, auto scaling mechanisms hold the promise of assuring QoS properties to the applications while simultaneously making efficient use of resources and keeping operational costs low for the service providers. Despite the perceived advantages of auto scaling, realizing the full potential of auto scaling is hard due to multiple challenges stemming from the need to precisely estimate resource usage in the face of significant variability in client workload patterns. This paper makes three contributions to overcome the general lack of effective techniques for workload forecasting and optimal resource allocation. First, it discusses the challenges involved in auto scaling in the cloud. Second, it develops a model-predictive algorithm for workload forecasting that is used for resource auto scaling. Finally, empirical results are provided that demonstrate that resources can be allocated and deal located by our algorithm in a way that satisfies both the application QoS while keeping operational costs low.

605 citations

Journal ArticleDOI
TL;DR: This paper carefully analyzed and discussed the properties of a monitoring system for the Cloud, the issues arising from such properties and how such issues have been tackled in literature, and identifies open issues, main challenges and future directions in the field of Cloud monitoring.

543 citations

Journal ArticleDOI
TL;DR: This paper focuses on some of the important resource management techniques such as resource provisioning, resource allocation, resource mapping and resource adaptation for IaaS in cloud computing.

517 citations

Journal ArticleDOI
TL;DR: People's lay concepts of algorithmic versus human decisions in a management context are revealed and it is suggested that task characteristics matter in understanding people's experiences with algorithmic technologies.
Abstract: Algorithms increasingly make managerial decisions that people used to make. Perceptions of algorithms, regardless of the algorithms' actual performance, can significantly influence their adoption, ...

478 citations