scispace - formally typeset
Open AccessBook ChapterDOI

Performance management for cluster based Web services

Reads0
Chats0
TLDR
The average response time is used as the performance metric for a performance management system for cluster-based Web services that supports multiple classes of Web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads.
Abstract
We present an architecture and prototype implementation of a performance management system for cluster-based Web services. The system supports multiple classes of Web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads. The cluster utility is a function of the performance delivered to the various classes, and this leads to Differentiated Service. In this paper we use the average response time as the performance metric. The management system is transparent: it requires no changes in the client code, the server code, or the network interface between them. The system performs three performance management tasks: resource allocation, load balancing, and server overload protection. We use two nested levels of management mechanism. The inner level centers on queuing and scheduling of request messages. The outer level is a feedback control loop that periodically adjusts the scheduling weights and server allocations of the inner level. The feedback controller is based on an approximate first-principles model of the system, with parameters derived from continuous monitoring. We focus on SOAP-based Web services. We report experimental results that show the dynamic behavior of the system.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Dynamic Placement of Virtual Machines for Managing SLA Violations

TL;DR: A dynamic server migration and consolidation algorithm is introduced and is shown to provide substantial improvement over static server consolidation in reducing the amount of required capacity and the rate of service level agreement violations.
Journal ArticleDOI

Real Time Scheduling Theory: A Historical Perspective

TL;DR: This 25th year anniversary paper for the IEEE Real Time Systems Symposium reviews the key results in real-time scheduling theory and the historical events that led to the establishment of the current real- time computing infrastructure.
Proceedings ArticleDOI

An analytical model for multi-tier internet services and its applications

TL;DR: This paper presents a model based on a network of queues, where the queues represent different tiers of the application, sufficiently general to capture the behavior of tiers with significantly different performance characteristics and application idiosyncrasies such as session-based workloads, concurrency limits, and caching at intermediate tiers.
Patent

System and method of brokering cloud computing resources

Abstract: System, method, and tangible computer-readable storage media are disclosed for providing a brokering service for compute resources The method includes, at a brokering service, polling a group of separately administered compute environments to identify resource capabilities and information, each compute resource environment including the group of managed nodes for processing workload, receiving a request for compute resources at the brokering service system, the request for compute resources being associated with a service level agreement (SLA) and based on the resource capabilities across the group of compute resource environments, selecting compute resources in one or more of the group of compute resource environments The brokering service system receives workload associated with the request and communicates the workload to the selected resources for processing The brokering services system can aggregate resources for multiple cloud service providers and act as an advocate for or a guarantor of the SLA associated with the workload
Journal ArticleDOI

Agile dynamic provisioning of multi-tier Internet applications

TL;DR: A novel dynamic provisioning technique for multi-tier Internet applications that employs a flexible queuing model to determine how much of the resources to allocate to each tier of the application, and a combination of predictive and reactive methods that determine when to provision these resources, both at large and small time scales is proposed.
References
More filters
Book

Queueing Systems. Volume 1: Theory

TL;DR: The purpose of this document is to summarize the main points of the book written by Leonard Kleinrock, titled, ‘Queueing Systems’, which is about queueing systems.
Journal ArticleDOI

Optimization flow control—I: basic algorithm and convergence

TL;DR: An optimization approach to flow control where the objective is to maximize the aggregate source utility over their transmission rates to solve the dual problem using a gradient projection algorithm.
Proceedings ArticleDOI

Managing energy and server resources in hosting centers

TL;DR: Experimental results from a prototype confirm that the system adapts to offered load and resource availability, and can reduce server energy usage by 29% or more for a typical Web workload.
Proceedings ArticleDOI

Resource containers: a new facility for resource management in server systems

TL;DR: This work proposes and evaluates a new operating system abstraction called a resource container, which separates the notion of a protection domain from that of a resource principal, and enables fine-grained resource management in server systems and allow the development of robust servers, with simple and firm control over priority policies.
Journal ArticleDOI

Performance guarantees for Web server end-systems: a control-theoretical approach

TL;DR: This paper uses feedback control theory to achieve overload protection, performance guarantees, and service differentiation in the presence of load unpredictability, and shows that control-theoretic techniques offer a sound way of achieving desired performance in performance-critical Internet applications.
Related Papers (5)