Performance management for cluster based Web services

doi:10.1007/978-0-387-35674-7_29

Open AccessBook ChapterDOI

Performance management for cluster based Web services

R. Levy, +5 more

- pp 247-261

Chats0

TLDR

The average response time is used as the performance metric for a performance management system for cluster-based Web services that supports multiple classes of Web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads.

Abstract:

We present an architecture and prototype implementation of a performance management system for cluster-based Web services. The system supports multiple classes of Web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads. The cluster utility is a function of the performance delivered to the various classes, and this leads to Differentiated Service. In this paper we use the average response time as the performance metric. The management system is transparent: it requires no changes in the client code, the server code, or the network interface between them. The system performs three performance management tasks: resource allocation, load balancing, and server overload protection. We use two nested levels of management mechanism. The inner level centers on queuing and scheduling of request messages. The outer level is a feedback control loop that periodically adjusts the scheduling weights and server allocations of the inner level. The feedback controller is based on an approximate first-principles model of the system, with parameters derived from continuous monitoring. We focus on SOAP-based Web services. We report experimental results that show the dynamic behavior of the system.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Dynamic Placement of Virtual Machines for Managing SLA Violations

Norman Bobroff, +2 more

TL;DR: A dynamic server migration and consolidation algorithm is introduced and is shown to provide substantial improvement over static server consolidation in reducing the amount of required capacity and the rate of service level agreement violations.

...read moreread less

Journal ArticleDOI

Real Time Scheduling Theory: A Historical Perspective

Lui Sha, +9 more

- 01 Nov 2004 -

Real-time Systems

TL;DR: This 25th year anniversary paper for the IEEE Real Time Systems Symposium reviews the key results in real-time scheduling theory and the historical events that led to the establishment of the current real- time computing infrastructure.

...read moreread less

Proceedings ArticleDOI

An analytical model for multi-tier internet services and its applications

Bhuvan Urgaonkar, +4 more

TL;DR: This paper presents a model based on a network of queues, where the queues represent different tiers of the application, sufficiently general to capture the behavior of tiers with significantly different performance characteristics and application idiosyncrasies such as session-based workloads, concurrency limits, and caching at intermediate tiers.

...read moreread less

Patent

System and method of brokering cloud computing resources

David Brian Jackson

Abstract: System, method, and tangible computer-readable storage media are disclosed for providing a brokering service for compute resources The method includes, at a brokering service, polling a group of separately administered compute environments to identify resource capabilities and information, each compute resource environment including the group of managed nodes for processing workload, receiving a request for compute resources at the brokering service system, the request for compute resources being associated with a service level agreement (SLA) and based on the resource capabilities across the group of compute resource environments, selecting compute resources in one or more of the group of compute resource environments The brokering service system receives workload associated with the request and communicates the workload to the selected resources for processing The brokering services system can aggregate resources for multiple cloud service providers and act as an advocate for or a guarantor of the SLA associated with the workload

...read moreread less

Journal ArticleDOI

Agile dynamic provisioning of multi-tier Internet applications

Bhuvan Urgaonkar, +4 more

- 27 Mar 2008 -

ACM Transactions on Autonomous and Adapt...

TL;DR: A novel dynamic provisioning technique for multi-tier Internet applications that employs a flexible queuing model to determine how much of the resources to allocate to each tier of the application, and a combination of predictive and reactive methods that determine when to provision these resources, both at large and small time scales is proposed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Queueing Systems. Volume 1: Theory

leonard Kleinrock

TL;DR: The purpose of this document is to summarize the main points of the book written by Leonard Kleinrock, titled, ‘Queueing Systems’, which is about queueing systems.

...read moreread less

Journal ArticleDOI

Optimization flow control—I: basic algorithm and convergence

Steven H. Low, +1 more

- 01 Dec 1999 -

IEEE ACM Transactions on Networking

TL;DR: An optimization approach to flow control where the objective is to maximize the aggregate source utility over their transmission rates to solve the dual problem using a gradient projection algorithm.

...read moreread less

Proceedings ArticleDOI

Managing energy and server resources in hosting centers

Jeffrey S. Chase, +4 more

TL;DR: Experimental results from a prototype confirm that the system adapts to offered load and resource availability, and can reduce server energy usage by 29% or more for a typical Web workload.

...read moreread less

Proceedings ArticleDOI

Resource containers: a new facility for resource management in server systems

Gaurav Banga, +2 more

TL;DR: This work proposes and evaluates a new operating system abstraction called a resource container, which separates the notion of a protection domain from that of a resource principal, and enables fine-grained resource management in server systems and allow the development of robust servers, with simple and firm control over priority policies.

...read moreread less

Journal ArticleDOI

Performance guarantees for Web server end-systems: a control-theoretical approach

Tarek Abdelzaher, +2 more

- 01 Jan 2002 -

IEEE Transactions on Parallel and Distri...

TL;DR: This paper uses feedback control theory to achieve overload protection, performance guarantees, and service differentiation in the presence of load unpredictability, and shows that control-theoretic techniques offer a sound way of achieving desired performance in performance-critical Internet applications.

...read moreread less

Related Papers (5)

Performance guarantees for Web server end-systems: a control-theoretical approach

Tarek Abdelzaher, +2 more

- 01 Jan 2002 -

IEEE Transactions on Parallel and Distri...

Performance management for cluster based Web services

Citations

Dynamic Placement of Virtual Machines for Managing SLA Violations

Real Time Scheduling Theory: A Historical Perspective

An analytical model for multi-tier internet services and its applications

System and method of brokering cloud computing resources

Agile dynamic provisioning of multi-tier Internet applications

References

Queueing Systems. Volume 1: Theory

Optimization flow control—I: basic algorithm and convergence

Managing energy and server resources in hosting centers

Resource containers: a new facility for resource management in server systems

Performance guarantees for Web server end-systems: a control-theoretical approach

Related Papers (5)

Performance guarantees for Web server end-systems: a control-theoretical approach

An analytical model for multi-tier internet services and its applications

Managing energy and server resources in hosting centers

Resource Allocation for Autonomic Data Centers using Analytic Performance Models

Resource overbooking and application profiling in shared hosting platforms