Showing papers by "Lars Lundberg published in 2003"

PDF

Open Access

Proceedings Article•DOI•

Using Golomb rulers for optimal recovery schemes in fault tolerant distributed computing

[...]

Kamilla Klonowska¹, Lars Lundberg¹, Håkan Lennerstad¹•Institutions (1)

22 Apr 2003

TL;DR: This paper defines recovery schemes, which are optimal for a number of important cases, and shows that the problem of finding optimal recovery schemes corresponds to the mathematical problem called Golomb rulers.

...read moreread less

Abstract: Clusters and distributed systems offer fault tolerance and high performance through load sharing. When all computers are up and running, we would like the load to be evenly distributed among the computers. When one or more computers break down the load on these computers must be redistributed to other computers in the cluster. The redistribution is determined by the recovery scheme. The recovery scheme should keep the load as evenly distributed as possible even when the most unfavorable combinations of computers break down, i.e. we want to optimize the worst-case behavior. In this paper we define recovery schemes, which are optimal for a number of important cases. We also show that the problem of finding optimal recovery schemes corresponds to the mathematical problem called Golomb rulers. These provide optimal recovery schemes for up to 373 computers in the cluster.

...read moreread less

12 citations

Journal Article•DOI•

Editorial: software architecture - Engineering quality attributes

[...]

Jan Bosch¹, Lars Lundberg²•Institutions (2)

University of Groningen¹, Blekinge Institute of Technology²

15 Jun 2003-Journal of Systems and Software

10 citations

Proceedings Article•DOI•

Recovery schemes for high availability and high performance distributed real-time computing

[...]

Lars Lundberg¹, D. Haggander¹, Kamilla Klonowska¹, Charlie Svahnberg¹•Institutions (1)

Blekinge Institute of Technology¹

22 Apr 2003

...read moreread less

Abstract: Clusters and distributed systems offer fault tolerance and high performance through load sharing, and are thus attractive in real-time applications. When all computers are up and running, we would like the load to be evenly distributed among the computers. When one or more computers-fail the must be redistributed. The redistribution is determined by the recovery scheme. The recovery scheme should keep the load as evenly distributed as possible even when the most unfavorable combinations of computers break down, i.e. we want to optimize the worst-case behavior. In this paper we define recovery schemes, which are optimal for a number of important cases. We also show that the problem of finding optimal recovery schemes corresponds to the mathematical problem of finding sequences of integers with minimal sum and for which all sums of subsequences are unique.

...read moreread less

8 citations

Journal Article•DOI•

Chapter 2: Previous scheduling research

[...]

Håkan Lennerstad, Lars Lundberg

01 May 2003-Electronic Notes in Discrete Mathematics

2 citations

End-User Development by Tailoring. Blurring the border between Use and Development

[...]

Yvonne Dittrich, Lars Lundberg, Olle Lindeberg

01 Jan 2003

2 citations

Journal Article•

Normal Versus Worst-case Performance in High Availability Cluster and Distributed Computing

[...]

Lars Lundberg, Charlie Svahnberg

01 Jan 2003-Applied Informatics

TL;DR: An optimal upper bound on the loss of normal case performance when optimizing for worst-case performance is put and a heuristic algorithm is provided for doing engineering trade-offs between worst- case andnormal case performance.

...read moreread less

Abstract: Clusters and distributed systems offer fault tolerance and high performance, When all computers are up and running, we would like the load to be evenly distributed among the computers. When a computer breaks down the load on this computer must be redistributed to the other computers in the cluster. Most cluster systems are designed to tolerate one single fault, and one can thus distinguish between two modes of operation: normal operation when all computers are up and running and worst-case operation when one computer is down. The performance during these two modes of operation is determined by the way work is allocated to the computers in the cluster or distributed system. It turns out that the same allocation can in general not achieve optimal normal and worst-case performance, i.e. there is a trade-off. In this paper we put an optimal upper bound on the loss of normal case performance when optimizing for worst-case performance, and an optimal upper bound on the loss of worst-case case performance when optimizing for normal case performance. We also provide a heuristic algorithm for doing engineering trade-offs between worst-case and normal case performance.

...read moreread less

1 citations

Journal Article•DOI•

Chapter 6: Parallel program scheduling using test executions

[...]

Håkan Lennerstad, Lars Lundberg

01 May 2003-Electronic Notes in Discrete Mathematics

1 citations

Journal Article•DOI•

Chapter 5: Parallel program scheduling with given parallel profile

[...]

Håkan Lennerstad, Lars Lundberg