Home
/
Authors
/
N.B. Beck

Author

N.B. Beck

Bio: N.B. Beck is an academic researcher from Purdue University. The author has contributed to research in topics: Heuristics & Data management. The author has an hindex of 5, co-authored 6 publications receiving 2114 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems

[...]

Tracy D. Braun¹, Howard Jay Siegel¹, N.B. Beck¹, Ladislau Bölöni¹, Muthucumaru Maheswaran¹, Albert Reuther¹, James Patrick Robertson¹, Mitchell D. Theys¹, Bin Yao¹, Debra Hensgen¹, Richard F. Freund¹ - Show less +7 more•Institutions (1)

Purdue University¹

01 Jun 2001-Journal of Parallel and Distributed Computing

TL;DR: It is shown that for the cases studied here, the relatively simple Min?min heuristic performs well in comparison to the other techniques, and one even basis for comparison and insights into circumstances where one technique will out-perform another.

...read moreread less

1,757 citations

Proceedings Article•DOI•

A comparison study of static mapping heuristics for a class of meta-tasks on heterogeneous computing systems

[...]

Tracy D. Braun¹, H.J. Siegal¹, N.B. Beck¹, Ladislau Bölöni¹, Muthucumaru Maheswaran², Albert Reuther¹, J.P. Robertson³, Mitchell D. Theys¹, Bin Yao¹, Debra Hensgen⁴, Richard F. Freund¹ - Show less +7 more•Institutions (4)

Purdue University¹, University of Manitoba², Motorola³, Naval Postgraduate School⁴

12 Apr 1999

TL;DR: A collection of eleven heuristics from the literature has been selected, implemented, and analyzed under one set of common assumptions and provides one even basis for comparison and insights into circumstances where one technique will outperform another.

...read moreread less

Abstract: Heterogeneous computing (HC) environments are well suited to meet the computational demands of large, diverse groups of tasks (i.e., a meta-task). The problem of mapping (defined as matching and scheduling) these tasks onto the machines of an HC environment has been shown, in general, to be NP-complete, requiring the development of heuristic techniques. Selecting the best heuristic to use in a given environment, however, remains a difficult problem, because comparisons are often clouded by different underlying assumptions in the original studies of each heuristic. Therefore, a collection of eleven heuristics from the literature has been selected, implemented, and analyzed under one set of common assumptions. The eleven heuristics examined are opportunistic load balancing, user-directed assignment, fast greedy, min-min, max-min, greedy, genetic algorithm, simulated annealing, genetic simulated annealing, tabu, and A*. This study provides one even basis for comparison and insights into circumstances where one technique will outperform another. The evaluation procedure is specified, the heuristics are defined, and then selected results are compared.

...read moreread less

259 citations

Proceedings Article•DOI•

A taxonomy for describing matching and scheduling heuristics for mixed-machine heterogeneous computing systems

[...]

Tracy D. Braun¹, Howard Jay Siegel¹, N.B. Beck¹, Ladislau Bölöni¹, Muthucumaru Maheswaran¹, Albert Reuther¹, James Patrick Robertson¹, Mitchell D. Theys, Bin Yao¹ - Show less +5 more•Institutions (1)

Purdue University¹

20 Oct 1998

TL;DR: A new taxonomy for classifying mapping heuristics for HC environments is proposed (Purdue HC Taxonomy), defined in three major parts: the models used for applications and communication requests; the models use for target hardware platforms; and the characteristics of mappingHeuristics.

...read moreread less

Abstract: The problem of mapping (defined as matching and scheduling) tasks and communications onto multiple machines and networks in a heterogeneous computing (HC) environment has been shown to be NP-complete, in general, requiring the development of heuristic techniques. Many different types of mapping heuristics have been developed in recent years. However, selecting the best heuristic to use in any given scenario remains a difficult problem. Factors making this selection difficult are discussed. Motivated by these difficulties, a new taxonomy for classifying mapping heuristics for HC environments is proposed (Purdue HC Taxonomy). The taxonomy is defined in three major parts: the models used for applications and communication requests; the models used for target hardware platforms; and the characteristics of mapping heuristics, Each part of the taxonomy is described, with examples given to help clarify the taxonomy. The benefits and uses of this taxonomy are also discussed.

...read moreread less

102 citations

Journal Article•DOI•

A mathematical model and scheduling heuristics for satisfying prioritized data requests in an oversubscribed communication network

[...]

M.D. Theys¹, Min Tan, N.B. Beck, Howard Jay Siegel², Michael Jurczyk³ - Show less +1 more•Institutions (3)

University of Illinois at Urbana–Champaign¹, Purdue University², University of Missouri³

01 Sep 2000-IEEE Transactions on Parallel and Distributed Systems

TL;DR: Three multiple-source shortest-path algorithm-based heuristics for finding a near-optimal schedule of the communication steps for staging the data are presented and are shown to perform well with respect to upper and lower bounds.

...read moreread less

Abstract: Providing up-to-date input to users' applications is an important data management problem for a distributed computing environment, where each data storage location and intermediate node may have specific data available, storage limitations, and communication links available. Sites in the network request data items and each request has an associated deadline and priority. In a military situation, the data staging problem involves positioning data for facilitating a faster access time when it is needed by programs that will aid in decision making. This work concentrates on solving a basic version of the data staging problem in which all parameter values for the communication system and the data request information represent the best known information collected so far and stay fixed throughout the scheduling process. The network is assumed to be oversubscribed and not all requests for data items can be satisfied. A mathematical model for the basic data staging problem is introduced. Then, three multiple-source shortest-path algorithm-based heuristics for finding a near-optimal schedule of the communication steps for staging the data are presented. Each heuristic can be used with each of four cost criteria developed. Thus, 12 implementations are examined. In addition, two different weightings for the relative importance of different priority levels are considered. The performance of the proposed heuristics are evaluated and compared by simulations. The proposed heuristics are shown to perform well with respect to upper and lower bounds. Furthermore, the heuristics and a complex cost criterion allow more highest priority messages to be received than a simple-cost-based heuristic that schedules all highest priority messages first.

...read moreread less

37 citations

Proceedings Article•DOI•

A mathematical model, heuristic, and simulation study for a basic data staging problem in a heterogeneous networking environment

[...]

Min Tan¹, Mitchell D. Theys, Howard Jay Siegel², N.B. Beck², Michael Jurczyk³ - Show less +1 more•Institutions (3)

Cisco Systems, Inc.¹, Purdue University², University of Missouri³

30 Mar 1998

TL;DR: This research, based on the simplified static model, serves as a necessary step toward solving the more realistic and complicated version of the data staging problem involving dynamic scheduling, fault tolerance, and determining where to stage data.

...read moreread less

Abstract: Data staging is an important data management problem for a distributed heterogeneous networking environment, where each data storage location and intermediate node may have specific data available, storage limitations, and communication links. Sites in the network request data items and each item is associated with a specific deadline and priority. It is assumed that not all requests can be satisfied by their deadline. The work concentrates on solving a basic version of the data staging problem in which all parameter values for the communication system and the data request information represent the best known information collected so far and stay fixed throughout the scheduling process. A mathematical model for the basic data staging problem is introduced. Then, a multiple-source shortest-path algorithm based heuristic for finding a suboptimal schedule of the communication steps for data staging is presented. A simulation study is provided, which evaluates the performance of the proposed heuristic. The results show the advantages of the proposed heuristic over two random based scheduling techniques. This research, based on the simplified static model, serves as a necessary step toward solving the more realistic and complicated version of the data staging problem involving dynamic scheduling, fault tolerance, and determining where to stage data.

...read moreread less

31 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Performance-effective and low-complexity task scheduling for heterogeneous computing

[...]

Haluk Rahmi Topcuoglu¹, Salim Hariri², Min-You Wu³•Institutions (3)

Marmara University¹, University of Arizona², University of New Mexico³

01 Mar 2002-IEEE Transactions on Parallel and Distributed Systems

TL;DR: Two novel scheduling algorithms for a bounded number of heterogeneous processors with an objective to simultaneously meet high performance and fast scheduling time are presented, called the Heterogeneous Earliest-Finish-Time (HEFT) algorithm and the Critical-Path-on-a-Processor (CPOP) algorithm.

...read moreread less

Abstract: Efficient application scheduling is critical for achieving high performance in heterogeneous computing environments. The application scheduling problem has been shown to be NP-complete in general cases as well as in several restricted cases. Because of its key importance, this problem has been extensively studied and various algorithms have been proposed in the literature which are mainly for systems with homogeneous processors. Although there are a few algorithms in the literature for heterogeneous processors, they usually require significantly high scheduling costs and they may not deliver good quality schedules with lower costs. In this paper, we present two novel scheduling algorithms for a bounded number of heterogeneous processors with an objective to simultaneously meet high performance and fast scheduling time, which are called the Heterogeneous Earliest-Finish-Time (HEFT) algorithm and the Critical-Path-on-a-Processor (CPOP) algorithm. The HEFT algorithm selects the task with the highest upward rank value at each step and assigns the selected task to the processor, which minimizes its earliest finish time with an insertion-based approach. On the other hand, the CPOP algorithm uses the summation of upward and downward rank values for prioritizing tasks. Another difference is in the processor selection phase, which schedules the critical tasks onto the processor that minimizes the total execution time of the critical tasks. In order to provide a robust and unbiased comparison with the related work, a parametric graph generator was designed to generate weighted directed acyclic graphs with various characteristics. The comparison study, based on both randomly generated graphs and the graphs of some real applications, shows that our scheduling algorithms significantly surpass previous approaches in terms of both quality and cost of schedules, which are mainly presented with schedule length ratio, speedup, frequency of best results, and average scheduling time metrics.

...read moreread less

2,961 citations

Journal Article•DOI•

A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems

[...]

Purdue University¹

01 Jun 2001-Journal of Parallel and Distributed Computing

...read moreread less

1,757 citations

Journal Article•DOI•

A taxonomy and survey of grid resource management systems for distributed computing

[...]

Klaus Krauter¹, Rajkumar Buyya², Muthucumaru Maheswaran¹•Institutions (2)

University of Manitoba¹, Monash University²

01 Feb 2002-Software - Practice and Experience

TL;DR: In this article, an abstract model and a comprehensive taxonomy for describing resource management architectures is developed, which is used to identify approaches followed in the implementation of existing resource management systems for very large-scale network computing systems known as Grids.

...read moreread less

Abstract: The resource management system is the central component of distributed network computing systems. There have been many projects focused on network computing that have designed and implemented resource management systems with a variety of architectures and services. In this paper, an abstract model and a comprehensive taxonomy for describing resource management architectures is developed. The taxonomy is used to identify approaches followed in the implementation of existing resource management systems for very large-scale network computing systems known as Grids. The taxonomy and the survey results are used to identify architectural approaches and issues that have not been fully explored in the research. Copyright © 2001 John Wiley & Sons, Ltd.

...read moreread less

993 citations

Journal Article•DOI•

Workflows and e-Science: An overview of workflow system features and capabilities

[...]

Ewa Deelman¹, Dennis Gannon², Matthew Shields³, Ian Taylor³•Institutions (3)

University of Southern California¹, Indiana University², Cardiff University³

01 May 2009-Future Generation Computer Systems

TL;DR: The taxonomy provides end users with a mechanism by which they can assess the suitability of workflow in general and how they might use these features to make an informed choice about which workflow system would be a good choice for their particular application.

...read moreread less

903 citations

Posted Content•

A Taxonomy of Workflow Management Systems for Grid Computing

[...]

Jia Yu¹, Rajkumar Buyya¹•Institutions (1)

University of Melbourne¹

11 Mar 2005-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: A taxonomy that characterizes and classifies various approaches for building and executing workflows on Grids is proposed that highlights the design and engineering similarities and differences of state-of-the-art in Grid workflow systems, but also identifies the areas that need further research.

...read moreread less

Abstract: With the advent of Grid and application technologies, scientists and engineers are building more and more complex applications to manage and process large data sets, and execute scientific experiments on distributed resources. Such application scenarios require means for composing and executing complex workflows. Therefore, many efforts have been made towards the development of workflow management systems for Grid computing. In this paper, we propose a taxonomy that characterizes and classifies various approaches for building and executing workflows on Grids. We also survey several representative Grid workflow systems developed by various projects world-wide to demonstrate the comprehensiveness of the taxonomy. The taxonomy not only highlights the design and engineering similarities and differences of state-of-the-art in Grid workflow systems, but also identifies the areas that need further research.

...read moreread less

851 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse