scispace - formally typeset
Search or ask a question
Conference

Parallel Processing and Applied Mathematics 

About: Parallel Processing and Applied Mathematics is an academic conference. The conference publishes majorly in the area(s): Parallel algorithm & Computer science. Over the lifetime, 738 publications have been published by the conference receiving 4803 citations.


Papers
More filters
Book ChapterDOI
09 Sep 2007
TL;DR: A concise survey of existing workflow technology from the business and scientific domain is presented and a number of key suggestions towards the future development of scientific workflow systems are made.
Abstract: Workflow technologies are emerging as the dominant approach to coordinate groups of distributed services. However with a space filled with competing specifications, standards and frameworks from multiple domains, choosing the right tool for the job is not always a straightforward task. Researchers are often unaware of the range of technology that already exists and focus on implementing yet another proprietary workflow system. As an antidote to this common problem, this paper presents a concise survey of existing workflow technology from the business and scientific domain and makes a number of key suggestions towards the future development of scientific workflow systems.

268 citations

Book ChapterDOI
11 Sep 2005
TL;DR: Open MPI as discussed by the authors is a production-quality MPI-2 implementation that is fundamentally centered around component concepts, and it provides a unique combination of novel features previously unavailable in an open-source, production quality implementation of MPI. Open MPI provides a stable platform for third-party research as well as enabling the run-time composition of independent software add-ons.
Abstract: A large number of MPI implementations are currently available, each of which emphasize different aspects of high-performance computing or are intended to solve a specific research problem. The result is a myriad of incompatible MPI implementations, all of which require separate installation, and the combination of which present significant logistical challenges for end users. Building upon prior research, and influenced by experience gained from the code bases of the LAM/MPI, LA-MPI, FT-MPI, and PACX-MPI projects, Open MPI is an all-new, production-quality MPI-2 implementation that is fundamentally centered around component concepts. Open MPI provides a unique combination of novel features previously unavailable in an open-source, production-quality implementation of MPI. Its component architecture provides both a stable platform for third-party research as well as enabling the run-time composition of independent software add-ons. This paper presents a high-level overview the goals, design, and implementation of Open MPI, as well as performance results for it's point-to-point implementation.

111 citations

Book ChapterDOI
09 Sep 2007
TL;DR: A distributed algorithm due to Hoepman is analysed and it is shown how this can be turned into a parallel algorithm that scales well using up to 32 processors.
Abstract: We consider the problem of computing a weighted edge matching in a large graph using a parallel algorithm. This problem has application in several areas of combinatorial scientific computing. Since an exact algorithm for the weighted matching problem is both fairly expensive to compute and hard to parallelise we instead consider fast approximation algorithms. We analyse a distributed algorithm due to Hoepman [8] and show how this can be turned into a parallel algorithm. Through experiments using both complete as well as sparse graphs we show that our new parallel algorithm scales well using up to 32 processors.

74 citations

Book ChapterDOI
13 Sep 2009
TL;DR: A diagnostic performance model for bandwidth-limited loop kernels which is founded on the analysis of modern cache based microarchitectures and provides an in-depth understanding of how performance for different memory hierarchy levels is made up.
Abstract: We present a diagnostic performance model for bandwidth-limited loop kernels which is founded on the analysis of modern cache based microarchitectures. This model allows an accurate performance prediction and evaluation for existing instruction codes. It provides an in-depth understanding of how performance for different memory hierarchy levels is made up. The performance of raw memory load, store and copy operations and a stream vector triad are analyzed and benchmarked on three modern x86-type quad-core architectures in order to demonstrate the capabilities of the model.

67 citations

Book ChapterDOI
09 Sep 2007
TL;DR: Comparison of FCFS, local search and dispatching rules is presented for both cases and it is demonstrated that the new local search based algorithm provides the best schedule while keeping the running time acceptable.
Abstract: This work concentrates on the design of a system intended for study of advanced scheduling techniques for planning various types of jobs in a Grid environment. The solution is able to deal with common problems of the job scheduling in Grids like heterogeneity of jobs and resources, and dynamic runtime changes such as arrivals of new jobs. Our new simulator called Alea is based on the GridSim simulation toolkit which we extended to provide a simulation environment that supports simulation of varying Grid scheduling problems. To demonstrate the features of the GridSim environment, we implemented an experimental centralised Grid scheduler which uses advanced scheduling techniques for schedule generation. By now local search based algorithms and some dispatching rules were tested. The scheduler is capable to handle both static and dynamic situation. In the static case, all jobs are known in advance while the dynamic situation means that jobs appear in the system during simulation. In this case generated schedule is changing through time as some jobs are already finished while the new ones are arriving. Comparison of FCFS, local search and dispatching rules is presented for both cases and we demonstrate that the new local search based algorithm provides the best schedule while keeping the running time acceptable.

66 citations

Performance
Metrics
No. of papers from the Conference in previous years
YearPapers
20233
202272
20203
20162
20144
20131