Showing papers in "Journal of Parallel and Distributed Computing in 2013"

PDF

Open Access

Journal Article•DOI•

Combinatorial auction-based allocation of virtual machine instances in clouds

[...]

Sharrukh Zaman¹, Daniel Grosu¹•Institutions (1)

01 Apr 2013-Journal of Parallel and Distributed Computing

TL;DR: This work formulate the problem of virtual machine allocation in clouds as a combinatorial auction problem and proposes two mechanisms to solve it, and performs extensive simulation experiments to reveal that the combinatorially auction-based mechanisms can significantly improve the allocation efficiency while generating higher revenue for the cloud providers.

...read moreread less

254 citations

Journal Article•DOI•

Graphics processing unit (GPU) programming strategies and trends in GPU computing

[...]

André R. Brodtkorb¹, Trond Runar Hagen², Martin L. Sætra²•Institutions (2)

SINTEF¹, University of Oslo²

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: In this paper, the authors present an overview of current GPU programming strategies, profile-driven development, and an outlook to future trends, as well as a discussion of the challenges of getting started with GPU programming.

...read moreread less

203 citations

Journal Article•DOI•

Detecting Sybil attacks in VANETs

[...]

Bo Yu¹, Cheng-Zhong Xu¹, Bin Xiao²•Institutions (2)

Wayne State University¹, Hong Kong Polytechnic University²

01 Jun 2013-Journal of Parallel and Distributed Computing

TL;DR: This paper proposes a cooperative method to verify the positions of potential Sybil nodes and introduces a statistical method and design a system which is able to verify where a vehicle comes from, termed the Presence Evidence System (PES).

...read moreread less

173 citations

Journal Article•DOI•

Enhancing data parallelism for Ant Colony Optimization on GPUs

[...]

José M. Cecilia¹, José M. García¹, Andy Nisbet², Martyn Amos², Manuel Ujaldón³ - Show less +1 more•Institutions (3)

University of Murcia¹, Manchester Metropolitan University², University of Málaga³

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: This paper deals with a GPU implementation of Ant Colony Optimization (ACO), a population-based optimization method which comprises two major stages: tour construction and pheromone update, and proposes a new mechanism called I-Roulette to replicate the classic roulette wheel while improving GPU parallelism.

...read moreread less

122 citations

Journal Article•DOI•

PHAST: Hardware-accelerated shortest path trees

[...]

Daniel Delling¹, Andrew V. Goldberg¹, Andreas G. Nowatzyk¹, Renato F. Werneck¹•Institutions (1)

Microsoft¹

01 Jul 2013-Journal of Parallel and Distributed Computing

TL;DR: A novel algorithm to solve the non-negative single-source shortest path problem on road networks and graphs with low highway dimension that needs fewer operations, has better locality, and is better able to exploit parallelism at multi-core and instruction levels.

...read moreread less

115 citations

Journal Article•DOI•

Parallel differential evolution with self-adapting control parameters and generalized opposition-based learning for solving high-dimensional optimization problems

[...]

Hui Wang¹, Shahryar Rahnamayan², Zhijian Wu³•Institutions (3)

Nanchang Institute of Technology¹, University of Ontario Institute of Technology², Wuhan University³

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: Simulation results demonstrate that GjODE is better than, or at least comparable to, six other algorithms, and employing GPU can effectively reduce computational time.

...read moreread less

105 citations

Journal Article•DOI•

Parallel Ant Colony Optimization on Graphics Processing Units

[...]

Audrey Delevacq¹, Pierre Delisle¹, Marc Gravel², Michaël Krajecki¹•Institutions (2)

University of Reims Champagne-Ardenne¹, Université du Québec²

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: A comparative experimental study highlights the performance impact of ACO parameters, GPU technical configuration, memory structures and parallelization granularity on a state-of-the-art Fermi GPU architecture.

...read moreread less

105 citations

Journal Article•DOI•

A DAG scheduling scheme on heterogeneous computing systems using double molecular structure-based chemical reaction optimization

[...]

Yuming Xu¹, Kenli Li¹, Ligang He², Tung Khac Truong¹•Institutions (2)

Hunan University¹, University of Warwick²

01 Sep 2013-Journal of Parallel and Distributed Computing

TL;DR: The CRO scheme is used to formulate the scheduling of Directed Acyclic Graph (DAG) jobs in heterogeneous computing systems, and a Double Molecular Structure-based Chemical Reaction Optimization (DMSCRO) method is developed.

...read moreread less

97 citations

Journal Article•DOI•

The Failure Trace Archive: Enabling the comparison of failure measurements and models of distributed systems

[...]

Bahman Javadi¹, Derrick Kondo², Alexandru Iosup³, Dick Epema³•Institutions (3)

University of Western Sydney¹, French Institute for Research in Computer Science and Automation², Delft University of Technology³

01 Aug 2013-Journal of Parallel and Distributed Computing

TL;DR: The design of the archive, in particular of the standard FTA data format, and the design of a toolbox that facilitates automated analysis of trace data sets are described, and how different interpretations of the meaning of failure data can result in different conclusions for failure modeling and job scheduling in distributed systems are shown.

...read moreread less

90 citations

Journal Article•DOI•

A decentralized approach for mining event correlations in distributed system monitoring

[...]

Gang Wu¹, Huxing Zhang¹, Meikang Qiu², Zhong Ming³, Jiayin Li², Xiao Qin⁴ - Show less +2 more•Institutions (4)

Shanghai Jiao Tong University¹, University of Kentucky², Shenzhen University³, Auburn University⁴

01 Mar 2013-Journal of Parallel and Distributed Computing

TL;DR: A MapReduce-based algorithm is proposed to data mining event association rules, which utilizes the computational resource of multiple dedicated nodes of the system, and achieves nearly ideal speedup compared to centralized mining approaches.

...read moreread less

86 citations

Journal Article•DOI•

Parallel approaches to machine learning-A comprehensive survey

[...]

Sujatha R. Upadhyaya¹•Institutions (1)

Infosys¹

01 Mar 2013-Journal of Parallel and Distributed Computing

TL;DR: Map reduce is another important technique that has evolved during this period and as the literature has it, it has been proved to be an important aid in delivering performance of machine learning algorithms on GPUs.

...read moreread less

Journal Article•DOI•

Oblivious algorithms for multicores and networks of processors

[...]

Rezaul Chowdhury¹, Vijaya Ramachandran², Francesco Silvestri³, Brandon Blakeley⁴•Institutions (4)

Stony Brook University¹, University of Texas at Austin², University of Padua³, University of Washington⁴

01 Jul 2013-Journal of Parallel and Distributed Computing

TL;DR: This work introduces a multicore-oblivious (MO) approach to algorithms and schedulers for HM, and presents efficient MO algorithms for several fundamental problems including matrix transposition, FFT, sorting, the Gaussian Elimination Paradigm, list ranking, and connected components.

...read moreread less

Journal Article•DOI•

Distributed anomaly detection for industrial wireless sensor networks based on fuzzy data modelling

[...]

Heshan Kumarage¹, Ibrahim Khalil¹, Zahir Tari¹, Albert Y. Zomaya²•Institutions (2)

RMIT University¹, University of Sydney²

01 Jun 2013-Journal of Parallel and Distributed Computing

TL;DR: This paper proposes a robust and scalable mechanism that aims to detect malicious anomalies accurately and efficiently using distributed in-network processing in a hierarchical framework with more than 96% less communication overheads opposed to a centralized approach.

...read moreread less

Journal Article•DOI•

Foundations of distributed multiscale computing: Formalization, specification, and analysis

[...]

Joris Borgdorff¹, Jean-Luc Falcone², Eric Lorenz¹, Carles Bona-Casas¹, Bastien Chopard², Alfons G. Hoekstra¹ - Show less +2 more•Institutions (2)

University of Amsterdam¹, University of Geneva²

01 Apr 2013-Journal of Parallel and Distributed Computing

TL;DR: A high-level and well-defined Multiscale Modeling Language (MML) is enhanced that describes and specifies multiscale models and their computational architecture in a modular way and is applied to two selected applications in nanotechnology and biophysics, showing its capabilities.

...read moreread less

Journal Article•DOI•

Towards accelerating smoothed particle hydrodynamics simulations for free-surface flows on multi-GPU clusters

[...]

Daniel Valdez-Balderas¹, José M. Domínguez², Benedict D. Rogers¹, Alejandro J. C. Crespo²•Institutions (2)

University of Manchester¹, University of Vigo²

01 Nov 2013-Journal of Parallel and Distributed Computing

TL;DR: In this paper, a multi-GPU SPH program is developed for free-surface flows based on a spatial decomposition technique, whereby different portions (subdomains) of the physical system under study are assigned to different GPUs.

...read moreread less

Journal Article•DOI•

KNEM: A generic and scalable kernel-assisted intra-node MPI communication framework

[...]

Brice Goglin, Stéphanie Moreaud

01 Feb 2013-Journal of Parallel and Distributed Computing

TL;DR: The KNEM module for the Linux kernel is presented that provides MPI implementations with a flexible and scalable interface for performing kernel-assisted single-copy data transfers between local processes and brings significant application performance improvements thanks to more efficient point-to-point and collective operations.

...read moreread less

Journal Article•DOI•

An investigation of the performance portability of OpenCL

[...]

Simon J. Pennycook¹, Simon D. Hammond², Steven A. Wright¹, J. A. Herdman³, I. Miller³, Stephen A. Jarvis¹ - Show less +2 more•Institutions (3)

University of Warwick¹, Sandia National Laboratories², Atomic Weapons Establishment³

01 Nov 2013-Journal of Parallel and Distributed Computing

TL;DR: The development of an MPI/OpenCL implementation of LU, an application-level benchmark from the NAS Parallel Benchmark Suite, is reported, demonstrating the importance of memory arrangement and work-item/work-group distribution strategies when applications are deployed on different device types.

...read moreread less

Journal Article•DOI•

MapReduce with communication overlap (MaRCO)

[...]

Faraz Ahmad¹, Seyong Lee², Mithuna Thottethodi¹, T. N. Vijaykumar¹•Institutions (2)

Purdue University¹, Oak Ridge National Laboratory²

01 May 2013-Journal of Parallel and Distributed Computing

TL;DR: MaRCO is implemented in Hadoop's MapReduce and it is shown that on a 128-node Amazon EC2 cluster, MaRCO achieves 23% average speed-up overHadoop for shuffle-heavy MapReductions.

...read moreread less

Journal Article•DOI•

Fault tolerant decentralised K-Means clustering for asynchronous large-scale networks

[...]

Giuseppe Di Fatta¹, Francesco Blasa², Simone Cafiero², Giancarlo Fortino²•Institutions (2)

University of Reading¹, University of Calabria²

01 Mar 2013-Journal of Parallel and Distributed Computing

TL;DR: The experimental analysis confirms that the proposed K-Means algorithm is very accurate and fault tolerant under unreliable network conditions (message loss and node failures) and is suitable for asynchronous networks of very large and extreme scale.

...read moreread less

Journal Article•DOI•

Inexact subgraph isomorphism in MapReduce

[...]

Todd Plantenga¹•Institutions (1)

Sandia National Laboratories¹

01 Feb 2013-Journal of Parallel and Distributed Computing

TL;DR: The mapReduce computing framework is designed for distributed computing on massive data sets, and the new algorithm leverages MapReduce techniques to enable processing of graphs with billions of vertices, including graphs that follow a power law degree distribution.

...read moreread less

Journal Article•DOI•

Solving very large instances of the scheduling of independent tasks problem on the GPU

[...]

Frédéric Pinel¹, Bernabé Dorronsoro¹, Pascal Bouvry¹•Institutions (1)

University of Luxembourg¹

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: GraphCell improves state-of-the-art solutions, especially for larger problems, and it provides an alternative to the GPU Min-min heuristic when more accurate solutions are needed, at the expense of an increased runtime.

...read moreread less

Journal Article•DOI•

An effective Parallel Multistart Tabu Search for Quadratic Assignment Problem on CUDA platform

[...]

Michał Czapiński¹•Institutions (1)

Cranfield University¹

01 Nov 2013-Journal of Parallel and Distributed Computing

TL;DR: Detailed analysis of parallelisation possibilities, memory organisation and access patterns, enables the implementation of fast and effective heuristics for QAP on the GPU - the Parallel Multistart Tabu Search (PMTS).

...read moreread less

Journal Article•DOI•

p-PIC: Parallel power iteration clustering for big data

[...]

Weizhong Yan¹, Brahmakshatriya Umang Gopalbhai¹, Ya Xue¹, Mark Richard Gilder¹, G. Bowden Wise¹ - Show less +1 more•Institutions (1)

General Electric¹

01 Mar 2013-Journal of Parallel and Distributed Computing

TL;DR: This paper attempts to expand PIC's data scalability by implementing a parallel power iteration clustering (p-PIC) algorithm that works well on low-end commodity computers (COTS-based clusters and general purpose servers found at most commercial cloud providers).

...read moreread less

Journal Article•DOI•

Stochastic DAG scheduling using a Monte Carlo approach

[...]

Wei Zheng¹, Rizos Sakellariou²•Institutions (2)

Xiamen University¹, University of Manchester²

01 Dec 2013-Journal of Parallel and Distributed Computing

TL;DR: A novel DAG scheduling approach is proposed to solve this stochastic scheduling problem, based on a Monte Carlo method, and empirical results show that a significant improvement of average application performance can be achieved by the proposed approach at a reasonable execution time cost.

...read moreread less

Journal Article•DOI•

Parallel multitask cross validation for Support Vector Machine using GPU

[...]

Qi Li¹, Raied Salman¹, Erik Test¹, Robert Strack¹, Vojislav Kecman¹ - Show less +1 more•Institutions (1)

Virginia Commonwealth University¹

01 Mar 2013-Journal of Parallel and Distributed Computing

TL;DR: A novel parallel SVM training implementation is proposed to accelerate the cross validation procedure by running multiple training tasks simultaneously on a Graphics Processing Unit (GPU) to reduce redundant computations of kernel values across different training tasks.

...read moreread less

Journal Article•DOI•

Energy-efficient clustering in lossy wireless sensor networks

[...]

Dawei Gong¹, Yuanyuan Yang¹, Zhexi Pan¹•Institutions (1)

Stony Brook University¹

01 Sep 2013-Journal of Parallel and Distributed Computing

TL;DR: The results demonstrate that the proposed clustering algorithms can significantly improve the data reception ratio, reduce the total energy consumption in the network and prolong network lifetime compared to a typical distributed clustering algorithm, HEED, that does not consider lossy links.

...read moreread less

Journal Article•DOI•

Accelerated parallel genetic programming tree evaluation with OpenCL

[...]

Douglas A. Augusto, Helio J. C. Barbosa¹•Institutions (1)

Universidade Federal de Juiz de Fora¹

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: This work proposes both a transcription of existing GP parallelization strategies into the OpenCL programming platform and a freely available implementation to evaluate its suitability for GP, by assessing the performance of parallel strategies on the CPU and GPU processors from different vendors.

...read moreread less

Journal Article•DOI•

Accelerating wildfire susceptibility mapping through GPGPU

[...]

Salvatore Di Gregorio¹, Giuseppe Filippone¹, William Spataro¹, Giuseppe A. Trunfio²•Institutions (2)

University of Calabria¹, University of Sassari²

01 Aug 2013-Journal of Parallel and Distributed Computing

TL;DR: General-Purpose Computation with Graphics Processing Units (GPGPU) is applied, in conjunction with a wildfire simulation model based on the Cellular Automata approach, to the process of BPM building.

...read moreread less

Journal Article•DOI•

G-MSA - A GPU-based, fast and accurate algorithm for multiple sequence alignment

[...]

Jacek Blazewicz¹, Wojciech Frohmberg, Michal Kierzynka, Paweł T. Wojciechowski•Institutions (1)

Polish Academy of Sciences¹

01 Jan 2013-Journal of Parallel and Distributed Computing

TL;DR: The main idea was to design and implement an MSA method which can take advantage of modern graphics cards, based on T-Coffee-well known for its high accuracy MSA algorithm, and is highly efficient achieving up to 193-fold speedup on a single GPU.

...read moreread less

Journal Article•DOI•

Parallel multi-dimensional range query processing with R-trees on GPU

[...]

Jinwoong Kim¹, Sul-Gi Kim¹, Beomseok Nam¹•Institutions (1)

Ulsan National Institute of Science and Technology¹

01 Aug 2013-Journal of Parallel and Distributed Computing

TL;DR: An extensive experimental study shows that MPTS R-tree traversal algorithm on NVIDIA Tesla M2090 GPU consistently outperforms traditional recursive R-trees search algorithm on Intel Xeon E5506 processors.

...read moreread less