Performance Evaluation of Priority Queues for Fine-Grained Parallel Tasks on GPUs
Citations
74 citations
52 citations
13 citations
Cites methods from "Performance Evaluation of Priority ..."
...The implementation based on ring buffers and the synchronisation based on atomic operations closely resembles GPUbased discrete-event simulations, which have been shown to achieve high speedup over a CPU-based execution [35], [36]....
[...]
10 citations
2 citations
Cites background from "Performance Evaluation of Priority ..."
...However, many classical algorithms and data structures that were developed for sequential machines generate workloads that are not well-suited for highly parallel GPUs....
[...]
...In 2007, Harish et al. [22] demonstrated that a BellmanFord approach is well-suited to GPUs, achieving performance gains over sequential approaches for scale-free graphs....
[...]
...[13] more recently demonstrated that, for small queues of up to 500 items, simple circular buffers out-perform tree-based queues for a range of applications....
[...]
...In the past decade, graphics processing units (GPUs) have emerged as the most effective hardware architectures for solving computationally intensive problems....
[...]
...While various priority queue structures have been developed that are parallel, cache-aware, or cache-oblivious, none has been shown to be efficient on GPUs....
[...]
References
17,939 citations
10,366 citations
"Performance Evaluation of Priority ..." refers methods in this paper
...2) A* Path Search: The A* algorithm [18] is an extension to Dijkstra’s algorithm for finding shortest paths in a graph....
[...]
1,321 citations
"Performance Evaluation of Priority ..." refers background in this paper
...At larger item counts, while the conclusions vary, heaps, splay trees [23] or more complex proposals such as the calendar queue [24] or ladder queue [25] achieved highest performance....
[...]
...Splay Tree: Splay trees, proposed by Sleator and Tarjan [23], are binary search trees that heuristically adjust to the access patterns to the tree’s items....
[...]
541 citations
"Performance Evaluation of Priority ..." refers background in this paper
...presented a parallelization of the breadth-first search problem achieving asymptotically optimal work complexity [12]....
[...]