Topic

Degree of parallelism

About: Degree of parallelism is a research topic. Over the lifetime, 1515 publications have been published within this topic receiving 25546 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A parallel alternating direction implicit preconditioning method

[...]

Hong Jiang¹, Yau Shu Wong¹•Institutions (1)

University of Alberta¹

27 Aug 1991-Journal of Computational and Applied Mathematics

TL;DR: A parallel algorithm for the ADI preconditioning is proposed, in which several tridiagonal systems that are traditionally solved sequentially are now solved concurrently and can be implemented in a multiprocessor architecture.

...read moreread less

7 citations

Journal Article•DOI•

In-memory parallelization of join queries over large ontological hierarchies

[...]

Dimitris Bilidas¹, Manolis Koubarakis¹•Institutions (1)

National and Kapodistrian University of Athens¹

01 Sep 2021-Distributed and Parallel Databases

TL;DR: PARJ is presented, an in-memory RDF store which takes into consideration ontological hierarchies during join processing with very low performance overhead, avoiding expensive preprocessing and materialization of implications, and is also amenable to straightforward parallelization.

...read moreread less

Abstract: The Resource Description Framework (RDF) data model enables the construction of knowledge graphs over various domains, using ontologies in order to encode information about the domain, and simple statements in the form of subject-predicate-object triples for data representation, facilitating the interlinking and exchange of Web data However, this simplicity comes with the cost of having to execute a large number of joins in order to get the desirable query results, while at the same time large ontological hierarchies complicate the query answering process even more, for systems that provide complete answers with respect to such ontological axioms In this work we present PARJ, an in-memory RDF store which takes into consideration ontological hierarchies during join processing with very low performance overhead, avoiding expensive preprocessing and materialization of implications, and is also amenable to straightforward parallelization Specifically, we present a join implementation that allows to achieve any desired degree of parallelism on arbitrary join queries and RDF graphs stored in memory using compact vertical partitioning We use an adaptive join processing approach, such that we take advantage of complete or even partial ordering of RDF data, which is compactly stored in order to increase spatial locality and keep memory consumption low, coupled with an ID-to-Position vector index used when ordering does not allow for efficient scanning of the input relation Finally, we experimentally show the efficiency and scalability of our proposal

...read moreread less

7 citations

Book Chapter•DOI•

Solving the Constant-Degree Parallelism Alginment Problem

[...]

Claude G. Diderich¹, Marc Gengler²•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, École normale supérieure de Lyon²

26 Aug 1996

TL;DR: An exact algorithm for finding a computation mapping and data distributions that minimize, for a given degree of parallelism, the number of remote data accesses in a distributed memory parallel computer (DMPC).

...read moreread less

Abstract: We describe an exact algorithm for finding a computation mapping and data distributions that minimize, for a given degree of parallelism, the number of remote data accesses in a distributed memory parallel computer (DMPC). This problem is shown to be NP-hard.

...read moreread less

7 citations

Book Chapter•DOI•

An Algorithm-by-Blocks for SuperMatrix Band Cholesky Factorization

[...]

Gregorio Quintana-Ortí¹, Enrique S. Quintana-Ortí¹, Alfredo Remón¹, Robert A. Geijn²•Institutions (2)

James I University¹, University of Texas at Austin²

16 Dec 2008

TL;DR: This work pursues the scalable parallel implementation of the Cholesky factorization of band matrices with medium to large bandwidth targeting SMP and multi-core architectures by decomposing the computation into a large number of fine-grained operations exposing a higher degree of parallelism.

...read moreread less

Abstract: We pursue the scalable parallel implementation of the factor- ization of band matrices with medium to large bandwidth targeting SMP and multi-core architectures. Our approach decomposes the computation into a large number of fine-grained operations exposing a higher degree of parallelism. The SuperMatrix run-time system allows an out-of-order scheduling of operations that is transparent to the programmer. Exper- imental results for the Cholesky factorization of band matrices on two parallel platforms with sixteen processors demonstrate the scalability of the solution.

...read moreread less

7 citations

Posted Content•

RECEIPT: REfine CoarsE-grained IndePendent Tasks for Parallel Tip decomposition of Bipartite Graphs

[...]

Kartik Lakhotia¹, Rajgopal Kannan¹, Viktor K. Prasanna¹, César A. F. De Rose²•Institutions (2)

University of Southern California¹, Pontifícia Universidade Católica do Rio Grande do Sul²

17 Oct 2020-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: A novel parallel tip-decomposition algorithm -- REfine CoarsE-grained Independent Tasks (RECEIPT) that relaxes the peeling order restrictions by partitioning the vertices into multiple independent subsets that can be concurrently peeled to simultaneously achieve a high degree of parallelism and dramatic reduction in synchronizations.

...read moreread less

Abstract: Tip decomposition is a crucial kernel for mining dense subgraphs in bipartite networks, with applications in spam detection, analysis of affiliation networks etc. It creates a hierarchy of vertex-induced subgraphs with varying densities determined by the participation of vertices in butterflies (2,2-bicliques). To build the hierarchy, existing algorithms iteratively follow a delete-update(peeling) process: deleting vertices with the minimum number of butterflies and correspondingly updating the butterfly count of their 2-hop neighbors. The need to explore 2-hop neighborhood renders tip-decomposition computationally very expensive. Furthermore, the inherent sequentiality in peeling only minimum butterfly vertices makes derived parallel algorithms prone to heavy synchronization. In this paper, we propose a novel parallel tip-decomposition algorithm -- REfine CoarsE-grained Independent Tasks (RECEIPT) that relaxes the peeling order restrictions by partitioning the vertices into multiple independent subsets that can be concurrently peeled. This enables RECEIPT to simultaneously achieve a high degree of parallelism and dramatic reduction in synchronizations. Further, RECEIPT employs a hybrid peeling strategy along with other optimizations that drastically reduce the amount of wedge exploration and execution time. We perform detailed experimental evaluation of RECEIPT on a shared-memory multicore server. It can process some of the largest publicly available bipartite datasets orders of magnitude faster than the state-of-the-art algorithms -- achieving up to 1100x and 64x reduction in the number of thread synchronizations and traversed wedges, respectively. Using 36 threads, RECEIPT can provide up to 17.1x self-relative speedup. Our implementation of RECEIPT is available at this https URL.

...read moreread less

7 citations

Collapse

Network Information

Performance

Metrics

1,515

Papers

27,447

Citations

No. of papers in the topic in previous years
Year	Papers
2022	1
2021	47
2020	48
2019	52
2018	70
2017	75

Degree of parallelism

Papers published on a yearly basis

Papers

Trending Questions (7)

Network Information

Related Topics (5)

Performance

Metrics