scispace - formally typeset
Proceedings ArticleDOI

The 48-core SCC Processor: the Programmer's View

TLDR
The programmer's view of this chip is described and RCCE is described: the native message passing model created for the SCC processor, an intermediate case, sharing traits of message passing and shared memory architectures.
Abstract
The number of cores integrated onto a single die is expected to climb steadily in the foreseeable future. This move to many-core chips is driven by a need to optimize performance per watt. How best to connect these cores and how to program the resulting many-core processor, however, is an open research question. Designs vary from GPUs to cache-coherent shared memory multiprocessors to pure distributed memory chips. The 48-core SCC processor reported in this paper is an intermediate case, sharing traits of message passing and shared memory architectures. The hardware has been described elsewhere. In this paper, we describe the programmer's view of this chip. In particular we describe RCCE: the native message passing model created for the SCC processor.

read more

Citations
More filters
Journal ArticleDOI

Light-weight communications on Intel's single-chip cloud computer processor

TL;DR: This paper describes two communication libraries available on the Single-Chip Cloud Computer: RCCE and Rckmb, a light-weight, minimal library for writing message passing parallel applications and SCC's non-cache-coherent shared memory for transferring data between cores without needing to go off-chip.
Journal ArticleDOI

A Reliable Routing Architecture and Algorithm for NoCs

TL;DR: Vicis is a fault-tolerant architecture and companion routing protocol that is robust to a large number of permanent failures, allowing communication to continue in the face of permanent transistor failures.
Journal ArticleDOI

A framework for metamorphic malware analysis and real-time detection

TL;DR: A new framework called MARD is presented, to protect the end points that are often the last defense, against metamorphic malware, and provides automation, platform independence, optimizations for real-time performance and modularity.
Proceedings ArticleDOI

K2: a mobile operating system for heterogeneous coherence domains

TL;DR: This work identifies a shared-most OS model for multiple coherence domains: creating per-domain instances of core OS services with no shared state, while enabling other extended OS services to share state across domains.
References
More filters
Book

MPI: The Complete Reference

TL;DR: MPI: The Complete Reference is an annotated manual for the latest 1.1 version of the standard that illuminates the more advanced and subtle features of MPI and covers such advanced issues in parallel computing and programming as true portability, deadlock, high-performance message passing, and libraries for distributed and parallel computing.
Journal ArticleDOI

The Nas Parallel Benchmarks

TL;DR: A new set of benchmarks has been developed for the performance evaluation of highly parallel supercom puters that mimic the computation and data move ment characteristics of large-scale computational fluid dynamics applications.
Journal Article

The NAS Parallel Benchmarks

TL;DR: The original NAS Parallel Benchmarks consisted of eight individual bench- mark problems, each of which focused on some aspect of scientific computing, although most of these benchmarks have much broader relevance, since in a much larger sense they are typical of many real-world computing applications.
Journal ArticleDOI

An 80-Tile Sub-100-W TeraFLOPS Processor in 65-nm CMOS

TL;DR: In this paper, an integrated network-on-chip architecture containing 80 tiles arranged as an 8x10 2D array of floating-point cores and packet-switched routers, both designed to operate at 4 GHz.
Related Papers (5)