Flit Synchronous Aelite Network on Chip

Flit Synchronous Aelite Network on Chip

01 Jan 2008-

TL;DR: The Aelite NoC offering guaranteed services exploits the complexities of System-on-Chip design with real time requirements and implements flit synchronous communication using mesochronous and asynchronous links.

read less

Abstract: The deep sub micron process technology and application convergence increases the design challenges in System-on-Chip (SoC). The traditional bus based on chip communication are not scalable and fails to deliver the performance requirements of the complex SoC. The Network on Chip (NoC) has been emerged as a solution to address these complexities of a efficient, high performance, scalable SoC design. The Aethereal NoC provides the latency and throughput bounds by pipelined timedivision multiplexed (TDM) circuit switching architecture. A global synchronous clock defines the timing for TDM, which is not beneficial for decreasing process geometry and increasing clock frequency. This thesis work focuses on the Aelite NoC architecture. The Aelite NoC offering guaranteed services exploits the complexities of System-on-Chip design with real time requirements. The Aelite NoC implements flit synchronous communication using mesochronous and asynchronous links.

...read moreread less

Content maybe subject to copyright Report

References

PDF

Open Access

More filters

Proceedings Article•DOI•

Task-level timing models for guaranteed performance in multiprocessor networks-on-chip

[...]

Peter Poplavko¹, Twan Basten¹, Marco J. G. Bekooij², J. van Meerbergen¹, Bart Mesman¹ - Show less +1 more•Institutions (2)

Eindhoven University of Technology¹, Philips²

30 Oct 2003

TL;DR: This work proposes exact timing models that effectively co-model both the computation and communication of a job, including buffer models, based on interprocessor communication (IPC) graphs.

...read moreread less

Abstract: We consider a dynamic application running on a multiprocessor network-on-chip as a set of independent jobs, each job possibly running on multiple processors. To provide guaranteed quality and performance, the scheduling of jobs, jobs themselves and the hardware must be amenable to timing analysis. For a certain class of applications and multiprocessor architectures, we propose exact timing models that effectively co-model both the computation and communication of a job. The models are based on interprocessor communication (IPC) graphs [4]. Our main contribution is a precise model of network-on-chip communication, including buffer models. We use a JPEG-decoder job as an example to demonstrate that our models can be used in practice to derive upper bounds on the job execution time and to reason about optimal buffer sizes.

...read moreread less

109 citations

Journal Article•DOI•

A unified approach to mapping and routing on a network-on-chip for both best-effort and guaranteed service traffic

[...]

MA Andreas Hansson, Kgw Kees Goossens, Andrei Radulescu

04 Jun 2007-Vlsi Design

TL;DR: This paper presents a unified single-objective algorithm, called Unified MApping, Routing, and Slot allocation (UMARS+), which shows how to couple path selection, mapping of cores, and channel time-slot allocation to minimize the network required to meet the constraints of the application.

...read moreread less

Abstract: One of the key steps in Network-on-Chip-based design is spatial mapping of cores and routing of the communication between those cores. Known solutions to the mapping and routing problems first map cores onto a topology and then route communication, using separate and possibly conflicting objective functions. In this paper, we present a unified single-objective algorithm, called Unified MApping, Routing, and Slot allocation (UMARS+). As the main contribution, we show how to couple path selection, mapping of cores, and channel time-slot allocation to minimize the network required to meet the constraints of the application. The time-complexity of UMARS+ is low and experimental results indicate a run-time only 20% higher than that of path selection alone. We apply the algorithm to an MPEG decoder System-on-Chip, reducing area by 33%, power dissipation by 35%, and worst-case latency by a factor four over a traditional waterfall approach.

...read moreread less

88 citations

Book Chapter•DOI•

Predictable embedded multiprocessor system design

[...]

Marco J. G. Bekooij¹, Orlando Moreira¹, Peter Poplavko¹, Peter Poplavko², Bart Mesman², Bart Mesman¹, M. Pastrnak², M. Pastrnak¹, Jef van Meerbergen², Jef van Meerbergen¹ - Show less +6 more•Institutions (2)

Philips¹, Eindhoven University of Technology²

02 Sep 2004

TL;DR: In this article, the authors present a model for predicting heterogeneous application domain specific multiprocessor systems, which can meet demanding performance, flexibility and power-efficiency requirements as well as stringent timing requirements.

...read moreread less

Abstract: Consumers have high expectations about the video and audio quality delivered by media processing devices like TV-sets, DVD-players and digital radios. Predictable heterogenous application domain specific multiprocessor systems, which are designed around a networks-on-chip, can meet demanding performance, flexibility and power-efficiency requirements as well as stringent timing requirements. The timing requirements can be guaranteed by making use of resource management techniques and the analytical techniques that are described in this paper.

...read moreread less

73 citations

Proceedings Article•DOI•

Undisrupted quality-of-service during reconfiguration of multiple applications in networks on chip

[...]

Andreas Hansson¹, Martijn Coenen², Kees Goossens²•Institutions (2)

Eindhoven University of Technology¹, NXP Semiconductors²

16 Apr 2007

TL;DR: A model that enables partial reconfiguration of NoCs and a mapping algorithm that uses the model to map multiple applications onto a NoC with undisrupted quality-of-service during reconfigurations are presented.

...read moreread less

Abstract: Networks on chip (NoC) have emerged as the design paradigm for scalable system on chip (SoC) communication infrastructure. Due to convergence, a growing number of applications are integrated on the same chip. When combined, these applications result in use-cases with different communication requirements. The NoC is configured per use-case and traditionally all running applications are disrupted during use-case transitions, even those continuing operation. In this paper we present a model that enables partial reconfiguration of NoCs and a mapping algorithm that uses the model to map multiple applications onto a NoC with undisrupted quality-of-service during reconfiguration. The performance of the methodology is verified by comparison with existing solutions for several SoC designs. We apply the algorithm to a mobile phone SoC with telecom, multimedia and gaming applications, reducing NoC area by more than 17% and power consumption by 50% compared to a state-of-the-art approach

...read moreread less

70 citations

Proceedings Article•DOI•

Liveness and Boundedness of Synchronous Data Flow Graphs

[...]

Amir Hossein Ghamarian¹, Marc Geilen¹, Twan Basten¹, Bart Theelen¹, Mohammad Reza Mousavi¹, Sander Stuijk¹ - Show less +2 more•Institutions (1)

Eindhoven University of Technology¹

12 Nov 2006

TL;DR: An algorithm to compute the maximal achievable throughput of an SDFG that relaxes the requirement of strong connectedness in earlier work on throughput analysis and introduces a third and new definition of self-timed boundedness, very important to SDFGs.

...read moreread less

Abstract: Synchronous Data Flow Graphs (SDFGs) have proven to be suitable for specifying and analyzing streaming applications that run on single- or multi-processor platforms. Streaming applications essentially continue their execution indefinitely. Therefore, one of the key properties of an SDFG is liveness, i.e., whether all parts of the SDFG can run infinitely often. Another elementary requirement is whether an implementation of an SDFG is feasible using a limited amount of memory. In this paper, we study two interpretations of this property, called boundedness and strict boundedness, that were either already introduced in the SDFG literature or studied for other models. A third and new definition is introduced, namely self-timed boundedness, which is very important to SDFGs, because self-timed execution results in the maximal throughput of an SDFG. Necessary and sufficient conditions for liveness in combination with all variants of boundedness are given, as well as algorithms for checking those conditions. As a by-product, we obtain an algorithm to compute the maximal achievable throughput of an SDFG that relaxes the requirement of strong connectedness in earlier work on throughput analysis.

...read moreread less

66 citations

Flit Synchronous Aelite Network on Chip

References

Related Papers (5)