Area and power efficient network on chip router architecture

doi:10.1109/CICT.2013.6558214

Home
/
Papers
/
Area and power efficient network on chip router architecture

Proceedings Article•DOI•

Area and power efficient network on chip router architecture

Sweta Sahu¹, Harish M. Kittur¹•Institutions (1)

VIT University¹

11 Apr 2013-pp 855-859

TL;DR: This paper has implemented a 5-port 32-bit and 64-bit router architecture using wormhole routing technique for 2D-mesh network by using simple deterministic algorithm, flow control and decoding mechanism and demonstrated that multiplexer router design is more efficient than a matrix router design.

read less

Abstract: In System on Chip, buses and point to point links are used as a communication infrastructure between one IP to another, but these cannot provide efficient interconnect from performance point of view. So NoC architecture was proposed to provide communication in multiprocessor SoC and overcome the limitations. This paper has implemented a 5-port 32-bit and 64-bit router architecture using wormhole routing technique for 2D-mesh network by using simple deterministic algorithm, flow control and decoding mechanism. In this router, 2 types of crossbar are used named as multiplexer and tri-state buffer matrix for efficient design. Comparisons of area and power are done for these router designs using ASIC tool flow in Cadence using TSMC 90nm and 180nm process technologies. Simulation results are performed in Cadence NC simulator. It is demonstrated that multiplexer router design is more efficient than a matrix router design as area and power increases for matrix design while using same port-width.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Comparative analysis of network-on-chip simulation tools

[...]

Sarzamin Khan, Sheraz Anjum, Usman Ali Gulzari, Frank Sill Torres

01 Jan 2018-Iet Computers and Digital Techniques

TL;DR: The exemplary real world application, video object plane decoder is mapped on a 2D mesh NoC using different mapping algorithms under NOCMAP and NoCTweak simulators for comparative analysis of the NoC simulators and their embedded mapping algorithms.

...read moreread less

Abstract: Network-on-chip (NoC) is a reliable and scalable communication paradigm deemed as an alternative to classic bus systems in modern systems-on-chip designs. Consequently, one can observe extensive multidimensional research related to the design and implementation of NoC-based systems. A basic requirement for most of these activities is the availability of NoC simulators that enable the study and comparison of different technologies. This study targets the analysis of different NoC simulators and highlights its contributions towards NoC research. Various NoC tools such as NoCTweak, Noxim, Nirgam, Nostrum, BookSim, WormSim, NOCMAP and ORION are evaluated and their strengths and weaknesses are highlighted. The comparative analysis includes methods for estimation of latency, throughput and energy consumption. Further, the exemplary real world application, video object plane decoder is mapped on a 2D mesh NoC using different mapping algorithms under NOCMAP and NoCTweak simulators for comparative analysis of the NoC simulators and their embedded mapping algorithms.

...read moreread less

24 citations

Proceedings Article•DOI•

Low-power multiplexer designs using three-independent-gate field effect transistors

[...]

Edouard Giacomin¹, Jorge Romero Gonzalez¹, Pierre-Emmanuel Gaillardon¹•Institutions (1)

University of Utah¹

25 Jul 2017

TL;DR: Novel two-level and tree-like multiplexer structures intended for low-power applications based on Silicon Nanowire TIGFETs (TIG SiNWFets) are presented and compared with conventional CMOS FinFET Low-STandby Power (LSTP) structures.

...read moreread less

Abstract: Three-Independent-Gate Field Effect Transistors (TIGFETs) are capable of different modes of operation thanks to their additional gate terminals. By electrically controlling their side gates, TIGFETs can act either as a p-type or an n-type transistor and can also implement multi-threshold logic. This versatility can be used to create compact logic gates intended for high-performance or low-leakage applications. In today's Integrated Circuits (ICs), multiplexers are used in a broad range of applications such as encoding-decoding, routing signals or Look-Up Tables (LUTs). In particular, conventional Complementary Metal-Oxide-Semiconductor (CMOS)-based multiplexers with a large number of inputs require several stages, leading to significant area and energy. Due to their three-gate terminals, TIGFETs can implement the equivalent multiplexers using less transistors, arranged in a compact way, reducing the area and energy. In this paper, we present novel two-level and tree-like multiplexer structures intended for low-power applications based on Silicon Nanowire TIGFETs (TIG SiNWFETs) and compare it with conventional CMOS FinFET Low-STandby Power (LSTP) structures. Using the 22nm technology node, electrical simulations show that our SiNWFET-based multiplexer can improve the energy by operation by 2 5× and Energy-Area Product by up to 2.6× compared to the best CMOS FinFET structures.

...read moreread less

20 citations

Proceedings Article•DOI•

FPGA based design of low power reconfigurable router for Network on Chip (NoC)

[...]

Amit Bhanwala¹, Mayank Kumar¹, Yogendera Kumar¹•Institutions (1)

Galgotias University¹

15 May 2015

TL;DR: Results show that the proposed design consumes less power compared to the previously designed reconfigurable routers, and power dissipation of the proposed reconfigured router is reduced using Power gating technique.

...read moreread less

Abstract: FPGA based design of reconfigurable router for NoC applications is proposed in the present work. Design entry of the proposed router is done using Verilog Hardware Description Language (Verilog HDL). The router designed in the present work has four channels (namely, east, west, north and south) and a crossbar switch. Each channel consists of First in First out (FIFO) buffers and multiplexers. FIFO buffers are used to store the data and the input and output of the data are controlled using multiplexers. Firstly, south channel is designed which includes the design of FIFO and multiplexers. After that, the crossbar switch and other three channels are designed. All these designed channels, FIFO buffers, multiplexers and crossbar switches are integrated to form the complete router architecture. The proposed design is simulated using Modelsim and the RTL view is obtained using Xilinx ISE 13.4. Xilinx SPARTAN-6 FPGAs are used for synthesis of proposed design. Power dissipation of the proposed reconfigurable router is reduced using Power gating technique. Total power is calculated by the use of XPower Analyzer tool. Obtained results show that the proposed design consumes less power compared to the previously designed reconfigurable routers.

...read moreread less

13 citations

Cites background or methods from "Area and power efficient network on..."

...A router having eight input ports and one output port for Network on Chip using IP core with latest verification methodologies are presented in [12]....
[...]
...Multiplexer based crossbar switch for 5 ports [12]....
[...]

Journal Article•DOI•

An optimized hybrid algorithm in term of energy and performance for mapping real time workloads on 2d based on-chip networks

[...]

Sarzamin Khan¹, Sheraz Anjum¹, Usman Ali Gulzari², Farruh Ishmanov³, Maurizio Palesi⁴, Muhammad Khalil Afzal¹ - Show less +2 more•Institutions (4)

COMSATS Institute of Information Technology¹, University of Lahore², Kwangwoon University³, University of Catania⁴

01 Dec 2018-Applied Intelligence

TL;DR: The mapping results of the exemplary real world applications indicate that the proposed ONMAP algorithm is more efficient than its competitors for most of the performance parameters of the on-chip network designs.

...read moreread less

Abstract: In this paper, we propose an optimized, search based near-optimal mapping heuristic, named as ONMAP for mapping real time embedded application workloads on 2D based on-chip interconnection network platforms. ONMAP exploits NMAP, a well-known and fast nearest neighbor heuristic algorithm by using the modular exact optimization method. The proposed hybrid algorithm minimizes the on-chip inter-processor communication energy consumption and optimizes the interconnection network performance parameters. The algorithm inherits the constructive search based heuristic nature of the NMAP algorithm, as well as the property of exact optimization for mapping embedded applications on the target communication architecture. To verify the efficiency and effectiveness of the algorithm, we have compared the proposed algorithm with NMAP and random mapping algorithm under similar simulation environments and traffic conditions. The mapping results of the exemplary real world applications such as VOPD, PIP, MPEG4, MWD, MMS and WiFi-80211arx indicate that ONMAP algorithm is more efficient than its competitors for most of the performance parameters of the on-chip network designs. The algorithm successfully optimized the energy consumption, up to 20 % and 26% in comparison to NMAP and random algorithms, respectively. Similarly, the cost is optimized up to 10% and 60% as compared to NMAP and random mapping algorithms, respectively.

...read moreread less

9 citations

Proceedings Article•DOI•

MLNoC: A Machine Learning Based Approach to NoC Design

[...]

Nishant S. Rao, Akshay Ramachandran, Amish M. Shah

01 Sep 2018

TL;DR: There is a strong correlation between SoC characteristics and good NoC design practices, however this correlation is highly non-linear and multidimensional, with dimensions indicative of the features of the SoC, design goals and properties of the NoC.

...read moreread less

Abstract: Modern System on Chips (SoCs) are becoming increasingly complex with a growing number of CPUs, caches, accelerators, memory and I/O subsystems. For such designs, a packet based distributed networks-on-chip (NoCs) interconnect can provide scalability, performance and efficiency. However, the design of such a NoC involves optimizing a large number of variables such as topology, routing choices, arbitration and quality of service (QoS) policies, buffer sizes, and deadlock avoidance policies. Widely varying die sizes, power, floorplan and performance constraints across a variety of different market segments, ranging from high-end servers to low-end IoT devices, impose additional design challenges. In this paper we demonstrate that there is a strong correlation between SoC characteristics and good NoC design practices. However this correlation is highly non-linear and multidimensional, with dimensions indicative of the features of the SoC, design goals and properties of the NoC. This results in a high-dimensional NoC design space and complex search process which is inefficient to solve with classic algorithms. Using a variety of real SoCs and training data sets, we demonstrate that a machine learning (ML) based approach yields near-optimal NoC designs quickly. We determine a number of SoC and NoC features, describe reduction methods, and also show that a multi-model approach yields better designs. We demonstrate that for a wide variety of SoCs, ML based NoC designs are far superior to those designed and optimized manually over years on almost all quality metrics.

...read moreread less

6 citations

Cites background from "Area and power efficient network on..."

...consumption and efficient power management schemes [22][23][24][25] such as clock gating are an important aspect of interconnect construction....
[...]

References

PDF

Open Access

More filters

Book•

Interconnection Networks: An Engineering Approach

[...]

José Duato, Sudhakar Yalamanchili, Ni Lionel

01 Oct 1997

TL;DR: The book's engineering approach considers the issues that designers need to deal with and presents a broad set of practical solutions that address the challenges and details the basic underlying concepts of interconnection networks.

...read moreread less

Abstract: From the Publisher: Addresses the challenges and details the basic underlying concepts of interconnection networks. The book's engineering approach considers the issues that designers need to deal with and presents a broad set of practical solutions. Considerable effort is made to establish new and more.

...read moreread less

2,021 citations

"Area and power efficient network on..." refers background in this paper

...circuit switching, packet switching, and wormhole switching [8]....
[...]

Proceedings Article•DOI•

A generic architecture for on-chip packet-switched interconnections

[...]

Pierre Guerrier¹, Alain Greiner¹•Institutions (1)

Pierre-and-Marie-Curie University¹

01 Jan 2000

TL;DR: This paper presents an architectural study of a scalable system-level interconnection template, and discusses the necessity and the ways to provide high-level services on top of the bare network packet protocol, such as dataflow and address-space communication services.

...read moreread less

Abstract: This paper presents an architectural study of a scalable system-level interconnection template. We explain why the shared bus, which is today's dominant template, will not meet the performance requirements of tomorrow's systems. We present an alternative interconnection in the form of switching networks. This technology originates in parallel computing, but is also well suited for heterogeneous communication between embedded processors and addresses many of the deep submicron integration issues. We discuss the necessity and the ways to provide high-level services on top of the bare network packet protocol, such as dataflow and address-space communication services. Eventually we present our first results on the cost/performance assessment of an integrated switching network.

...read moreread less

995 citations

"Area and power efficient network on..." refers methods in this paper

...Here are most popular ones: SPIN [3], CLICHÉ [4], TORUS [5], OCTAGON [6], and BFT [7]....
[...]

Journal Article•DOI•

An interconnect architecture for networking systems on chips

[...]

Faraydon Karim¹, Anh Nguyen¹, Sujit Dey²•Institutions (2)

STMicroelectronics¹, University of California, Berkeley²

01 Sep 2002-IEEE Micro

TL;DR: The octagon on-chip communication architecture, with its cost, performance, and scalability advantages, supports network processor SOCs of next-generation internet routers.

...read moreread less

Abstract: Network processor systems on chips meet the speed and flexibility requirements of next-generation internet routers. The octagon on-chip communication architecture, with its cost, performance, and scalability advantages, supports these network processor SOCs.

...read moreread less

293 citations

"Area and power efficient network on..." refers methods in this paper

...Here are most popular ones: SPIN [3], CLICHÉ [4], TORUS [5], OCTAGON [6], and BFT [7]....
[...]

Proceedings Article•DOI•

Silicon-photonic clos networks for global on-chip communication

[...]

Ajay Joshi¹, Christopher Batten¹, Yong-Jin Kwon², Scott Beamer², Imran Shamim¹, Krste Asanovic², Vladimir Stojanovic¹ - Show less +3 more•Institutions (2)

Massachusetts Institute of Technology¹, University of California, Berkeley²

10 May 2009

TL;DR: In this paper, the authors explore using photonics to implement low-diameter non-blocking crossbar and Clos networks, and show that a 64-tile photonic Clos network consumes significantly less optical power, thermal tuning power, and area compared to global photonic crossbars over a range of photonic device parameters.

...read moreread less

Abstract: Future manycore processors will require energy-efficient, high-throughput on-chip networks. Silicon-photonics is a promising new interconnect technology which offers lower power, higher bandwidth density, and shorter latencies than electrical interconnects. In this paper we explore using photonics to implement low-diameter non-blocking crossbar and Clos networks. We use analytical modeling to show that a 64-tile photonic Clos network consumes significantly less optical power, thermal tuning power, and area compared to global photonic crossbars over a range of photonic device parameters. Compared to various electrical on-chip networks, our simulation results indicate that a photonic Clos network can provide more uniform latency and throughput across a range of traffic patterns while consuming less power. These properties will help simplify parallel programming by allowing the programmer to ignore network topology during optimization.

...read moreread less

290 citations

Proceedings Article•DOI•

Design of a switch for network on chip applications

[...]

Partha Pratim Pande¹, C. Grecu¹, Andre Ivanov¹, Resve A. Saleh¹•Institutions (1)

University of British Columbia¹

25 May 2003

TL;DR: A switch-based network-centric architecture to interconnect IP blocks is proposed with a butterfly fat tree architecture as an overall interconnect template and wormhole routing is adopted to reduce overall latency and hardware overhead.

...read moreread less

Abstract: System on Chip (SoC) design in the forthcoming billion transistor era will involve the integration of numerous heterogeneous semiconductor intellectual property (IP) blocks Some of the main problems in the ultra deep sub micron technologies characterized by gate lengths in the range of 50-100 nm arise from non-scalable global wire delays, failure to achieve global synchronization, errors due to signal integrity issues, and difficulties associated with non-scalable bus-based functional interconnect These problems are addressed in this paper by introducing a new design methodology A switch-based network-centric architecture to interconnect IP blocks is proposed We introduce a butterfly fat tree architecture as an overall interconnect template In this new interconnect architecture, switches are used to transfer data between IP blocks To reduce overall latency and hardware overhead, wormhole routing is adopted The proposed switch architecture supports this routing method Initial implementation of the switch reveals that the total switch area is expected to amount to less than 2% of a large SoC

...read moreread less

223 citations