Showing papers in &quot;Microprocessors and Microsystems in 2011&quot;

A novel discrete particle swarm optimization algorithm for meta-task assignment in heterogeneous computing systems

TL;DR: This paper presents a simple and efficient multiplier with the possibility to achieve an arbitrary accuracy through an iterative procedure, prior to achieving the exact result.

...read moreread less

71 citations

Journal Article•DOI•

[...]

Qinma Kang¹, Hong He²•Institutions (2)

Tongji University¹, Shandong University²

Open-hardware e-puck Linux extension board for experimental swarm robotics research

TL;DR: To make particle swarm optimization algorithm more suitable for solving task assignment problems, particles are represented as integer vectors and a new position update method is developed based on discrete domain.

...read moreread less

68 citations

Journal Article•DOI•

[...]

Wenguo Liu¹, Alan F. T. Winfield¹•Institutions (1)

University of the West of England¹

Low-error digital hardware implementation of artificial neuron activation functions and their derivative

TL;DR: The extended e-puck robot platform requires minimal effort to integrate the well-known open-source robot control framework Player and provides a powerful and flexible platform for experimental swarm robotics research.

...read moreread less

62 citations

Journal Article•DOI•

[...]

Antonio Armato, Luca Fanucci, Enzo Pasquale Scilingo, Danilo De Rossi

01 Aug 2011-Microprocessors and Microsystems

TL;DR: A low-error approximation of the sigmoid function and hyperbolic tangent, which are mainly used to activate the artificial neuron, are proposed based on the piecewise linear method, showing better results than the state-of-the-art.

...read moreread less

53 citations

Journal Article•DOI•

Scalable network-on-chip architecture for configurable neural networks

[...]

Dmitri Vainbrand¹, Ran Ginosar¹•Institutions (1)

Technion – Israel Institute of Technology¹

Design and implementation of an operating system for composable processor sharing

TL;DR: It is shown that multicast mesh NoC provides the highest performance/cost ratio and consequently it is the most suitable interconnect architecture for configurable neural network implementation.

...read moreread less

50 citations

Journal Article•DOI•

[...]

Andreas Hansson¹, Marcus Ekerhult², Anca Molnos³, Aleksandar Milutinovic¹, Andrew Nelson³, Jude Angelo Ambrose³, Kees Goossens⁴ - Show less +3 more•Institutions (4)

University of Twente¹, Lund University², Delft University of Technology³, Eindhoven University of Technology⁴

Design of a real time automatic speech recognition system using Modified One Against All SVM classifier

TL;DR: This work presents the design and implementation of CompOSe, a light-weight (only 1500 lines of code) composable operating system for MPSoCs, and experimentally demonstrates the ability to provide temporal composability, even in the presence of dynamic application behaviour and multiple use cases.

...read moreread less

46 citations

Journal Article•DOI•

[...]

J. Manikandan¹, B. Venkataramani¹•Institutions (1)

National Institute of Technology, Tiruchirappalli¹

01 Aug 2011-Microprocessors and Microsystems

TL;DR: The proposed SVM classifier is found to reduce the number of support vectors by a factor of 1.73 when applied to speaker identification and isolated letter recognition problems and can be adapted for various other SVM based pattern recognition systems.

...read moreread less

36 citations

Journal Article•DOI•

A study of 3D Network-on-Chip design for data parallel H.264 coding

[...]

Thomas Canhao Xu¹, Alexander Wei Yin¹, Pasi Liljeberg¹, Hannu Tenhunen¹•Institutions (1)

Information Technology University¹

01 Oct 2011-Microprocessors and Microsystems

TL;DR: It is shown in this study that the inter-thread data dependency of shared reads and writes are performance bottlenecks and provides a guideline to design efficient 3D NoCs for data parallel H.264 coding applications.

...read moreread less

36 citations

Journal Article•DOI•

Energy consumption and execution time estimation of embedded system applications

[...]

Gustavo Callou¹, Paulo Maciel¹, Eduardo Tavares¹, Ermeson Andrade¹, Bruno Nogueira¹, Carlos Araújo¹, Paulo Roberto Freire Cunha¹ - Show less +3 more•Institutions (1)

Federal University of Pernambuco¹

A hybrid flash translation layer design for SLC-MLC flash memory based multibank solid state disk

TL;DR: This work proposes a mechanism for supporting design decisions on energy consumption and performance of embedded system applications and the estimates obtained are 93% close to the respective measures obtained from the real hardware platform.

...read moreread less

35 citations

Journal Article•DOI•

[...]

Jung-Wook Park¹, Seung-Ho Park¹, Charles C. Weems², Shin-Dug Kim¹•Institutions (2)

Yonsei University¹, University of Massachusetts Amherst²

On an efficient NoC multicasting scheme in support of multiple applications running on irregular sub-networks

TL;DR: Experimental results show that the proposed HFTL (hybrid flash translation layer) which makes use of chained-blocks, combining SLC NAND and MLC Nand flash memories in parallel can improve performance compared to other solid state disk configurations, composed of either SLCNAND or M LC NAND flash memory alone.

...read moreread less

30 citations

Journal Article•DOI•

[...]

Xiaohang Wang¹, Mei Yang², Yingtao Jiang², Peng Liu³•Institutions (3)

University of Nevada, Reno¹, University of Nevada, Las Vegas², Zhejiang University³

A TDM slot allocation flow based on multipath routing in NoCs

TL;DR: The experiment results show that the proposed multicast AL+RPM algorithm can consume, on average, 14% and 20% less power than bLBDR (a broadcasting-based routing algorithm) and the multiple unicast scheme, respectively and has much lower network latency than the above two approaches.

...read moreread less

Journal Article•DOI•

[...]

Radu Stefan¹, Kees Goossens•Institutions (1)

Delft University of Technology¹

Supporting OpenMP on a multi-cluster embedded MPSoC

TL;DR: This study focuses on a multi-path slot allocation method in networks with static resource reservations, in particular TDM NoCs, which provides significant overall gains in terms of increased bandwidth or reduced working frequency or area.

...read moreread less

Journal Article•DOI•

[...]

Andrea Marongiu¹, Paolo Burgio¹, Luca Benini¹•Institutions (1)

University of Bologna¹

Design and coverage-driven verification of a novel network-interface IP macrocell for network-on-chip interconnects

TL;DR: This paper considers a representative template of a modern multi-cluster embedded MPSoC and presents an extensive evaluation of the cost associated with supporting OpenMP on such a machine, investigating several implementation variants that are aware of the memory hierarchy and of the heterogeneous interconnection.

...read moreread less

Journal Article•DOI•

[...]

Sergio Saponara¹, Luca Fanucci¹, Marcello Coppola²•Institutions (2)

University of Pisa¹, STMicroelectronics²

01 Aug 2011-Microprocessors and Microsystems

TL;DR: A constrained-random coverage-driven approach is presented and customized to be applied to the novel NI as design under test (DUT), and full code and functional coverage is achieved.

...read moreread less

Journal Article•DOI•

Virtualizing network-on-chip resources in chip-multiprocessors

[...]

Francisco Triviño¹, José L. Sánchez¹, Francisco J. Alfaro¹, Jose Flich²•Institutions (2)

University of Castilla–La Mancha¹, Polytechnic University of Valencia²

Real-time fault injection using enhanced on-chip debug infrastructures

TL;DR: This proposal is based on the virtualization concept and allows us to reduce execution time and network latency in a significant percentage and improves the individual application performance when several applications are simultaneously running.

...read moreread less

Journal Article•DOI•

[...]

Andre Fidalgo¹, Manuel Gericota¹, Gustavo R. Alves¹, José M.F. Ferreira²•Institutions (2)

Instituto Superior de Engenharia do Porto¹, Faculdade de Engenharia da Universidade do Porto²

Finding the optimal tradeoff between area and delay in multiple constant multiplications

TL;DR: This paper proposes a fault injection environment and a scalable methodology to assist the execution of real-time fault injection campaigns, providing enhanced performance and capabilities in microprocessor memory elements with minimum delay and intrusiveness.

...read moreread less

Journal Article•DOI•

[...]

Levent Aksoy¹, Eduardo Costa², Paulo Flores¹, José Monteiro¹•Institutions (2)

INESC-ID¹, Universidade Católica de Pelotas²

A compact AES core with on-line error-detection for FPGA applications with modest hardware resources

TL;DR: Two approximate algorithms are introduced that aim to optimize the area of the MCM operation by taking into account the gate-level implementation of each addition and subtraction operation which realizes a constant multiplication.

...read moreread less

Journal Article•DOI•

[...]

Uros Legat¹, Anton Biasizzo¹, Franc Novak¹•Institutions (1)

Jožef Stefan Institute¹

A NoC-based hybrid message-passing/shared-memory approach to CMP design

TL;DR: This paper presents a compact, low-cost, on-line error-detection architecture for a 32-bit hardware implementation of the AES, which has been upgraded to an efficient BIST with a high fault coverage and a low hardware overhead.

...read moreread less

Journal Article•DOI•

[...]

Mario R. Casu¹, Massimo Ruo Roch¹, Sergio V. Tota², Maurizio Zamboni¹•Institutions (2)

Polytechnic University of Turin¹, University of Hertfordshire²

3D floorplanning of low-power and area-efficient Network-on-Chip architecture

TL;DR: A hybrid approach that combines shared-memory and message passing in a single general-purpose CMP architecture that allows efficient execution of applications developed with both parallel programming approaches is proposed.

...read moreread less

Journal Article•DOI•

[...]

Licheng Xue¹, Feng Shi¹, Weixing Ji¹, Haroon-Ur-Rashid Khan²•Institutions (2)

Beijing Institute of Technology¹, Pakistan Institute of Engineering and Applied Sciences²

01 Jul 2011-Microprocessors and Microsystems

TL;DR: This paper proposes three 3D floorplanning methods for a Triplet-based Hierarchical Interconnection Network (THIN) which is a new high performance NoC which is not only a feasible but also a low-power and area-efficient NoC at physical level.

...read moreread less

Journal Article•DOI•

On chip interconnects for multiprocessor turbo decoding architectures

[...]

Maurizio Martina¹, Guido Masera¹, Hazem Moussa², Amer Baghdadi²•Institutions (2)

Polytechnic University of Turin¹, École nationale supérieure des télécommunications de Bretagne²

Multi-objective efficient design space exploration and architectural synthesis of an application specific processor (ASP)

TL;DR: On chip interconnects for multiprocessor turbo decoding are investigated and experimental results show that a Network-on-Chip based decoder made of 16 processing elements can achieve a throughput of several hundreds of Mbps.

...read moreread less

Journal Article•DOI•

[...]

Anirban Sengupta¹, Reza Sedaghat¹, Zhipeng Zeng¹•Institutions (1)

Ryerson University¹

An embedded multi-core biometric identification system

TL;DR: A design methodology of a multi-objective application specific processor is proposed by integrating an efficient multi- objective exploration approach with the architecture synthesis process, useful for portable devices and many high end applications.

...read moreread less

Journal Article•DOI•

[...]

Giovanni Danese¹, M. Giachero¹, Francesco Leporati¹, Nelson Nazzicari¹•Institutions (1)

University of Pavia¹

01 Jul 2011-Microprocessors and Microsystems

TL;DR: A parallel architecture that efficiently implements the high computationally demanding core of a matching algorithm based on Band-Limited Phase Only spatial Correlation (BLPOC), performed by two concurrent computational units implemented onto a Stratix II Altera family FPGA is proposed.

...read moreread less

Journal Article•DOI•

A deadlock-free routing algorithm for dynamically reconfigurable Networks-on-Chip

[...]

Christopher L. Jackson¹, Simon J. Hollis¹•Institutions (1)

University of Bristol¹

A fuzzy predictive redundancy system for fault-tolerance of x-by-wire systems

TL;DR: A novel routing algorithm that can cope with irregular mesh topologies with Long-Range Links and adapt to run-time LRL insertion and topology reconfiguration is presented and a selection function that uses local topology data to adaptively select optimal paths is presented.

...read moreread less

Journal Article•DOI•

[...]

Man Ho Kim¹, Suk Lee¹, Kyung Chang Lee²•Institutions (2)

Pusan National University¹, Pukyong National University²

01 Jul 2011-Microprocessors and Microsystems

TL;DR: A fuzzy predictive redundancy system that can remove most erroneous faults with a fault-detection algorithm is presented that outperforms well-known average and median voters and shows that it can be an appropriate choice for fault-tolerance in the x-by-wire systems.

...read moreread less

Journal Article•DOI•

A common operator for FFT and FEC decoding

[...]

Malek Naoues, Dominique Noguet, Laurent Alaus, Yves Louet¹•Institutions (1)

Supélec¹

Wormhole cut-through switching: Flit-level messages interleaving for virtual-channelless network-on-chip

TL;DR: This paper capitalizes on the common operator technique to present new common structures for the FFT and FEC decoding algorithms to make the architecture open to future function mapping and adapted to accommodated silicon technology variability through dependable design.

...read moreread less

Journal Article•DOI•

[...]

Faizal Arya Samman¹, Thomas Hollstein, Manfred Glesner¹•Institutions (1)

Technische Universität Darmstadt¹

01 May 2011-Microprocessors and Microsystems

TL;DR: A VLSI microrchitecture of a network-on-chip (NoC) router with a wormhole cut-through switching method is presented in this paper and the concept, on-chip microarchitecture, performance characteristics and interesting transient behaviors of the proposed NoC router that uses the wormhole Cut-Through switching method are presented.

...read moreread less

Journal Article•DOI•

A low-latency modular switch for CMP systems

[...]

Antoni Roca¹, Jose Flich¹, Federico Silla¹, José Duato¹•Institutions (1)

Polytechnic University of Valencia¹

Design of a performance enhanced and power reduced dual-crossbar Network-on-Chip (NoC) architecture

TL;DR: This paper identifies the switch components that limit the switch frequency: the arbiter, and proposes new pipelined switch designs focused in reducing the switch latency.

...read moreread less

Journal Article•DOI•

[...]

Yixuan Zhang¹, Randy Morris¹, Avinash Kodi¹•Institutions (1)

Ohio University¹

System design of full HD MVC decoding on mesh-based multicore NoCs

TL;DR: The proposed DXbar can outperform current bufferless networks with deflecting and dropping protocols while consuming at most half of the power, and achieves at least 20% performance improvement in terms of throughput and latency, and at least20% power saving over buffered networks with virtual channels.

...read moreread less

Journal Article•DOI•

[...]

Ning Ma¹, Zhonghai Lu¹, Li-Rong Zheng¹•Institutions (1)

Royal Institute of Technology¹