scispace - formally typeset
Search or ask a question

Showing papers by "Ulrich Rückert published in 2007"


Proceedings ArticleDOI
26 Mar 2007
TL;DR: This paper presents an architecture that is composed of multiple FPGAs that are connected to an embedded processor that is referred to as a multi-FPGA clustered architecture (MFCA), which can be partially and dynamically reconfigured to integrate user-defined IP-cores into the system at run-time.
Abstract: Dynamically reconfigurable hardware allows for implementing systems that can be adapted at run-time according to the needs of the user. This paper presents an architecture that is composed of multiple FPGAs that are connected to an embedded processor. Thus, the architecture is referred to as a multi-FPGA clustered architecture (MFCA). All FPGAs can be partially and dynamically reconfigured to integrate user-defined IP-cores into the system at run-time. For the resource management and communication management we have implemented a Linux operating system on the embedded processor that can be used to control the reconfiguration of the FPGAs by means of simple function calls. Furthermore, the Linux OS completely hides the physical infrastructure of the MFCA from user applications, offering a consistent interface to utilize partial reconfiguration.

40 citations


Proceedings ArticleDOI
29 Aug 2007
TL;DR: The GigaNoC is presented, a hierarchical Network-on-Chip that is especially suitable for scalable Chip-Multiprocessor architectures and features a packet-switched wormhole routing on-chip network that provides the backbone of the authors' multiprocesser architecture.
Abstract: Due to the technological progress in the semiconductor industry, more and more components can be integrated on a single die forming a complex System-on-Chip. For enabling an efficient interaction between the various building blocks of today's SoCs, efficient communication structures become more and more essential. In this paper, we present the GigaNoC, a hierarchical Network-on-Chip that is especially suitable for scalable Chip-Multiprocessor architectures. The GigaNoC approach features a packet-switched wormhole routing on-chip network that provides the backbone of our multiprocessor architecture. In order to meet bandwidth requirements of different application domains, our Network-on-Chip is easily scalable and parameterizable in various aspects. This work highlights the communication protocol and shows a performance evaluation for different congestion scenarios. Furthermore, we present an FPGA-based prototypical realization and introduce a debugging and verification environment. Finally, implementation results for a standard cell technology are discussed.

33 citations


Journal ArticleDOI
TL;DR: To achieve robuster neural network architectures fundamental methods are introduced to identify sensitive parameters and neurons and upper bounds on the mean square error under noise contaminated parameters and inputs are determined if the network parameters are constrained.

27 citations


01 Jan 2007
TL;DR: A stereoscopic vision system for the mini-robot Khepera enhances the robot’s visual perception ability by grabbing stereo images and analysis 3D objects, while the robot doesn’t need to move.
Abstract: This paper presents a stereoscopic vision system for the mini-robot Khepera The vision system performs objects detection by using the stereo disparity and stereo correspondence The stereoscopic vision system enhances robot’s visual perception ability by grabbing stereo images and analysis 3D objects, while the robot doesn’t need to move The simple principle of our stereo vision is the less displacement of correspondence pixels shows that the pixels object is far away To realize the stereo vision and its calculation algorithms, the mini robot needs a powerful FPGA and micro-controller module as well as 2D color cameras An application of Khepera equipped with the stereoscopic camera is robot soccer in the KheperaSot league In the match, the robot has to be able to detect its environment, ie the ball, walls, goals and its opponent

21 citations


Journal ArticleDOI
TL;DR: This article uses an FPGA-based rapid prototyping system to verify the functionality of the scalable GigaNetIC chip multiprocessor architecture before fabricating the ASIC in a modern CMOS standard cell technology.

16 citations


01 Jan 2007
TL;DR: A unified hardware-software approach to design a reconfigurable multiprocessor system called QuadroCore is presented and first results indicate suitability especially in audio and video processing applications.
Abstract: Multiprocessors enable parallel execution of a single large application to achieve a performance improvement. An application is split at instruction, data or task level (based on the granularity), such that the overhead of partitioning is minimal. Parallelization for multiprocessors is mostly restricted to a fixed granularity. Reconfiguration enables architectural variations to allow multiple granularities of operation within a multiprocessor. This adaptability optimizes resource utilization over a fixed organization. Here, a unified hardware-software approach to design a reconfigurable multiprocessor system called QuadroCore is presented. In our holistic methodology, compiler-driven reconfiguration selects from a fixed set of modes. Each mode relies on matching program analysis to exploit the architecture efficiently. For instance, a multiprocessor may adapt to different parallelization paradigms. The compiler can determine the best execution mode for each piece of code by analyzing the parallelism in a program. A fast, singlecycle, run-time reconfiguration between these predetermined modes is enabled by executing special instructions which switch coarse-grained components like instruction decoders, ALUs and register banks. Performance is evaluated in terms of execution cycles and achieved clock frequency. First results indicate suitability especially in audio and video processing applications.

10 citations


Book ChapterDOI
01 Jan 2007
TL;DR: In this paper, the authors propose a mechanism for task relocation that includes saving and restoring of state information of the task, and present defragmentation algorithms that minimize different types of costs.
Abstract: Dynamic reconfiguration is a promising approach for resource efficient utilization of microelectronic systems. Standard platforms for partial dynamic reconfiguration are field-programmable gate arrays (FPGAs). Multiple hardware tasks can share the same FPGA resources over time, which increases the device utilization in comparison to non-reconfigurable systems. Although, similar resource management is already known in the area of operating systems, there is a requirement to adapt these concepts to the special needs of dynamically reconfigurable systems. Additionally, there is a lack of underlying mechanisms, e.g., to suspend hardware tasks and restart them at a different position within the FPGA. In this article we introduce a mechanism for task relocation that includes saving and restoring of state information of the task. Based on this approach we address the problem of defragmentation. We present defragmentation algorithms that minimize different types of costs. With the help of a detailed simulation model and a benchmark, we finally provide realistic simulation results and compare the different algorithms.

8 citations


Proceedings Article
01 Jan 2007
TL;DR: By calculating the distance of surrounding patches, the implemented algorithm will minimize the distance traveled, in turn less energy and time, and the advantage of using the Teleworkbench for performing experiments using real robots is shown.
Abstract: This paper presents some results in the implementation of a local navigation strategy for environment exploration using real robot. As an experiment platform we used mini-robot Khepera II running in the Teleworkbench. The complete environment is divided into small quadratic patches with some objects placed in it. With on-board infrared sensors and wheel encoder, the robot can successfully explore the unknown environment. Moreover, by calculating the distance of surrounding patches, the implemented algorithm will minimize the distance traveled, in turn less energy and time. This paper also shows the advantage of using the Teleworkbench for performing experiments using real robots.

5 citations


Book ChapterDOI
12 Mar 2007
TL;DR: This paper presents an advanced multiprocessor cache architecture for chip multip rocessors (CMPs) designed for the scalable GigaNetIC CMP, which is based on massively parallel on-chip computing clusters.
Abstract: In this paper, we present an advanced multiprocessor cache architecture for chip multiprocessors (CMPs) It is designed for the scalable GigaNetIC CMP, which is based on massively parallel on-chip computing clusters Our write-through multiprocessor cache is configurable in respect to the most relevant design options It is supposed to be used in universal co-processors as well as in network processing units For an early verification of the software and an early exploration of various hardware configurations, we have developed a SystemC-based simulation model for the complete chip multiprocessor For detailed hardware-software co-verification, we use our FPGA-based rapid prototyping system RAPTOR2000 to emulate our architecture with near-ASIC performance Finally, we demonstrate the performance gains for different application scenarios enabled by the usage of our multiprocessor cache

3 citations


Proceedings ArticleDOI
17 Sep 2007
TL;DR: This publication presents a digital framework for building up pulse coded neural networks with leaky integrate-and-fire neurons and static synapses as well as dynamic synapses, mainly focused on ASIC synthesis but also shows a small footprint on Virtex2(Pro) FPGAs.
Abstract: This publication presents a digital framework for building up pulse coded neural networks with leaky integrate-and-fire neurons and static synapses as well as dynamic synapses. The system, including a novel communication infrastructure, is mainly focused on ASIC synthesis but also shows a small footprint on Virtex2(Pro) FPGAs. Its bit-serial operation has been verified by simulations.

3 citations


Proceedings Article
01 Jan 2007
TL;DR: This paper introduces the Real-time Multiprocessor SoC intended for low power wireless application as mobile ad hoc networks and is based on eight of the S-CORE RISC processors.
Abstract: This paper introduces our Real-time Multiprocessor SoC intended for low power wireless application as mobile ad hoc networks. The multiprocessor is based on eight of our 32bit S-CORE RISC processors.

Proceedings Article
01 Jan 2007
TL;DR: A Bluetooth scatternet using Bluetooth communication sticks developed in the research group is presented, using bridge nodes carrying two of such Bluetooth sticks to interconnect piconets.
Abstract: Summary. Radio-based communication plays a vital role in multi-robot systems. Bluetooth is an energy-ecient communication technology suited for resourcelimited mini-robots such as the Khepera. However, the maximum number of nodes in a Bluetooth piconet is limited, while scatternets - networks of piconets - have not been fully specified. In this paper we present a Bluetooth scatternet using Bluetooth communication sticks developed in our research group. In our solution, bridge nodes carrying two of such Bluetooth sticks are used to interconnect piconets. Beside the developed hardware, issues such as routing as well as topology control are addressed. Finally, data rate and latency measurements are presented for the implemented solution.

Book ChapterDOI
20 Jun 2007
TL;DR: A brief overview about actual trends and problems in the semiconductor industry and how the upcoming tasks can be solved by the designers and researchers is given.
Abstract: Extremely down-scaled field effect transistor, innovative manufacturing of semiconductors, novel material and computing devices have led to rapid changes in the semiconductor industry which now allows more complex systems and more computing power per chip area than several years ago Albeit these significant improvements novel technology nodes also offer unsolved problems to researchers and challenges to the designers In this paper, we give a brief overview about actual trends and problems in the semiconductor industry and how the upcoming tasks can be solved by the designers and researchers


Proceedings Article
01 Jan 2007
TL;DR: In this work, a measuring technique is proposed which can control the model complexity and is based on the correlation coefficient between two basis functions and can be integrated in the RBF training procedure.
Abstract: Using radial basis function networks for function approxima- tion tasks suffers from unavailable knowledge about an adequate network size. In this work, a measuring technique is proposed which can control the model complexity and is based on the correlation coefficient between two basis functions. Simulation results show good performance and, therefore, this technique can be integrated in the RBF training procedure.

Proceedings Article
01 Jan 2007
TL;DR: A teleoperated robotic-laboratory named Teleworkbench is presented and its benefit for educational purpose and three issues that need to take into consideration to make such a laboratory more effective are presented.
Abstract: This paper presents a teleoperated robotic-laboratory named Teleworkbench and its benefit for educational purpose. It also presents briefly three issues that we need to take into consideration to make such a laboratory more effective.

Proceedings ArticleDOI
28 Mar 2007
TL;DR: The intensive measurements made in an indoor environment show that this test-bed can closely approach the theoretical prediction on SNR gain and AF (amount of fading) made in [4] and [9] previously.
Abstract: In this paper a new diversity selection combining scheme called SSB (simplified switched beam) is given, as well as its test-bed implementation. SSB has apparent advantage in terms of size, power and complexity since it only requires one RF front-end and very few additional components. Furthermore, an interference suppression technique is designed, which can effectively mitigate the disturbance on correct diversity selection caused by interference. The intensive measurements made in an indoor environment show that this test-bed can closely approach the theoretical prediction on SNR gain and AF (amount of fading) made in [4] and [9] previously.

Book ChapterDOI
20 Jun 2007
TL;DR: This publication covers the properties of IAF neurons from calculations on an ideal electrical circuit modeling the soma of an IAF neuron and compares the theoretical results with simulation results from an extracted layout of the implemented neuron.
Abstract: In this paper, the implementation results of an integrate and fire neuron implemented in a 130 nm process are presented. This publication covers the properties of IAF neurons from calculations on an ideal electrical circuit modeling the soma of an IAF neuron and compares the theoretical results with simulation results from an extracted layout of the implemented neuron.

Book ChapterDOI
09 Sep 2007
TL;DR: The impact of shrinking device sizes on the activation function of neurons is investigated with respect to area demands, power consumption and the maximum resolution in their information processing.
Abstract: Artificial neural networks are able to solve a great variety of different applications, e.g. classification or approximation tasks. To utilize their advantages in technical systems various hardware realizations do exist. In this work, the impact of shrinking device sizes on the activation function of neurons is investigated with respect to area demands, power consumption and the maximum resolution in their information processing. Furthermore, analog and digital implementations are compared in emerging silicon technologies beyond 100 nm feature size.

Book ChapterDOI
09 Sep 2007
TL;DR: The aim of this work is to provide a neuron model that is capable of describing the main features of biological neurons such as maintaining an equilibrium potential using the NaK-ATPase and the generation of action potentials as well as to provide an estimation of the energy consumption of a single cell in a) quiescent mode or equilibrium state.
Abstract: In this paper we present a neuron model based on the description of biophysical mechanisms combined with a regulatory mechanism from control theory. The aim of this work is to provide a neuron model that is capable of describing the main features of biological neurons such as maintaining an equilibrium potential using the NaK-ATPase and the generation of action potentials as well as to provide an estimation of the energy consumption of a single cell in a) quiescent mode (or equilibrium state) and b) firing state, when excited by other neurons. The same mechanism has also been used to model the synaptic excitation used in the simulated system.

Proceedings Article
01 Jan 2007

Proceedings ArticleDOI
14 May 2007
TL;DR: Intensive measurements show that SSB can help to suppress the effect of fading, enhance the SNR and finally result in the reduced BER.
Abstract: SSB (Simplified Switched Beam) was originally motivated by a simple beamforming technique. It is expected to increase the performance of Mobile Ad Hoc Networks (MANETs) by associating with the dedicated MAC protocol [12]. However, we further find that it is a variation of the antenna diversity scheme, which is called GSC in [6] and H-S/MRC in [11]. SSB only requires one RF front-end and a simple additional processing unit, hence it has apparent advantage in terms of size, power, expense and complexity. Its 2.4 GHz test-bed for reception has been implemented recently. Intensive measurements show that SSB can help to suppress the effect of fading, enhance the SNR and finally result in the reduced BER.