Topic

Control reconfiguration

About: Control reconfiguration is a research topic. Over the lifetime, 22423 publications have been published within this topic receiving 334217 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

OPU: An FPGA-Based Overlay Processor for Convolutional Neural Networks

[...]

Yunxuan Yu¹, Chen Wu¹, Tiandong Zhao¹, Kun Wang¹, Lei He¹ - Show less +1 more•Institutions (1)

University of California, Los Angeles¹

01 Jan 2020-IEEE Transactions on Very Large Scale Integration Systems

TL;DR: A domain-specific FPGA overlay processor, named OPU, is proposed to accelerate CNN networks, which offers software-like programmability for CNN end users, as CNN algorithms are automatically compiled into executable codes, which are loaded and executed by OPU without reconfiguration of FPGa for switch or update of CNN networks.

...read moreread less

Abstract: Field-programmable gate array (FPGA) provides rich parallel computing resources with high energy efficiency, making it ideal for deep convolutional neural network (CNN) acceleration. In recent years, automatic compilers have been developed to generate network-specific FPGA accelerators. However, with more cascading deep CNN algorithms adapted by various complicated tasks, reconfiguration of FPGA devices during runtime becomes unavoidable when network-specific accelerators are employed. Such reconfiguration can be difficult for edge devices. Moreover, network-specific accelerator means regeneration of RTL code and physical implementation whenever the network is updated. This is not easy for CNN end users. In this article, we propose a domain-specific FPGA overlay processor, named OPU to accelerate CNN networks. It offers software-like programmability for CNN end users, as CNN algorithms are automatically compiled into executable codes, which are loaded and executed by OPU without reconfiguration of FPGA for switch or update of CNN networks. Our OPU instructions have complicated functions with variable runtimes but a uniform length. The granularity of instruction is optimized to provide good performance and sufficient flexibility, while reducing complexity to develop microarchitecture and compiler. Experiments show that OPU can achieve an average of 91% runtime multiplication and accumulation unit (MAC) efficiency (RME) among nine different networks. Moreover, for VGG and YOLO networks, OPU outperforms automatically compiled network-specific accelerators in the literature. In addition, OPU shows $5.35\times $ better power efficiency compared with Titan Xp. For a real-time cascaded CNN networks scenario, OPU is $2.9\times $ faster compared with edge computing GPU Jetson Tx2, which has a similar amount of computing resources.

...read moreread less

74 citations

Journal Article•DOI•

Run-Time Reconfiguration: A method for enhancing the functional density of SRAM-based FPGAs

[...]

James G. Eldredge¹, Brad Hutchings²•Institutions (2)

Hewlett-Packard¹, Brigham Young University²

01 Jan 1996

TL;DR: The Run-Time Reconfiguration Artificial Neural Network (RRANN), a proof-of-concept system that demonstrates the effectiveness of RTR for implementing neural networks, is tested and shown to increase the functional density of a network up to 500% when compared to FPGA-based implementations that do not use RTR.

...read moreread less

Abstract: One way to further exploit the reconfigurable resources of SRAM FPGAs and increase functional density is to reconfigure them during system operation. This proces is referred to as Run-Time Reconfiguration (RTR). RTR is an approach to system implementation that divides an application or algorithm into time-exclusive operations that are implemented as separate configurations. The Run-Time Reconfiguration Artificial Neural Network (RRANN) is a proof-of-concept system that demonstrates the effectiveness of RTR for implementing neural networks. It implements the popular backpropagation training algorithm as three distinct time-exclusive FPGA configurations: feed-forward, backpropagation and update. System operation consists of sequencing through these three reconfigurationsat run-time, one configuration at a time. RRANN has been fully implemented with Xilinx FPGAs, tested and shown to increase the functional density of a network up to 500% when compared to FPGA-based implementations that do not use RTR.

...read moreread less

74 citations

Journal Article•DOI•

Feedback Control Structures, Embedded Residual Signals, and Feedback Control Schemes With an Integrated Residual Access

[...]

Steven X. Ding¹, G. Yang¹, Ping Zhang, E.L. Ding, Torsten Jeinsch², Nick Werner Weinhold², Matthias Schultalbers² - Show less +3 more•Institutions (2)

University of Duisburg-Essen¹, IAV²

01 Mar 2010-IEEE Transactions on Control Systems and Technology

TL;DR: A new interpretation of control signals as a composite of the residual and reference signals is revealed, which leads to the development of two kinds of schemes: extracting residual signals from an existing control loop and configuring control loops with an integrated residual access.

...read moreread less

Abstract: Driven by the increasing needs for the integration of model-based fault diagnosis into the electronic control units (ECUs) with limited computation capacity and motivated by the recent study on the fault tolerant controller architecture, we investigate feedback controller structures aiming at accessing the residuals embedded in the control loops. For this purpose, we first develop an observer-based realization of the Youla parameterization. This result reveals a new interpretation of control signals as a composite of the residual and reference signals. From this viewpoint, different control schemes are studied and useful relationships between the controller structures and embedded residual signals are established. It leads to the development of two kinds of schemes: 1) extracting residual signals from an existing control loop and 2) configuring control loops with an integrated residual access. The achieved results are demonstrated by two examples of the feedback control loops in engine management systems.

...read moreread less

74 citations

Proceedings Article•DOI•

On the role of software architectures in runtime system reconfiguration

[...]

Peyman Oreizy¹, Richard N. Taylor¹•Institutions (1)

University of California, Irvine¹

04 Mar 1998

TL;DR: An architecture-based approach to runtime software reconfiguration is presented, highlighting the role of architectural styles and software connectors in facilitating runtime change.

...read moreread less

Abstract: Society's increasing dependence on software-intensive systems is driving the need for dependable, robust, continuously available systems. Runtime system reconfiguration is one aspect of achieving continuous availability. We present an architecture-based approach to runtime software reconfiguration, highlighting the role of architectural styles and software connectors in facilitating runtime change. Finally, we describe the implementation of our tool suite, called ArchStudio, that supports runtime reconfiguration using our architecture-based approach.

...read moreread less

73 citations

Proceedings Article•DOI•

PruneTrain: fast neural network training by dynamic sparse model reconfiguration

[...]

Sangkug Lym¹, Esha Choukse¹, Siavash Zangeneh¹, Wei Wen², Sujay Sanghavi¹, Mattan Erez¹ - Show less +2 more•Institutions (2)

University of Texas at Austin¹, Duke University²

17 Nov 2019

TL;DR: This work proposes PruneTrain, a cost-efficient mechanism that gradually reduces the training cost during training by using a structured group-lasso regularization approach that drives the training optimization toward both high accuracy and small weight values.

...read moreread less

Abstract: State-of-the-art convolutional neural networks (CNNs) used in vision applications have large models with numerous weights. Training these models is very compute- and memory-resource intensive. Much research has been done on pruning or compressing these models to reduce the cost of inference, but little work has addressed the costs of training. We focus precisely on accelerating training. We propose PruneTrain, a cost-efficient mechanism that gradually reduces the training cost during training. PruneTrain uses a structured group-lasso regularization approach that drives the training optimization toward both high accuracy and small weight values. Small weights can then be periodically removed by reconfiguring the network model to a smaller one. By using a structured-pruning approach and additional reconfiguration techniques we introduce, the pruned model can still be efficiently processed on a GPU accelerator. Overall, PruneTrain achieves a reduction of 39% in the end-to-end training time of ResNet50 for ImageNet by reducing computation cost by 40% in FLOPs, memory accesses by 37% for memory bandwidth bound layers, and the inter-accelerator communication by 55%.

...read moreread less

73 citations

Collapse

Network Information

Performance

Metrics

24,975

Papers

376,261

Citations

No. of papers in the topic in previous years
Year	Papers
2023	784
2022	1,765
2021	778
2020	958
2019	976
2018	1,060

Control reconfiguration

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics