Home
/
Authors
/
Horácio C. Neto

Author

Horácio C. Neto

Other affiliations: INESC-ID, University of Lisbon, Massachusetts Institute of Technology ...read more

Bio: Horácio C. Neto is an academic researcher from Instituto Superior Técnico. The author has contributed to research in topics: Field-programmable gate array & Reconfigurable computing. The author has an hindex of 16, co-authored 101 publications receiving 981 citations. Previous affiliations of Horácio C. Neto include INESC-ID & University of Lisbon.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
1999
1998
1996
1994
1992
1990

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Macro-based hardware compilation of Java/sup TM/ bytecodes into a dynamic reconfigurable computing system

[...]

João M. P. Cardoso, Horácio C. Neto

21 Apr 1999

TL;DR: This paper presents a new approach to synthesize to reconfigurable hardware (HW) user-specified regions of a program, under the assumption of "virtual HW" support, which exploits the temporal partitions at the behavior level, resolves memory access conflicts, and generates the VHDL descriptions at register-transfer level that will be mapped into the reconfigured HW devices.

...read moreread less

Abstract: This paper presents a new approach to synthesize to reconfigurable hardware (HW) user-specified regions of a program, under the assumption of "virtual HW" support. The automation of this approach is supported by a compiler front-end and by an HW compiler under development. The front-end starts from the Java bytecodes and, therefore, supports any language that can be compiled to the JVM (Java Virtual Machine) model. It extracts from the bytecodes all the dependencies inside and between basic blocks. This information is stored in representation graphs more suitable to efficiently exploit the existent parallelism in the program than those typically used in high-level synthesis. From the intermediate representations the HW compiler exploits the temporal partitions at the behavior level, resolves memory access conflicts, and generates the VHDL descriptions at register-transfer level that will be mapped into the reconfigurable HW devices.

...read moreread less

96 citations

Proceedings Article•DOI•

Trends of CPU, GPU and FPGA for high-performance computing

[...]

Mário P. Véstias¹, Horácio C. Neto²•Institutions (2)

INESC-ID¹, Instituto Superior Técnico²

20 Oct 2014

TL;DR: In this article, the authors compare the trends of these computing architectures for high-performance computing and survey these platforms in the execution of algorithms belonging to different scientific application domains, showing that FPGAs are increasing the gap to GPUs and many-core CPUs moving them away from highperformance computing with intensive floating-point calculations.

...read moreread less

Abstract: Floating-point computing with more than one TFLOP of peak performance is already a reality in recent Field-Programmable Gate Arrays (FPGA). General-Purpose Graphics Processing Units (GPGPU) and recent many-core CPUs have also taken advantage of the recent technological innovations in integrated circuit (IC) design and had also dramatically improved their peak performances. In this paper, we compare the trends of these computing architectures for high-performance computing and survey these platforms in the execution of algorithms belonging to different scientific application domains. Trends in peak performance, power consumption and sustained performances, for particular applications, show that FPGAs are increasing the gap to GPUs and many-core CPUs moving them away from high-performance computing with intensive floating-point calculations. FPGAs become competitive for custom floating-point or fixed-point representations, for smaller input sizes of certain algorithms, for combinational logic problems and parallel map-reduce problems.

...read moreread less

77 citations

Proceedings Article•DOI•

Assignment and reordering of incompletely specified pattern sequences targetting minimum power dissipation

[...]

Paulo Flores, José Carlos Costa, Horácio C. Neto, José Monteiro, Joao Marques-Silva - Show less +1 more

10 Jan 1999

TL;DR: This paper develops an optimization model and describes an efficient algorithm for reordering pattern sequences in the presence of don't cares and preliminary experimental results amply confirm that the resulting power savings due to pattern sequence reordering usingDon't cares can be significant.

...read moreread less

Abstract: For a significant number of electronic systems used in safety-critical applications circuit testing is performed periodically. For these systems, power dissipation due to Built-in Self Test (BIST) can represent a significant percentage of the overall power dissipation. One approach to minimize power consumption in these systems consists of test pattern sequence reordering. Moreover a key observation is that test patterns are in general expected to exhibit don't cares, which can naturally be exploited during test pattern sequence reordering. In this paper we develop an optimization model and describe an efficient algorithm for reordering pattern sequences in the presence of don't cares. Preliminary experimental results amply confirm that the resulting power savings due to pattern sequence reordering using don't cares can be significant.

...read moreread less

51 citations

Journal Article•DOI•

Compilation for FPGA-based reconfigurable hardware

[...]

João M. P. Cardoso¹, Horácio C. Neto²•Institutions (2)

University of the Algarve¹, Technical University of Lisbon²

01 Mar 2003-IEEE Design & Test of Computers

TL;DR: This paper provides techniques for compiling software programs into reconfigurable hardware which offer faster and more efficient performance than the complex resource-sharing approaches typical of high-level synthesis systems.

...read moreread less

Abstract: This paper provides techniques for compiling software programs into reconfigurable hardware which offer faster and more efficient performance than the complex resource-sharing approaches typical of high-level synthesis systems. The Java-based compiler presented in this paper uses intermediate graph representations to embody parallelism at various levels.

...read moreread less

51 citations

Book Chapter•DOI•

Sorting Units for FPGA-Based Embedded Systems

[...]

Rui Marcelino¹, Horácio C. Neto², João M. P. Cardoso²•Institutions (2)

University of the Algarve¹, INESC-ID²

07 Sep 2008

TL;DR: It is shown that a hybrid between an insertion sorting unit and a merge FIFO sorting unit provides a speed-up between 1.6 and 25 compared to a quicksort software implementation.

...read moreread less

Abstract: Sorting is an important operation for a number of embedded applications. As sorting large datasets may impose undesired performance degradation, acceleration units coupled to the embedded processor can be an interesting solution for speeding-up the computations. This paper presents and evaluates three hardware sorting units, bearing in mind embedded computing systems implemented with FPGAs. The proposed architectures take advantage of specific FPGA hardware resources to increase efficiency. Experimental results show the differences in resources and performances among the three proposed sorting units and also between the sorting units and pure software implementations for sorting.We show that a hybrid between an insertion sorting unit and a merge FIFO sorting unit provides a speed-up between 1.6 and 25 compared to a quicksort software implementation.

...read moreread less

42 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Reconfigurable computing: a survey of systems and software

[...]

Katherine Compton¹, Scott Hauck²•Institutions (2)

Northwestern University¹, University of Washington²

01 Jun 2002-ACM Computing Surveys

TL;DR: The hardware aspects of reconfigurable computing machines, from single chip architectures to multi-chip systems, including internal structures and external coupling are explored, and the software that targets these machines is focused on.

...read moreread less

Abstract: Due to its potential to greatly accelerate a wide variety of applications, reconfigurable computing has become a subject of a great deal of research. Its key feature is the ability to perform computations in hardware to increase performance, while retaining much of the flexibility of a software solution. In this survey, we explore the hardware aspects of reconfigurable computing machines, from single chip architectures to multi-chip systems, including internal structures and external coupling. We also focus on the software that targets these machines, such as compilation tools that map high-level algorithms directly to the reconfigurable substrate. Finally, we consider the issues involved in run-time reconfigurable systems, which reuse the configurable hardware during program execution.

...read moreread less

1,666 citations

Posted Content•

A Survey of Neuromorphic Computing and Neural Networks in Hardware.

[...]

Catherine D. Schuman, Thomas E. Potok, Robert M. Patton, J. Douglas Birdwell, Mark Edward Dean, Garrett S. Rose, James S. Plank - Show less +3 more

19 May 2017-arXiv: Neural and Evolutionary Computing

TL;DR: An exhaustive review of the research conducted in neuromorphic computing since the inception of the term is provided to motivate further work by illuminating gaps in the field where new research is needed.

...read moreread less

Abstract: Neuromorphic computing has come to refer to a variety of brain-inspired computers, devices, and models that contrast the pervasive von Neumann computer architecture This biologically inspired approach has created highly connected synthetic neurons and synapses that can be used to model neuroscience theories as well as solve challenging machine learning problems The promise of the technology is to create a brain-like ability to learn and adapt, but the technical challenges are significant, starting with an accurate neuroscience model of how the brain works, to finding materials and engineering breakthroughs to build devices to support these models, to creating a programming framework so the systems can learn, to creating applications with brain-like capabilities In this work, we provide a comprehensive survey of the research and motivations for neuromorphic computing over its history We begin with a 35-year review of the motivations and drivers of neuromorphic computing, then look at the major research areas of the field, which we define as neuro-inspired models, algorithms and learning approaches, hardware and devices, supporting systems, and finally applications We conclude with a broad discussion on the major research topics that need to be addressed in the coming years to see the promise of neuromorphic computing fulfilled The goals of this work are to provide an exhaustive review of the research conducted in neuromorphic computing since the inception of the term, and to motivate further work by illuminating gaps in the field where new research is needed

...read moreread less

570 citations

Journal Article•DOI•

The MOLEN polymorphic processor

[...]

Stamatis Vassiliadis¹, Stephan Wong¹, Georgi Gaydadjiev¹, Koen Bertels¹, Georgi Kuzmanov¹, Elena Moscu Panainte¹ - Show less +2 more•Institutions (1)

Delft University of Technology¹

01 Nov 2004-IEEE Transactions on Computers

TL;DR: A microarchitecture based on reconfigurable hardware emulation to allow high-speed reconfiguration and execution of the processor and to prove the viability of the proposal, the proposal was experimented with the MPEG-2 encoder and decoder and a Xilinx Virtex II Pro FPGA.

...read moreread less

Abstract: In this paper, we present a polymorphic processor paradigm incorporating both general-purpose and custom computing processing. The proposal incorporates an arbitrary number of programmable units, exposes the hardware to the programmers/designers, and allows them to modify and extend the processor functionality at will. To achieve the previously stated attributes, we present a new programming paradigm, a new instruction set architecture, a microcode-based microarchitecture, and a compiler methodology. The programming paradigm, in contrast with the conventional programming paradigms, allows general-purpose conventional code and hardware descriptions to coexist in a program: In our proposal, for a given instruction set architecture, a onetime instruction set extension of eight instructions, is sufficient to implement the reconfigurable functionality of the processor. We propose a microarchitecture based on reconfigurable hardware emulation to allow high-speed reconfiguration and execution. To prove the viability of the proposal, we experimented with the MPEG-2 encoder and decoder and a Xilinx Virtex II Pro FPGA. We have implemented three operations, SAD, DCT, and IDCT. The overall attainable application speedup for the MPEG-2 encoder and decoder is between 2.64-3.18 and between 1.56-1.94, respectively, representing between 93 percent and 98 percent of the theoretically obtainable speedups.

...read moreread less

436 citations

Proceedings Article•DOI•

A reconfigurable design-for-debug infrastructure for SoCs

[...]

Miron Abramovici, Paul Bradley, K.N. Dwarakanath, Peter L. Levin, Gerard Memmi, Dave Miller - Show less +2 more

24 Jul 2006

TL;DR: A distributed reconfigurable fabric inserted at RTL provides a debug platform that can be configured and operated post-silicon via the JTAG port and can be repeatedly reused to configure many debug structures such as assertions checkers, transaction identifiers, triggers, and event counters.

...read moreread less

Abstract: In this paper we present a design-for-debug (DFD) reconfigurable infrastructure for SoCs to support at-speed in-system functional debug. A distributed reconfigurable fabric inserted at RTL provides a debug platform that can be configured and operated post-silicon via the JTAG port. The platform can be repeatedly reused to configure many debug structures such as assertions checkers, transaction identifiers, triggers, and event counters.

...read moreread less

351 citations

Journal Article•DOI•

Fault-tolerant computer system design

[...]

Niraj K. Jha¹•Institutions (1)

Princeton University¹

24 Jan 1996-IEEE Parallel & Distributed Technology: Systems & Applications

TL;DR: Fault-Tolerant Computer System Design by Dhiraj K. Pradhan examines the design of fault-tolerant systems and their applications in the oil and gas industry.

...read moreread less

Abstract: Fault-Tolerant Computer System Design by Dhiraj K. Pradhan 550 pp. $72 Prentice Hall Upper Saddle River, N.J. 1996 ISBN 0-13-057887-8

...read moreread less

222 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196

Collapse