Home
/
Authors
/
Fernando Magno Quintão Pereira

Author

Fernando Magno Quintão Pereira

Other affiliations: University of California, Los Angeles, University of Porto, National Council for Scientific and Technological Development ...read more

Bio: Fernando Magno Quintão Pereira is an academic researcher from Universidade Federal de Minas Gerais. The author has contributed to research in topics: Compiler & Register allocation. The author has an hindex of 18, co-authored 115 publications receiving 1337 citations. Previous affiliations of Fernando Magno Quintão Pereira include University of California, Los Angeles & University of Porto.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
1993

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Qubit allocation

[...]

Marcos Yukio Siraichi¹, Vinícius Fernandes dos Santos¹, Caroline Collange², Fernando Magno Quintão Pereira¹•Institutions (2)

Universidade Federal de Minas Gerais¹, University of Rennes²

24 Feb 2018

TL;DR: This paper formally introduces the qubit allocation problem and provides an exact solution to it, and provides a heuristic solution to qu bit allocation, which is faster than the current solutions already implemented to deal with this problem.

...read moreread less

Abstract: In May of 2016, IBM Research has made a quantum processor available in the cloud to the general public. The possibility of programming an actual quantum device has elicited much enthusiasm. Yet, quantum programming still lacks the compiler support that modern programming languages enjoy today. To use universal quantum computers like IBM's, programmers must design low-level circuits. In particular, they must map logical qubits into physical qubits that need to obey connectivity constraints. This task resembles the early days of programming, in which software was built in machine languages. In this paper, we formally introduce the qubit allocation problem and provide an exact solution to it. This optimal algorithm deals with the simple quantum machinery available today; however, it cannot scale up to the more complex architectures scheduled to appear. Thus, we also provide a heuristic solution to qubit allocation, which is faster than the current solutions already implemented to deal with this problem.

...read moreread less

213 citations

Proceedings Article•DOI•

Divergence Analysis and Optimizations

[...]

Bruno Coutinho, Diogo Sampaio, Fernando Magno Quintão Pereira, Wagner Meira

10 Oct 2011

TL;DR: This paper introduces divergence analysis, a static analysis that determines which program variables will have the same values for every PE, and introduces branch fusion, a new compiler optimization that identifies chains of similarities between divergent program paths, and weaves these paths together as much as possible.

...read moreread less

Abstract: The growing interest in GPU programming has brought renewed attention to the Single Instruction Multiple Data (SIMD) execution model. SIMD machines give application developers a tremendous computational power, however, the model also brings restrictions. In particular, processing elements (PEs) execute in lock-step, and may lose performance due to divergences caused by conditional branches. In face of divergences, some PEs execute, while others wait, this alternation ending when they reach a synchronization point. In this paper we introduce divergence analysis, a static analysis that determines which program variables will have the same values for every PE. This analysis is useful in three different ways: it improves the translation of SIMD code to non-SIMD CPUs, it helps developers to manually improve their SIMD applications, and it also guides the compiler in the optimization of SIMD programs. We demonstrate this last point by introducing branch fusion, a new compiler optimization that identifies, via a gene sequencing algorithm, chains of similarities between divergent program paths, and weaves these paths together as much as possible. Our implementation has been accepted in the Ocelot open-source CUDA compiler, and is publicly available. We have tested it on many industrial-strength GPU benchmarks, including Rodinia and the Nvidia's SDK. Our divergence analysis has a 34% false-positive rate, compared to the results of a dynamic profiler. Our automatic optimization adds a 3% speed-up onto parallel quick sort, a heavily optimized benchmark. Our manual optimizations extend this number to over 10%.

...read moreread less

100 citations

Proceedings Article•DOI•

[...]

Fernando Magno Quintão Pereira¹, Jens Palsberg¹•Institutions (1)

University of California, Los Angeles¹

07 Jun 2008

TL;DR: It is shown that register allocation can be viewed as solving a collection of puzzles, and the register file is model as a puzzle board and the program variables as puzzle pieces; pre-coloring and register aliasing fit in naturally.

...read moreread less

Abstract: We show that register allocation can be viewed as solving a collection of puzzles. We model the register file as a puzzle board and the program variables as puzzle pieces; pre-coloring and register aliasing fit in naturally. For architectures such as PowerPC, x86, and StrongARM, we can solve the puzzles in polynomial time, and we have augmented the puzzle solver with a simple heuristic for spilling. For SPEC CPU2000, the compilation time of our implementation is as fast as that of the extended version of linear scan used by LLVM, which is the JIT compiler in the openGL stack of Mac OS 10.5. Our implementation produces x86 code that is of similar quality to the code produced by the slower, state-of-the-art iterated register coalescing of George and Appel with the extensions proposed by Smith, Ramsey, and Holloway in 2004.

...read moreread less

77 citations

Proceedings Article•DOI•

Sparse representation of implicit flows with applications to side-channel detection

[...]

Bruno Augusto Rodrigues¹, Fernando Magno Quintão Pereira¹, Diego F. Aranha²•Institutions (2)

Universidade Federal de Minas Gerais¹, State University of Campinas²

17 Mar 2016

TL;DR: FlowTracker, a tool that uncovers side-channel vulnerabilities in cryptographic algorithms, is built, which handles programs with over one-million assembly instructions in less than 200 seconds, and creates 24% less implicit flow edges than Ferrante et al.'s technique.

...read moreread less

Abstract: Information flow analyses traditionally use the Program Dependence Graph (PDG) as a supporting data-structure. This graph relies on Ferrante et al.'s notion of control dependences to represent implicit flows of information. A limitation of this approach is that it may create O(|I| x |E|) implicit flow edges in the PDG, where I are the instructions in a program, and E are the edges in its control flow graph. This paper shows that it is possible to compute information flow analyses using a different notion of implicit dependence, which yields a number of edges linear on the number of definitions plus uses of variables. Our algorithm computes these dependences in a single traversal of the program's dominance tree. This efficiency is possible due to a key property of programs in Static Single Assignment form: the definition of a variable dominates all its uses. Our algorithm correctly implements Hunt and Sands system of security types. Contrary to their original formulation, which required O(IxI) space and time for structured programs, we require only O(I). We have used our ideas to build FlowTracker, a tool that uncovers side-channel vulnerabilities in cryptographic algorithms. FlowTracker handles programs with over one-million assembly instructions in less than 200 seconds, and creates 24% less implicit flow edges than Ferrante et al.'s technique. FlowTracker has detected an issue in a constant-time implementation of Elliptic Curve Cryptography; it has found several time-variant constructions in OpenSSL, one issue in TrueCrypt and it has validated the isochronous behavior of the NaCl library.

...read moreread less

76 citations

Book Chapter•DOI•

[...]

Fernando Magno Quintão Pereira¹, Jens Palsberg¹•Institutions (1)

University of California, Los Angeles¹

02 Nov 2005

TL;DR: A simple algorithm for register allocation which is competitive with the iterated register coalescing algorithm of George and Appel and can optimally color a chordal graph in time linear in the number of edges.

...read moreread less

Abstract: We present a simple algorithm for register allocation which is competitive with the iterated register coalescing algorithm of George and Appel. We base our algorithm on the observation that 95% of the methods in the Java 1.5 library have chordal interference graphs when compiled with the JoeQ compiler. A greedy algorithm can optimally color a chordal graph in time linear in the number of edges, and we can easily add powerful heuristics for spilling and coalescing. Our experiments show that the new algorithm produces better results than iterated register coalescing for settings with few registers and comparable results for settings with many registers.

...read moreread less

74 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26

Collapse

Cited by

PDF

Open Access

More filters

Data Mining - Concepts and Techniques.

[...]

Petra Perner

01 Jan 2002

9,314 citations

Proceedings Article•DOI•

Random graphs

[...]

Alan Frieze¹•Institutions (1)

Carnegie Mellon University¹

22 Jan 2006

TL;DR: Some of the major results in random graphs and some of the more challenging open problems are reviewed, including those related to the WWW.

...read moreread less

Abstract: We will review some of the major results in random graphs and some of the more challenging open problems. We will cover algorithmic and structural questions. We will touch on newer models, including those related to the WWW.

...read moreread less

7,116 citations

Journal Article•

An overview of AspectJ

[...]

Gregor Kiczales¹, Erik Hilsdale², Jim Hugunin², Mik Kersten², Jeffrey Palm², William G. Griswold³ - Show less +2 more•Institutions (3)

University of British Columbia¹, PARC², University of California³

01 Jan 2001-Lecture Notes in Computer Science

TL;DR: AspectJ as mentioned in this paper is a simple and practical aspect-oriented extension to Java with just a few new constructs, AspectJ provides support for modular implementation of a range of crosscutting concerns.

...read moreread less

Abstract: Aspect] is a simple and practical aspect-oriented extension to Java With just a few new constructs, AspectJ provides support for modular implementation of a range of crosscutting concerns. In AspectJ's dynamic join point model, join points are well-defined points in the execution of the program; pointcuts are collections of join points; advice are special method-like constructs that can be attached to pointcuts; and aspects are modular units of crosscutting implementation, comprising pointcuts, advice, and ordinary Java member declarations. AspectJ code is compiled into standard Java bytecode. Simple extensions to existing Java development environments make it possible to browse the crosscutting structure of aspects in the same kind of way as one browses the inheritance structure of classes. Several examples show that AspectJ is powerful, and that programs written using it are easy to understand.

...read moreread less

2,947 citations

[서평]「Computer Organization and Design, The Hardware/Software Interface」

[...]

장훈

01 Nov 1997

TL;DR: Recognizing the mannerism ways to get this books computer organization and design the hardware software interface 4th fourth edition by patterson hennessy is additionally useful.

...read moreread less

Abstract: Recognizing the mannerism ways to get this books computer organization and design the hardware software interface 4th fourth edition by patterson hennessy is additionally useful. You have remained in right site to begin getting this info. acquire the computer organization and design the hardware software interface 4th fourth edition by patterson hennessy join that we manage to pay for here and check out the link.

...read moreread less

832 citations

Book•

Modular elliptic curves and fermat's last theorem

[...]

Kenneth A. Ribet

01 Jan 1993

434 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse