Home
/
Authors
/
Rui Castro

Author

Rui Castro

Other affiliations: Technical University of Lisbon, Columbia University, Instituto Superior Técnico ...read more

Bio: Rui Castro is an academic researcher from University of Lisbon. The author has contributed to research in topics: Wind power & Renewable energy. The author has an hindex of 31, co-authored 173 publications receiving 4035 citations. Previous affiliations of Rui Castro include Technical University of Lisbon & Columbia University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
1996
1995
1993
1989

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Network Tomography: Recent Developments

[...]

Rui Castro, Mark Coates, Gang Liang, Robert Nowak, Bin Yu - Show less +1 more

01 Aug 2004-Statistical Science

TL;DR: This article introduces network tomography, a new field which it is believed will benefit greatly from the wealth of statistical methods and algorithms including the application of pseudo-likelihood methods and tree estimation formulations.

...read moreread less

Abstract: Today's Internet is a massive, distributed network which contin- ues to explode in size as e-commerce and related activities grow. The hetero- geneous and largely unregulated structure of the Internet renders tasks such as dynamic routing, optimized service provision, service level verification and detection of anomalous/malicious behavior extremely challenging. The problem is compounded by the fact that one cannot rely on the cooperation of individual servers and routers to aid in the collection of network traffic measurements vital for these tasks. In many ways, network monitoring and inference problems bear a strong resemblance to other "inverse problems" in which key aspects of a system are not directly observable. Familiar sig- nal processing or statistical problems such as tomographic image reconstruc- tion and phylogenetic tree identification have interesting connections to those arising in networking. This article introduces network tomography, a new field which we believe will benefit greatly from the wealth of statistical the- ory and algorithms. It focuses especially on recent developments in the field including the application of pseudo-likelihood methods and tree estimation formulations.

...read moreread less

483 citations

Journal Article•DOI•

Minimax Bounds for Active Learning

[...]

Rui Castro¹, Robert Nowak¹•Institutions (1)

University of Wisconsin-Madison¹

01 May 2008-IEEE Transactions on Information Theory

TL;DR: The achievable rates of classification error convergence for broad classes of distributions characterized by decision boundary regularity and noise conditions are studied using minimax analysis techniques to indicate the conditions under which one can expect significant gains through active learning.

...read moreread less

Abstract: This paper analyzes the potential advantages and theoretical challenges of "active learning" algorithms. Active learning involves sequential sampling procedures that use information gleaned from previous samples in order to focus the sampling and accelerate the learning process relative to "passive learning" algorithms, which are based on nonadaptive (usually random) samples. There are a number of empirical and theoretical results suggesting that in certain situations active learning can be significantly more effective than passive learning. However, the fact that active learning algorithms are feedback systems makes their theoretical analysis very challenging. This paper aims to shed light on achievable limits in active learning. Using minimax analysis techniques, we study the achievable rates of classification error convergence for broad classes of distributions characterized by decision boundary regularity and noise conditions. The results clearly indicate the conditions under which one can expect significant gains through active learning. Furthermore, we show that the learning rates derived are tight for "boundary fragment" classes in d-dimensional feature spaces when the feature marginal density is bounded from above and below.

...read moreread less

242 citations

Proceedings Article•DOI•

Maximum likelihood network topology identification from edge-based unicast measurements

[...]

Mark Coates¹, Rui Castro², Robert Nowak², Manik Gadhiok², Ryan King², Yolanda Tsang² - Show less +2 more•Institutions (2)

McGill University¹, Rice University²

01 Jun 2002

TL;DR: This paper introduces a novel delay-based measurement scheme that does not require clock synchronization, making it more practical than other previous proposals, and develops a novel Markov Chain Monte Carlo procedure for rapid determination of the most likely topologies.

...read moreread less

Abstract: Network tomography is a process for inferring "internal" link-level delay and loss performance information based on end-to-end (edge) network measurements. These methods require knowledge of the network topology; therefore a first crucial step in the tomography process is topology identification. This paper considers the problem of discovering network topology solely from host-based, unicast measurements, without internal network cooperation. First, we introduce a novel delay-based measurement scheme that does not require clock synchronization, making it more practical than other previous proposals. In contrast to methods that rely on network cooperation , our methodology has the potential to identify layer two elements (provided they are logical topology branching points and induce some measurable delay). Second, we propose a maximum penalized likelihood criterion for topology identification. This is a global optimality criterion, in contrast to other recent proposals for topology identification that employ suboptimal, pair-merging strategies. We develop a novel Markov Chain Monte Carlo (MCMC) procedure for rapid determination of the most likely topologies. The performance of our new probing scheme and identification algorithm is explored through simulation and Internet experiments.

...read moreread less

205 citations

Proceedings Article•

Faster Rates in Regression via Active Learning

[...]

Rebecca Willett¹, Robert Nowak¹, Rui Castro²•Institutions (2)

University of Wisconsin-Madison¹, Rice University²

05 Dec 2005

TL;DR: In this article, the authors present a rigorous statistical analysis characterizing regimes in which active learning significantly outperforms classical passive learning, and explore fundamental performance limits of active and passive learning in two illustrative nonparametric function classes.

...read moreread less

Abstract: This paper presents a rigorous statistical analysis characterizing regimes in which active learning significantly outperforms classical passive learning. Active learning algorithms are able to make queries or select sample locations in an online fashion, depending on the results of the previous queries. In some regimes, this extra flexibility leads to significantly faster rates of error decay than those possible in classical passive learning settings. The nature of these regimes is explored by studying fundamental performance limits of active and passive learning in two illustrative nonparametric function classes. In addition to examining the theoretical potential of active learning, this paper describes a practical algorithm capable of exploiting the extra flexibility of the active setting and provably improving upon the classical passive techniques. Our active learning theory and methods show promise in a number of applications, including field estimation using wireless sensor networks and fault line detection.

...read moreread less

160 citations

Journal Article•DOI•

Distilled Sensing: Adaptive Sampling for Sparse Detection and Estimation

[...]

Jarvis Haupt¹, Rui Castro², Robert Nowak³•Institutions (3)

University of Minnesota¹, Columbia University², University of Wisconsin-Madison³

01 Sep 2011-IEEE Transactions on Information Theory

TL;DR: In this article, a sequential adaptive sampling-and-refinement procedure called distilled sensing (DS) is proposed and analyzed, which can detect and localize far weaker signals than possible from non-adaptive measurements.

...read moreread less

Abstract: Adaptive sampling results in significant improvements in the recovery of sparse signals in white Gaussian noise. A sequential adaptive sampling-and-refinement procedure called Distilled Sensing (DS) is proposed and analyzed. DS is a form of multistage experimental design and testing. Because of the adaptive nature of the data collection, DS can detect and localize far weaker signals than possible from non-adaptive measurements. In particular, reliable detection and localization (support estimation) using non-adaptive samples is possible only if the signal amplitudes grow logarithmically with the problem dimension. Here it is shown that using adaptive sampling, reliable detection is possible provided the amplitude exceeds a constant, and localization is possible when the amplitude exceeds any arbitrarily slowly growing function of the dimension.

...read moreread less

143 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40

Collapse

Cited by

PDF

Open Access

More filters

Book•

Machine Learning : A Probabilistic Perspective

[...]

Kevin P. Murphy

24 Aug 2012

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

Abstract: Today's Web-enabled deluge of electronic data calls for automated methods of data analysis. Machine learning provides these, developing methods that can automatically detect patterns in data and then use the uncovered patterns to predict future data. This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach. The coverage combines breadth and depth, offering necessary background material on such topics as probability, optimization, and linear algebra as well as discussion of recent developments in the field, including conditional random fields, L1 regularization, and deep learning. The book is written in an informal, accessible style, complete with pseudo-code for the most important algorithms. All topics are copiously illustrated with color images and worked examples drawn from such application domains as biology, text processing, computer vision, and robotics. Rather than providing a cookbook of different heuristic methods, the book stresses a principled model-based approach, often using the language of graphical models to specify models in a concise and intuitive way. Almost all the models described have been implemented in a MATLAB software package--PMTK (probabilistic modeling toolkit)--that is freely available online. The book is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

8,059 citations

Journal Article•DOI•

Comprehensive Approach to Modeling and Simulation of Photovoltaic Arrays

[...]

Marcelo Gradella Villalva¹, Jonas Rafael Gazoli¹, Ernesto Ruppert Filho¹•Institutions (1)

State University of Campinas¹

27 Mar 2009-IEEE Transactions on Power Electronics

TL;DR: In this article, the authors proposed a method of modeling and simulation of photovoltaic arrays by adjusting the curve at three points: open circuit, maximum power, and short circuit.

...read moreread less

Abstract: This paper proposes a method of modeling and simulation of photovoltaic arrays. The main objective is to find the parameters of the nonlinear I-V equation by adjusting the curve at three points: open circuit, maximum power, and short circuit. Given these three points, which are provided by all commercial array data sheets, the method finds the best I-V equation for the single-diode photovoltaic (PV) model including the effect of the series and parallel resistances, and warranties that the maximum power of the model matches with the maximum power of the real array. With the parameters of the adjusted I-V equation, one can build a PV circuit model with any circuit simulator by using basic math blocks. The modeling method and the proposed circuit model are useful for power electronics designers who need a simple, fast, accurate, and easy-to-use modeling method for using in simulations of PV systems. In the first pages, the reader will find a tutorial on PV devices and will understand the parameters that compose the single-diode PV model. The modeling method is then introduced and presented in details. The model is validated with experimental data of commercial PV arrays.

...read moreread less

3,811 citations

Journal Article•DOI•

Taking the Human Out of the Loop: A Review of Bayesian Optimization

[...]

Bobak Shahriari¹, Kevin Swersky², Ziyu Wang³, Ryan P. Adams⁴, Nando de Freitas³ - Show less +1 more•Institutions (4)

University of British Columbia¹, University of Toronto², University of Oxford³, Harvard University⁴

01 Jan 2016

TL;DR: This review paper introduces Bayesian optimization, highlights some of its methodological aspects, and showcases a wide range of applications.

...read moreread less

Abstract: Big Data applications are typically associated with systems involving large numbers of users, massive complex software systems, and large-scale heterogeneous computing and storage architectures. The construction of such systems involves many distributed design choices. The end products (e.g., recommendation systems, medical analysis tools, real-time game engines, speech recognizers) thus involve many tunable configuration parameters. These parameters are often specified and hard-coded into the software by various developers or teams. If optimized jointly, these parameters can result in significant improvements. Bayesian optimization is a powerful tool for the joint optimization of design choices that is gaining great popularity in recent years. It promises greater automation so as to increase both product quality and human productivity. This review paper introduces Bayesian optimization, highlights some of its methodological aspects, and showcases a wide range of applications.

...read moreread less

3,703 citations

Journal Article•DOI•

Single-Pixel Imaging via Compressive Sampling

[...]

Marco F. Duarte¹, Mark A. Davenport¹, Dharmpal Takhar¹, Jason N. Laska², Ting Sun¹, Kevin F. Kelly¹, Richard G. Baraniuk¹ - Show less +3 more•Institutions (2)

Rice University¹, University of Illinois at Urbana–Champaign²

21 Mar 2008-IEEE Signal Processing Magazine

TL;DR: A new camera architecture based on a digital micromirror device with the new mathematical theory and algorithms of compressive sampling is presented that can operate efficiently across a broader spectral range than conventional silicon-based cameras.

...read moreread less

Abstract: In this article, the authors present a new approach to building simpler, smaller, and cheaper digital cameras that can operate efficiently across a broader spectral range than conventional silicon-based cameras. The approach fuses a new camera architecture based on a digital micromirror device with the new mathematical theory and algorithms of compressive sampling.

...read moreread less

3,316 citations

An Introduction to MultiAgent Systems.

[...]

Barbara Messing

01 Jan 2003

3,093 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse