Home
/
Authors
/
Daniela Pucci de Farias

Author

Daniela Pucci de Farias

Other affiliations: Stanford University, State University of Campinas, IBM

Bio: Daniela Pucci de Farias is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Linear programming & Linear-fractional programming. The author has an hindex of 17, co-authored 33 publications receiving 2153 citations. Previous affiliations of Daniela Pucci de Farias include Stanford University & State University of Campinas.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The Linear Programming Approach to Approximate Dynamic Programming

[...]

Daniela Pucci de Farias¹, B. Van Roy²•Institutions (2)

Massachusetts Institute of Technology¹, Stanford University²

01 Nov 2003-Operations Research

TL;DR: In this article, an efficient method based on linear programming for approximating solutions to large-scale stochastic control problems is proposed. But the approach is not suitable for large scale queueing networks.

...read moreread less

Abstract: The curse of dimensionality gives rise to prohibitive computational requirements that render infeasible the exact solution of large-scale stochastic control problems. We study an efficient method based on linear programming for approximating solutions to such problems. The approach "fits" a linear combination of pre-selected basis functions to the dynamic programming cost-to-go function. We develop error bounds that offer performance guarantees and also guide the selection of both basis functions and "state-relevance weights" that influence quality of the approximation. Experimental results in the domain of queueing network control provide empirical support for the methodology.

...read moreread less

643 citations

Journal Article•DOI•

Output feedback control of Markov jump linear systems in continuous-time

[...]

Daniela Pucci de Farias¹, Jose C. Geromel¹, J.B.R. do Val¹, Oswaldo Luiz do Valle Costa²•Institutions (2)

State University of Campinas¹, University of São Paulo²

01 May 2000-IEEE Transactions on Automatic Control

TL;DR: This paper addresses the dynamic output feedback control problem of continuous-time Markovian jump linear systems with an LMI characterization, comprising all dynamical compensators that stabilize the closed-loop system in the mean square sense.

...read moreread less

Abstract: This paper addresses the dynamic output feedback control problem of continuous-time Markovian jump linear systems. The fundamental point in the analysis is an LMI characterization, comprising all dynamical compensators that stabilize the closed-loop system in the mean square sense. The H/sub 2/ and H/sub /spl infin//-norm control problems are studied, and the H/sub 2/ and H/sub /spl infin// filtering problems are solved as a by product.

...read moreread less

433 citations

Journal Article•DOI•

On Constraint Sampling in the Linear Programming Approach to Approximate Dynamic Programming

[...]

Daniela Pucci de Farias, Benjamin Van Roy

01 Aug 2004-Mathematics of Operations Research

TL;DR: A scheme that samples and imposes a subset of constraints on a linear program that has a relatively small number of variables but an intractable number of constraints is studied.

...read moreread less

Abstract: In the linear programming approach to approximate dynamic programming, one tries to solve a certain linear program--the ALP--that has a relatively small numberK of variables but an intractable numberM of constraints. In this paper, we study a scheme that samples and imposes a subset ofm <

...read moreread less

415 citations

Journal Article•DOI•

Decentralized Resource Allocation in Dynamic Networks of Agents

[...]

Hariharan Lakshmanan, Daniela Pucci de Farias

11 Sep 2008-Siam Journal on Optimization

TL;DR: This work proposes a decentralized, asynchronous gradient-descent method that is suitable for implementation in the case where the communication between agents is described in terms of a dynamic network and shows how to accommodate nonnegativity constraints on the resources using the results derived.

...read moreread less

Abstract: We consider the problem of n agents that share m common resources. The objective is to derive an optimal allocation that maximizes a global objective expressed as a separable concave objective function. We propose a decentralized, asynchronous gradient-descent method that is suitable for implementation in the case where the communication between agents is described in terms of a dynamic network. This communication model accommodates situations such as mobile agents and communication failures. The method is shown to converge provided that the objective function has Lipschitz-continuous gradients. We further consider a randomized version of the same algorithm for the case where the objective function is nondifferentiable but has bounded subgradients. We show that both algorithms converge to near-optimal solutions and derive convergence rates in terms of the magnitude of the gradient of the objective function. We show how to accommodate nonnegativity constraints on the resources using the results derived. Ex...

...read moreread less

148 citations

Proceedings Article•DOI•

Embedding Health Management into Mission Tasking for UAV Teams

[...]

Mario Valenti¹, Brett Bethke¹, Jonathan P. How¹, Daniela Pucci de Farias¹, John Vian² - Show less +1 more•Institutions (2)

Massachusetts Institute of Technology¹, Boeing Phantom Works²

09 Jul 2007

TL;DR: The traditional definition of health management is extended to the context of multiple vehicle operations and autonomous multi-agent teams and health management information about each mission system component is used to improve the mission system's self-awareness and adapt vehicle, guidance, task and mission plans.

...read moreread less

Abstract: Coordinated multi-vehicle autonomous systems can provide incredible functionality, but off-nominal conditions and degraded system components can render this capability ineffective. This paper presents techniques to improve mission-level functional reliability through better system self-awareness and adaptive mission planning. In particular, we extend the traditional definition of health management, which has historically referred to the process of actively monitoring and managing vehicle sub-systems (e.g., avionics) in the event of component failures, to the context of multiple vehicle operations and autonomous multi-agent teams. In this case, health management information about each mission system component is used to improve the mission system's self-awareness and adapt vehicle, guidance, task and mission plans. This paper presents the theoretical foundations of our approach and recent experimental results on a new UAV testbed.

...read moreread less

95 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

A Survey of Recent Results in Networked Control Systems

[...]

Joao P. Hespanha¹, Payam Naghshtabrizi¹, Yonggang Xu¹•Institutions (1)

University of California, Santa Barbara¹

05 Mar 2007

TL;DR: This work reviews several recent results on estimation, analysis, and controller synthesis for NCSs, and addresses channel limitations in terms of packet-rates, sampling, network delay, and packet dropouts.

...read moreread less

Abstract: Networked control systems (NCSs) are spatially distributed systems for which the communication between sensors, actuators, and controllers is supported by a shared communication network. We review several recent results on estimation, analysis, and controller synthesis for NCSs. The results surveyed address channel limitations in terms of packet-rates, sampling, network delay, and packet dropouts. The results are presented in a tutorial fashion, comparing alternative methodologies

...read moreread less

3,748 citations

Book•

Prediction, learning, and games

[...]

Nicolò Cesa-Bianchi¹, Gábor Lugosi²•Institutions (2)

University of Milan¹, Pompeu Fabra University²

01 Jan 2006

TL;DR: In this paper, the authors provide a comprehensive treatment of the problem of predicting individual sequences using expert advice, a general framework within which many related problems can be cast and discussed, such as repeated game playing, adaptive data compression, sequential investment in the stock market, sequential pattern analysis, and several other problems.

...read moreread less

Abstract: This important text and reference for researchers and students in machine learning, game theory, statistics and information theory offers a comprehensive treatment of the problem of predicting individual sequences. Unlike standard statistical approaches to forecasting, prediction of individual sequences does not impose any probabilistic assumption on the data-generating mechanism. Yet, prediction algorithms can be constructed that work well for all possible sequences, in the sense that their performance is always nearly as good as the best forecasting strategy in a given reference class. The central theme is the model of prediction using expert advice, a general framework within which many related problems can be cast and discussed. Repeated game playing, adaptive data compression, sequential investment in the stock market, sequential pattern analysis, and several other problems are viewed as instances of the experts' framework and analyzed from a common nonstochastic standpoint that often reveals new and intriguing connections.

...read moreread less

3,615 citations

Book•

Regret Analysis of Stochastic and Nonstochastic Multi-Armed Bandit Problems

[...]

Sébastien Bubeck¹, Nicolò Cesa-Bianchi²•Institutions (2)

Princeton University¹, University of Milan²

12 Dec 2012

TL;DR: In this article, the authors focus on regret analysis in the context of multi-armed bandit problems, where regret is defined as the balance between staying with the option that gave highest payoff in the past and exploring new options that might give higher payoffs in the future.

...read moreread less

Abstract: A multi-armed bandit problem - or, simply, a bandit problem - is a sequential allocation problem defined by a set of actions. At each time step, a unit resource is allocated to an action and some observable payoff is obtained. The goal is to maximize the total payoff obtained in a sequence of allocations. The name bandit refers to the colloquial term for a slot machine (a "one-armed bandit" in American slang). In a casino, a sequential allocation problem is obtained when the player is facing many slot machines at once (a "multi-armed bandit"), and must repeatedly choose where to insert the next coin. Multi-armed bandit problems are the most basic examples of sequential decision problems with an exploration-exploitation trade-off. This is the balance between staying with the option that gave highest payoffs in the past and exploring new options that might give higher payoffs in the future. Although the study of bandit problems dates back to the 1930s, exploration-exploitation trade-offs arise in several modern applications, such as ad placement, website optimization, and packet routing. Mathematically, a multi-armed bandit is defined by the payoff process associated with each option. In this book, the focus is on two extreme cases in which the analysis of regret is particularly simple and elegant: independent and identically distributed payoffs and adversarial payoffs. Besides the basic setting of finitely many actions, it also analyzes some of the most important variants and extensions, such as the contextual bandit model. This monograph is an ideal reference for students and researchers with an interest in bandit problems.

...read moreread less

2,427 citations

Posted Content•

Coverage control for mobile sensing networks

[...]

Jorge E. Cortes, Sonia Martinez, Timur Karatas¹, Francesco Bullo¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

16 Dec 2002-arXiv: Optimization and Control

TL;DR: This paper proposes gradient descent algorithms for a class of utility functions which encode optimal coverage and sensing policies which are adaptive, distributed, asynchronous, and verifiably correct.

...read moreread less

Abstract: This paper presents control and coordination algorithms for groups of vehicles. The focus is on autonomous vehicle networks performing distributed sensing tasks where each vehicle plays the role of a mobile tunable sensor. The paper proposes gradient descent algorithms for a class of utility functions which encode optimal coverage and sensing policies. The resulting closed-loop behavior is adaptive, distributed, asynchronous, and verifiably correct.

...read moreread less

2,198 citations

Journal Article•DOI•

Distributionally Robust Optimization Under Moment Uncertainty with Application to Data-Driven Problems

[...]

Erick Delage¹, Yinyu Ye²•Institutions (2)

HEC Montréal¹, Stanford University²

01 May 2010-Operations Research

TL;DR: This paper proposes a model that describes uncertainty in both the distribution form (discrete, Gaussian, exponential, etc.) and moments (mean and covariance matrix) and demonstrates that for a wide range of cost functions the associated distributionally robust stochastic program can be solved efficiently.

...read moreread less

Abstract: Stochastic programming can effectively describe many decision-making problems in uncertain environments. Unfortunately, such programs are often computationally demanding to solve. In addition, their solution can be misleading when there is ambiguity in the choice of a distribution for the random parameters. In this paper, we propose a model that describes uncertainty in both the distribution form (discrete, Gaussian, exponential, etc.) and moments (mean and covariance matrix). We demonstrate that for a wide range of cost functions the associated distributionally robust (or min-max) stochastic program can be solved efficiently. Furthermore, by deriving a new confidence region for the mean and the covariance matrix of a random vector, we provide probabilistic arguments for using our model in problems that rely heavily on historical data. These arguments are confirmed in a practical example of portfolio selection, where our framework leads to better-performing policies on the “true” distribution underlying the daily returns of financial assets.

...read moreread less

1,569 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse