Showing papers on "Multi-agent system published in 2010"

PDF

Open Access

Journal Article•DOI•

Constrained Consensus and Optimization in Multi-Agent Networks

[...]

Angelia Nedic¹, Asuman Ozdaglar², Pablo A. Parrilo²•Institutions (2)

University of Illinois at Urbana–Champaign¹, Massachusetts Institute of Technology²

02 Feb 2010-IEEE Transactions on Automatic Control

TL;DR: In this article, the authors present a distributed algorithm that can be used by multiple agents to align their estimates with a particular value over a network with time-varying connectivity.

...read moreread less

Abstract: We present distributed algorithms that can be used by multiple agents to align their estimates with a particular value over a network with time-varying connectivity. Our framework is general in that this value can represent a consensus value among multiple agents or an optimal solution of an optimization problem, where the global objective function is a combination of local agent objective functions. Our main focus is on constrained problems where the estimates of each agent are restricted to lie in different convex sets. To highlight the effects of constraints, we first consider a constrained consensus problem and present a distributed "projected consensus algorithm" in which agents combine their local averaging operation with projection on their individual constraint sets. This algorithm can be viewed as a version of an alternating projection method with weights that are varying over time and across agents. We establish convergence and convergence rate results for the projected consensus algorithm. We next study a constrained optimization problem for optimizing the sum of local objective functions of the agents subject to the intersection of their local constraint sets. We present a distributed "projected subgradient algorithm" which involves each agent performing a local averaging operation, taking a subgradient step to minimize its own objective function, and projecting on its constraint set. We show that, with an appropriately selected stepsize rule, the agent estimates generated by this algorithm converge to the same optimal solution for the cases when the weights are constant and equal, and when the weights are time-varying but all agents have the same constraint set.

...read moreread less

1,773 citations

Book•

Population Games and Evolutionary Dynamics

[...]

William H. Sandholm

17 Dec 2010

TL;DR: The text first considers population games, which provide a simple, powerful model for studying strategic interactions among large numbers of anonymous agents, and studies the dynamics of behavior in these games, providing foundations for two distinct approaches to aggregate behavior dynamics.

...read moreread less

Abstract: This text offers a systematic, rigorous, and unified presentation of evolutionary game theory, covering the core developments of the theory from its inception in biology in the 1970s through recent advances. Evolutionary game theory, which studies the behavior of large populations of strategically interacting agents, is used by economists to make predictions in settings where traditional assumptions about agents' rationality and knowledge may not be justified. Recently, computer scientists, transportation scientists, engineers, and control theorists have also turned to evolutionary game theory, seeking tools for modeling dynamics in multiagent systems. Population Games and Evolutionary Dynamics provides a point of entry into the field for researchers and students in all of these disciplines. The text first considers population games, which provide a simple, powerful model for studying strategic interactions among large numbers of anonymous agents. It then studies the dynamics of behavior in these games. By introducing a general model of myopic strategy revision by individual agents, the text provides foundations for two distinct approaches to aggregate behavior dynamics: the deterministic approach, based on differential equations, and the stochastic approach, based on Markov processes. Key results on local stability, global convergence, stochastic stability, and nonconvergence are developed in detail. Ten substantial appendixes present the mathematical tools needed to work in evolutionary game theory, offering a practical introduction to the methods of dynamic modeling. Accompanying the text are more than 200 color illustrations of the mathematics and theoretical results; many were created using the Dynamo software suite, which is freely available on the author's Web site. Readers are encouraged to use Dynamo to run quick numerical experiments and to create publishable figures for their own research.

...read moreread less

1,296 citations

Journal Article•DOI•

Leader-following consensus of multi-agent systems under fixed and switching topologies ☆

[...]

Wei Ni¹, Daizhan Cheng¹•Institutions (1)

Chinese Academy of Sciences¹

01 Mar 2010-Systems & Control Letters

TL;DR: The control of each agent using local information is designed and detailed analysis of the leader-following consensus is presented for both fixed and switching interaction topologies, which describe the information exchange between the multi-agent systems.

...read moreread less

1,252 citations

Journal Article•DOI•

Finite-Time Consensus Problems for Networks of Dynamic Agents

[...]

Long Wang¹, Feng Xiao²•Institutions (2)

Peking University¹, Beijing Institute of Technology²

02 Feb 2010-IEEE Transactions on Automatic Control

TL;DR: It is proved that if the sum of time intervals, in which the interaction topology is connected, is sufficiently large, the proposed protocols will solve the finite-time consensus problems.

...read moreread less

Abstract: In this note, we discuss finite-time state consensus problems for multi-agent systems and present one framework for constructing effective distributed protocols, which are continuous state feedbacks. By employing the theory of finite-time stability, we investigate both the bidirectional interaction case and the unidirectional interaction case, and prove that if the sum of time intervals, in which the interaction topology is connected, is sufficiently large, the proposed protocols will solve the finite-time consensus problems.

...read moreread less

907 citations

Journal Article•DOI•

A Review of the Applications of Agent Technology in Traffic and Transportation Systems

[...]

Bo Chen¹, Harry H. Cheng²•Institutions (2)

Michigan Technological University¹, University of California, Davis²

01 Jun 2010-IEEE Transactions on Intelligent Transportation Systems

TL;DR: This paper examines an agent- based approach and its applications in different modes of transportation, including roadway, railway, and air transportation, and addresses some critical issues in developing agent-based traffic control and management systems, such as interoperability, flexibility, and extendibility.

...read moreread less

Abstract: The agent computing paradigm is rapidly emerging as one of the powerful technologies for the development of large-scale distributed systems to deal with the uncertainty in a dynamic environment. The domain of traffic and transportation systems is well suited for an agent-based approach because transportation systems are usually geographically distributed in dynamic changing environments. Our literature survey shows that the techniques and methods resulting from the field of agent and multiagent systems have been applied to many aspects of traffic and transportation systems, including modeling and simulation, dynamic routing and congestion management, and intelligent traffic control. This paper examines an agent-based approach and its applications in different modes of transportation, including roadway, railway, and air transportation. This paper also addresses some critical issues in developing agent-based traffic control and management systems, such as interoperability, flexibility, and extendibility. Finally, several future research directions toward the successful deployment of agent technology in traffic and transportation systems are discussed.

...read moreread less

590 citations

Journal Article•DOI•

Second-order leader-following consensus of nonlinear multi-agent systems via pinning control

[...]

Qiang Song¹, Qiang Song², Jinde Cao², Wenwu Yu³, Wenwu Yu² - Show less +1 more•Institutions (3)

Hangzhou Dianzi University¹, Southeast University², City University of Hong Kong³

01 Sep 2010-Systems & Control Letters

TL;DR: This paper addresses what kind of agents and how many agents should be pinned, and establishes some sufficient conditions to guarantee that all agents asymptotically follow the virtual leader.

...read moreread less

552 citations

Book Chapter•DOI•

An Introduction to Multi-Agent Systems

[...]

P.G. Balaji¹, Dipti Srinivasan¹•Institutions (1)

National University of Singapore¹

01 Jan 2010

TL;DR: The goal of this chapter is to provide a quick reference to assist in the design of multi-agent systems and to highlight the merit and demerits of the existing methods.

...read moreread less

Abstract: Multi-agent systems is a subfield of Distributed Artificial Intelligence that has experienced rapid growth because of the flexibility and the intelligence available solve distributed problems. In this chapter, a brief survey of multi-agent systems has been presented. These encompass different attributes such as architecture, communication, coordination strategies, decision making and learning abilities. The goal of this chapter is to provide a quick reference to assist in the design of multi-agent systems and to highlight the merit and demerits of the existing methods.

...read moreread less

540 citations

Journal Article•DOI•

Reinforcement learning-based multi-agent system for network traffic signal control

[...]

I. Arel¹, C. Liu¹, T. Urbanik¹, Airton G Kohls¹•Institutions (1)

University of Tennessee¹

03 Jun 2010-Iet Intelligent Transport Systems

TL;DR: Experimental results clearly demonstrate the advantages of multi-agent RL-based control over LQF governed isolated single-intersection control, thus paving the way for efficient distributed traffic signal control in complex settings.

...read moreread less

Abstract: A challenging application of artificial intelligence systems involves the scheduling of traffic signals in multi-intersection vehicular networks. This paper introduces a novel use of a multi-agent system and reinforcement learning (RL) framework to obtain an efficient traffic signal control policy. The latter is aimed at minimising the average delay, congestion and likelihood of intersection cross-blocking. A five-intersection traffic network has been studied in which each intersection is governed by an autonomous intelligent agent. Two types of agents, a central agent and an outbound agent, were employed. The outbound agents schedule traffic signals by following the longest-queue-first (LQF) algorithm, which has been proved to guarantee stability and fairness, and collaborate with the central agent by providing it local traffic statistics. The central agent learns a value function driven by its local and neighbours' traffic conditions. The novel methodology proposed here utilises the Q-Learning algorithm with a feedforward neural network for value function approximation. Experimental results clearly demonstrate the advantages of multi-agent RL-based control over LQF governed isolated single-intersection control, thus paving the way for efficient distributed traffic signal control in complex settings.

...read moreread less

463 citations

Journal Article•DOI•

A Distributed Control Approach to A Robust Output Regulation Problem for Multi-Agent Linear Systems

[...]

Xiaoli Wang¹, Yiguang Hong², Jie Huang³, Zhong-Ping Jiang⁴•Institutions (4)

Harbin Institute of Technology¹, Chinese Academy of Sciences², The Chinese University of Hong Kong³, New York University⁴

13 Sep 2010-IEEE Transactions on Automatic Control

TL;DR: In this note, the robust output regulation problem of a multi-agent system is considered and an internal model based distributed control scheme is adopted to achieve the objectives of asymptotic tracking and disturbance rejection in an uncertain multi- agent system.

...read moreread less

Abstract: In this note, the robust output regulation problem of a multi-agent system is considered. An internal model based distributed control scheme is adopted to achieve the objectives of asymptotic tracking and disturbance rejection in an uncertain multi-agent system where both the reference inputs and disturbances are generated by an exosystem.

...read moreread less

450 citations

Proceedings Article•

Ad hoc autonomous agent teams: collaboration without pre-coordination

[...]

Peter Stone¹, Gal A. Kaminka², Sarit Kraus², Jeffrey S. Rosenschein•Institutions (2)

University of Texas at Austin¹, Bar-Ilan University²

11 Jul 2010

TL;DR: This paper defines the concept of ad hoc team agents, specifies an evaluation paradigm, and provides examples of possible theoretical and empirical approaches to challenge to encourage progress towards this ambitious, newly realistic, and increasingly important research goal.

...read moreread less

Abstract: As autonomous agents proliferate in the real world, both in software and robotic settings, they will increasingly need to band together for cooperative activities with previously unfamiliar teammates. In such ad hoc team settings, team strategies cannot be developed a priori. Rather, an agent must be prepared to cooperate with many types of teammates: it must collaborate without pre-coordination. This paper challenges the AI community to develop theory and to implement prototypes of ad hoc team agents. It defines the concept of ad hoc team agents, specifies an evaluation paradigm, and provides examples of possible theoretical and empirical approaches to challenge. The goal is to encourage progress towards this ambitious, newly realistic, and increasingly important research goal.

...read moreread less

350 citations

Journal Article•DOI•

Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access

[...]

Keqin Liu¹, Qing Zhao¹•Institutions (1)

University of California, Davis¹

01 Nov 2010-IEEE Transactions on Information Theory

TL;DR: This work establishes the indexability and obviates the need to know the Markov transition probabilities in Whittle index policy, and develops efficient algorithms for computing a performance upper bound given by Lagrangian relaxation.

...read moreread less

Abstract: In this paper, we consider a class of restless multiarmed bandit processes (RMABs) that arises in dynamic multichannel access, user/server scheduling, and optimal activation in multiagent systems. For this class of RMABs, we establish the indexability and obtain Whittle index in closed form for both discounted and average reward criteria. These results lead to a direct implementation of Whittle index policy with remarkably low complexity. When arms are stochastically identical, we show that Whittle index policy is optimal under certain conditions. Furthermore, it has a semiuniversal structure that obviates the need to know the Markov transition probabilities. The optimality and the semiuniversal structure result from the equivalence between Whittle index policy and the myopic policy established in this work. For nonidentical arms, we develop efficient algorithms for computing a performance upper bound given by Lagrangian relaxation. The tightness of the upper bound and the near-optimal performance of Whittle index policy are illustrated with simulation examples.

...read moreread less

Journal Article•DOI•

Neural-Network-Based Adaptive Leader-Following Control for Multiagent Systems With Uncertainties

[...]

Long Cheng¹, Zeng-Guang Hou¹, Min Tan¹, Yingzi Lin², Wenjun Zhang³ - Show less +1 more•Institutions (3)

Chinese Academy of Sciences¹, Northeastern University², University of Saskatchewan³

01 Aug 2010-IEEE Transactions on Neural Networks

TL;DR: A neural-network-based adaptive approach is proposed for the leader-following control of multiagent systems that takes uncertainty in the agent's dynamics into account; the leader's state could be time-varying; and the proposed algorithm for each following agent is only dependent on the information of its neighbor agents.

...read moreread less

Abstract: A neural-network-based adaptive approach is proposed for the leader-following control of multiagent systems. The neural network is used to approximate the agent's uncertain dynamics, and the approximation error and external disturbances are counteracted by employing the robust signal. When there is no control input constraint, it can be proved that all the following agents can track the leader's time-varying state with the tracking error as small as desired. Compared with the related work in the literature, the uncertainty in the agent's dynamics is taken into account; the leader's state could be time-varying; and the proposed algorithm for each following agent is only dependent on the information of its neighbor agents. Finally, the satisfactory performance of the proposed method is illustrated by simulation examples.

...read moreread less

Journal Article•DOI•

Distributed Coordination of Networked Fractional-Order Systems

[...]

Yongcan Cao¹, Yan Li¹, Wei Ren¹, YangQuan Chen¹•Institutions (1)

Utah State University¹

01 Apr 2010

TL;DR: This paper shows sufficient conditions on the interaction graph and the fractional order such that coordination can be achieved using the general model, and compares the convergence speed of coordination for fractional- order systems with that for integer-order systems.

...read moreread less

Abstract: This paper studies the distributed coordination of networked fractional-order systems over a directed interaction graph. A general fractional-order coordination model is introduced by summarizing three different cases: 1) fractional-order agent dynamics with integer-order coordination algorithms; 2) fractional-order agent dynamics with fractional-order coordination algorithms; and 3) integer-order agent dynamics with fractional-order coordination algorithms. We show sufficient conditions on the interaction graph and the fractional order such that coordination can be achieved using the general model. The coordination equilibrium is also explicitly given. In addition, we characterize the relationship between the number of agents and the fractional order to ensure coordination. Furthermore, we compare the convergence speed of coordination for fractional-order systems with that for integer-order systems. It is shown that the convergence speed of the fractional-order coordination algorithms can be improved by varying the fractional orders with time. Finally, simulation results are presented as a proof of concept.

...read moreread less

Journal Article•DOI•

An agent-based approach to model land-use change at a regional scale

[...]

Diego Valbuena¹, Peter H. Verburg¹, Arnold K. Bregt¹, Arend Ligtenberg¹•Institutions (1)

Wageningen University and Research Centre¹

01 Feb 2010-Landscape Ecology

TL;DR: A conceptual framework for ABM to analyse and explore regional LUCC processes is described, represented by combining different concepts including agent typologies, farm trajectories and probabilistic decision-making processes.

...read moreread less

Abstract: Land-use/cover change (LUCC) is a complex process that includes actors and factors at different social and spatial levels. A common approach to analyse and simulate LUCC as the result of individual decisions is agent-based modelling (ABM). However, ABM is often applied to simulate processes at local scales, while its application in regional studies is limited. This paper describes first a conceptual framework for ABM to analyse and explore regional LUCC processes. Second, the conceptual framework is represented by combining different concepts including agent typologies, farm trajectories and probabilistic decision-making processes. Finally, the framework is illustrated through a case study in the Netherlands, where processes of farm cessation, farm expansion and farm diversification are shaping the structure of the landscape. The framework is a generic, straightforward approach to analyse and explore regional LUCC with an explicit link to empirical approaches for parameterization of ABM.

...read moreread less

Journal Article•DOI•

Brief paper: Stability analysis for multi-agent systems using the incidence matrix: Quantized communication and formation control

[...]

Dimos V. Dimarogonas¹, Karl Henrik Johansson²•Institutions (2)

Massachusetts Institute of Technology¹, Royal Institute of Technology²

01 Apr 2010-Automatica

TL;DR: It is shown how the relation between tree graphs and the null space of the corresponding incidence matrix encode fundamental properties for these two multi-agent control problems.

...read moreread less

Journal Article•DOI•

Agent-based modeling in ecological economics.

[...]

Scott Heckbert¹, Tim Baynes¹, Andrew Reeson¹•Institutions (1)

Commonwealth Scientific and Industrial Research Organisation¹

01 Jan 2010-Annals of the New York Academy of Sciences

TL;DR: Frontiers for ABM research in ecological economics involve advancing the empirical calibration and validation of models through mixed methods, including surveys, interviews, participatory modeling, and, notably, experimental economics to test specific decision‐making hypotheses.

...read moreread less

Abstract: Interconnected social and environmental systems are the domain of ecological economics, and models can be used to explore feedbacks and adaptations inherent in these systems. Agent-based modeling (ABM) represents autonomous entities, each with dynamic behavior and heterogeneous characteristics. Agents interact with each other and their environment, resulting in emergent outcomes at the macroscale that can be used to quantitatively analyze complex systems. ABM is contributing to research questions in ecological economics in the areas of natural resource management and land-use change, urban systems modeling, market dynamics, changes in consumer attitudes, innovation, and diffusion of technology and management practices, commons dilemmas and self-governance, and psychological aspects to human decision making and behavior change. Frontiers for ABM research in ecological economics involve advancing the empirical calibration and validation of models through mixed methods, including surveys, interviews, participatory modeling, and, notably, experimental economics to test specific decision-making hypotheses. Linking ABM with other modeling techniques at the level of emergent properties will further advance efforts to understand dynamics of social-environmental systems.

...read moreread less

Journal Article•DOI•

A multi-agent system for energy management of distributed power sources

[...]

Jeremy Lagorse¹, Damien Paire¹, Abdellatif Miraoui¹•Institutions (1)

Universite de technologie de Belfort-Montbeliard¹

01 Jan 2010-Renewable Energy

TL;DR: In this article, a distributed management solution based on the paradigm of multi-agent systems (MASs) is proposed in order to ensure better system reliability in a hybrid power source.

...read moreread less

Journal Article•DOI•

Distributed Q-Learning for Aggregated Interference Control in Cognitive Radio Networks

[...]

Ana Galindo-Serrano, Lorenza Giupponi

17 Feb 2010-IEEE Transactions on Vehicular Technology

TL;DR: A form of real-time multiagent reinforcement learning, which is known as decentralized Q-learning, is proposed to manage the aggregated interference generated by multiple WRAN systems.

...read moreread less

Abstract: This paper deals with the problem of aggregated interference generated by multiple cognitive radios (CRs) at the receivers of primary (licensed) users. In particular, we consider a secondary CR system based on the IEEE 802.22 standard for wireless regional area networks (WRANs), and we model it as a multiagent system where the multiple agents are the different secondary base stations in charge of controlling the secondary cells. We propose a form of real-time multiagent reinforcement learning, which is known as decentralized Q-learning, to manage the aggregated interference generated by multiple WRAN systems. We consider both situations of complete and partial information about the environment. By directly interacting with the surrounding environment in a distributed fashion, the multiagent system is able to learn, in the first case, an efficient policy to solve the problem and, in the second case, a reasonably good suboptimal policy. Computational and memory requirement considerations are also presented, discussing two different options for uploading and processing the learning information. Simulation results, which are presented for both the upstream and downstream cases, reveal that the proposed approach is able to fulfill the primary-user interference constraints, without introducing signaling overhead in the system.

...read moreread less

Journal Article•DOI•

ASPECS: an agent-oriented software process for engineering complex systems

[...]

Massimo Cossentino¹, Nicolas Gaud¹, Vincent Hilaire¹, Stéphane Galland¹, A. Koukam¹ - Show less +1 more•Institutions (1)

Universite de technologie de Belfort-Montbeliard¹

01 Mar 2010-Autonomous Agents and Multi-Agent Systems

TL;DR: This paper introduces an agent-oriented software process for engineering complex systems called ASPECS, based on a holonic organisational metamodel and provides a step-by-step guide from requirements to code allowing the modelling of a system at different levels of details using a set of refinement methods.

...read moreread less

Abstract: Holonic multiagent systems (HMAS) offer a promising software engineering approach for developing complex open software systems. However the process of building Multi-Agent Systems (MAS) and HMAS is mostly different from the process of building more traditional software systems as it introduces new design and development challenges. This paper introduces an agent-oriented software process for engineering complex systems called ASPECS. ASPECS is based on a holonic organisational metamodel and provides a step-by-step guide from requirements to code allowing the modelling of a system at different levels of details using a set of refinement methods. This paper details the entire ASPECS development process and provides a set of methodological guidelines for each process activity. A complete case study is also used to illustrate the design process and the associated notations. ASPECS uses UML as a modelling language. Because of the specific needs of agents and holonic organisational design, the UML semantics and notation are used as reference points, but they have been extended by introducing new specific profiles.

...read moreread less

Journal Article•DOI•

Agents applied in health care: A review

[...]

David Isern, David Sánchez, Antonio Moreno

01 Mar 2010-International Journal of Medical Informatics

TL;DR: A review of the literature (2002-2008) of applications of agents in healthcare collected from medical databases as well as international conferences finds a huge number of contributions in this area, showing a growing interest of researchers.

...read moreread less

Journal Article•DOI•

A Class of Bounded Distributed Control Strategies for Connectivity Preservation in Multi-Agent Systems

[...]

Amir Ajorlou¹, Ahmadreza Momeni¹, Amir G. Aghdam¹•Institutions (1)

Concordia University¹

02 Sep 2010-IEEE Transactions on Automatic Control

TL;DR: This technical note proposes a general class of distributed potential-based control laws with the connectivity preserving property for single-integrator agents designed in such a way that when an edge in the information flow graph is about to lose connectivity, the gradient of the potential function lies in the direction of that edge, aiming to shrink it.

...read moreread less

Abstract: This technical note proposes a general class of distributed potential-based control laws with the connectivity preserving property for single-integrator agents. The potential functions are designed in such a way that when an edge in the information flow graph is about to lose connectivity, the gradient of the potential function lies in the direction of that edge, aiming to shrink it. The results are developed for a static information flow graph first, and then are extended to the case of dynamic edge addition. Connectivity preservation for problems involving static leaders is covered as well. The potential functions are chosen to be smooth, resulting in bounded control inputs. Other constraints may also be imposed on the potential functions to satisfy various design criteria such as consensus, containment, and formation convergence. The effectiveness of the proposed control strategy is illustrated by simulation for examples of consensus and containment.

...read moreread less

Journal Article•DOI•

Instrumenting multi-agent organisations with organisational artifacts and agents

[...]

Jomi Fred Hübner, Olivier Boissier, Rosine Kitio, Alessandro Ricci¹•Institutions (1)

University of Bologna¹

01 May 2010-Autonomous Agents and Multi-Agent Systems

TL;DR: ORA4MAS (Organisational Artifacts for Multi-Agent Systems), a proposed approach aiming at these issues, introduces organisational artifacts as first class entities to instrument the organisation for supporting agents activities within it.

...read moreread less

Abstract: The social and organisational aspects of agency have led to a good amount of theoretical work in terms of formal models and theories However, the conception and engineering of proper organisational infrastructures embodying such models and theories are still an open issue The introduction of normative concerns with requirements of openness and adaptation stresses this issue The corresponding mechanisms for the current infrastructures appear to be not appropriate for managing distributed and open normative organisations There is still the need of proper abstractions and tools to facilitate application agents taking part in the monitoring of the organisation on one hand, and in the adaptation and definition of the organisation in which they are situated on the other hand In this paper we present and discuss ORA4MAS (Organisational Artifacts for Multi-Agent Systems), a proposed approach aiming at these issues Based on the Agents and Artifacts meta-model (A&A), it introduces organisational artifacts as first class entities to instrument the organisation for supporting agents activities within it

...read moreread less

Journal Article•DOI•

Urban traffic signal control using reinforcement learning agents

[...]

P.G. Balaji¹, X. German¹, Dipti Srinivasan¹•Institutions (1)

National University of Singapore¹

30 Aug 2010-Iet Intelligent Transport Systems

TL;DR: The proposed multi- agent reinforcement learning (RLA) signal control showed significant improvement in mean time delay and speed in comparison to other traffic control system like hierarchical multi-agent system (HMS), cooperative ensemble (CE) and actuated control.

...read moreread less

Abstract: This study presents a distributed multi-agent-based traffic signal control for optimising green timing in an urban arterial road network to reduce the total travel time and delay experienced by vehicles. The proposed multi-agent architecture uses traffic data collected by sensors at each intersection, stored historical traffic patterns and data communicated from agents in adjacent intersections to compute green time for a phase. The parameters like weights, threshold values used in computing the green time is fine tuned by online reinforcement learning with an objective to reduce overall delay. PARAMICS software was used as a platform to simulate 29 signalised intersection at Central Business District of Singapore and test the performance of proposed multi-agent traffic signal control for different traffic scenarios. The proposed multi-agent reinforcement learning (RLA) signal control showed significant improvement in mean time delay and speed in comparison to other traffic control system like hierarchical multi-agent system (HMS), cooperative ensemble (CE) and actuated control.

...read moreread less

Journal Article•DOI•

Stochastic Consensus Seeking With Noisy and Directed Inter-Agent Communication: Fixed and Randomly Varying Topologies

[...]

Minyi Huang¹, Jonathan H. Manton²•Institutions (2)

Carleton University¹, University of Melbourne²

01 Jan 2010-IEEE Transactions on Automatic Control

TL;DR: This work considers consensus seeking of networked agents on directed graphs where each agent has only noisy measurements of its neighbors' states and uses Stochastic approximation type algorithms to generalize the algorithm to networks with random link failures and prove convergence results.

...read moreread less

Abstract: We consider consensus seeking of networked agents on directed graphs where each agent has only noisy measurements of its neighbors' states. Stochastic approximation type algorithms are employed so that the individual states converge both in mean square and almost surely to the same limit. We further generalize the algorithm to networks with random link failures and prove convergence results.

...read moreread less

Journal Article•DOI•

Continuous-Time Average-Preserving Opinion Dynamics with Opinion-Dependent Communications

[...]

Vincent D. Blondel¹, Julien M. Hendrickx, John N. Tsitsiklis²•Institutions (2)

Université catholique de Louvain¹, Massachusetts Institute of Technology²

01 Sep 2010-Siam Journal on Control and Optimization

TL;DR: It is established that the continuum model accurately represents the asymptotic behavior of a system with a finite but large number of agents.

...read moreread less

Abstract: We study a simple continuous-time multiagent system related to Krause's model of opinion dynamics: each agent holds a real value, and this value is continuously attracted by every other value differing from it by less than 1, with an intensity proportional to the difference. We prove convergence to a set of clusters, with the agents in each cluster sharing a common value, and provide a lower bound on the distance between clusters at a stable equilibrium, under a suitable notion of multiagent system stability. To better understand the behavior of the system for a large number of agents, we introduce a variant involving a continuum of agents. We prove, under some conditions, the existence of a solution to the system dynamics, convergence to clusters, and a nontrivial lower bound on the distance between clusters. Finally, we establish that the continuum model accurately represents the asymptotic behavior of a system with a finite but large number of agents.

...read moreread less

Journal Article•DOI•

Decentralized Coordination in RoboCup Rescue

[...]

Sarvapali D. Ramchurn¹, Alessandro Farinelli¹, Alessandro Farinelli², Kathryn Macarthur¹, Nicholas R. Jennings¹ - Show less +1 more•Institutions (2)

University of Southampton¹, University of Verona²

01 Nov 2010-The Computer Journal

TL;DR: A novel decentralized solution to the coalition formation process that pervades disaster management is provided using the state-of-the-art Max-Sum algorithm that provides a completely decentralized message-passing solution and a novel algorithm (F-Max-Sum) that avoids sending redundant messages and efficiently adapts to changes in the environment.

...read moreread less

Abstract: Emergency responders are faced with a number of significant challenges when managing major disasters. First, the number of rescue tasks posed is usually larger than the number of responders (or agents) and the resources available to them. Second, each task is likely to require a different level of effort in order to be completed by its deadline. Third, new tasks may continually appear or disappear from the environment, thus requiring the responders to quickly recompute their allocation of resources. Fourth, forming teams or coalitions of multiple agents from different agencies is vital since no single agency will have all the resources needed to save victims, unblock roads and extinguish the fires which might erupt in the disaster space. Given this, coalitions have to be efficiently selected and scheduled to work across the disaster space so as to maximize the number of lives and the portion of the infrastructure saved. In particular, it is important that the selection of such coalitions should be performed in a decentralized fashion in order to avoid a single point of failure in the system. Moreover, it is critical that responders communicate only locally given they are likely to have limited battery power or minimal access to long-range communication devices. Against this background, we provide a novel decentralized solution to the coalition formation process that pervades disaster management. More specifically, we model the emergency management scenario defined in the RoboCup Rescue disaster simulation platform as a coalition formation with spatial and temporal constraints (CFST) problem where agents form coalitions to complete tasks, each with different demands. To design a decentralized algorithm for CFST, we formulate it as a distributed constraint optimization problem and show how to solve it using the state-of-the-art Max-Sum algorithm that provides a completely decentralized message-passing solution. We then provide a novel algorithm (F-Max-Sum) that avoids sending redundant messages and efficiently adapts to changes in the environment. In empirical evaluations, our algorithm is shown to generate better solutions than other decentralized algorithms used for this problem.

...read moreread less

Book Chapter•DOI•

An agent-based approach to care in independent living

[...]

Boštjan Kaluža¹, Violeta Mirchevska, Erik Dovgan¹, Mitja Luštrek¹, Matjaž Gams¹ - Show less +1 more•Institutions (1)

Jožef Stefan Institute¹

10 Nov 2010

TL;DR: In this paper, a multi-agent system for the care of elderly people living at home on their own, with the aim to prolong their independence, is presented, composed of seven groups of agents providing a reliable, robust and flexible monitoring by sensing the user in the environment, reconstructing the position and posture to create the physical awareness of the user.

...read moreread less

Abstract: This paper presents a multi-agent system for the care of elderly people living at home on their own, with the aim to prolong their independence. The system is composed of seven groups of agents providing a reliable, robust and flexible monitoring by sensing the user in the environment, reconstructing the position and posture to create the physical awareness of the user in the environment, reacting to critical situations, calling for help in the case of an emergency, and issuing warnings if unusual behavior is detected. The system has been tested during several on-line demonstrations.

...read moreread less

Journal Article•DOI•

Controllability analysis of multi-agent systems using relaxed equitable partitions

[...]

Simone Martini, Magnus Egerstedt¹, Antonio Bicchi•Institutions (1)

Georgia Institute of Technology¹

01 Jan 2010-International Journal of Systems, Control and Communications

TL;DR: In this paper, a graph theoretic interpretation of the controllability properties through the relaxed equitable partition is given, and the main result is a necessary and sufficient condition for the control of such systems in terms of the graph topology.

...read moreread less

Abstract: This paper investigates how to make decentralised networks, amenable to external control, i.e., how to ensure that they are appropriately organised so that they can be effectively 'reprogrammed'. In particular, we study networked systems whose interaction dynamics are given by a nearest-neighbour averaging rule, with one leader node providing the control input to the entire system. The main result is a necessary and sufficient condition for the controllability of such systems in terms of the graph topology. In particular, we give a graph theoretic interpretation of the controllability properties through the so-called relaxed equitable partition.

...read moreread less

Journal Article•DOI•

Asynchronous Distributed Optimization With Event-Driven Communication

[...]

Minyi Zhong¹, Christos G. Cassandras¹•Institutions (1)

Boston University¹

03 May 2010-IEEE Transactions on Automatic Control

TL;DR: An asynchronous (event-driven) optimization scheme is proposed that limits communication to instants when some state estimation error function at a node exceeds a threshold and it is proved that, under certain conditions, such convergence is guaranteed when communication delays are negligible.

...read moreread less

Abstract: We consider problems where multiple agents cooperate to control their individual state so as to optimize a common objective while communicating with each other to exchange state information. Since communication costs can be significant, especially when the agents are wireless devices with limited energy, we seek conditions under which communication of state information among nodes can be restricted while still ensuring that the optimization process converges. We propose an asynchronous (event-driven) optimization scheme that limits communication to instants when some state estimation error function at a node exceeds a threshold and prove that, under certain conditions, such convergence is guaranteed when communication delays are negligible. We subsequently extend the analysis to include communication delays as long as they are bounded. We apply this approach to a sensor network coverage control problem where the objective is to maximize the probability of detecting events occurring in a given region and show that the proposed asynchronous approach may significantly reduce communication costs, hence also prolonging the system's lifetime, without any performance degradation.

...read moreread less

Journal Article•DOI•

Integrated process planning and scheduling by an agent-based ant colony optimization

[...]

Chi Wah Leung¹, T. N. Wong¹, Kai-Ling Mak¹, Richard Y. K. Fung²•Institutions (2)

University of Hong Kong¹, City University of Hong Kong²

01 Aug 2010-Computers & Industrial Engineering

TL;DR: The experimental results indicate that the ACO algorithm can effectively solve the IPPS problems and the agent-based implementation can provide a distributive computation of the algorithm.

...read moreread less

Collapse