Showing papers in "arXiv: Computer Science and Game Theory in 2019"

PDF

Open Access

Posted Content•

A Motivational Game-Theoretic Approach for Peer-to-Peer Energy Trading in the Smart Grid

[...]

Wayes Tushar¹, Tapan Kumar Saha¹, Chau Yuen², Thomas Morstyn³, Malcolm McCulloch³, H. Vincent Poor⁴, Kristin L. Wood² - Show less +3 more•Institutions (4)

University of Queensland¹, Singapore University of Technology and Design², University of Oxford³, Princeton University⁴

10 Mar 2019-arXiv: Computer Science and Game Theory

TL;DR: In this paper, a game-theoretic peer-to-peer energy trading scheme is developed, its relevant properties are studied, and it is shown that the coalition among different prosumers is a stable coalition.

...read moreread less

Abstract: Peer-to-peer trading in energy networks is expected to be exclusively conducted by the prosumers of the network with negligible influence from the grid. This raises the critical question: how can enough prosumers be encouraged to participate in peer-to-peer trading so as to make its operation sustainable and beneficial to the overall electricity network? To this end, this paper proposes how a motivational psychology framework can be used effectively to design peer-to-peer energy trading to increase user participation. To do so, first, the state-of-the-art of peer-to-peer energy trading literature is discussed by following a systematic classification, and gaps in existing studies are identified. Second, a motivation psychology framework is introduced, which consists of a number of motivational models that a prosumer needs to satisfy before being convinced to participate in energy trading. Third, a game-theoretic peer-to-peer energy trading scheme is developed, its relevant properties are studied, and it is shown that the coalition among different prosumers is a stable coalition. Fourth, through numerical case studies, it is shown that the proposed model can reduce carbon emissions by 18.38% and 9.82% in a single day in Summer and Winter respectively compared to a feed-in-tariff scheme. The proposed scheme is also shown to reduce the cost of energy up to 118 cents and 87 cents per day in Summer and Winter respectively. Finally, how the outcomes of the scheme satisfy all the motivational psychology models is discussed, which subsequently shows its potential to attract users to participate in energy trading.

...read moreread less

88 citations

Posted Content•

Convergence of Learning Dynamics in Stackelberg Games.

[...]

Tanner Fiez¹, Benjamin Chasnov¹, Lillian J. Ratliff¹•Institutions (1)

University of Washington¹

04 Jun 2019-arXiv: Computer Science and Game Theory

TL;DR: This paper develops a gradient-based update for the leader while the follower employs a best response strategy for which each stable critical point is guaranteed to be a Stackelberg equilibrium in zero-sum games and proposes a two-timescale algorithm with similar asymptotic convergence guarantees.

...read moreread less

Abstract: This paper investigates the convergence of learning dynamics in Stackelberg games. In the class of games we consider, there is a hierarchical game being played between a leader and a follower with continuous action spaces. We establish a number of connections between the Nash and Stackelberg equilibrium concepts and characterize conditions under which attracting critical points of simultaneous gradient descent are Stackelberg equilibria in zero-sum games. Moreover, we show that the only stable critical points of the Stackelberg gradient dynamics are Stackelberg equilibria in zero-sum games. Using this insight, we develop a gradient-based update for the leader while the follower employs a best response strategy for which each stable critical point is guaranteed to be a Stackelberg equilibrium in zero-sum games. As a result, the learning rule provably converges to a Stackelberg equilibria given an initialization in the region of attraction of a stable critical point. We then consider a follower employing a gradient-play update rule instead of a best response strategy and propose a two-timescale algorithm with similar asymptotic convergence guarantees. For this algorithm, we also provide finite-time high probability bounds for local convergence to a neighborhood of a stable Stackelberg equilibrium in general-sum games. Finally, we present extensive numerical results that validate our theory, provide insights into the optimization landscape of generative adversarial networks, and demonstrate that the learning dynamics we propose can effectively train generative adversarial networks.

...read moreread less

71 citations

Posted Content•

A Survey on Applications of Game Theory in Blockchain

[...]

Ziyao Liu¹, Nguyen Cong Luong¹, Wenbo Wang¹, Dusit Niyato¹, Ping Wang², Ying-Chang Liang, Dong In Kim³ - Show less +3 more•Institutions (3)

Nanyang Technological University¹, York University², Sungkyunkwan University³

28 Feb 2019-arXiv: Computer Science and Game Theory

TL;DR: This survey aims to fill the gap between the large number of studies on blockchain network, where game theory emerges as an analytical tool, and the lack of a comprehensive survey on the game theoretical approaches applied in blockchain related issues.

...read moreread less

Abstract: In the past decades, the blockchain technology has attracted tremendous attention from both academia and industry. The popularity of blockchain networks was originated from a crypto-currency to serve as a decentralized and tamperproof transaction data ledger. Nowadays, blockchain as the key framework in the decentralized public data-ledger, has been applied to a wide range of scenarios far beyond crypto-currencies, such as Internet of Things (IoT), healthcare, and insurance. This survey aims to fill the gap between the large number of studies on blockchain network, where game theory emerges as an analytical tool, and the lack of a comprehensive survey on the game theoretical approaches applied in blockchain related issues. In this paper, we review game models proposed to address common issues in the blockchain network. The issues include security issues, e.g., selfish mining, majority attack and Denial of Service (DoS) attack, issues regard mining management, e.g., computational power allocation, reward allocation, and pool selection, as well as issues regarding blockchain economic and energy trading. Additionally, we discuss advantages and disadvantages of these selected game models and solutions. Finally, we highlight important challenges and future research directions of applying game theoretical approaches to incentive mechanism design, and the combination of blockchain with other technologies.

...read moreread less

64 citations

Posted Content•

Toward an Automated Auction Framework for Wireless Federated Learning Services Market

[...]

Yutao Jiao¹, Ping Wang², Dusit Niyato³, Bin Lin⁴, Dong In Kim⁵ - Show less +1 more•Institutions (5)

Penn State College of Communications¹, York University², Nanyang Technological University³, Dalian Maritime University⁴, Sungkyunkwan University⁵

13 Dec 2019-arXiv: Computer Science and Game Theory

TL;DR: This paper proposes an auction based market model for incentivizing data owners to participate in federated learning and designs an approximate strategy-proof mechanism which guarantees the truthfulness, individual rationality, and computational efficiency.

...read moreread less

Abstract: In traditional machine learning, the central server first collects the data owners' private data together and then trains the model. However, people's concerns about data privacy protection are dramatically increasing. The emerging paradigm of federated learning efficiently builds machine learning models while allowing the private data to be kept at local devices. The success of federated learning requires sufficient data owners to jointly utilize their data, computing and communication resources for model training. In this paper, we propose an auction based market model for incentivizing data owners to participate in federated learning. We design two auction mechanisms for the federated learning platform to maximize the social welfare of the federated learning services market. Specifically, we first design an approximate strategy-proof mechanism which guarantees the truthfulness, individual rationality, and computational efficiency. To improve the social welfare, we develop an automated strategy-proof mechanism based on deep reinforcement learning and graph neural networks. The communication traffic congestion and the unique characteristics of federated learning are particularly considered in the proposed model. Extensive experimental results demonstrate that our proposed auction mechanisms can efficiently maximize the social welfare and provide effective insights and strategies for the platform to organize the federated training.

...read moreread less

62 citations

Posted Content•

The Shapley Taylor Interaction Index

[...]

Kedar Dhamdhere, Ashish Agarwal, Mukund Sundararajan

14 Feb 2019-arXiv: Computer Science and Game Theory

TL;DR: The Shapley-Taylor index as mentioned in this paper is a generalization of the Shapley value that attributes the model's prediction to interactions of subsets of features up to some size k. The method is analogous to how the truncated Taylor Series decomposes the function value at a certain point using its derivatives at a different point.

...read moreread less

Abstract: The attribution problem, that is the problem of attributing a model's prediction to its base features, is well-studied. We extend the notion of attribution to also apply to feature interactions. The Shapley value is a commonly used method to attribute a model's prediction to its base features. We propose a generalization of the Shapley value called Shapley-Taylor index that attributes the model's prediction to interactions of subsets of features up to some size k. The method is analogous to how the truncated Taylor Series decomposes the function value at a certain point using its derivatives at a different point. In fact, we show that the Shapley Taylor index is equal to the Taylor Series of the multilinear extension of the set-theoretic behavior of the model. We axiomatize this method using the standard Shapley axioms -- linearity, dummy, symmetry and efficiency -- and an additional axiom that we call the interaction distribution axiom. This new axiom explicitly characterizes how interactions are distributed for a class of functions that model pure interaction. We contrast the Shapley-Taylor index against the previously proposed Shapley Interaction index (cf. [9]) from the cooperative game theory literature. We also apply the Shapley Taylor index to three models and identify interesting qualitative insights.

...read moreread less

57 citations

Posted Content•

The Economics of Social Data

[...]

Dirk Bergemann¹, Alessandro Bonatti², Tan Gan¹•Institutions (2)

Yale University¹, Massachusetts Institute of Technology²

01 Jan 2019-arXiv: Computer Science and Game Theory

TL;DR: The intermediary's optimal data policy is derived and it is shown that it preserves the privacy of the consumers' identities while providing precise information about market demand to the firms.

...read moreread less

Abstract: A data intermediary pays consumers for information about their preferences and sells the information so acquired to firms that use it to tailor their products and prices. The social dimension of the individual data---whereby an individual's data are predictive of the behavior of others---generates a data externality that reduces the intermediary's cost of acquiring information. We derive the intermediary's optimal data policy and show that it preserves the privacy of the consumers' identities while providing precise information about market demand to the firms. This enables the intermediary to capture the entire value of information as the number of consumers grows large.

...read moreread less

54 citations

Posted Content•

An Algorithmic Framework for Approximating Maximin Share Allocation of Chores

[...]

Xin Huang¹, Pinyan Lu²•Institutions (2)

Technion – Israel Institute of Technology¹, Shanghai University of Finance and Economics²

10 Jul 2019-arXiv: Computer Science and Game Theory

TL;DR: With this algorithm, the problem of fairly dividing m indivisible chores among n agents is considered and it is shown that there always exists a 4/3-approximation maximin share allocation for any instance.

...read moreread less

Abstract: In this paper, we consider the problem of how to fairly dividing $m$ indivisible chores among $n$ agents. The fairness measure we considered here is the maximin share. The previous best known result is that there always exists a $\frac{4}{3}$ approximation maximin share allocation. With a novel algorithm, we can always find a $\frac{11}{9}$ approximation maximin share allocation for any instances. We also discuss how to improve the efficiency of the algorithm and its connection to the job scheduling problem.

...read moreread less

49 citations

Posted Content•

What Do Multiwinner Voting Rules Do? An Experiment Over the Two-Dimensional Euclidean Domain

[...]

Edith Elkind¹, Piotr Faliszewski, Jean-François Laslier², Piotr Skowron¹, Arkadii Slinko³, Nimrod Talmon⁴ - Show less +2 more•Institutions (4)

University of Oxford¹, Paris School of Economics², University of Auckland³, Weizmann Institute of Science⁴

26 Jan 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors visualize aggregate outputs of popular multiwinner voting rules (SNTV, STV, Bloc, k-Borda, Monroe, Chamberlin--Courant, and HarmonicBordas) for elections generated according to the two-dimensional Euclidean model.

...read moreread less

Abstract: We visualize aggregate outputs of popular multiwinner voting rules--SNTV, STV, Bloc, k-Borda, Monroe, Chamberlin--Courant, and HarmonicBorda--for elections generated according to the two-dimensional Euclidean model. We consider three applications of multiwinner voting, namely, parliamentary elections, portfolio/movie selection, and shortlisting, and use our results to understand which of our rules seem to be best suited for each application. In particular, we show that STV (one of the few nontrivial rules used in real high-stake elections) exhibits excellent performance, whereas the Bloc rule (also often used in practice) performs poorly.

...read moreread less

48 citations

Posted Content•

A polynomial-time algorithm for computing a Pareto optimal and almost proportional allocation

[...]

Haris Aziz¹, Hervé Moulin², Fedor Sandomirskiy³, Fedor Sandomirskiy⁴•Institutions (4)

University of New South Wales¹, University of Glasgow², National Research University – Higher School of Economics³, Technion – Israel Institute of Technology⁴

02 Sep 2019-arXiv: Computer Science and Game Theory

TL;DR: It is shown that there exists a strongly polynomial-time algorithm that always computes an allocation satisfying Pareto optimality and proportionality up to one item even if the utilities are mixed and the agents have asymmetric weights.

...read moreread less

Abstract: We consider fair allocation of indivisible items under additive utilities. When the utilities can be negative, the existence and complexity of an allocation that satisfies Pareto optimality and proportionality up to one item (PROP1) is an open problem. We show that there exists a strongly polynomial-time algorithm that always computes an allocation satisfying Pareto optimality and proportionality up to one item even if the utilities are mixed and the agents have asymmetric weights. We point out that the result does not hold if either of Pareto optimality or PROP1 is replaced with slightly stronger concepts.

...read moreread less

41 citations

Posted Content•

Proportionality and the Limits of Welfarism

[...]

Dominik Peters¹, Piotr Skowron²•Institutions (2)

Carnegie Mellon University¹, University of Warsaw²

26 Nov 2019-arXiv: Computer Science and Game Theory

TL;DR: An attractive committee rule is introduced which satisfies a property intermediate between the core and extended justified representation (EJR), and is computable in polynomial time, and it is shown that the new rule provides a logarithmic approximation to the core.

...read moreread less

Abstract: We study two influential voting rules proposed in the 1890s by Phragmen and Thiele, which elect a committee or parliament of k candidates which proportionally represents the voters. Voters provide their preferences by approving an arbitrary number of candidates. Previous work has proposed proportionality axioms satisfied by Thiele's rule (now known as Proportional Approval Voting, PAV) but not by Phragmen's rule. By proposing two new proportionality axioms (laminar proportionality and priceability) satisfied by Phragmen but not Thiele, we show that the two rules achieve two distinct forms of proportional representation. Phragmen's rule ensures that all voters have a similar amount of influence on the committee, and Thiele's rule ensures a fair utility distribution. Thiele's rule is a welfarist voting rule (one that maximizes a function of voter utilities). We show that no welfarist rule can satisfy our new axioms, and we prove that no such rule can satisfy the core. Conversely, some welfarist fairness properties cannot be guaranteed by Phragmen-type rules. This formalizes the difference between the two types of proportionality. We then introduce an attractive committee rule which satisfies a property intermediate between the core and extended justified representation (EJR). It satisfies laminar proportionality, priceability, and is computable in polynomial time. We show that our new rule provides a logarithmic approximation to the core. On the other hand, PAV provides a factor-2 approximation to the core, and this factor is optimal for rules that are fair in the sense of the Pigou--Dalton principle.

...read moreread less

37 citations

Book Chapter•DOI•

Dynamic Bayesian Games for Adversarial and Defensive Cyber Deception

[...]

Linan Huang¹, Quanyan Zhu¹•Institutions (1)

New York University¹

01 Jan 2019-arXiv: Computer Science and Game Theory

TL;DR: It is essential to design up-to-date security mechanisms that can mitigate the risks despite the successful infiltration and the strategic response of sophisticated attackers.

...read moreread less

Abstract: Security challenges accompany the efficiency The pervasive integration of information and communications technologies (ICTs) makes cyber-physical systems vulnerable to targeted attacks that are deceptive, persistent, adaptive, and strategic Attack instances such as Stuxnet, Dyn, and WannaCry ransomware have shown the insufficiency of off-the-shelf defensive methods including the firewall and intrusion detection systems Hence, it is essential to design up-to-date security mechanisms that can mitigate the risks despite the successful infiltration and the strategic response of sophisticated attackers

...read moreread less

Posted Content•

Evaluating Reputation Management Schemes of Internet of Vehicles based on Evolutionary Game Theory

[...]

Zhihong Tian¹, Xiangsong Gao, Shen Su¹, Jing Qiu¹, Xiaojiang Du², Mohsen Guizani³ - Show less +2 more•Institutions (3)

Guangzhou University¹, Temple University², Qatar University³

12 Feb 2019-arXiv: Computer Science and Game Theory

TL;DR: Zhang et al. as discussed by the authors applied evolutionary game theory to model the evolution process of malicious users' attacking strategies, and discussed the methodology of the evaluation simulations, and further applied their evaluation method to a reputation management scheme with multiple utility functions, and discuss the evaluation results.

...read moreread less

Abstract: Conducting reputation management is very important for Internet of vehicles. However, most of the existing researches evaluate the effectiveness of their schemes with settled attacking behaviors in their simulation which cannot represent the scenarios in reality. In this paper, we propose to consider dynamical and diversity attacking strategies in the simulation of reputation management scheme evaluation. To that end, we apply evolutionary game theory to model the evolution process of malicious users' attacking strategies, and discuss the methodology of the evaluation simulations. We further apply our evaluation method to a reputation management scheme with multiple utility functions, and discuss the evaluation results. The results indicate that our evaluation method is able to depict the evolving process of the dynamic attacking strategies in a vehicular network, and the final state of the simulation could be used to quantify the protection effectiveness of the reputation management scheme.

...read moreread less

Journal Article•DOI•

Incomplete Preferences in Single-Peaked Electorates

[...]

Zack Fitzsimmons¹, Martin Lackner²•Institutions (2)

College of the Holy Cross¹, Vienna University of Technology²

01 Jul 2019-arXiv: Computer Science and Game Theory

TL;DR: It is proved that the problem of recognizing single-peakedness is NP-complete for incomplete profiles consisting of partial orders, and several polynomial-time algorithms for reasonably restricted settings are found.

...read moreread less

Abstract: Incomplete preferences are likely to arise in real-world preference aggregation scenarios. This paper deals with determining whether an incomplete preference profile is single-peaked. This is valuable information since many intractable voting problems become tractable given single-peaked preferences. We prove that the problem of recognizing single-peakedness is \NP-complete for incomplete profiles consisting of partial orders. Despite this intractability result, we find several polynomial-time algorithms for reasonably restricted settings. In particular, we give polynomial-time recognition algorithms for weak orders, which can be viewed as preferences with indifference.

...read moreread less

Posted Content•

The Disparate Equilibria of Algorithmic Decision Making when Individuals Invest Rationally

[...]

Lydia T. Liu¹, Ashia C. Wilson, Nika Haghtalab, Adam Tauman Kalai, Christian Borgs, Jennifer Chayes - Show less +2 more•Institutions (1)

University of California, Berkeley¹

04 Oct 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors study a dynamic learning setting where individuals invest in a positive outcome based on their group's expected gain and the decision rule is updated to maximize institutional benefit.

...read moreread less

Abstract: The long-term impact of algorithmic decision making is shaped by the dynamics between the deployed decision rule and individuals' response. Focusing on settings where each individual desires a positive classification---including many important applications such as hiring and school admissions, we study a dynamic learning setting where individuals invest in a positive outcome based on their group's expected gain and the decision rule is updated to maximize institutional benefit. By characterizing the equilibria of these dynamics, we show that natural challenges to desirable long-term outcomes arise due to heterogeneity across groups and the lack of realizability. We consider two interventions, decoupling the decision rule by group and subsidizing the cost of investment. We show that decoupling achieves optimal outcomes in the realizable case but has discrepant effects that may depend on the initial conditions otherwise. In contrast, subsidizing the cost of investment is shown to create better equilibria for the disadvantaged group even in the absence of realizability.

...read moreread less

Posted Content•

Probabilistic Smart Contracts: Secure Randomness on the Blockchain

[...]

Krishnendu Chatterjee¹, Amir Kafshdar Goharshady¹, Arash Pourdamghani²•Institutions (2)

Institute of Science and Technology Austria¹, Sharif University of Technology²

21 Feb 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, a game-theoretic approach for generating provably unmanipulatable pseudorandom numbers on the blockchain is proposed, which allows smart contracts to access a trustworthy source of randomness that does not rely on potentially compromised miners or oracles.

...read moreread less

Abstract: In today's programmable blockchains, smart contracts are limited to being deterministic and non-probabilistic. This lack of randomness is a consequential limitation, given that a wide variety of real-world financial contracts, such as casino games and lotteries, depend entirely on randomness. As a result, several ad-hoc random number generation approaches have been developed to be used in smart contracts. These include ideas such as using an oracle or relying on the block hash. However, these approaches are manipulatable, i.e. their output can be tampered with by parties who might not be neutral, such as the owner of the oracle or the miners. We propose a novel game-theoretic approach for generating provably unmanipulatable pseudorandom numbers on the blockchain. Our approach allows smart contracts to access a trustworthy source of randomness that does not rely on potentially compromised miners or oracles, hence enabling the creation of a new generation of smart contracts that are not limited to being non-probabilistic and can be drawn from the much more general class of probabilistic programs.

...read moreread less

Posted Content•

Single Deep Counterfactual Regret Minimization

[...]

Eric Steinberger

25 Sep 2019-arXiv: Computer Science and Game Theory

TL;DR: Single Deep CFR is introduced, a simplified variant of Deep CFR that has a lower overall approximation error by avoiding the training of an average strategy network and is more attractive from a theoretical perspective and empirically outperforms Deep CFR with respect to exploitability and one-on-one play in poker.

...read moreread less

Abstract: Counterfactual Regret Minimization (CFR) is the most successful algorithm for finding approximate Nash equilibria in imperfect information games. However, CFR's reliance on full game-tree traversals limits its scalability. For this reason, the game's state- and action-space is often abstracted (i.e. simplified) for CFR, and the resulting strategy is then translated back to the full game, which requires extensive expert-knowledge and often converges to highly exploitable policies. A recently proposed method, Deep CFR, applies deep learning directly to CFR, allowing the agent to intrinsically abstract and generalize over the state-space from samples, without requiring expert knowledge. In this paper, we introduce Single Deep CFR (SD-CFR), a simplified variant of Deep CFR that has a lower overall approximation error by avoiding the training of an average strategy network. We show that SD-CFR is more attractive from a theoretical perspective and empirically outperforms Deep CFR with respect to exploitability and one-on-one play in poker.

...read moreread less

Posted Content•

Algorithms for Competitive Division of Chores

[...]

Simina Brânzei, Fedor Sandomirskiy¹•Institutions (1)

National Research University – Higher School of Economics¹

03 Jul 2019-arXiv: Computer Science and Game Theory

TL;DR: It is shown that all the outcomes of the competitive rule for chores can be computed in strongly polynomial time if either the number of agents or thenumber of chores is fixed.

...read moreread less

Abstract: We study the problem of allocating divisible bads (chores) among multiple agents with additive utilities, when money transfers are not allowed. The competitive rule is known to be the best mechanism for goods with additive utilities and was recently extended to chores by Bogomolnaia et al (2017). For both goods and chores, the rule produces Pareto optimal and envy-free allocations. In the case of goods, the outcome of the competitive rule can be easily computed. Competitive allocations solve the Eisenberg-Gale convex program; hence the outcome is unique and can be approximately found by standard gradient methods. An exact algorithm that runs in polynomial time in the number of agents and goods was given by Orlin. In the case of chores, the competitive rule does not solve any convex optimization problem; instead, competitive allocations correspond to local minima, local maxima, and saddle points of the Nash Social Welfare on the Pareto frontier of the set of feasible utilities. The rule becomes multivalued and none of the standard methods can be applied to compute its outcome. In this paper, we show that all the outcomes of the competitive rule for chores can be computed in strongly polynomial time if either the number of agents or the number of chores is fixed. The approach is based on a combination of three ideas: all consumption graphs of Pareto optimal allocations can be listed in polynomial time; for a given consumption graph, a candidate for a competitive allocation can be constructed via explicit formula; and a given allocation can be checked for being competitive using a maximum flow computation as in Devanur et al (2002). Our algorithm immediately gives an approximately-fair allocation of indivisible chores by the rounding technique of Barman and Krishnamurthy (2018).

...read moreread less

Posted Content•

Schelling Games on Graphs

[...]

Edith Elkind¹, Jiarui Gan¹, Ayumi Igarashi², Warut Suksompong¹, Alexandros A. Voudouris¹ - Show less +1 more•Institutions (2)

University of Oxford¹, University of Tokyo²

21 Feb 2019-arXiv: Computer Science and Game Theory

TL;DR: In this paper, the authors consider the existence of equilibria in these games, study the complexity of finding an equilibrium outcome or an outcome with high social welfare, and also provide upper and lower bounds on the price of anarchy and stability.

...read moreread less

Abstract: We consider strategic games that are inspired by Schelling's model of residential segregation. In our model, the agents are partitioned into k types and need to select locations on an undirected graph. Agents can be either stubborn, in which case they will always choose their preferred location, or strategic, in which case they aim to maximize the fraction of agents of their own type in their neighborhood. We investigate the existence of equilibria in these games, study the complexity of finding an equilibrium outcome or an outcome with high social welfare, and also provide upper and lower bounds on the price of anarchy and stability. Some of our results extend to the setting where the preferences of the agents over their neighbors are defined by a social network rather than a partition into types.

...read moreread less

Posted Content•

Envy-freeness up to any item with high Nash welfare: The virtue of donating items

[...]

Ioannis Caragiannis¹, Nick Gravin², Xin Huang³•Institutions (3)

University of Patras¹, Shanghai University of Finance and Economics², The Chinese University of Hong Kong³

12 Feb 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors show that for every instance with additive valuations, there is an EFX allocation of a subset of items with a Nash welfare that is at least half of the maximum possible Nash welfare for the original set of items.

...read moreread less

Abstract: Several fairness concepts have been proposed recently in attempts to approximate envy-freeness in settings with indivisible goods. Among them, the concept of envy-freeness up to any item (EFX) is arguably the closest to envy-freeness. Unfortunately, EFX allocations are not known to exist except in a few special cases. We make significant progress in this direction. We show that for every instance with additive valuations, there is an EFX allocation of a subset of items with a Nash welfare that is at least half of the maximum possible Nash welfare for the original set of items. That is, after donating some items to a charity, one can distribute the remaining items in a fair way with high efficiency. This bound is proved to be best possible. Our proof is constructive and highlights the importance of maximum Nash welfare allocation. Starting with such an allocation, our algorithm decides which items to donate and redistributes the initial bundles to the agents, eventually obtaining an allocation with the claimed efficiency guarantee. The application of our algorithm to large markets, where the valuations of an agent for every item is relatively small, yields EFX with almost optimal Nash welfare. To the best of our knowledge, this is the first use of large market assumptions in the fair division literature. We also show that our algorithm can be modified to compute, in polynomial-time, EFX allocations that approximate optimal Nash welfare within a factor of at most $2\rho$, using a $\rho$-approximate allocation on input instead of the maximum Nash welfare one.

...read moreread less

Posted Content•

Multi-Agent Learning in Network Zero-Sum Games is a Hamiltonian System

[...]

James P. Bailey¹, Georgios Piliouras¹•Institutions (1)

Singapore University of Technology and Design¹

05 Mar 2019-arXiv: Computer Science and Game Theory

TL;DR: This work establishes a formal and robust connection between multi-agent systems and Hamiltonian dynamics -- the same dynamics that describe conservative systems in physics and provides a type of a Rosetta stone that helps to translate results and techniques between online optimization, convex analysis, games theory, and physics.

...read moreread less

Abstract: Zero-sum games are natural, if informal, analogues of closed physical systems where no energy/utility can enter or exit. This analogy can be extended even further if we consider zero-sum network (polymatrix) games where multiple agents interact in a closed economy. Typically, (network) zero-sum games are studied from the perspective of Nash equilibria. Nevertheless, this comes in contrast with the way we typically think about closed physical systems, e.g., Earth-moon systems which move perpetually along recurrent trajectories of constant energy. We establish a formal and robust connection between multi-agent systems and Hamiltonian dynamics -- the same dynamics that describe conservative systems in physics. Specifically, we show that no matter the size, or network structure of such closed economies, even if agents use different online learning dynamics from the standard class of Follow-the-Regularized-Leader, they yield Hamiltonian dynamics. This approach generalizes the known connection to Hamiltonians for the special case of replicator dynamics in two agent zero-sum games developed by Hofbauer. Moreover, our results extend beyond zero-sum settings and provide a type of a Rosetta stone (see e.g. Table 1) that helps to translate results and techniques between online optimization, convex analysis, games theory, and physics.

...read moreread less

Journal Article•DOI•

The Price of Fairness for Indivisible Goods

[...]

Xiaohui Bei¹, Xinhang Lu¹, Pasin Manurangsi², Warut Suksompong³•Institutions (3)

Nanyang Technological University¹, University of California, Berkeley², National University of Singapore³

13 May 2019-arXiv: Computer Science and Game Theory

TL;DR: This paper investigates the efficiency of fair allocations of indivisible goods using the well-studied price of fairness concept, and provides tight or asymptotically tight bounds on the worst-case efficiency loss for allocations satisfying notions with guaranteed existence, including envy-freeness up to one good, balancedness, maximum Nash welfare, and leximin.

...read moreread less

Abstract: We investigate the efficiency of fair allocations of indivisible goods using the well-studied price of fairness concept. Previous work has focused on classical fairness notions such as envy-freeness, proportionality, and equitability. However, these notions cannot always be satisfied for indivisible goods, leading to certain instances being ignored in the analysis. In this paper, we focus instead on notions with guaranteed existence, including envy-freeness up to one good (EF1), balancedness, maximum Nash welfare (MNW), and leximin. We also introduce the concept of strong price of fairness, which captures the efficiency loss in the worst fair allocation as opposed to that in the best fair allocation as in the price of fairness. We mostly provide tight or asymptotically tight bounds on the worst-case efficiency loss for allocations satisfying these notions, for both the price of fairness and the strong price of fairness.

...read moreread less

Proceedings Article•DOI•

Bid Optimization by Multivariable Control in Display Advertising

[...]

Xun Yang¹, Yasong Li¹, Hao Wang¹, Di Wu¹, Qing Tan¹, Jian Xu¹, Kun Gai¹ - Show less +3 more•Institutions (1)

Alibaba Group¹

24 May 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors proposed a primal-dual method to derive the optimal bidding strategy for real-time RTB campaigns, where advertisers aim to maximize the quantity of conversions, and set cost-per-click (CPC) as a KPI constraint.

...read moreread less

Abstract: Real-Time Bidding (RTB) is an important paradigm in display advertising, where advertisers utilize extended information and algorithms served by Demand Side Platforms (DSPs) to improve advertising performance. A common problem for DSPs is to help advertisers gain as much value as possible with budget constraints. However, advertisers would routinely add certain key performance indicator (KPI) constraints that the advertising campaign must meet due to practical reasons. In this paper, we study the common case where advertisers aim to maximize the quantity of conversions, and set cost-per-click (CPC) as a KPI constraint. We convert such a problem into a linear programming problem and leverage the primal-dual method to derive the optimal bidding strategy. To address the applicability issue, we propose a feedback control-based solution and devise the multivariable control system. The empirical study based on real-word data from this http URL verifies the effectiveness and superiority of our approach compared with the state of the art in the industry practices.

...read moreread less

Posted Content•

Almost Group Envy-free Allocation of Indivisible Goods and Chores

[...]

Haris Aziz¹, Simon Rey²•Institutions (2)

University of New South Wales¹, University of Amsterdam²

16 Jul 2019-arXiv: Computer Science and Game Theory

TL;DR: This work takes the group envy-freeness concept that is well-established in the literature and presents stronger and relaxed versions that are especially suitable for the allocation of indivisible items, and presents a clear taxonomy of the fairness concepts.

...read moreread less

Abstract: We consider a multi-agent resource allocation setting in which an agent's utility may decrease or increase when an item is allocated. We take the group envy-freeness concept that is well-established in the literature and present stronger and relaxed versions that are especially suitable for the allocation of indivisible items. Of particular interest is a concept called group envy-freeness up to one item (GEF1). We then present a clear taxonomy of the fairness concepts. We study which fairness concepts guarantee the existence of a fair allocation under which preference domain. For two natural classes of additive utilities, we design polynomial-time algorithms to compute a GEF1 allocation. We also prove that checking whether a given allocation satisfies GEF1 is coNP-complete when there are either only goods, only chores or both.

...read moreread less

Posted Content•

Correlation in Extensive-Form Games: Saddle-Point Formulation and Benchmarks

[...]

Gabriele Farina¹, Chun Kai Ling¹, Fei Fang¹, Tuomas Sandholm¹•Institutions (1)

Carnegie Mellon University¹

29 May 2019-arXiv: Computer Science and Game Theory

TL;DR: It is shown that an EFCE can be formulated as the solution to a bilinear saddle-point problem and to showcase how this novel formulation can inspire new algorithms to compute EFCEs, a simple subgradient descent method is proposed which exploits this formulation and structural properties of E FCEs.

...read moreread less

Abstract: While Nash equilibrium in extensive-form games is well understood, very little is known about the properties of extensive-form correlated equilibrium (EFCE), both from a behavioral and from a computational point of view. In this setting, the strategic behavior of players is complemented by an external device that privately recommends moves to agents as the game progresses; players are free to deviate at any time, but will then not receive future recommendations. Our contributions are threefold. First, we show that an EFCE can be formulated as the solution to a bilinear saddle-point problem. To showcase how this novel formulation can inspire new algorithms to compute EFCEs, we propose a simple subgradient descent method which exploits this formulation and structural properties of EFCEs. Our method has better scalability than the prior approach based on linear programming. Second, we propose two benchmark games, which we hope will serve as the basis for future evaluation of EFCE solvers. These games were chosen so as to cover two natural application domains for EFCE: conflict resolution via a mediator, and bargaining and negotiation. Third, we document the qualitative behavior of EFCE in our proposed games. We show that the social-welfare-maximizing equilibria in these games are highly nontrivial and exhibit surprisingly subtle sequential behavior that so far has not received attention in the literature.

...read moreread less

Posted Content•

The Price of Connectivity in Fair Division

[...]

Xiaohui Bei¹, Ayumi Igarashi², Xinhang Lu¹, Warut Suksompong³•Institutions (3)

Nanyang Technological University¹, National Institute of Informatics², National University of Singapore³

15 Aug 2019-arXiv: Computer Science and Game Theory

TL;DR: The price of connectivity is introduced to capture the largest gap between the graph-specific and the unconstrained maximin share, and bounds on this quantity are derived which are tight for large classes of graphs in the case of two agents and for paths and stars in the general case.

...read moreread less

Abstract: We study the allocation of indivisible goods that form an undirected graph and quantify the loss of fairness when we impose a constraint that each agent must receive a connected subgraph. Our focus is on well-studied fairness notions including envy-freeness and maximin share fairness. We introduce the price of connectivity to capture the largest gap between the graph-specific and the unconstrained maximin share, and derive bounds on this quantity which are tight for large classes of graphs in the case of two agents and for paths and stars in the general case. For instance, with two agents we show that for biconnected graphs it is possible to obtain at least $3/4$ of the maximin share with connected allocations, while for the remaining graphs the guarantee is at most $1/2$. In addition, we determine the optimal relaxation of envy-freeness that can be obtained with each graph for two agents, and characterize the set of trees and complete bipartite graphs that always admit an allocation satisfying envy-freeness up to one good (EF1) for three agents. Our work demonstrates several applications of graph-theoretic tools and concepts to fair division problems.

...read moreread less

Journal Article•DOI•

Interdependent Strategic Security Risk Management with Bounded Rationality in the Internet of Things.

[...]

Juntao Chen¹, Quanyan Zhu¹•Institutions (1)

New York University¹

22 May 2019-arXiv: Computer Science and Game Theory

TL;DR: In this paper, a game-in-game framework is proposed to characterize the decisions of agents and quantify their risk of bounded perception due to the limited attention in the IoT environment.

...read moreread less

Abstract: With the increasing connectivity enabled by the Internet of Things (IoT), security becomes a critical concern, and the users should invest to secure their IoT applications. Due to the massive devices in the IoT network, users cannot be aware of the security policies taken by all its connected neighbors. Instead, a user makes security decisions based on the cyber risks he perceives by observing a selected number of nodes. To this end, we propose a model which incorporates the limited attention or bounded rationality nature of players in the IoT. Specifically, each individual builds a sparse cognitive network of nodes to respond to. Based on this simplified cognitive network representation, each user then determines his security management policy by minimizing his own real-world security cost. The bounded rational decision-makings of players and their cognitive network formations are interdependent and thus should be addressed in a holistic manner. We establish a games-in-games framework and propose a Gestalt Nash equilibrium (GNE) solution concept to characterize the decisions of agents, and quantify their risk of bounded perception due to the limited attention. In addition, we design a proximal-based iterative algorithm to compute the GNE. With case studies of smart communities, the designed algorithm can successfully identify the critical users whose decisions need to be taken into account by the other users during the security management.

...read moreread less

Proceedings Article•DOI•

Improved Metric Distortion for Deterministic Social Choice Rules

[...]

Kamesh Munagala¹, Kangning Wang¹•Institutions (1)

Duke University¹

04 May 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, the distortion of the best deterministic social choice rule was known to be between $3$ and $5$ and it was conjectured that any rule that only looks at the weighted tournament graph on the candidates cannot have distortion better than $5$.

...read moreread less

Abstract: In this paper, we study the metric distortion of deterministic social choice rules that choose a winning candidate from a set of candidates based on voter preferences. Voters and candidates are located in an underlying metric space. A voter has cost equal to her distance to the winning candidate. Ordinal social choice rules only have access to the ordinal preferences of the voters that are assumed to be consistent with the metric distances. Our goal is to design an ordinal social choice rule with minimum distortion, which is the worst-case ratio, over all consistent metrics, between the social cost of the rule and that of the optimal omniscient rule with knowledge of the underlying metric space. The distortion of the best deterministic social choice rule was known to be between $3$ and $5$. It had been conjectured that any rule that only looks at the weighted tournament graph on the candidates cannot have distortion better than $5$. In our paper, we disprove it by presenting a weighted tournament rule with distortion of $4.236$. We design this rule by generalizing the classic notion of uncovered sets, and further show that this class of rules cannot have distortion better than $4.236$. We then propose a new voting rule, via an alternative generalization of uncovered sets. We show that if a candidate satisfying the criterion of this voting rule exists, then choosing such a candidate yields a distortion bound of $3$, matching the lower bound. We present a combinatorial conjecture that implies distortion of $3$, and verify it for small numbers of candidates and voters by computer experiments. Using our framework, we also show that selecting any candidate guarantees distortion of at most $3$ when the weighted tournament graph is cyclically symmetric.

...read moreread less

Posted Content•

An Axiomatic Approach to Block Rewards

[...]

Xi Chen¹, Christos H. Papadimitriou¹, Tim Roughgarden¹•Institutions (1)

Columbia University¹

23 Sep 2019-arXiv: Computer Science and Game Theory

TL;DR: It is shown that Bitcoin's proportional allocation rule is the unique allocation rule satisfying a certain system of properties, but this does not hold for slightly weaker sets of properties; and a rich class of allocation rules can be approximately implemented in a proof-of-work blockchain.

...read moreread less

Abstract: Proof-of-work blockchains reward each miner for one completed block by an amount that is, in expectation, proportional to the number of hashes the miner contributed to the mining of the block. Is this proportional allocation rule optimal? And in what sense? And what other rules are possible? In particular, what are the desirable properties that any "good" allocation rule should satisfy? To answer these questions, we embark on an axiomatic theory of incentives in proof-of-work blockchains at the time scale of a single block. We consider desirable properties of allocation rules including: symmetry; budget balance (weak or strong); sybil-proofness; and various grades of collusion-proofness. We show that Bitcoin's proportional allocation rule is the unique allocation rule satisfying a certain system of properties, but this does not hold for slightly weaker sets of properties, or when the miners are not risk-neutral. We also point out that a rich class of allocation rules can be approximately implemented in a proof-of-work blockchain.

...read moreread less

Posted Content•

Dominantly Truthful Multi-task Peer Prediction with a Constant Number of Tasks

[...]

Yuqing Kong¹•Institutions (1)

Peking University¹

01 Nov 2019-arXiv: Computer Science and Game Theory

TL;DR: DMI-Mechanism is the first dominantly truthful mechanism that works for a finite number of tasks, not to say a small constant number of roles, and can be transferred into an information evaluation rule that identifies high-quality information without verification when there are at least 3 participants.

...read moreread less

Abstract: In the setting where participants are asked multiple similar possibly subjective multi-choice questions (e.g. Do you like Panda Express? Y/N; do you like Chick-fil-A? Y/N), a series of peer prediction mechanisms are designed to incentivize honest reports and some of them achieve dominantly truthfulness: truth-telling is a dominant strategy and strictly dominate other "non-permutation strategy" with some mild conditions. However, a major issue hinders the practical usage of those mechanisms: they require the participants to perform an infinite number of tasks. When the participants perform a finite number of tasks, these mechanisms only achieve approximated dominant truthfulness. The existence of a dominantly truthful multi-task peer prediction mechanism that only requires a finite number of tasks remains to be an open question that may have a negative result, even with full prior knowledge. This paper answers this open question by proposing a new mechanism, Determinant based Mutual Information Mechanism (DMI-Mechanism), that is dominantly truthful when the number of tasks is at least 2C and the number of participants is at least 2. C is the number of choices for each question (C=2 for binary-choice questions). In addition to incentivizing honest reports, DMI-Mechanism can also be transferred into an information evaluation rule that identifies high-quality information without verification when there are at least 3 participants. To the best of our knowledge, DMI-Mechanism is the first dominantly truthful mechanism that works for a finite number of tasks, not to say a small constant number of tasks.

...read moreread less

Book Chapter•DOI•

Oceanic Games: Centralization Risks and Incentives in Blockchain Mining

[...]

Nikos Leonardos¹, Stefanos Leonardos², Georgios Piliouras²•Institutions (2)

National and Kapodistrian University of Athens¹, Singapore University of Technology and Design²

04 Apr 2019-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors employ the concept of Oceanic Games, Milnor and Shapley (1978), to analyze decision making in corporate settings with small numbers of dominant players (shareholders) and large numbers of individually insignificant players, the ocean.

...read moreread less

Abstract: To participate in the distributed consensus of permissionless blockchains, prospective nodes -- or miners -- provide proof of designated, costly resources. However, in contrast to the intended decentralization, current data on blockchain mining unveils increased concentration of these resources in a few major entities, typically mining pools. To study strategic considerations in this setting, we employ the concept of Oceanic Games, Milnor and Shapley (1978). Oceanic Games have been used to analyze decision making in corporate settings with small numbers of dominant players (shareholders) and large numbers of individually insignificant players, the ocean. Unlike standard equilibrium models, they focus on measuring the value (or power) per entity and per unit of resource} in a given distribution of resources. These values are viewed as strategic components in coalition formations, mergers and resource acquisitions. Considering such issues relevant to blockchain governance and long-term sustainability, we adapt oceanic games to blockchain mining and illustrate the defined concepts via examples. The application of existing results reveals incentives for individual miners to merge in order to increase the value of their resources. This offers an alternative perspective to the observed centralization and concentration of mining power. Beyond numerical simulations, we use the model to identify issues relevant to the design of future cryptocurrencies and formulate prospective research questions.

...read moreread less

Collapse