Showing papers in "arXiv: Computer Science and Game Theory in 2018"

PDF

Open Access

Posted Content•

Solving Imperfect-Information Games via Discounted Regret Minimization

[...]

Noam Brown¹, Tuomas Sandholm¹•Institutions (1)

11 Sep 2018-arXiv: Computer Science and Game Theory

TL;DR: This paper introduces novel CFR variants that 1) discount regrets from earlier iterations in various ways, 2) reweight iterations inVarious ways to obtain the output strategies, 3) use a non-standard regret minimizer and/or 4) leverage "optimistic regret matching".

...read moreread less

Abstract: Counterfactual regret minimization (CFR) is a family of iterative algorithms that are the most popular and, in practice, fastest approach to approximately solving large imperfect-information games In this paper we introduce novel CFR variants that 1) discount regrets from earlier iterations in various ways (in some cases differently for positive and negative regrets), 2) reweight iterations in various ways to obtain the output strategies, 3) use a non-standard regret minimizer and/or 4) leverage "optimistic regret matching" They lead to dramatically improved performance in many settings For one, we introduce a variant that outperforms CFR+, the prior state-of-the-art algorithm, in every game tested, including large-scale realistic settings CFR+ is a formidable benchmark: no other algorithm has been able to outperform it Finally, we show that, unlike CFR+, many of the important new variants are compatible with modern imperfect-information-game pruning techniques and one is also compatible with sampling in the game tree

...read moreread less

81 citations

Posted Content•

Congestion Pricing in a World of Self-driving vehicles: an Analysis of Different Strategies in Alternative Future Scenarios

[...]

Michele D. Simoni¹, Kara M. Kockelman¹, Krishna Murthy Gurumurthy¹, Joschka Bischoff²•Institutions (2)

University of Texas at Austin¹, Technical University of Berlin²

28 Mar 2018-arXiv: Computer Science and Game Theory

TL;DR: Results suggest that, while all pricing strategies reduce congestion, their social welfare impacts differ in meaningful ways, and more complex and advanced strategies perform better in terms of traffic conditions and traveler welfare, depending on the development of the mobility landscape of autonomous driving.

...read moreread less

Abstract: The introduction of autonomous (self-driving) and shared autonomous vehicles (AVs and SAVs) will affect travel destinations and distances, mode choice, and congestion. From a traffic perspective, although some congestion reduction may be achieved (thanks to fewer crashes and tighter headways), car-trip frequencies and vehicle miles traveled (VMT) are likely to rise significantly, reducing the benefits of driverless vehicles. Congestion pricing (CP) and road tolls are key tools for moderating demand and incentivizing more socially and environmentally optimal travel choices. This work develops multiple CP and tolling strategies in alternative future scenarios, and investigates their effects on the Austin, Texas network conditions and traveler welfare, using the agent-based simulation model MATSim. Results suggest that, while all pricing strategies reduce congestion, their social welfare impacts differ in meaningful ways. More complex and advanced strategies perform better in terms of traffic conditions and traveler welfare, depending on the development of the mobility landscape of autonomous driving. The possibility to refund users by reinvesting toll revenues as traveler budgets plays a salient role in the overall efficiency of each CP strategy as well as in the public acceptability.

...read moreread less

75 citations

Posted Content•

Fair Allocation of Indivisible Public Goods

[...]

Brandon Fain¹, Kamesh Munagala¹, Nisarg Shah²•Institutions (2)

Duke University¹, University of Toronto²

08 May 2018-arXiv: Computer Science and Game Theory

TL;DR: This work is the first to approximate the core of a groupwise fairness notion called the core, which generalizes well-studied notions of proportionality and Pareto efficiency, and requires that each subset of agents must receive an outcome that is fair relative to its size.

...read moreread less

Abstract: We consider the problem of fairly allocating indivisible public goods. We model the public goods as elements with feasibility constraints on what subsets of elements can be chosen, and assume that agents have additive utilities across elements. Our model generalizes existing frameworks such as fair public decision making and participatory budgeting. We study a groupwise fairness notion called the core, which generalizes well-studied notions of proportionality and Pareto efficiency, and requires that each subset of agents must receive an outcome that is fair relative to its size. In contrast to the case of divisible public goods (where fractional allocations are permitted), the core is not guaranteed to exist when allocating indivisible public goods. Our primary contributions are the notion of an additive approximation to the core (with a tiny multiplicative loss), and polynomial time algorithms that achieve a small additive approximation, where the additive factor is relative to the largest utility of an agent for an element. If the feasibility constraints define a matroid, we show an additive approximation of 2. A similar approach yields a constant additive bound when the feasibility constraints define a matching. More generally, if the feasibility constraints define an arbitrary packing polytope with mild restrictions, we show an additive guarantee that is logarithmic in the width of the polytope. Our algorithms are based on variants of the convex program for maximizing the Nash social welfare, but differ significantly from previous work in how it is used. Our guarantees are meaningful even when there are fewer elements than the number of agents. As far as we are aware, our work is the first to approximate the core in indivisible settings.

...read moreread less

74 citations

Posted Content•

Price-based Resource Allocation for Edge Computing: A Market Equilibrium Approach

[...]

Duong Tung Nguyen¹, Long Bao Le², Vijay K. Bhargava¹•Institutions (2)

University of British Columbia¹, Université du Québec à Montréal²

08 May 2018-arXiv: Computer Science and Game Theory

TL;DR: In this paper, a market-based framework for efficiently allocating resources of heterogeneous capacity-limited edge nodes (EN) to multiple competing services at the network edge is proposed, and the proposed framework generates a market equilibrium (ME) solution that not only maximizes the edge computing resource utilization but also allocates optimal resource bundles to the services given their budget constraints.

...read moreread less

Abstract: The emerging edge computing paradigm promises to deliver superior user experience and enable a wide range of Internet of Things (IoT) applications. In this work, we propose a new market-based framework for efficiently allocating resources of heterogeneous capacity-limited edge nodes (EN) to multiple competing services at the network edge. By properly pricing the geographically distributed ENs, the proposed framework generates a market equilibrium (ME) solution that not only maximizes the edge computing resource utilization but also allocates optimal (i.e., utility-maximizing) resource bundles to the services given their budget constraints. When the utility of a service is defined as the maximum revenue that the service can achieve from its resource allotment, the equilibrium can be computed centrally by solving the Eisenberg-Gale (EG) convex program. drawn from the economics literature. We further show that the equilibrium allocation is Pareto-optimal and satisfies desired fairness properties including sharing incentive, proportionality, and envy-freeness. Also, two distributed algorithms are introduced, which efficiently converge to an ME. When each service aims to maximize its net profit (i.e., revenue minus cost) instead of the revenue, we derive a novel convex optimization problem and rigorously prove that its solution is exactly an ME. Extensive numerical results are presented to validate the effectiveness of the proposed techniques.

...read moreread less

67 citations

Posted Content•

Peer-to-Peer Energy Trading with Sustainable User Participation: A Game Theoretic Approach

[...]

Wayes Tushar¹, Tapan Kumar Saha¹, Chau Yuen², Paul Liddell, Richard Bean, H. Vincent Poor³ - Show less +2 more•Institutions (3)

University of Queensland¹, Singapore University of Technology and Design², Princeton University³

28 Sep 2018-arXiv: Computer Science and Game Theory

TL;DR: In this paper, the feasibility of social cooperation between prosumers within an energy network in establishing their sustainable participation in peer-to-peer (P2P) energy trading is explored.

...read moreread less

Abstract: This paper explores the feasibility of social cooperation between prosumers within an energy network in establishing their sustainable participation in peer-to-peer (P2P) energy trading. In particular, a canonical coalition game (CCG) is utilized to propose a P2P energy trading scheme, in which a set of participating prosumers form a coalition group to trade their energy, if there is any, with one another. By exploring the concept of the core of the designed CCG framework, the mid-market rate is utilized as a pricing mechanism of the proposed P2P trading to confirm the stability of the coalition as well as to guarantee the benefit to the prosumers for forming the social coalition. The paper further introduces the motivational psychology models that are relevant to the proposed P2P scheme and it is shown that the outcomes of proposed P2P energy trading scheme satisfy the discussed models. Consequently, it is proven that the proposed scheme is consumer-centric that has the potential to corroborate sustainable prosumers participation in P2P energy trading. Finally, some numerical examples are provided to demonstrate the beneficial properties of the proposed scheme.

...read moreread less

64 citations

Journal Article•DOI•

Auction Mechanisms in Cloud/Fog Computing Resource Allocation for Public Blockchain Networks

[...]

Yutao Jiao¹, Ping Wang², Dusit Niyato¹, Kongrath Suankaewmanee¹•Institutions (2)

Nanyang Technological University¹, York University²

26 Apr 2018-arXiv: Computer Science and Game Theory

TL;DR: In this article, an auction-based market model for efficient computing resource allocation in a proof-of-work-based blockchain network is proposed, where the objective is to maximize the social welfare of the blockchain network and provide effective strategies for the cloud/fog computing service provider.

...read moreread less

Abstract: As an emerging decentralized secure data management platform, blockchain has gained much popularity recently. To maintain a canonical state of blockchain data record, proof-of-work based consensus protocols provide the nodes, referred to as miners, in the network with incentives for confirming new block of transactions through a process of "block mining" by solving a cryptographic puzzle. Under the circumstance of limited local computing resources, e.g., mobile devices, it is natural for rational miners, i.e., consensus nodes, to offload computational tasks for proof of work to the cloud/fog computing servers. Therefore, we focus on the trading between the cloud/fog computing service provider and miners, and propose an auction-based market model for efficient computing resource allocation. In particular, we consider a proof-of-work based blockchain network. Due to the competition among miners in the blockchain network, the allocative externalities are particularly taken into account when designing the auction mechanisms. Specifically, we consider two bidding schemes: the constant-demand scheme where each miner bids for a fixed quantity of resources, and the multi-demand scheme where the miners can submit their preferable demands and bids. For the constant-demand bidding scheme, we propose an auction mechanism that achieves optimal social welfare. In the multi-demand bidding scheme, the social welfare maximization problem is NP-hard. Therefore, we design an approximate algorithm which guarantees the truthfulness, individual rationality and computational efficiency. Through extensive simulations, we show that our proposed auction mechanisms with the two bidding schemes can efficiently maximize the social welfare of the blockchain network and provide effective strategies for the cloud/fog computing service provider.

...read moreread less

48 citations

Posted Content•

On profitability of selfish mining

[...]

Cyril Grunspan, Ricardo Pérez-Marco¹•Institutions (1)

Institut de Mathématiques de Jussieu¹

16 May 2018-arXiv: Computer Science and Game Theory

TL;DR: An improvement of Bitcoin protocol is proposed making it immune to selfish mining attacks, and a rigorous profitability model for repetition games is built that is out of reach by the previous Markov models.

...read moreread less

Abstract: We review the so called selfish mining strategy in the Bitcoin network and compare its profitability to honest mining.We build a rigorous profitability model for repetition games. The time analysis of the attack has been ignored in the previous literature based on a Markov model,but is critical. Using martingale's techniques and Doob Stopping Time Theorem we compute the expected duration of attack cycles. We discover a remarkable property of the bitcoin network: no strategy is more profitable than the honest strategy before a difficulty adjustment. So selfish mining can only become profitable afterwards, thus it is an attack on the difficulty adjustment algorithm. We propose an improvement of Bitcoin protocol making it immune to selfish mining attacks. We also study miner's attraction to selfish mining pools. We calculate the expected duration time before profit for the selfish miner, a computation that is out of reach by the previous Markov models.

...read moreread less

48 citations

Posted Content•

Fair Division Under Cardinality Constraints

[...]

Siddharth Barman¹, Arpita Biswas¹•Institutions (1)

Indian Institute of Science¹

25 Apr 2018-arXiv: Computer Science and Game Theory

TL;DR: In this paper, the authors considered the problem of fairly allocating indivisible goods among agents, under cardinality constraints and additive valuations, and established the existence and algorithmic guarantees established for these solution concepts in the unconstrained setting.

...read moreread less

Abstract: We consider the problem of fairly allocating indivisible goods, among agents, under cardinality constraints and additive valuations. In this setting, we are given a partition of the entire set of goods---i.e., the goods are categorized---and a limit is specified on the number of goods that can be allocated from each category to any agent. The objective here is to find a fair allocation in which the subset of goods assigned to any agent satisfies the given cardinality constraints. This problem naturally captures a number of resource-allocation applications, and is a generalization of the well-studied (unconstrained) fair division problem. The two central notions of fairness, in the context of fair division of indivisible goods, are envy freeness up to one good (EF1) and the (approximate) maximin share guarantee (MMS). We show that the existence and algorithmic guarantees established for these solution concepts in the unconstrained setting can essentially be achieved under cardinality constraints. Specifically, we develop efficient algorithms which compute EF1 and approximately MMS allocations in the constrained setting. Furthermore, focusing on the case wherein all the agents have the same additive valuation, we establish that EF1 allocations exist and can be computed efficiently even under laminar matroid constraints.

...read moreread less

41 citations

Posted Content•

Adaptive Strategic Cyber Defense for Advanced Persistent Threats in Critical Infrastructure Networks

[...]

Linan Huang¹, Quanyan Zhu¹•Institutions (1)

New York University¹

06 Sep 2018-arXiv: Computer Science and Game Theory

TL;DR: A multi-stage Bayesian game framework to capture incomplete information of deceptive APTs and their multistage multi-phase movement and the analysis of the perfect Bayesian Nash equilibrium (PBNE) enables a prediction of attacker's behaviors and a design of defensive strategies that can deter the adversaries and mitigate the security risks.

...read moreread less

Abstract: Advanced Persistent Threats (APTs) have created new security challenges for critical infrastructures due to their stealthy, dynamic, and adaptive natures. In this work, we aim to lay a game-theoretic foundation by establishing a multi-stage Bayesian game framework to capture incomplete information of deceptive APTs and their multi-stage multi-phase movement. The analysis of the perfect Bayesian Nash equilibrium (PBNE) enables a prediction of attacker's behaviors and a design of defensive strategies that can deter the adversaries and mitigate the security risks. A conjugate-prior method allows online computation of the belief and reduces Bayesian update into an iterative parameter update. The forwardly updated parameters are assimilated into the backward dynamic programming computation to characterize a computationally tractable and time-consistent equilibrium solution based on the expanded state space. The Tennessee Eastman (TE) process control problem is used as a case study to demonstrate the dynamic game under the information asymmetry and show that APTs tend to be stealthy and deceptive during their transitions in the cyber layer and behave aggressively when reaching the targeted physical plant. The online update of the belief allows the defender to learn the behavior of the attacker and choose strategic defensive actions that can thwart adversarial behaviors and mitigate APTs. Numerical results illustrate the defender's tradeoff between the immediate reward and the future expectation as well as the attacker's goal to reach an advantageous system state while making the defender form a positive belief.

...read moreread less

40 citations

Posted Content•

Strategyproof Linear Regression in High Dimensions.

[...]

Yiling Chen¹, Chara Podimata¹, Ariel D. Procaccia², Nisarg Shah³•Institutions (3)

Harvard University¹, Carnegie Mellon University², University of Toronto³

27 May 2018-arXiv: Computer Science and Game Theory

TL;DR: This paper focuses on the ubiquitous problem of linear regression, where strategyproof mechanisms have previously been identified in two dimensions, and finds a family of group strategyproof linear regression mechanisms in any number of dimensions, which are called generalized resistant hyperplane mechanisms.

...read moreread less

Abstract: This paper is part of an emerging line of work at the intersection of machine learning and mechanism design, which aims to avoid noise in training data by correctly aligning the incentives of data sources. Specifically, we focus on the ubiquitous problem of linear regression, where strategyproof mechanisms have previously been identified in two dimensions. In our setting, agents have single-peaked preferences and can manipulate only their response variables. Our main contribution is the discovery of a family of group strategyproof linear regression mechanisms in any number of dimensions, which we call generalized resistant hyperplane mechanisms. The game-theoretic properties of these mechanisms -- and, in fact, their very existence -- are established through a connection to a discrete version of the Ham Sandwich Theorem.

...read moreread less

40 citations

Posted Content•

Privacy Preserving and Cost Optimal Mobile Crowdsensing using Smart Contracts on Blockchain

[...]

Dimitris Chatzopoulos¹, Sujit Gujar², Boi Faltings³, Pan Hui¹•Institutions (3)

Hong Kong University of Science and Technology¹, International Institute of Information Technology, Hyderabad², École Polytechnique³

13 Aug 2018-arXiv: Computer Science and Game Theory

TL;DR: This work proposes to use blockchain technologies and smart contracts to orchestrate the interactions between mobile crowdsensing providers and mobile users for the case of spatial crowdsensing, where mobile users need to be at specific locations to perform the tasks.

...read moreread less

Abstract: The popularity and applicability of mobile crowdsensing applications are continuously increasing due to the widespread of mobile devices and their sensing and processing capabilities. However, we need to offer appropriate incentives to the mobile users who contribute their resources and preserve their privacy. Blockchain technologies enable semi-anonymous multi-party interactions and can be utilized in crowdsensing applications to maintain the privacy of the mobile users while ensuring first-rate crowdsensed data. In this work, we propose to use blockchain technologies and smart contracts to orchestrate the interactions between mobile crowdsensing providers and mobile users for the case of spatial crowdsensing, where mobile users need to be at specific locations to perform the tasks. Smart contracts, by operating as processes that are executed on the blockchain, are used to preserve users' privacy and make payments. Furthermore, for the assignment of the crowdsensing tasks to the mobile users, we design a truthful, cost-optimal auction that minimizes the payments from the crowdsensing providers to the mobile users. Extensive experimental results show that the proposed privacy preserving auction outperforms state-of-the-art proposals regarding cost by ten times for high numbers of mobile users and tasks.

...read moreread less

Book Chapter•DOI•

The Fluid Mechanics of Liquid Democracy

[...]

Paul Gölz¹, Anson Kahng¹, Simon Mackenzie¹, Ariel D. Procaccia¹•Institutions (1)

Carnegie Mellon University¹

06 Aug 2018-arXiv: Computer Science and Game Theory

TL;DR: This work analyzes the problem of choosing delegations to approximately minimize the maximum number of votes entrusted to any agent, by drawing connections to the literature on confluent flow and introduces a random graph model for liquid democracy.

...read moreread less

Abstract: Liquid democracy is the principle of making collective decisions by letting agents transitively delegate their votes. Despite its significant appeal, it has become apparent that a weakness of liquid democracy is that a small subset of agents may gain massive influence. To address this, we propose to change the current practice by allowing agents to specify multiple delegation options instead of just one. Much like in nature, where --- fluid mechanics teaches us --- liquid maintains an equal level in connected vessels, so do we seek to control the flow of votes in a way that balances influence as much as possible. Specifically, we analyze the problem of choosing delegations to approximately minimize the maximum number of votes entrusted to any agent, by drawing connections to the literature on confluent flow. We also introduce a random graph model for liquid democracy, and use it to demonstrate the benefits of our approach both theoretically and empirically.

...read moreread less

Posted Content•

Reward Sharing Schemes for Stake Pools

[...]

Lars Brunjes, Aggelos Kiayias, Elias Koutsoupias, Aikaterini-Panagiota Stouka

30 Jul 2018-arXiv: Computer Science and Game Theory

TL;DR: RSS is introduced and study and fast convergence to equilibria is experimentally demonstrated in dynamic environments where players react to each other's strategic moves over an indefinite period of interactive play, showing how simple reward sharing schemes that are seemingly more “fair”, perhaps counterin-tuitively, converge to centralizedEquilibria.

...read moreread less

Abstract: We introduce and study reward sharing schemes (RSS) that promote the fair formation of {\em stake pools}\ in collaborative projects that involve a large number of stakeholders such as the maintenance of a proof-of-stake (PoS) blockchain. Our mechanisms are parameterized by a target value for the desired number of pools. We show that by properly incentivizing participants, the desired number of stake pools is a Nash equilibrium arising from rational play. Our equilibria also exhibit an efficiency / security tradeoff via a parameter that calibrates between including pools with the smallest cost and providing protection against Sybil attacks, the setting where a single stakeholder creates a large number of pools in the hopes to dominate the collaborative project. We then describe how RSS can be deployed in the PoS setting, mitigating a number of potential deployment attacks and protocol deviations that include censoring transactions, performing Sybil attacks with the objective to control the majority of stake, lying about the actual cost and others. Finally, we experimentally demonstrate fast convergence to equilibria in dynamic environments where players react to each other's strategic moves over an indefinite period of interactive play. We also show how simple reward sharing schemes that are seemingly more "fair", perhaps counterintuitively, converge to centralized equilibria.

...read moreread less

Posted Content•

Greedy Algorithms for Maximizing Nash Social Welfare

[...]

Siddharth Barman¹, Sanath Kumar Krishnamurthy², Rohit Vaish¹•Institutions (2)

Indian Institute of Science¹, Chennai Mathematical Institute²

27 Jan 2018-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors study the problem of fairly allocating a set of indivisible goods among agents with additive valuations, and show that a simple greedy algorithm provides a 1.061-approximation guarantee when agents have identical valuations.

...read moreread less

Abstract: We study the problem of fairly allocating a set of indivisible goods among agents with additive valuations. The extent of fairness of an allocation is measured by its Nash social welfare, which is the geometric mean of the valuations of the agents for their bundles. While the problem of maximizing Nash social welfare is known to be APX-hard in general, we study the effectiveness of simple, greedy algorithms in solving this problem in two interesting special cases. First, we show that a simple, greedy algorithm provides a 1.061-approximation guarantee when agents have identical valuations, even though the problem of maximizing Nash social welfare remains NP-hard for this setting. Second, we show that when agents have binary valuations over the goods, an exact solution (i.e., a Nash optimal allocation) can be found in polynomial time via a greedy algorithm. Our results in the binary setting extend to provide novel, exact algorithms for optimizing Nash social welfare under concave valuations. Notably, for the above mentioned scenarios, our techniques provide a simple alternative to several of the existing, more sophisticated techniques for this problem such as constructing equilibria of Fisher markets or using real stable polynomials.

...read moreread less

Posted Content•

Fair allocation of combinations of indivisible goods and chores.

[...]

Haris Aziz¹, Ioannis Caragiannis², Ayumi Igarashi³•Institutions (3)

University of New South Wales¹, Aarhus University², National Institute of Informatics³

27 Jul 2018-arXiv: Computer Science and Game Theory

TL;DR: This paper considers a more general scenario where an agent may have negative or positive utility for each item, e.g., fair task assignment, where agents can have both positive and negative utilities for each task.

...read moreread less

Abstract: We consider the problem of fairly dividing a set of items Much of the fair division literature assumes that the items are `goods' ie, they yield positive utility for the agents There is also some work where the items are `chores' that yield negative utility for the agents In this paper, we consider a more general scenario where an agent may have negative or positive utility for each item This framework captures, eg, fair task assignment, where agents can have both positive and negative utilities for each task We show that whereas some of the positive axiomatic and computational results extend to this more general setting, others do not We present several new and efficient algorithms for finding fair allocations in this general setting We also point out several gaps in the literature regarding the existence of allocations satisfying certain fairness and efficiency properties and further study the complexity of computing such allocations

...read moreread less

Posted Content•

Formal Barriers to Longest-Chain Proof-of-Stake Protocols.

[...]

Jonah Brown-Cohen, Arvind Narayanan, Christos-Alexandros Psomas, S. Matthew Weinberg

18 Sep 2018-arXiv: Computer Science and Game Theory

TL;DR: The main results of this paper are several formal barriers to designing incentive-compatible proof-of-stake cryptocurrencies (that don't apply to proof- of-work).

...read moreread less

Abstract: The security of most existing cryptocurrencies is based on a concept called Proof-of-Work, in which users must solve a computationally hard cryptopuzzle to authorize transactions (`one unit of computation, one vote'). This leads to enormous expenditure on hardware and electricity in order to collect the rewards associated with transaction authorization. Proof-of-Stake is an alternative concept that instead selects users to authorize transactions proportional to their wealth (`one coin, one vote'). Some aspects of the two paradigms are the same. For instance, obtaining voting power in Proof-of-Stake has a monetary cost just as in Proof-of-Work: a coin cannot be freely duplicated any more easily than a unit of computation. However some aspects are fundamentally different. In particular, exactly because Proof-of-Stake is wasteless, there is no inherent resource cost to deviating (commonly referred to as the `Nothing-at-Stake' problem). In contrast to prior work, we focus on incentive-driven deviations (any participant will deviate if doing so yields higher revenue) instead of adversarial corruption (an adversary may take over a significant fraction of the network, but the remaining players follow the protocol). The main results of this paper are several formal barriers to designing incentive-compatible proof-of-stake cryptocurrencies (that don't apply to proof-of-work).

...read moreread less

Proceedings Article•DOI•

Pacing Equilibrium in First-Price Auction Markets

[...]

Vincent Conitzer¹, Christian Kroer², Debmalya Panigrahi¹, Okke Schrijvers³, Eric Sodomka³, Nicolas E. Stier-Moses³, Chris Wilkens - Show less +3 more•Institutions (3)

Duke University¹, Columbia University², Facebook³

17 Nov 2018-arXiv: Computer Science and Game Theory

TL;DR: This paper uses simulations to demonstrate that while there are incentives to misreport in thin markets (where budgets aren't constraining), a bidder's incentive to deviate vanishes in thick markets.

...read moreread less

Abstract: In the isolated auction of a single item, second price often dominates first price in properties of theoretical interest. But, single items are rarely sold in true isolation, so considering the broader context is critical when adopting a pricing strategy. In this paper, we study a model centrally relevant to Internet advertising and show that when items (ad impressions) are individually auctioned within the context of a larger system that is managing budgets, theory offers surprising endorsement for using a first price auction to sell each individual item. In particular, first price auctions offer theoretical guarantees of equilibrium uniqueness, monotonicity, and other desirable properties, as well as efficient computability as the solution to the well-studied Eisenberg-Gale convex program. We also use simulations to demonstrate that a bidder's incentive to deviate vanishes in thick markets.

...read moreread less

Posted Content•

Bandit learning in concave $N$-person games

[...]

Mario Bravo¹, David S. Leslie, Panayotis Mertikopoulos•Institutions (1)

University of Santiago, Chile¹

03 Oct 2018-arXiv: Computer Science and Game Theory

TL;DR: In this paper, the authors examined the long-run behavior of learning with bandit feedback in non-cooperative concave games and derived an upper bound for the convergence rate of the process that nearly matches the best attainable rate for single-agent bandit stochastic optimization.

...read moreread less

Abstract: This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concave games. The bandit framework accounts for extremely low-information environments where the agents may not even know they are playing a game; as such, the agents' most sensible choice in this setting would be to employ a no-regret learning algorithm. In general, this does not mean that the players' behavior stabilizes in the long run: no-regret learning may lead to cycles, even with perfect gradient information. However, if a standard monotonicity condition is satisfied, our analysis shows that no-regret learning based on mirror descent with bandit feedback converges to Nash equilibrium with probability $1$. We also derive an upper bound for the convergence rate of the process that nearly matches the best attainable rate for single-agent bandit stochastic optimization.

...read moreread less

Posted Content•

Schelling Segregation with Strategic Agents

[...]

Ankit Chauhan¹, Pascal Lenzner¹, Louise Molitor¹•Institutions (1)

University of Potsdam¹

22 Jun 2018-arXiv: Computer Science and Game Theory

TL;DR: This work introduces and analyzes generalized game-theoretic models of Schelling segregation, where the agents can also have individual location preferences, and closes the gap in the model where rational agents strategically choose their location.

...read moreread less

Abstract: Schelling's segregation model is a landmark model in sociology. It shows the counter-intuitive phenomenon that residential segregation between individuals of different groups can emerge even when all involved individuals are tolerant. Although the model is widely studied, no pure game-theoretic version where rational agents strategically choose their location exists. We close this gap by introducing and analyzing generalized game-theoretic models of Schelling segregation, where the agents can also have individual location preferences. For our models, we investigate the convergence behavior and the efficiency of their equilibria. In particular, we prove guaranteed convergence to an equilibrium in the version which is closest to Schelling's original model. Moreover, we provide tight bounds on the Price of Anarchy.

...read moreread less

Posted Content•

Downstream Effects of Affirmative Action

[...]

Sampath Kannan, Aaron Roth, Juba Ziani

27 Aug 2018-arXiv: Computer Science and Game Theory

TL;DR: In this article, the authors study a two-stage model, in which students are admitted to college on the basis of an entrance exam which is a noisy signal about their qualifications (type), and then those students who were accepted to college can be hired by an employer as a function of their college grades, which are an independently drawn noisy signal of their type.

...read moreread less

Abstract: We study a two-stage model, in which students are 1) admitted to college on the basis of an entrance exam which is a noisy signal about their qualifications (type), and then 2) those students who were admitted to college can be hired by an employer as a function of their college grades, which are an independently drawn noisy signal of their type. Students are drawn from one of two populations, which might have different type distributions. We assume that the employer at the end of the pipeline is rational, in the sense that it computes a posterior distribution on student type conditional on all information that it has available (college admissions, grades, and group membership), and makes a decision based on posterior expectation. We then study what kinds of fairness goals can be achieved by the college by setting its admissions rule and grading policy. For example, the college might have the goal of guaranteeing equal opportunity across populations: that the probability of passing through the pipeline and being hired by the employer should be independent of group membership, conditioned on type. Alternately, the college might have the goal of incentivizing the employer to have a group blind hiring rule. We show that both goals can be achieved when the college does not report grades. On the other hand, we show that under reasonable conditions, these goals are impossible to achieve even in isolation when the college uses an (even minimally) informative grading policy.

...read moreread less

Posted Content•

Proportionality Degree of Multiwinner Rules.

[...]

Piotr Skowron¹•Institutions (1)

University of Warsaw¹

20 Oct 2018-arXiv: Computer Science and Game Theory

TL;DR: This work investigates a number of election rules and investigates whether the committees that they return represent the voters proportionally, and measures the tradeoff between proportionality and utilitarian efficiency for a broad subclass of committee election rules.

...read moreread less

Abstract: We study multiwinner elections with approval-based preferences. An instance of a multiwinner election consists of a set of alternatives, a population of voters---each voter approves a subset of alternatives, and the desired committee size $k$; the goal is to select a committee (a~subset) of $k$ alternatives according to the preferences of the voters. We investigate a number of election rules and ask whether the committees that they return represent the voters proportionally. In contrast to the classic literature, we employ quantitative techniques that allow to measure the extent to which the considered rules are proportional. This allows us to arrange the rules in a clear hierarchy. For example, we find that Proportional Approval Voting (PAV) has better proportionality guarantees than its sequential counterpart, and that Phragmen's Sequential Rule is worse than Sequential PAV. Yet, the loss of proportionality for the two sequential rules is moderate and in some contexts can be outweighed by their other appealing properties. Finally, we measure the tradeoff between proportionality and utilitarian efficiency for a broad subclass of committee election rules.

...read moreread less

Proceedings Article•DOI•

Dancing Pigs or Externalities? Measuring the Rationality of Security Decisions

[...]

Elissa M. Redmiles¹, Michelle L. Mazurek¹, John P. Dickerson¹•Institutions (1)

University of Maryland, College Park¹

16 May 2018-arXiv: Computer Science and Game Theory

TL;DR: In this article, behavioral economics experiments were conducted to model the rationality of end-user security decision-making in a realistic online experimental system simulating a bank account, where participants were asked to make a financially impactful security choice, in the face of transparent risks of account compromise and benefits offered by an optional security behavior (two-factor authentication).

...read moreread less

Abstract: Accurately modeling human decision-making in security is critical to thinking about when, why, and how to recommend that users adopt certain secure behaviors. In this work, we conduct behavioral economics experiments to model the rationality of end-user security decision-making in a realistic online experimental system simulating a bank account. We ask participants to make a financially impactful security choice, in the face of transparent risks of account compromise and benefits offered by an optional security behavior (two-factor authentication). We measure the cost and utility of adopting the security behavior via measurements of time spent executing the behavior and estimates of the participant's wage. We find that more than 50% of our participants made rational (e.g., utility optimal) decisions, and we find that participants are more likely to behave rationally in the face of higher risk. Additionally, we find that users' decisions can be modeled well as a function of past behavior (anchoring effects), knowledge of costs, and to a lesser extent, users' awareness of risks and context (R2=0.61). We also find evidence of endowment effects, as seen in other areas of economic and psychological decision-science literature, in our digital-security setting. Finally, using our data, we show theoretically that a "one-size-fits"-all emphasis on security can lead to market losses, but that adoption by a subset of users with higher risks or lower costs can lead to market gains.

...read moreread less

Posted Content•

The Schulze Method of Voting.

[...]

Markus Schulze¹•Institutions (1)

Technical University of Berlin¹

15 Mar 2018-arXiv: Computer Science and Game Theory

TL;DR: A new single-winner election method ("Schulze method") is proposed and it is proved that it satisfies many academic criteria and a generalization of the Condorcet criterion to multi-winner elections is proposed.

...read moreread less

Abstract: We propose a new single-winner election method ("Schulze method") and prove that it satisfies many academic criteria (e.g. monotonicity, reversal symmetry, resolvability, independence of clones, Condorcet criterion, k-consistency, polynomial runtime). We then generalize this method to proportional representation by the single transferable vote ("Schulze STV") and to methods to calculate a proportional ranking ("Schulze proportional ranking"). Furthermore, we propose a generalization of the Condorcet criterion to multi-winner elections. This paper contains a large number of examples to illustrate the proposed methods.

...read moreread less

Posted Content•

Depth-Limited Solving for Imperfect-Information Games

[...]

Noam Brown¹, Tuomas Sandholm¹, Brandon Amos¹•Institutions (1)

Carnegie Mellon University¹

21 May 2018-arXiv: Computer Science and Game Theory

TL;DR: A principled way to conduct depth-limited solving in imperfect-information games by allowing the opponent to choose among a number of strategies for the remainder of the game at the depth limit, which defeats two prior top agents using only a 4-core CPU and 16 GB of memory.

...read moreread less

Abstract: A fundamental challenge in imperfect-information games is that states do not have well-defined values. As a result, depth-limited search algorithms used in single-agent settings and perfect-information games do not apply. This paper introduces a principled way to conduct depth-limited solving in imperfect-information games by allowing the opponent to choose among a number of strategies for the remainder of the game at the depth limit. Each one of these strategies results in a different set of values for leaf nodes. This forces an agent to be robust to the different strategies an opponent may employ. We demonstrate the effectiveness of this approach by building a master-level heads-up no-limit Texas hold'em poker AI that defeats two prior top agents using only a 4-core CPU and 16 GB of memory. Developing such a powerful agent would have previously required a supercomputer.

...read moreread less

Posted Content•

A Collaborative Multi-agent Reinforcement Learning Anti-jamming Algorithm in Wireless Networks

[...]

Fuqiang Yao¹, Luliang Jia•Institutions (1)

National University of Defense Technology¹

12 Sep 2018-arXiv: Computer Science and Game Theory

TL;DR: Simulation results show that the proposed CMAA is superior to both sensing-based method and independent ${Q}$ -learning method, and has the highest normalized rate.

...read moreread less

Abstract: In this letter, we investigate the anti-jamming defense problem in multi-user scenarios, where the coordination among users is taken into consideration. The Markov game framework is employed to model and analyze the anti-jamming defense problem, and a collaborative multi-agent anti-jamming algorithm (CMAA) is proposed to obtain the optimal anti-jamming strategy. In sweep jamming scenarios, on the one hand, the proposed CMAA can tackle the external malicious jamming. On the other hand, it can effectively cope with the mutual interference among users. Simulation results show that the proposed CMAA is superior to both sensing based method and independent Q-learning method, and has the highest normalized rate.

...read moreread less

Posted Content•

A Game-Theoretic Approach to Recommendation Systems with Strategic Content Providers

[...]

Omer Ben-Porat¹, Moshe Tennenholtz¹•Institutions (1)

Technion – Israel Institute of Technology¹

04 Jun 2018-arXiv: Computer Science and Game Theory

TL;DR: In this paper, a game-theoretic approach to the study of recommendation systems with strategic content providers is proposed. But the Shapley mediator does not satisfy the fairness and stability requirements.

...read moreread less

Abstract: We introduce a game-theoretic approach to the study of recommendation systems with strategic content providers. Such systems should be fair and stable. Showing that traditional approaches fail to satisfy these requirements, we propose the Shapley mediator. We show that the Shapley mediator fulfills the fairness and stability requirements, runs in linear time, and is the only economically efficient mechanism satisfying these properties.

...read moreread less

Posted Content•

Simple versus Optimal Contracts

[...]

Paul Dütting¹, Tim Roughgarden², Inbal Talgam-Cohen³•Institutions (3)

London School of Economics and Political Science¹, Columbia University², Technion – Israel Institute of Technology³

10 Aug 2018-arXiv: Computer Science and Game Theory

TL;DR: This paper considers the classic principal-agent model of contract theory, and proves that linear contracts are guaranteed to be worst-case optimal, ranging over all reward distributions consistent with the given moments.

...read moreread less

Abstract: We consider the classic principal-agent model of contract theory, in which a principal designs an outcome-dependent compensation scheme to incentivize an agent to take a costly and unobservable action. When all of the model parameters---including the full distribution over principal rewards resulting from each agent action---are known to the designer, an optimal contract can in principle be computed by linear programming. In addition to their demanding informational requirements, such optimal contracts are often complex and unintuitive, and do not resemble contracts used in practice. This paper examines contract theory through the theoretical computer science lens, with the goal of developing novel theory to explain and justify the prevalence of relatively simple contracts, such as linear (pure commission) contracts. First, we consider the case where the principal knows only the first moment of each action's reward distribution, and we prove that linear contracts are guaranteed to be worst-case optimal, ranging over all reward distributions consistent with the given moments. Second, we study linear contracts from a worst-case approximation perspective, and prove several tight parameterized approximation bounds.

...read moreread less

Posted Content•

Adaptive Game-Theoretic Decision Making for Autonomous Vehicle Control at Roundabouts

[...]

Ran Tian¹, Sisi Li¹, Nan Li¹, Ilya Kolmanovsky¹, Anouck Girard¹, Yildiray Yildiz² - Show less +2 more•Institutions (2)

University of Michigan¹, Bilkent University²

01 Oct 2018-arXiv: Computer Science and Game Theory

TL;DR: In this paper, a decision-making algorithm for autonomous vehicle control at a roundabout intersection is proposed based on a game-theoretic model representing the interactions between the ego vehicle and an opponent vehicle, and adapts to an online estimated driver type of the opponent vehicle.

...read moreread less

Abstract: In this paper, we propose a decision making algorithm for autonomous vehicle control at a roundabout intersection. The algorithm is based on a game-theoretic model representing the interactions between the ego vehicle and an opponent vehicle, and adapts to an online estimated driver type of the opponent vehicle. Simulation results are reported.

...read moreread less

Posted Content•

Optimal sequential contests

[...]

Toomas Hinnosaar¹•Institutions (1)

University of Nottingham¹

13 Feb 2018-arXiv: Computer Science and Game Theory

TL;DR: The total effort is maximized with full transparency and minimized with no transparency in sequential contests where the efforts of earlier players may be disclosed to later players by nature or by design.

...read moreread less

Abstract: I study sequential contests where the efforts of earlier players may be disclosed to later players by nature or by design. The model has a range of applications, including rent seeking, R&D, oligopoly, public goods provision, and tragedy of the commons. I show that information about other players' efforts increases the total effort. Thus, the total effort is maximized with full transparency and minimized with no transparency. I also study the advantages of moving earlier and the limits of large contests.

...read moreread less

Posted Content•

Revisiting CFR+ and Alternating Updates

[...]

Neil Burch, Matej Moravcik, Martin Schmid

26 Oct 2018-arXiv: Computer Science and Game Theory

TL;DR: The CFR+ algorithm for solving imperfect information games is a variant of the popular CFR algorithm, with faster empirical performance on a range of problems, introduced with a theoretical upper bound on solution error, but subsequent work showed an error in one step of the proof.

...read moreread less

Abstract: The CFR+ algorithm for solving imperfect information games is a variant of the popular CFR algorithm, with faster empirical performance on a range of problems. It was introduced with a theoretical upper bound on solution error, but subsequent work showed an error in one step of the proof. We provide updated proofs to recover the original bound.

...read moreread less

Collapse