Source author record

Jérôme Renault

Jérôme Renault appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Computer Science and Game Theory math.CO

Catalog footprint

What is connected

9works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Competition and Recall in Selection Problems

We consider the problem in which n items arrive to a market sequentially over time, where two agents compete to choose the best possible item. When an agent selects an item, he leaves the market and obtains a payoff given by the value of the item, which is represented by a random variable following a known distribution with support contained in [0, 1]. We consider two different settings for this problem. In the first one, namely competitive selection problem with no recall, agents observe the value of each item upon its arrival and decide whether to accept or reject it, in which case they will not select it in future. In the second setting, called competitive selection problem with recall, agents are allowed to select any of the available items arrived so far. For each of these problems, we describe the game induced by the selection problem as a sequential game with imperfect information and study the set of subgame-perfect Nash equilibrium payoffs. We also study the efficiency of the game equilibria. More specifically, we address the question of how much better is to have the power of getting any available item against the take-it-or-leave-it fashion. To this end, we define and study the price of anarchy and price of stability of a game instance as the ratio between the maximal sum of payoffs obtained by players under any feasible strategy and the sum of payoffs for the worst and best subgame-perfect Nash equilibrium, respectively. For the no recall case, we prove that if there are two agents and two items arriving sequentially over time, both the price of anarchy and price of stability are upper bounded by the constant 4/3 for any value distribution. Even more, we show that this bound is tight.

preprint2020arXiv

Strategic information transmission with sender's approval

We consider a sender-receiver game with an outside option for the sender. After the cheap talk phase, the receiver makes a proposal to the sender, which the latter can reject. We study situations in which the sender's approval is crucial to the receiver.We show that a partitional, (perfect Bayesian Nash) equilibrium exists if the sender has only two types or if the receiver's preferences over decisions do not depend on the type of the sender as long as the latter participates. The result does not extend: we construct a counter-example (with three types for the sender and type-dependent affine utility functions) in which there is no mixed equilibrium. In the three type case, we provide a full characterization of (possibly mediated) equilibria.

preprint2016arXiv

On values of repeated games with signals

We study the existence of different notions of value in two-person zero-sum repeated games where the state evolves and players receive signals. We provide some examples showing that the limsup value (and the uniform value) may not exist in general. Then we show the existence of the value for any Borel payoff function if the players observe a public signal including the actions played. We also prove two other positive results without assumptions on the signaling structure: the existence of the $\sup$ value in any game and the existence of the uniform value in recursive games with nonnegative payoffs.

preprint2015arXiv

Limit value for optimal control with general means

We consider optimal control problem with an integral cost which is a mean of a given function. As a particular case, the cost concerned is the Cesàro average. The limit of the value with Cesàro mean when the horizon tends to infinity is widely studied in the literature. We address the more general question of the existence of a limit when the averaging parameter converges, for values defined with means of general types. We consider a given function and a family of costs defined as the mean of the function with respect to a family of probability measures -- the evaluations -- on R_+. We provide conditions on the evaluations in order to obtain the uniform convergence of the associated value function (when the parameter of the family converges). Our main result gives a necessary and sufficient condition in term of the total variation of the family of probability measures on R_+. As a byproduct, we obtain the existence of a limit value (for general means) for control systems having a compact invariant set and satisfying suitable nonexpansive property.

preprint2014arXiv

Hidden Stochastic Games and Limit Equilibrium Payoffs

We consider 2-player stochastic games with perfectly observed actions, and study the limit, as the discount factor goes to one, of the equilibrium payoffs set. In the usual setup where current states are observed by the players, we show that the set of stationary equilibrium payoffs always converges, and provide a simple example where the set of equilibrium payoffs has no limit. We then introduce the more general model of hidden stochastic game, where the players publicly receive imperfect signals over current states. In this setup we present an example where not only the limit set of equilibrium payoffs does not exist, but there is no converging selection of equilibrium payoffs. This second example is robust in many aspects, in particular to perturbations of the payoffs and to the introduction of correlation or communication devices.

preprint2014arXiv

The value of Markov Chain Games with incomplete information on both sides

We consider zero-sum repeated games with incomplete information on both sides, where the states privately observed by each player follow independent Markov chains. It generalizes the model, introduced by Aumann and Maschler in the sixties and solved by Mertens and Zamir in the seventies, where the private states of the players were fixed. It also includes the model introduced in Renault \cite{R2006}, of Markov chain repeated games with lack of information on one side, where only one player privately observes the sequence of states. We prove here that the limit value exists, and we obtain a characterization via the Mertens-Zamir system, where the "non revealing value function" plugged in the system is now defined as the limit value of an auxiliary "non revealing" dynamic game. This non revealing game is defined by restricting the players not to reveal any information on the {\it limit behavior} of their own Markov chain, as in Renault 2006. There are two key technical difficulties in the proof: 1) proving regularity, in the sense of equicontinuity, of the $T$-stage non revealing value functions, and 2) constructing strategies by blocks in order to link the values of the non revealing games with the original values.

preprint2013arXiv

General limit value in Dynamic Programming

We consider a dynamic programming problem with arbitrary state space and bounded rewards. Is it possible to define in an unique way a limit value for the problem, where the "patience" of the decision-maker tends to infinity ? We consider, for each evaluation $θ$ (a probability distribution over positive integers) the value function $v_θ$ of the problem where the weight of any stage $t$ is given by $θ_t$, and we investigate the uniform convergence of a sequence $(v_{θ^k})_k$ when the "impatience" of the evaluations vanishes, in the sense that $\sum_{t} |θ^k_{t}-θ^k_{t+1}| \rightarrow_{k \to \infty} 0$. We prove that this uniform convergence happens if and only if the metric space ${v_{θ^k}, k\geq 1}$ is totally bounded. Moreover there exists a particular function $v^*$, independent of the particular chosen sequence $({θ^k})_k$, such that any limit point of such sequence of value functions is precisely $v^*$. Consequently, while speaking of uniform convergence of the value functions, $v^*$ may be considered as the unique possible limit when the patience of the decision-maker tends to infinity. The result applies in particular to discounted payoffs when the discount factor vanishes, as well as to average payoffs where the number of stages goes to infinity, and also to models with stochastic transitions. We present tractable corollaries, and we discuss counterexamples and a conjecture.

preprint2013arXiv

Ramsey-type results on singletons, co-singletons and monotone sequences in large collections of sets

We say that a 0-1 matrix $N$ of size $a\times b$ can be found in a collection of sets $\mathcal{H}$ if we can find sets $H_{1}, H_{2}, \dots, H_{a}$ in $\mathcal{H}$ and elements $e_1, e_2, \dots, e_b$ in $\cup_{H \in \mathcal{H}} H$ such that $N$ is the incidence matrix of the sets $H_{1}, H_{2}, \dots, H_{a}$ over the elements $e_1, e_2, \dots, e_b$. We prove the following Ramsey-type result: for every $n\in \N$, there exists a number S(n) such that in any collection of at least S(n) sets, one can find either the incidence matrix of a collection of $n$ singletons, or its complementary matrix, or the incidence matrix of a collection of $n$ sets completely ordered by inclusion. We give several results of the same extremal set theoretical flavour. For some of these, we give the exact value of the number of sets required.

preprint2012arXiv

A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games

Given a finite set $K$, we denote by $X=Δ(K)$ the set of probabilities on $K$ and by $Z=Δ_f(X)$ the set of Borel probabilities on $X$ with finite support. Studying a Markov Decision Process with partial information on $K$ naturally leads to a Markov Decision Process with full information on $X$. We introduce a new metric $d_*$ on $Z$ such that the transitions become 1-Lipschitz from $(X, \|.\|_1)$ to $(Z,d_*)$. In the first part of the article, we define and prove several properties of the metric $d_*$. Especially, $d_*$ satisfies a Kantorovich-Rubinstein type duality formula and can be characterized by using disintegrations. In the second part, we characterize the limit values in several classes of "compact non expansive" Markov Decision Processes. In particular we use the metric $d_*$ to characterize the limit value in Partial Observation MDP with finitely many states and in Repeated Games with an informed controller with finite sets of states and actions. Moreover in each case we can prove the existence of a generalized notion of uniform value where we consider not only the Cesàro mean when the number of stages is large enough but any evaluation function $θ\in Δ(\N^*)$ when the impatience $I(θ)=\sum_{t\geq 1} |θ_{t+1}-θ_t|$ is small enough.

Jérôme Renault

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Competition and Recall in Selection Problems

Strategic information transmission with sender's approval

On values of repeated games with signals

Limit value for optimal control with general means

Hidden Stochastic Games and Limit Equilibrium Payoffs

The value of Markov Chain Games with incomplete information on both sides

General limit value in Dynamic Programming

Ramsey-type results on singletons, co-singletons and monotone sequences in large collections of sets

A distance for probability spaces, and long-term values in Markov Decision Processes and Repeated Games