Source author record

Nan Rong

Nan Rong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Science and Game Theory Robotics

Catalog footprint

What is connected

4works

3topics

2close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

MDPs with Unawareness in Robotics

We formalize decision-making problems in robotics and automated control using continuous MDPs and actions that take place over continuous time intervals. We then approximate the continuous MDP using finer and finer discretizations. Doing this results in a family of systems, each of which has an extremely large action space, although only a few actions are "interesting". We can view the decision maker as being unaware of which actions are "interesting". We can model this using MDPUs, MDPs with unawareness, where the action space is much smaller. As we show, MDPUs can be used as a general framework for learning tasks in robotic problems. We prove results on the difficulty of learning a near-optimal policy in an an MDPU for a continuous task. We apply these ideas to the problem of having a humanoid robot learn on its own how to walk.

preprint2014arXiv

Cooperative Equilibrium: A solution predicting cooperative play

Nash equilibrium (NE) assumes that players always make a best response. However, this is not always true; sometimes people cooperate even it is not a best response to do so. For example, in the Prisoner's Dilemma, people often cooperate. Are there rules underlying cooperative behavior? In an effort to answer this question, we propose a new equilibrium concept: perfect cooperative equilibrium (PCE), and two related variants: max-PCE and cooperative equilibrium. PCE may help explain players' behavior in games where cooperation is observed in practice. A player's payoff in a PCE is at least as high as in any NE. However, a PCE does not always exist. We thus consider α-PCE, where α takes into account the degree of cooperation; a PCE is a 0-PCE. Every game has a Pareto-optimal max-PCE (M-PCE); that is, an α-PCE for a maximum α. We show that M-PCE does well at predicting behavior in quite a few games of interest. We also consider cooperative equilibrium (CE), another generalization of PCE that takes punishment into account. Interestingly, all Pareto-optimal M-PCE are CE. We prove that, in 2-player games, a PCE (if it exists), a M-PCE, and a CE can all be found in polynomial time using bilinear programming. This is a contrast to Nash equilibrium, which is PPAD complete even in 2-player games [Chen, Deng, and Teng 2009]. We compare M-PCE to the coco value [Kalai and Kalai 2009], another solution concept that tries to capture cooperation, both axiomatically and in terms of an algebraic characterization, and show that the two are closely related, despite their very different definitions.

preprint2014arXiv

MDPs with Unawareness

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision maker (DM) knows all states and actions. However, this may not be true in many situations of interest. We define a new framework, MDPs with unawareness (MDPUs) to deal with the possibilities that a DM may not be aware of all possible actions. We provide a complete characterization of when a DM can learn to play near-optimally in an MDPU, and give an algorithm that learns to play near-optimally when it is possible to do so, as efficiently as possible. In particular, we characterize when a near-optimal solution can be found in polynomial time.

preprint2010arXiv

Nan Rong

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

MDPs with Unawareness in Robotics

Cooperative Equilibrium: A solution predicting cooperative play

MDPs with Unawareness

MDPs with Unawareness