Source author record

Jakub Cerny

Jakub Cerny appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.EP Computer Science and Game Theory

Catalog footprint

What is connected

2works

2topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Unified Perspective on Deep Equilibrium Finding

Extensive-form games provide a versatile framework for modeling interactions of multiple agents subjected to imperfect observations and stochastic events. In recent years, two paradigms, policy space response oracles (PSRO) and counterfactual regret minimization (CFR), showed that extensive-form games may indeed be solved efficiently. Both of them are capable of leveraging deep neural networks to tackle the scalability issues inherent to extensive-form games and we refer to them as deep equilibrium-finding algorithms. Even though PSRO and CFR share some similarities, they are often regarded as distinct and the answer to the question of which is superior to the other remains ambiguous. Instead of answering this question directly, in this work we propose a unified perspective on deep equilibrium finding that generalizes both PSRO and CFR. Our four main contributions include: i) a novel response oracle (RO) which computes Q values as well as reaching probability values and baseline values; ii) two transform modules -- a pre-transform and a post-transform -- represented by neural networks transforming the outputs of RO to a latent additive space (LAS), and then the LAS to action probabilities for execution; iii) two average oracles -- local average oracle (LAO) and global average oracle (GAO) -- where LAO operates on LAS and GAO is used for evaluation only; and iv) a novel method inspired by fictitious play that optimizes the transform modules and average oracles, and automatically selects the optimal combination of components of the two frameworks. Experiments on Leduc poker game demonstrate that our approach can outperform both frameworks.

preprint2014arXiv

Unexpected fading of comet C/2003 T4 (LINEAR) and disintegration of C/2012 S1 (ISON)

Comet C/2003 T4 (LINEAR) exhibit a large asymmetry in brightness before and after perihelion, when it became much fainter. Large non-gravitational forces shows that mass of nucleus doesn't exceed 2.51*10^11 kg, which is nearly double than previously disintegrated comets C/2012 S1 (ISON) and C/1999 S4 (LINEAR). Amount of water mass loss of C/2003 T4 in interval between 70 days before to 60 after perihelion, was nearly 3.16(+/- 0.60)*10^10 kg or >13 % of total nucleus mass and >21 % for C/2012 S1, which is much larger than value >7 % previously stated for C/1999 S4.