Source author record

Marcin Michalski

Marcin Michalski appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.GN Computer Vision

Catalog footprint

What is connected

6works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

Representation learning promises to unlock deep learning for the long tail of vision tasks without expensive labelled datasets. Yet, the absence of a unified evaluation for general visual representations hinders progress. Popular protocols are often too constrained (linear classification), limited in diversity (ImageNet, CIFAR, Pascal-VOC), or only weakly related to representation quality (ELBO, reconstruction error). We present the Visual Task Adaptation Benchmark (VTAB), which defines good representations as those that adapt to diverse, unseen tasks with few examples. With VTAB, we conduct a large-scale study of many popular publicly-available representation learning algorithms. We carefully control confounders such as architecture and tuning budget. We address questions like: How effective are ImageNet representations beyond standard natural datasets? How do representations trained via generative and discriminative models compare? To what extent can self-supervision replace labels? And, how close are we to general visual representations?

preprint2020arXiv

Google Research Football: A Novel Reinforcement Learning Environment

Recent progress in the field of reinforcement learning has been accelerated by virtual learning environments such as video games, where novel algorithms and ideas can be quickly tested in a safe and reproducible manner. We introduce the Google Research Football Environment, a new reinforcement learning environment where agents are trained to play football in an advanced, physics-based 3D simulator. The resulting environment is challenging, easy to use and customize, and it is available under a permissive open-source license. In addition, it provides support for multiplayer and multi-agent experiments. We propose three full-game scenarios of varying difficulty with the Football Benchmarks and report baseline results for three commonly used reinforcement algorithms (IMPALA, PPO, and Ape-X DQN). We also provide a diverse set of simpler scenarios with the Football Academy and showcase several promising research directions.

preprint2020arXiv

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference

We present a modern scalable reinforcement learning agent called SEED (Scalable, Efficient Deep-RL). By effectively utilizing modern accelerators, we show that it is not only possible to train on millions of frames per second but also to lower the cost of experiments compared to current methods. We achieve this with a simple architecture that features centralized inference and an optimized communication layer. SEED adopts two state of the art distributed algorithms, IMPALA/V-trace (policy gradients) and R2D2 (Q-learning), and is evaluated on Atari-57, DeepMind Lab and Google Research Football. We improve the state of the art on Football and are able to reach state of the art on Atari-57 three times faster in wall-time. For the scenarios we consider, a 40% to 80% cost reduction for running experiments is achieved. The implementation along with experiments is open-sourced so results can be reproduced and novel ideas tried out.

preprint2020arXiv

What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study

In recent years, on-policy reinforcement learning (RL) has been successfully applied to many different continuous control tasks. While RL algorithms are often conceptually simple, their state-of-the-art implementations take numerous low- and high-level design decisions that strongly affect the performance of the resulting agents. Those choices are usually not extensively discussed in the literature, leading to discrepancy between published descriptions of algorithms and their implementations. This makes it hard to attribute progress in RL and slows down overall progress [Engstrom'20]. As a step towards filling that gap, we implement >50 such ``choices'' in a unified on-policy RL framework, allowing us to investigate their impact in a large-scale empirical study. We train over 250'000 agents in five continuous control environments of different complexity and provide insights and practical recommendations for on-policy training of RL agents.

preprint2015arXiv

Some properties of $\mathcal{I}$-Luzin sets

In this paper we consider a notion of $\mathcal{I}$-Luzin set which generalizes the classical notion of Luzin set and Sierpi{ń}ski set on Euclidean spaces. We show that there is a translation invariant $σ$-ideal $\mathcal{I}$ with Borel base for which $\mathcal{I}$-Luzin set can be $\mathcal{I}$-measurable. If we additionally assume that $\mathcal{I}$ has Smital property (or its weaker version) then $\mathcal{I}$-Luzin sets are $\mathcal{I}$-nonmeasurable. We give some constructions of $\mathcal{I}$-Luzin sets involving additive structure of $\mathbb{R}^n$. Moreover, we show that if $L$ is a Luzin set and $S$ is a Sierpi{ń}ski set then the complex sum $L+S$ cannot be a Bernstein set.

preprint2014arXiv

Luzin and Sierpiński sets, some nonmeasurable subsets of the plane and additive properties on the line

In this paper we shall introduce some nonmeasurable and completely nonmeasurable subsets of the plane with various additional properties, e.g. being Hamel basis, intersecting each line in a strong Luzin / Sierpiński set. Also some additive properties of Luzin and Sierpiński sets and their generalization I-Luzin sets, on the line are investigated.

Marcin Michalski

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

A Large-scale Study of Representation Learning with the Visual Task Adaptation Benchmark

Google Research Football: A Novel Reinforcement Learning Environment

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference

What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study

Some properties of $\mathcal{I}$-Luzin sets

Luzin and Sierpiński sets, some nonmeasurable subsets of the plane and additive properties on the line