Source author record

Vladimir Egorov

Vladimir Egorov appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence physics.app-ph cond-mat.mtrl-sci math.RA Multiagent Systems quant-ph

Catalog footprint

What is connected

7works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Scalable Multi-Agent Model-Based Reinforcement Learning

Recent Multi-Agent Reinforcement Learning (MARL) literature has been largely focused on Centralized Training with Decentralized Execution (CTDE) paradigm. CTDE has been a dominant approach for both cooperative and mixed environments due to its capability to efficiently train decentralized policies. While in mixed environments full autonomy of the agents can be a desirable outcome, cooperative environments allow agents to share information to facilitate coordination. Approaches that leverage this technique are usually referred as communication methods, as full autonomy of agents is compromised for better performance. Although communication approaches have shown impressive results, they do not fully leverage this additional information during training phase. In this paper, we propose a new method called MAMBA which utilizes Model-Based Reinforcement Learning (MBRL) to further leverage centralized training in cooperative environments. We argue that communication between agents is enough to sustain a world model for each agent during execution phase while imaginary rollouts can be used for training, removing the necessity to interact with the environment. These properties yield sample efficient algorithm that can scale gracefully with the number of agents. We empirically confirm that MAMBA achieves good performance while reducing the number of interactions with the environment up to an orders of magnitude compared to Model-Free state-of-the-art approaches in challenging domains of SMAC and Flatland.

preprint2022arXiv

Self-Imitation Learning from Demonstrations

Despite the numerous breakthroughs achieved with Reinforcement Learning (RL), solving environments with sparse rewards remains a challenging task that requires sophisticated exploration. Learning from Demonstrations (LfD) remedies this issue by guiding the agent's exploration towards states experienced by an expert. Naturally, the benefits of this approach hinge on the quality of demonstrations, which are rarely optimal in realistic scenarios. Modern LfD algorithms require meticulous tuning of hyperparameters that control the influence of demonstrations and, as we show in the paper, struggle with learning from suboptimal demonstrations. To address these issues, we extend Self-Imitation Learning (SIL), a recent RL algorithm that exploits the agent's past good experience, to the LfD setup by initializing its replay buffer with demonstrations. We denote our algorithm as SIL from Demonstrations (SILfD). We empirically show that SILfD can learn from demonstrations that are noisy or far from optimal and can automatically adjust the influence of demonstrations throughout the training without additional hyperparameters or handcrafted schedules. We also find SILfD superior to the existing state-of-the-art LfD algorithms in sparse environments, especially when demonstrations are highly suboptimal.

preprint2021arXiv

Balancing Rational and Other-Regarding Preferences in Cooperative-Competitive Environments

Recent reinforcement learning studies extensively explore the interplay between cooperative and competitive behaviour in mixed environments. Unlike cooperative environments where agents strive towards a common goal, mixed environments are notorious for the conflicts of selfish and social interests. As a consequence, purely rational agents often struggle to achieve and maintain cooperation. A prevalent approach to induce cooperative behaviour is to assign additional rewards based on other agents' well-being. However, this approach suffers from the issue of multi-agent credit assignment, which can hinder performance. This issue is efficiently alleviated in cooperative setting with such state-of-the-art algorithms as QMIX and COMA. Still, when applied to mixed environments, these algorithms may result in unfair allocation of rewards. We propose BAROCCO, an extension of these algorithms capable to balance individual and social incentives. The mechanism behind BAROCCO is to train two distinct but interwoven components that jointly affect each agent's decisions. Our meta-algorithm is compatible with both Q-learning and Actor-Critic frameworks. We experimentally confirm the advantages over the existing methods and explore the behavioural aspects of BAROCCO in two mixed multi-agent setups.

preprint2020arXiv

Architected Porous Metals in Electrochemical Energy Storage

Porous metallic structures are regularly used in electrochemical energy storage devices as supports, current collectors or active electrode materials. Bulk metal porosification, dealloying, welding or chemical synthesis routes involving crystal growth or self-assembly for example, can sometimes provide limited control of porous length scale, ordering, periodicity, reproducibility, porosity and surface area. Additive manufacturing and 3D printing has shown the potential to revolutionize the fabrication of architected metals many forms, allowing complex geometries not usually possible by traditional methods, but enabling complete design freedom of a porous metal based on the required physical or chemical property to be exploited. We discuss properties of porous metal structures in EES devices and provide some opinions on how architected metals may alleviate issues with electrochemically active porous metal current collectors, and provide opportunities for optimum design based on electrochemical characteristics required by batteries, supercapacitors or other electrochemical devices.

preprint2020arXiv

Laser damage attack against optical attenuators in quantum key distribution

Many quantum key distribution systems employ a laser followed by an optical attenuator to prepare weak coherent states in the source. Their mean photon number must be pre-calibrated to guarantee the security of key distribution. Here we experimentally show that this calibration can be broken with a high-power laser attack. We have tested four fiber-optic attenuator types used in quantum key distribution systems, and found that two of them exhibit a permanent decrease in attenuation after laser damage. This results in higher mean photon numbers in the prepared states and may allow an eavesdropper to compromise the key.

preprint2019arXiv

Evolution of 3D Printing Methods and Materials for Electrochemical Energy Storage

Additive manufacturing has revolutionized the building of materials direct from design, allowing high resolution rapid prototyping in complex 3D designs with many materials. 3D printing hasenabled high strength damage-tolerant structures, bioprinted artificial organs and tissues, ultralight metals, medicine, education, prosthetics, architecture, consumer electronics,and as a prototyping tool for engineers and hobbyists alike. 3D printing has emerged as a useful tool for complex electrode and material assembly method for batteries and supercapacitors in recent years. The field initially grew from extrusion-based methods such as fused deposition modelling, and evolved to photopolymerization printing of intricate composites, while supercapacitor technologies less sensitive to solvents more often involved material jetting processes. Underpinning every part of a 3D printable battery and many other devices is the printing method and the nature of the feed material. Material purity, printing fidelity, accuracy, complexity, and the ability to form conductive, ceramic, glassy, or solvent-stable plastics relies on the nature of the feed material or composite to such an extent, that the future of 3D printable batteries and electrochemical energy storage devices will depend on materials and printing methods that are co-operatively informed by the requirements of the device and how it is fabricated. In this Perspective, we address the materials and methods requirements in 3D printable batteries and supercapacitors and outline requirements for the future of the field by linking existing performance limitations to the requirements of printable energy storage materials, casing materials and the direct printing of electrodes and electrolytes. We also look to the future by taking inspiration from additive manufacturing, to posit links between materials and printing methods to allow new form factor cells.

preprint2012arXiv

A cohomological proof of Peterson-Kac's theorem on conjugacy of Cartan subalgebras of affine Kac-Moody Lie algebras

This paper deals with the problem of conjugacy of Cartan subalgebras for affine Kac-Moody Lie algebras. Unlike the methods used by Peterson and Kac, our approach is entirely cohomological and geometric. It is deeply rooted on the theory of reductive group schemes developed by Demazure and Grothendieck, and on the work of J. Tits on buildings.