Source author record

Jian Yuan

Jian Yuan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Machine Learning Computational Engineering, Finance, and Science cond-mat.mtrl-sci Cryptography and Security hep-ex math.DS Multiagent Systems nlin.CD nucl-ex physics.class-ph physics.ins-det physics.optics

Catalog footprint

What is connected

11works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Learning to Advise and Learning from Advice in Cooperative Multi-Agent Reinforcement Learning

Learning to coordinate is a daunting problem in multi-agent reinforcement learning (MARL). Previous works have explored it from many facets, including cognition between agents, credit assignment, communication, expert demonstration, etc. However, less attention were paid to agents' decision structure and the hierarchy of coordination. In this paper, we explore the spatiotemporal structure of agents' decisions and consider the hierarchy of coordination from the perspective of multilevel emergence dynamics, based on which a novel approach, Learning to Advise and Learning from Advice (LALA), is proposed to improve MARL. Specifically, by distinguishing the hierarchy of coordination, we propose to enhance decision coordination at meso level with an advisor and leverage a policy discriminator to advise agents' learning at micro level. The advisor learns to aggregate decision information in both spatial and temporal domains and generates coordinated decisions by employing a spatiotemporal dual graph convolutional neural network with a task-oriented objective function. Each agent learns from the advice via a policy generative adversarial learning method where a discriminator distinguishes between the policies of the agent and the advisor and boosts both of them based on its judgement. Experimental results indicate the advantage of LALA over baseline approaches in terms of both learning efficiency and coordination capability. Coordination mechanism is investigated from the perspective of multilevel emergence dynamics and mutual information point of view, which provides a novel perspective and method to analyze and improve MARL algorithms.

preprint2022arXiv

Supervised Off-Policy Ranking

Off-policy evaluation (OPE) is to evaluate a target policy with data generated by other policies. Most previous OPE methods focus on precisely estimating the true performance of a policy. We observe that in many applications, (1) the end goal of OPE is to compare two or multiple candidate policies and choose a good one, which is a much simpler task than precisely evaluating their true performance; and (2) there are usually multiple policies that have been deployed to serve users in real-world systems and thus the true performance of these policies can be known. Inspired by the two observations, in this work, we study a new problem, supervised off-policy ranking (SOPR), which aims to rank a set of target policies based on supervised learning by leveraging off-policy data and policies with known performance. We propose a method to solve SOPR, which learns a policy scoring model by minimizing a ranking loss of the training policies rather than estimating the precise policy performance. The scoring model in our method, a hierarchical Transformer based model, maps a set of state-action pairs to a score, where the state of each pair comes from the off-policy data and the action is taken by a target policy on the state in an offline manner. Extensive experiments on public datasets show that our method outperforms baseline methods in terms of rank correlation, regret value, and stability. Our code is publicly available at GitHub.

preprint2020arXiv

Topological-darkness-assisted phase regulation for atomically thin meta-optics

Two-dimensional (2D) noble-metal dichalcogenides have emerged as a new platform for the realization of versatile flat optics with a considerable degree of miniaturization. However, light field manipulation at the atomic scale is widely considered unattainable since the vanishing thickness and intrinsic losses of 2D materials completely suppress both resonances and phase accumulation effects. Empowered by conventionally perceived adverse effects of intrinsic losses, we show that the structured PtSe2 films integrated with a uniform substrate can regulate nontrivial singular phase and realize atomic-thick meta-optics in the presence of topological darkness. We experimentally demonstrate a series of atomic-thick binary meta-optics that allows angle-robust and high unit-thickness diffraction efficiency of 0.96%/nm in visible frequencies, given its thickness of merely 4.3 nm. Our results unlock the potential of a new class of 2D flat optics for light field manipulation at an atomic thickness.

preprint2019arXiv

Beam Test of the PIN-diode Readout Units with Electron Energies from 5 to 40 GeV at CERN SPS

The Chinese large-area violet-light-sensitive silicon photodiode PIN is one of the candidates of the lead tungstate crystal detector readout component of the photon spectrometer in the large heavy ion collision experiment. The PIN diode was assembled with the lead tungstate crystal and the low-noise preamplifier into a complete detector unit. The beam test was carried out on the SPS accelerator at CERN. The energy resolution was measured with the electron beam energy ranging from 5 to 40 GeV. The summation correction method was discussed, and an excellent linearity of the nominal beam energy versus the peak position of the detector was obtained, which showed the punch-through effect can be ignored.

preprint2016arXiv

Chernoff Information of Bottleneck Gaussian Trees

In this paper, our objective is to find out the determining factors of Chernoff information in distinguishing a set of Gaussian trees. In this set, each tree can be attained via an edge removal and grafting operation from another tree. This is equivalent to asking for the Chernoff information between the most-likely confused, i.e. "bottleneck", Gaussian trees, as shown to be the case in ML estimated Gaussian tree graphs lately. We prove that the Chernoff information between two Gaussian trees related through an edge removal and a grafting operation is the same as that between two three-node Gaussian trees, whose topologies and edge weights are subject to the underlying graph operation. In addition, such Chernoff information is shown to be determined only by the maximum generalized eigenvalue of the two Gaussian covariance matrices. The Chernoff information of scalar Gaussian variables as a result of linear transformation (LT) of the original Gaussian vectors is also uniquely determined by the same maximum generalized eigenvalue. What is even more interesting is that after incorporating the cost of measurements into a normalized Chernoff information, Gaussian variables from LT have larger normalized Chernoff information than the one based on the original Gaussian vectors, as shown in our proved bounds

preprint2016arXiv

Mechanical energy and mean equivalent viscous damping for SDOF fractional oscillators

This paper addresses the total mechanical energy of a single degree of freedom fractional oscillator. Based on the energy storage and dissipation properties of the Caputo fractional derivatives, the expression for total mechanical energy in the single degree of freedom fractional oscillator is firstly presented. The energy regeneration due to the external exciting force and the energy loss due to the fractional damping force during the vibratory motion are analyzed. Furthermore, based on the mean energy dissipation of the fractional damping element in steady-state vibration, a new concept of mean equivalent viscous damping is suggested and the value of the damping coefficient is evaluated.

preprint2016arXiv

Sliding control for single-degree-of-freedom fractional oscillators

This paper proposes fractional sliding control designs for single-degree-of-freedom fractional oscillators respectively of the Kelvin-Voigt type, the modified Kelvin-Voigt type and Düffing type, whose dynamical behaviors are described by second-order differential equations involving fractional derivatives. Firstly, the differential equations of motion are transformed into non-commensurate fractional state equations by introducing state variables with physical significance. Secondly, fractional sliding manifolds are constructed and stability of the corresponding sliding dynamics is addressed via the infinite state approach and Lyapunov stability theory. Thirdly, sliding control laws and adaptive sliding laws are designed for fractional oscillators respectively in cases that the bound of the external exciting force is known or unknown. Finally, numerical simulations are carried out to validate the above control designs.

preprint2015arXiv

Asymptotic Error Free Partitioning over Noisy Boolean Multiaccess Channels

In this paper, we consider the problem of partitioning active users in a manner that facilitates multi-access without collision. The setting is of a noisy, synchronous, Boolean, multi-access channel where $K$ active users (out of a total of $N$ users) seek to access. A solution to the partition problem places each of the $N$ users in one of $K$ groups (or blocks) such that no two active nodes are in the same block. We consider a simple, but non-trivial and illustrative case of $K=2$ active users and study the number of steps $T$ used to solve the partition problem. By random coding and a suboptimal decoding scheme, we show that for any $T\geq (C_1 +ξ_1)\log N$, where $C_1$ and $ξ_1$ are positive constants (independent of $N$), and $ξ_1$ can be arbitrary small, the partition problem can be solved with error probability $P_e^{(N)} \to 0$, for large $N$. Under the same scheme, we also bound $T$ from the other direction, establishing that, for any $T \leq (C_2 - ξ_2) \log N$, the error probability $P_e^{(N)} \to 1$ for large $N$; again $C_2$ and $ξ_2$ are constants and $ξ_2$ can be arbitrarily small. These bounds on the number of steps are lower than the tight achievable lower-bound in terms of $T \geq (C_g +ξ)\log N $ for group testing (in which all active users are identified, rather than just partitioned). Thus, partitioning may prove to be a more efficient approach for multi-access than group testing.

preprint2014arXiv

Partition Information and its Transmission over Boolean Multi-Access Channels

In this paper, we propose a novel partition reservation system to study the partition information and its transmission over a noise-free Boolean multi-access channel. The objective of transmission is not message restoration, but to partition active users into distinct groups so that they can, subsequently, transmit their messages without collision. We first calculate (by mutual information) the amount of information needed for the partitioning without channel effects, and then propose two different coding schemes to obtain achievable transmission rates over the channel. The first one is the brute force method, where the codebook design is based on centralized source coding; the second method uses random coding where the codebook is generated randomly and optimal Bayesian decoding is employed to reconstruct the partition. Both methods shed light on the internal structure of the partition problem. A novel hypergraph formulation is proposed for the random coding scheme, which intuitively describes the information in terms of a strong coloring of a hypergraph induced by a sequence of channel operations and interactions between active users. An extended Fibonacci structure is found for a simple, but non-trivial, case with two active users. A comparison between these methods and group testing is conducted to demonstrate the uniqueness of our problem.

preprint2012arXiv

Enhancement of Secrecy of Block Ciphered Systems by Deliberate Noise

This paper considers the problem of end-end security enhancement by resorting to deliberate noise injected in ciphertexts. The main goal is to generate a degraded wiretap channel in application layer over which Wyner-type secrecy encoding is invoked to deliver additional secure information. More specifically, we study secrecy enhancement of DES block cipher working in cipher feedback model (CFB) when adjustable and intentional noise is introduced into encrypted data in application layer. A verification strategy in exhaustive search step of linear attack is designed to allow Eve to mount a successful attack in the noisy environment. Thus, a controllable wiretap channel is created over multiple frames by taking advantage of errors in Eve's cryptanalysis, whose secrecy capacity is found for the case of known channel states at receivers. As a result, additional secure information can be delivered by performing Wyner type secrecy encoding over super-frames ahead of encryption, namely, our proposed secrecy encoding-then-encryption scheme. These secrecy bits could be taken as symmetric keys for upcoming frames. Numerical results indicate that a sufficiently large secrecy rate can be achieved by selective noise addition.

preprint2010arXiv

A Multi-Interference-Channel Matrix Pair Beamformer for CDMA Systems

Matrix pair beamformer (MPB) is a promising blind beamformer which exploits the temporal signature of the signal of interest (SOI) to acquire its spatial statistical information. It does not need any knowledge of directional information or training sequences. However, the major problem of the existing MPBs is that they have serious threshold effects and the thresholds will grow as the interference power increases or even approach infinity. In particular, this issue prevails in scenarios with structured interference, such as, periodically repeated white noise, tones, or MAIs in multipath channels. In this paper, we will first present the principles for designing the projection space of the MPB which are closely correlated with the ability of suppressing structured interference and system finite sample performance. Then a multiple-interference-channel based matrix pair beamformer (MIC-MPB) for CDMA systems is developed according to the principles. In order to adapt to dynamic channels, an adaptive algorithm for the beamformer is also proposed. Theoretical analysis and simulation results show that the proposed beamformer has a small and bounded threshold when the interference power increases. Performance comparisons of the MIC-MPB and the existing MPBs in various scenarios via a number of numerical examples are also presented.

Jian Yuan

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Learning to Advise and Learning from Advice in Cooperative Multi-Agent Reinforcement Learning

Supervised Off-Policy Ranking

Topological-darkness-assisted phase regulation for atomically thin meta-optics

Beam Test of the PIN-diode Readout Units with Electron Energies from 5 to 40 GeV at CERN SPS

Chernoff Information of Bottleneck Gaussian Trees

Mechanical energy and mean equivalent viscous damping for SDOF fractional oscillators

Sliding control for single-degree-of-freedom fractional oscillators

Asymptotic Error Free Partitioning over Noisy Boolean Multiaccess Channels

Partition Information and its Transmission over Boolean Multi-Access Channels

Enhancement of Secrecy of Block Ciphered Systems by Deliberate Noise

A Multi-Interference-Channel Matrix Pair Beamformer for CDMA Systems