Source author record

Tianxian Zhang

Tianxian Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Machine Learning Multiagent Systems

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning

We explore value decomposition solutions for multi-agent deep reinforcement learning in the popular paradigm of centralized training with decentralized execution(CTDE). As the recognized best solution to CTDE, Weighted QMIX is cutting-edge on StarCraft Multi-agent Challenge (SMAC), with a weighting scheme implemented on QMIX to place more emphasis on the optimal joint actions. However, the fixed weight requires manual tuning according to the application scenarios, which painfully prevents Weighted QMIX from being used in broader engineering applications. In this paper, we first demonstrate the flaw of Weighted QMIX using an ordinary One-Step Matrix Game (OMG), that no matter how the weight is chosen, Weighted QMIX struggles to deal with non-monotonic value decomposition problems with a large variance of reward distributions. Then we characterize the problem of value decomposition as an Underfitting One-edged Robust Regression problem and make the first attempt to give a solution to the value decomposition problem from the perspective of information-theoretical learning. We introduce the Maximum Correntropy Criterion (MCC) as a cost function to dynamically adapt the weight to eliminate the effects of minimum in reward distributions. We simplify the implementation and propose a new algorithm called MCVD. A preliminary experiment conducted on OMG shows that MCVD could deal with non-monotonic value decomposition problems with a large tolerance of kernel bandwidth selection. Further experiments are carried out on Cooperative-Navigation and multiple SMAC scenarios, where MCVD exhibits unprecedented ease of implementation, broad applicability, and stability.

preprint2016arXiv

Optimal Deployment of Multistatic Radar System Using Multi-Objective Particle Swarm Optimization

We consider an optimization deployment problem of multistatic radar system (MSRS). Through the antenna placing and the transmitted power allocating, we optimally deploy the MSRS for two goals: 1) the first one is to improve the coverage ratio of surveillance region; 2) the second goal is to get a even distribution of signal energy in surveillance region. In two typical working modes of MSRS, we formulate the optimization problem by introducing two objective functions according to the two mentioned goals, respectively. Addressing on two main challenges of applying multi-objective particle swarm optimization (MOPSO) in solving the proposed optimization problem, we propose a deployment algorithm based on multiobjective particle swarm optimization with non-dominated relative crowding distance (MOPSO-NRCD). For the challenge of value difference, we propose a novel selection method with a non-dominated relative crowding distance. For the challenge of particle allocation, a multi-swarm structure of MOPSO is also introduced. Finally, simulation results are given out to prove the advantages and validity of the proposed deployment algorithm. It is shown that with same number of employed particles, the proposed MOPSO-NRCD algorithm can achieve better optimization performance than that of traditional multiobjective particle swarm optimization with crowding distance (MOPSO-CD).

preprint2014arXiv

MIMO OFDM Radar IRCI Free Range Reconstruction with Sufficient Cyclic Prefix

In this paper, we propose MIMO OFDM radar with sufficient cyclic prefix (CP), where all OFDM pulses transmitted from different transmitters share the same frequency band and are orthogonal to each other for every subcarrier in the discrete frequency domain. The orthogonality is not affected by time delays from transmitters. Thus, our proposed MIMO OFDM radar has the same range resolution as single transmitter radar and achieves full spatial diversity. Orthogonal designs are used to achieve this orthogonality across the transmitters, with which it is only needed to design OFDM pulses for the first transmitter. We also propose a joint pulse compression and pulse coherent integration for range reconstruction. In order to achieve the optimal SNR for the range reconstruction, we apply the paraunitary filterbank theory to design the OFDM pulses. We then propose a modified iterative clipping and filtering (MICF) algorithm for the designs of OFDM pulses jointly, when other important factors, such as peak-to-average power ratio (PAPR) in time domain, are also considered. With our proposed MIMO OFDM radar, there is no interference for the range reconstruction not only across the transmitters but also across the range cells in a swath called inter-range-cell interference (IRCI) free that is similar to our previously proposed CP based OFDM radar for single transmitter. Simulations are presented to illustrate our proposed theory and show that the CP based MIMO OFDM radar outperforms the existing frequency-band shared MIMO radar with polyphase codes and also frequency division MIMO radar.