Source author record

Shuai Han

Shuai Han appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence hep-ph Information Theory math.IT quant-ph

Catalog footprint

What is connected

5works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Aspects of $Z_{cs}(3985)$ and $Z_{cs}(4000)$

In the present work we investigate the $η_c K$, $J/ψK$, $η_c K^*$ and $J/ψK^*$ hidden-charm decay modes for the $c\bar{c}s\bar{u}$ four-quark system in the molecular and compact tetraquark scenarios using the quark-exchange model. Our theoretical results indicate that if the newly observed states $Z_{cs}(3985)$ and $Z_{cs}(4000)$ are two different states, $Z_{cs}(4000)$ may be interpreted as the mixture $\frac{1}{\sqrt{2}}(D^0D_s^{*-}+D^{*0}D_s^{-})$ of which the $J/ψK$ partial decay width is about $Γ\sim2.89$ MeV, while $Z_{cs}(3985)$ may be explained as the mixture $\frac{1}{\sqrt{2}}(-D^0D_s^{*-}+D^{*0}D_s^{-})$ of which the $J/ψK$ partial decay width is small to zero. Moreover, if the state $Z_{cs}(4000)$ can be explained as the mixed state $\frac{1}{\sqrt{2}}(D^0D_s^{*-}+D^{*0}D_s^{-})$ indeed, the partial decay width ratio between $J/ψK$ and $η_cK^*$ is close to unit, which indicates the decay channel $η_cK^*$ may be a ideal channel as well to decode the inner structure of $Z_{cs}(4000)$. In addition, the partial decay width for the tensor molecular state $|D^{*0}D_s^{*-}\rangle_{2^+}$ decaying into $J/ψK^*$ can reach up to a few MeV, which shows this tensor molecular state has a good potential to be observed in this decay channel.

preprint2022arXiv

Weighted Sum-Rate Maximization for Rate-Splitting Multiple Access Based Secure Communication

As investigations on physical layer security evolve from point-to-point systems to multi-user scenarios, multi-user interference (MUI) is introduced and becomes an unavoidable issue. Different from treating MUI totally as noise in conventional secure communications, in this paper, we propose a rate-splitting multiple access (RSMA)-based secure beamforming design, where user messages are split and encoded into common and private streams. Each user not only decodes the common stream and the intended private stream, but also tries to eavesdrop the private streams of other users. We formulate a weighted sum-rate (WSR) maximization problem subject to the secrecy rate requirements of all users. To tackle the non-convexity of the formulated problem, a successive convex approximation (SCA)-based approach is adopted to convert the original non-convex and intractable problem into a low-complexity suboptimal iterative algorithm. Numerical results demonstrate that the proposed secure beamforming scheme outperforms the conventional multi-user linear precoding (MULP) technique in terms of the WSR performance while ensuring user secrecy rate requirements.

preprint2020arXiv

NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration

Deep reinforcement learning has been applied more and more widely nowadays, especially in various complex control tasks. Effective exploration for noisy networks is one of the most important issues in deep reinforcement learning. Noisy networks tend to produce stable outputs for agents. However, this tendency is not always enough to find a stable policy for an agent, which decreases efficiency and stability during the learning process. Based on NoisyNets, this paper proposes an algorithm called NROWAN-DQN, i.e., Noise Reduction and Online Weight Adjustment NoisyNet-DQN. Firstly, we develop a novel noise reduction method for NoisyNet-DQN to make the agent perform stable actions. Secondly, we design an online weight adjustment strategy for noise reduction, which improves stable performance and gets higher scores for the agent. Finally, we evaluate this algorithm in four standard domains and analyze properties of hyper-parameters. Our results show that NROWAN-DQN outperforms prior algorithms in all these domains. In addition, NROWAN-DQN also shows better stability. The variance of the NROWAN-DQN score is significantly reduced, especially in some action-sensitive environments. This means that in some environments where high stability is required, NROWAN-DQN will be more appropriate than NoisyNets-DQN.

preprint2020arXiv

Optimally Displaced Threshold Detection for Discriminating Binary Coherent States Using Imperfect Devices

Because of the potential applications in quantum information processing tasks, discrimination of binary coherent states using generalized Kennedy receiver with maximum a posteriori probability (MAP) detection has attracted increasing attentions in recent years. In this paper, we analytically study the performance of the generalized Kennedy receiver having optimally displaced threshold detection (ODTD) in a realistic situation with noises and imperfect devices. We first prove that the MAP detection for a generalized Kennedy receiver is equivalent to a threshold detection in this realistic situation. Then we analyze the properties of the optimum threshold and the optimum displacement for ODTD, and propose a heuristic greedy search algorithm to obtain them. We prove that the ODTD degenerates to the Kennedy receiver with threshold detection when the signal power is large, and we also clarify the connection between the generalized Kennedy receiver with threshold detection and the one-port homodyne detection. Numerical results show that the proposed heuristic greedy search algorithm can obtain a lower and smoother error probability than the existing works.

preprint2020arXiv

Regularly Updated Deterministic Policy Gradient Algorithm

Deep Deterministic Policy Gradient (DDPG) algorithm is one of the most well-known reinforcement learning methods. However, this method is inefficient and unstable in practical applications. On the other hand, the bias and variance of the Q estimation in the target function are sometimes difficult to control. This paper proposes a Regularly Updated Deterministic (RUD) policy gradient algorithm for these problems. This paper theoretically proves that the learning procedure with RUD can make better use of new data in replay buffer than the traditional procedure. In addition, the low variance of the Q value in RUD is more suitable for the current Clipped Double Q-learning strategy. This paper has designed a comparison experiment against previous methods, an ablation experiment with the original DDPG, and other analytical experiments in Mujoco environments. The experimental results demonstrate the effectiveness and superiority of RUD.

Shuai Han

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Aspects of $Z_{cs}(3985)$ and $Z_{cs}(4000)$

Weighted Sum-Rate Maximization for Rate-Splitting Multiple Access Based Secure Communication

NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration

Optimally Displaced Threshold Detection for Discriminating Binary Coherent States Using Imperfect Devices

Regularly Updated Deterministic Policy Gradient Algorithm