Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
56works
0followers
32topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

56 published item(s)

preprint2026arXiv

A Survey of Bargmann Invariants: Geometric Foundations and Applications

Bargmann invariants, a class of gauge-invariant quantities arising from the overlaps of quantum state vectors, provide a profound and unifying framework for understanding the geometric structure of quantum mechanics. This survey offers a comprehensive overview of Bargmann invariants, with a particular focus on their role in shaping the informational geometry of the state space. The core of this review demonstrates how these invariants serve as a powerful tool for characterizing the intrinsic geometry of the space of quantum states, leading to applications in determining local unitary equivalence and constructing a complete set of polynomial invariants for mixed states. Furthermore, we explore their pivotal role in modern quantum information science, specifically in developing operational methods for entanglement detection without the need for full state tomography. By synthesizing historical context with recent advances, this survey aims to highlight Bargmann invariants not merely as mathematical curiosities, but as essential instruments for probing the relational and geometric features of quantum systems.

preprint2026arXiv

Representing Sounds as Neural Amplitude Fields: A Benchmark of Coordinate-MLPs and A Fourier Kolmogorov-Arnold Framework

Although Coordinate-MLP-based implicit neural representations have excelled in representing radiance fields, 3D shapes, and images, their application to audio signals remains underexplored. To fill this gap, we investigate existing implicit neural representations, from which we extract 3 types of positional encoding and 16 commonly used activation functions. Through combinatorial design, we establish the first benchmark for Coordinate-MLPs in audio signal representations. Our benchmark reveals that Coordinate-MLPs require complex hyperparameter tuning and frequency-dependent initialization, limiting their robustness. To address these issues, we propose Fourier-ASR, a novel framework based on the Fourier series theorem and the Kolmogorov-Arnold representation theorem. Fourier-ASR introduces Fourier Kolmogorov-Arnold Networks (Fourier-KAN), which leverage periodicity and strong nonlinearity to represent audio signals, eliminating the need for additional positional encoding. Furthermore, a Frequency-adaptive Learning Strategy (FaLS) is proposed to enhance the convergence of Fourier-KAN by capturing high-frequency components and preventing overfitting of low-frequency signals. Extensive experiments conducted on natural speech and music datasets reveal that: (1) well-designed positional encoding and activation functions in Coordinate-MLPs can effectively improve audio representation quality; and (2) Fourier-ASR can robustly represent complex audio signals without extensive hyperparameter tuning. Looking ahead, the continuity and infinite resolution of implicit audio representations make our research highly promising for tasks such as audio compression, synthesis, and generation. The source code will be released publicly to ensure reproducibility. The code is available at https://github.com/lif314/Fourier-ASR.

preprint2025arXiv

DRL-TH: Jointly Utilizing Temporal Graph Attention and Hierarchical Fusion for UGV Navigation in Crowded Environments

Deep reinforcement learning (DRL) methods have demonstrated potential for autonomous navigation and obstacle avoidance of unmanned ground vehicles (UGVs) in crowded environments. Most existing approaches rely on single-frame observation and employ simple concatenation for multi-modal fusion, which limits their ability to capture temporal context and hinders dynamic adaptability. To address these challenges, we propose a DRL-based navigation framework, DRL-TH, which leverages temporal graph attention and hierarchical graph pooling to integrate historical observations and adaptively fuse multi-modal information. Specifically, we introduce a temporal-guided graph attention network (TG-GAT) that incorporates temporal weights into attention scores to capture correlations between consecutive frames, thereby enabling the implicit estimation of scene evolution. In addition, we design a graph hierarchical abstraction module (GHAM) that applies hierarchical pooling and learnable weighted fusion to dynamically integrate RGB and LiDAR features, achieving balanced representation across multiple scales. Extensive experiments demonstrate that our DRL-TH outperforms existing methods in various crowded environments. We also implemented DRL-TH control policy on a real UGV and showed that it performed well in real world scenarios.

preprint2025arXiv

Hybrid A* Path Planning with Multi-Modal Motion Extension for Four-Wheel Steering Mobile Robots

Four-wheel independent steering (4WIS) systems provide mobile robots with a rich set of motion modes, such as Ackermann steering, lateral steering, and parallel movement, offering superior maneuverability in constrained environments. However, existing path planning methods generally assume a single kinematic model and thus fail to fully exploit the multi-modal capabilities of 4WIS platforms. To address this limitation, we propose an extended Hybrid A* framework that operates in a four-dimensional state space incorporating both spatial states and motion modes. Within this framework, we design multi-modal Reeds-Shepp curves tailored to the distinct kinematic constraints of each motion mode, develop an enhanced heuristic function that accounts for mode-switching costs, and introduce a terminal connection strategy with intelligent mode selection to ensure smooth transitions between different steering patterns. The proposed planner enables seamless integration of multiple motion modalities within a single path, significantly improving flexibility and adaptability in complex environments. Results demonstrate significantly improved planning performance for 4WIS robots in complex environments.

preprint2025arXiv

SmartSplat: Feature-Smart Gaussians for Scalable Compression of Ultra-High-Resolution Images

Recent advances in generative AI have accelerated the production of ultra-high-resolution visual content, posing significant challenges for efficient compression and real-time decoding on end-user devices. Inspired by 3D Gaussian Splatting, recent 2D Gaussian image models improve representation efficiency, yet existing methods struggle to balance compression ratio and reconstruction fidelity in ultra-high-resolution scenarios. To address this issue, we propose SmartSplat, a highly adaptive and feature-aware GS-based image compression framework that supports arbitrary image resolutions and compression ratios. SmartSplat leverages image-aware features such as gradients and color variances, introducing a Gradient-Color Guided Variational Sampling strategy together with an Exclusion-based Uniform Sampling scheme to improve the non-overlapping coverage of Gaussian primitives in pixel space. In addition, we propose a Scale-Adaptive Gaussian Color Sampling method to enhance color initialization across scales. Through joint optimization of spatial layout, scale, and color initialization, SmartSplat efficiently captures both local structures and global textures using a limited number of Gaussians, achieving high reconstruction quality under strong compression. Extensive experiments on DIV8K and a newly constructed 16K dataset demonstrate that SmartSplat consistently outperforms state-of-the-art methods at comparable compression ratios and exceeds their compression limits, showing strong scalability and practical applicability. The code is publicly available at https://github.com/lif314/SmartSplat.

preprint2023arXiv

Solving Math Word Problems via Cooperative Reasoning induced Language Models

Large-scale pre-trained language models (PLMs) bring new opportunities to challenging problems, especially those that need high-level intelligence, such as the math word problem (MWPs). However, directly applying existing PLMs to MWPs can fail as the generation process lacks sufficient supervision and thus lacks fast adaptivity as humans. We notice that human reasoning has a dual reasoning framework that consists of an immediate reaction system (system 1) and a delicate reasoning system (system 2), where the entire reasoning is determined by their interaction. This inspires us to develop a cooperative reasoning-induced PLM for solving MWPs, called Cooperative Reasoning (CoRe), resulting in a human-like reasoning architecture with system 1 as the generator and system 2 as the verifier. In our approach, the generator is responsible for generating reasoning paths, and the verifiers are used to supervise the evaluation in order to obtain reliable feedback for the generator. We evaluate our CoRe framework on several mathematical reasoning datasets and achieve decent improvement over state-of-the-art methods, up to 9.6% increase over best baselines. Our codes are available at https://github.com/TianHongZXY/CoRe

preprint2022arXiv

$ \text{T}^3 $OMVP: A Transformer-based Time and Team Reinforcement Learning Scheme for Observation-constrained Multi-Vehicle Pursuit in Urban Area

Smart Internet of Vehicles (IoVs) combined with Artificial Intelligence (AI) will contribute to vehicle decision-making in the Intelligent Transportation System (ITS). Multi-Vehicle Pursuit games (MVP), a multi-vehicle cooperative ability to capture mobile targets, is becoming a hot research topic gradually. Although there are some achievements in the field of MVP in the open space environment, the urban area brings complicated road structures and restricted moving spaces as challenges to the resolution of MVP games. We define an Observation-constrained MVP (OMVP) problem in this paper and propose a Transformer-based Time and Team Reinforcement Learning scheme ($ \text{T}^3 $OMVP) to address the problem. First, a new multi-vehicle pursuit model is constructed based on decentralized partially observed Markov decision processes (Dec-POMDP) to instantiate this problem. Second, by introducing and modifying the transformer-based observation sequence, QMIX is redefined to adapt to the complicated road structure, restricted moving spaces and constrained observations, so as to control vehicles to pursue the target combining the vehicle's observations. Third, a multi-intersection urban environment is built to verify the proposed scheme. Extensive experimental results demonstrate that the proposed $ \text{T}^3 $OMVP scheme achieves significant improvements relative to state-of-the-art QMIX approaches by 9.66%~106.25%. Code is available at https://github.com/pipihaiziguai/T3OMVP.

preprint2022arXiv

A characterization of maximally entangled two-qubit states

As already known by Rana's result \href{https://doi.org/10.1103/PhysRevA.87.054301}{[\pra {\bf87} (2013) 054301]}, all eigenvalues of any partial-transposed bipartite state fall within the closed interval $[-\frac12,1]$. In this note, we study a family of bipartite quantum states whose minimal eigenvalues of partial-transposed states being $-\frac12$. For a two-qubit system, we find that the minimal eigenvalue of its partial-transposed state is $-\frac12$ if and only if such two-qubit state must be maximally entangled. However this result does not hold in general for a two-qudit system when the dimensions of the underlying space are larger than two.

preprint2022arXiv

A discussion of measuring the top-1 percent most-highly cited publications: Quality and impact of Chinese papers

The top 1 percent most highly cited articles are watched closely as the vanguards of the sciences. Using Web of Science data, one can find that China had overtaken the USA in the relative participation in the top 1 percent in 2019, after outcompeting the EU on this indicator in 2015. However, this finding contrasts with repeated reports of Western agencies that the quality of Chinese output in science is lagging other advanced nations, even as it has caught up in numbers of articles. The difference between the results presented here and the previous results depends mainly upon field normalizations, which classify source journals by discipline. Average citation rates of these subsets are commonly used as a baseline so that one can compare among disciplines. However, the expected value of the top 1 percent of a sample of N papers is N 100, ceteris paribus. Using the average citation rates as expected values, errors are introduced by using the mean of highly skewed distributions and a specious precision in the delineations of the subsets. Classifications can be used for the decomposition, but not for the normalization. When the data is thus decomposed, the USA ranks ahead of China in biomedical fields such as virology. Although the number of papers is smaller, China outperforms the US in the field of Business and Finance in the Social Sciences Citation Index when p is less than .05. Using percentile ranks, subsets other than indexing based classifications can be tested for the statistical significance of differences among them.

preprint2022arXiv

AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 365 videos, including the LDV 2.0 dataset (335 videos) and 30 additional videos. In this challenge, there are 12 teams and 2 teams that submitted the final results to Track 1 and Track 2, respectively. The proposed methods and solutions gauge the state-of-the-art of super-resolution on compressed image and video. The proposed LDV 3.0 dataset is available at https://github.com/RenYang-home/LDV_dataset. The homepage of this challenge is at https://github.com/RenYang-home/AIM22_CompressSR.

preprint2022arXiv

An empirical learning-based validation procedure for simulation workflow

Simulation workflow is a top-level model for the design and control of simulation process. It connects multiple simulation components with time and interaction restrictions to form a complete simulation system. Before the construction and evaluation of the component models, the validation of upper-layer simulation workflow is of the most importance in a simulation system. However, the methods especially for validating simulation workflow is very limit. Many of the existing validation techniques are domain-dependent with cumbersome questionnaire design and expert scoring. Therefore, this paper present an empirical learning-based validation procedure to implement a semi-automated evaluation for simulation workflow. First, representative features of general simulation workflow and their relations with validation indices are proposed. The calculation process of workflow credibility based on Analytic Hierarchy Process (AHP) is then introduced. In order to make full use of the historical data and implement more efficient validation, four learning algorithms, including back propagation neural network (BPNN), extreme learning machine (ELM), evolving new-neuron (eNFN) and fast incremental gaussian mixture model (FIGMN), are introduced for constructing the empirical relation between the workflow credibility and its features. A case study on a landing-process simulation workflow is established to test the feasibility of the proposed procedure. The experimental results also provide some useful overview of the state-of-the-art learning algorithms on the credibility evaluation of simulation models.

preprint2022arXiv

Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

Deep learning (DL)-based channel state information (CSI) feedback improves the capacity and energy efficiency of massive multiple-input multiple-output (MIMO) systems in frequency division duplexing mode. However, multiple neural networks with different lengths of feedback overhead are required by time-varying bandwidth resources. The storage space required at the user equipment (UE) and the base station (BS) for these models increases linearly with the number of models. In this paper, we propose a DL-based changeable-rate framework with novel quantization scheme to improve the efficiency and feasibility of CSI feedback systems. This framework can reutilize all the network layers to achieve overhead-changeable CSI feedback to optimize the storage efficiency at the UE and the BS sides. Designed quantizer in this framework can avoid the normalization and gradient problems faced by traditional quantization schemes. Specifically, we propose two DL-based changeable-rate CSI feedback networks CH-CsiNetPro and CH-DualNetSph by introducing a feedback overhead control unit. Then, a pluggable quantization block (PQB) is developed to further improve the encoding efficiency of CSI feedback in an end-to-end way. Compared with existing CSI feedback methods, the proposed framework saves the storage space by about 50% with changeable-rate scheme and improves the encoding efficiency with the quantization module.

preprint2022arXiv

Combining Predictions of Auto Insurance Claims

This paper aims to better predict highly skewed auto insurance claims by combining candidate predictions. We analyze a version of the Kangaroo Auto Insurance company data and study the effects of combining different methods using five measures of prediction accuracy. The results show the following. First, when there is an outstanding (in terms of Gini Index) prediction among the candidates, the "forecast combination puzzle" phenomenon disappears. The simple average method performs much worse than the more sophisticated model combination methods, indicating that combining different methods could help us avoid performance degradation. Second, the choice of the prediction accuracy measure is crucial in defining the best candidate prediction for "low frequency and high severity" (LFHS) data. For example, mean square error (MSE) does not distinguish well between model combination methods, as the values are close. Third, the performances of different model combination methods can differ drastically. We propose using a new model combination method, named ARM-Tweedie, for such LFHS data; it benefits from an optimal rate of convergence and exhibits a desirable performance in several measures for the Kangaroo data. Fourth, overall, model combination methods improve the prediction accuracy for auto insurance claim costs. In particular, Adaptive Regression by Mixing (ARM), ARM-Tweedie, and constrained Linear Regression can improve forecast performance when there are only weak learners or when no dominant learner exists.

preprint2022arXiv

Confidence Estimation Transformer for Long-term Renewable Energy Forecasting in Reinforcement Learning-based Power Grid Dispatching

The expansion of renewable energy could help realizing the goals of peaking carbon dioxide emissions and carbon neutralization. Some existing grid dispatching methods integrating short-term renewable energy prediction and reinforcement learning (RL) have been proved to alleviate the adverse impact of energy fluctuations risk. However, these methods omit the long-term output prediction, which leads to stability and security problems on the optimal power flow. This paper proposes a confidence estimation Transformer for long-term renewable energy forecasting in reinforcement learning-based power grid dispatching (Conformer-RLpatching). Conformer-RLpatching predicts long-term active output of each renewable energy generator with an enhanced Transformer to boost the performance of hybrid energy grid dispatching. Furthermore, a confidence estimation method is proposed to reduce the prediction error of renewable energy. Meanwhile, a dispatching necessity evaluation mechanism is put forward to decide whether the active output of a generator needs to be adjusted. Experiments carried out on the SG-126 power grid simulator show that Conformer-RLpatching achieves great improvement over the second best algorithm DDPG in security score by 25.8% and achieves a better total reward compared with the golden medal team in the power grid dispatching competition sponsored by State Grid Corporation of China under the same simulation environment. Codes are outsourced in https://github.com/buptlxh/Conformer-RLpatching.

preprint2022arXiv

CS-Rep: Making Speaker Verification Networks Embracing Re-parameterization

Automatic speaker verification (ASV) systems, which determine whether two speeches are from the same speaker, mainly focus on verification accuracy while ignoring inference speed. However, in real applications, both inference speed and verification accuracy are essential. This study proposes cross-sequential re-parameterization (CS-Rep), a novel topology re-parameterization strategy for multi-type networks, to increase the inference speed and verification accuracy of models. CS-Rep solves the problem that existing re-parameterization methods are unsuitable for typical ASV backbones. When a model applies CS-Rep, the training-period network utilizes a multi-branch topology to capture speaker information, whereas the inference-period model converts to a time-delay neural network (TDNN)-like plain backbone with stacked TDNN layers to achieve the fast inference speed. Based on CS-Rep, an improved TDNN with friendly test and deployment called Rep-TDNN is proposed. Compared with the state-of-the-art model ECAPA-TDNN, which is highly recognized in the industry, Rep-TDNN increases the actual inference speed by about 50% and reduces the EER by 10%. The code will be released.

preprint2022arXiv

Experimental quantum simulation of non-Hermitian dynamical topological states using stochastic Schrödinger equation

Noise is ubiquitous in real quantum systems, leading to non-Hermitian quantum dynamics, and may affect the fundamental states of matter. Here we report in experiment a quantum simulation of the two-dimensional non-Hermitian quantum anomalous Hall (QAH) model using the nuclear magnetic resonance processor. Unlike the usual experiments using auxiliary qubits, we develop a stochastic average approach based on the stochastic Schrödinger equation to realize the non-Hermitian dissipative quantum dynamics, which has advantages in saving the quantum simulation sources and simplifies implementation of quantum gates. We demonstrate the stability of dynamical topology against weak noise, and observe two types of dynamical topological transitions driven by strong noise. Moreover, a region that the emergent topology is always robust regardless of the noise strength is observed. Our work shows a feasible quantum simulation approach for dissipative quantum dynamics with stochastic Schrödinger equation and opens a route to investigate non-Hermitian dynamical topological physics.

preprint2022arXiv

Global Speed-of-Sound Prediction Using Transmission Geometry

Most ultrasound (US) imaging techniques use spatially-constant speed-of-sound (SoS) values for beamforming. Having a discrepancy between the actual and used SoS value leads to aberration artifacts, e.g., reducing the image resolution, which may affect diagnostic usability. Accuracy and quality of different US imaging modalities, such as tomographic reconstruction of local SoS maps, also depend on a good initial beamforming SoS. In this work, we develop an analytical method for estimating mean SoS in an imaged medium. We show that the relative shifts between beamformed frames depend on the SoS offset and the geometric disparities in transmission paths. Using this relation, we estimate a correction factor and hence a corrected mean SoS in the medium. We evaluated our proposed method on a set of numerical simulations, demonstrating its utility both for global SoS prediction and for local SoS tomographic reconstruction. For our evaluation dataset, for an initial SoS under- and over-assumption of 5% the medium SoS, our method is able to predict the actual mean SoS within 0.3% accuracy. For the tomographic reconstruction of local SoS maps, the reconstruction accuracy is improved on average by 78.5% and 87%, respectively, compared to an initial SoS under- and over-assumption of 5%.

preprint2022arXiv

Learning from Temporal Gradient for Semi-supervised Action Recognition

Semi-supervised video action recognition tends to enable deep neural networks to achieve remarkable performance even with very limited labeled data. However, existing methods are mainly transferred from current image-based methods (e.g., FixMatch). Without specifically utilizing the temporal dynamics and inherent multimodal attributes, their results could be suboptimal. To better leverage the encoded temporal information in videos, we introduce temporal gradient as an additional modality for more attentive feature extraction in this paper. To be specific, our method explicitly distills the fine-grained motion representations from temporal gradient (TG) and imposes consistency across different modalities (i.e., RGB and TG). The performance of semi-supervised action recognition is significantly improved without additional computation or parameters during inference. Our method achieves the state-of-the-art performance on three video action recognition benchmarks (i.e., Kinetics-400, UCF-101, and HMDB-51) under several typical semi-supervised settings (i.e., different ratios of labeled data).

preprint2022arXiv

Probability density functions of quantum mechanical observable uncertainties

We study the uncertainties of quantum mechanical observables, quantified by the standard deviation (square root of variance) in Haar-distributed random pure states. We derive analytically the probability density functions (PDFs) of the uncertainties of arbitrary qubit observables. Based on these PDFs, the uncertainty regions of the observables are characterized by the supports of the PDFs. The state-independent uncertainty relations are then transformed into the optimization problems over uncertainty regions, which opens a new vista for studying state independent uncertainty relations. Our results may be generalized to multiple observable case in higher dimensional spaces.

preprint2022arXiv

Running coupling constant at finite chemical potential and magnetic field from holography

According to the gauge/gravity duality, we use an Einstein-Maxwell-dilaton(EMD) model to study the running coupling constant at finite chemical potential and magnetic field. First, we calculate the effect of temperature on the running coupling constant and find the results are in consistent with lattice qualitatively. Subsequently, we calculate the effect of chemical potential and magnetic field on running coupling. It is found that the chemical potential and magnetic field both suppress the running coupling constant, however, the effect of magnetic field is slightly larger than chemical potential for a fixed temperature. Compared with the confinement phase, the magnetic field has a large influence on the running coupling in the deconfinement phase.

preprint2022arXiv

Safe Reinforcement Learning for a Robot Being Pursued but with Objectives Covering More Than Capture-avoidance

Reinforcement Learning (RL) algorithms show amazing performance in recent years, but placing RL in real-world applications such as self-driven vehicles may suffer safety problems. A self-driven vehicle moving to a target position following a learned policy may suffer a vehicle with unpredictable aggressive behaviors or even being pursued by a vehicle following a Nash strategy. To address the safety issue of the self-driven vehicle in this scenario, this paper conducts a preliminary study based on a system of robots. A safe RL framework with safety guarantees is developed for a robot being pursued but with objectives covering more than capture-avoidance. Simulations and experiments are conducted based on the system of robots to evaluate the effectiveness of the developed safe RL framework.

preprint2022arXiv

Scalable K-FAC Training for Deep Neural Networks with Distributed Preconditioning

The second-order optimization methods, notably the D-KFAC (Distributed Kronecker Factored Approximate Curvature) algorithms, have gained traction on accelerating deep neural network (DNN) training on GPU clusters. However, existing D-KFAC algorithms require to compute and communicate a large volume of second-order information, i.e., Kronecker factors (KFs), before preconditioning gradients, resulting in large computation and communication overheads as well as a high memory footprint. In this paper, we propose DP-KFAC, a novel distributed preconditioning scheme that distributes the KF constructing tasks at different DNN layers to different workers. DP-KFAC not only retains the convergence property of the existing D-KFAC algorithms but also enables three benefits: reduced computation overhead in constructing KFs, no communication of KFs, and low memory footprint. Extensive experiments on a 64-GPU cluster show that DP-KFAC reduces the computation overhead by 1.55x-1.65x, the communication cost by 2.79x-3.15x, and the memory footprint by 1.14x-1.47x in each second-order update compared to the state-of-the-art D-KFAC methods.

preprint2022arXiv

Sharp decay estimates for massless Dirac fields on a Schwarzschild background

We consider the explicit asymptotic profile of massless Dirac fields on a Schwarzschild background. First, we prove for the spin $s=\pm \frac{1}{2}$ components of the Dirac field a uniform bound of a positive definite energy and an integrated local energy decay estimate from a symmetric hyperbolic wave system. Based on these estimates, we further show that these components have globally pointwise decay $fv^{-3/2-s}τ^{-5/2+s}$ as both an upper and a lower bound outside the black hole, with function $f$ finite and explicitly expressed in terms of the initial data and the coordinates. This establishes the validity of the conjectured Price's law for massless Dirac fields outside a Schwarzschild black hole.

preprint2022arXiv

Skew information-based coherence generating power of quantum channels

We study the ability of a quantum channel to generate quantum coherence when it applies to incoherent states. We define the measure of coherence generating power (CGP) for a generic quantum channel to be the average coherence generated by the quantum channel acting on a uniform ensemble of incoherent states based on the skew information-based coherence measure. We present explicitly the analytical formulae of the CGP for any arbitrary finite dimensional unitary channels. We derive the mean value of the CGP over the unitary groups and investigate the typicality of the normalized CGP. Furthermore, we give an upper bound of the CGP for the convex combinations of unitary channels. Detailed examples are provided to calculate exactly the values of the CGP for the unitary channels related to specific quantum gates and for some qubit channels.

preprint2022arXiv

Standard symmetrized variance with applications to coherence, uncertainty and entanglement

Variance is a ubiquitous quantity in quantum information theory. Given a basis, we consider the averaged variances of a fixed diagonal observable in a pure state under all possible permutations on the components of the pure state and call it the symmetrized variance. Moreover we work out the analytical expression of the symmetrized variance and find that such expression is in the factorized form where two factors separately depends on the diagonal observable and quantum state. By shifting the factor corresponding to the diagonal observable, we introduce the notion named the standard symmetrized variance for the pure state which is independent of the diagonal observable. We then extend the standard symmetrized variance to mixed states in three different ways, which characterize the uncertainty, the coherence and the coherence of assistance, respectively. These quantities are evaluated analytically and the relations among them are established. In addition, we show that the standard symmetrized variance is also an entanglement measure for bipartite systems. In this way, these different quantumness of quantum states are unified by the variance.

preprint2022arXiv

The Local Orthogonality between Quantum States and Entanglement Decomposition

In the paper, we show that when a quantum state can be decomposed as a convex combination of locally orthogonal mixed states, its entanglement can be decomposed into the entanglement of these mixed states without losing them. The obtained result generalizes a corresponding one proved by Horodecki [Acta Phys. Slov. 48, 141 (1998).]. But, for the entanglement cost it requires certain conditions for holding the decomposition, and the distillable entanglement only has a week result as inequality. Finally, we presented an example to show that the conditions of our conclusions are existence.

preprint2022arXiv

TMS: A Temporal Multi-scale Backbone Design for Speaker Embedding

Speaker embedding is an important front-end module to explore discriminative speaker features for many speech applications where speaker information is needed. Current SOTA backbone networks for speaker embedding are designed to aggregate multi-scale features from an utterance with multi-branch network architectures for speaker representation. However, naively adding many branches of multi-scale features with the simple fully convolutional operation could not efficiently improve the performance due to the rapid increase of model parameters and computational complexity. Therefore, in the most current state-of-the-art network architectures, only a few branches corresponding to a limited number of temporal scales could be designed for speaker embeddings. To address this problem, in this paper, we propose an effective temporal multi-scale (TMS) model where multi-scale branches could be efficiently designed in a speaker embedding network almost without increasing computational costs. The new model is based on the conventional TDNN, where the network architecture is smartly separated into two modeling operators: a channel-modeling operator and a temporal multi-branch modeling operator. Adding temporal multi-scale in the temporal multi-branch operator needs only a little bit increase of the number of parameters, and thus save more computational budget for adding more branches with large temporal scales. Moreover, in the inference stage, we further developed a systemic re-parameterization method to convert the TMS-based model into a single-path-based topology in order to increase inference speed. We investigated the performance of the new TMS method for automatic speaker verification (ASV) on in-domain and out-of-domain conditions. Results show that the TMS-based model obtained a significant increase in the performance over the SOTA ASV models, meanwhile, had a faster inference speed.

preprint2021arXiv

Average skew information-based coherence and its typicality for random quantum states

We study the average skew information-based coherence for both random pure and mixed states. The explicit formulae of the average skew information-based coherence are derived and shown to be the functions of the dimension N of the state space. We demonstrate that as N approaches to infinity, the average coherence is 1 for random pure states, and a positive constant less than 1/2 for random mixed states. We also explore the typicality of average skew information-based coherence of random quantum states. Furthermore, we identify a coherent subspace such that the amount of the skew information-based coherence for each pure state in this subspace can be bounded from below almost always by a fixed number that is arbitrarily close to the typical value of coherence.

preprint2021arXiv

Estimating coherence with respect to general quantum measurements

The conventional coherence is defined with respect to a fixed orthonormal basis, i.e., to a von Neumann measurement. Recently, generalized quantum coherence with respect to general positive operator-valued measurements (POVMs) has been presented. Several well-defined coherence measures, such as the relative entropy of coherence $C_{r}$, the $l_{1}$ norm of coherence $C_{l_{1}}$ and the coherence $C_{T,α}$ based on Tsallis relative entropy with respect to general POVMs have been obtained. In this work, we investigate the properties of $C_{r}$, $l_{1}$ and $C_{T,α}$. We estimate the upper bounds of $C_{l_{1}}$; we show that the minimal error probability of the least square measurement state discrimination is given by $C_{T,1/2}$; we derive the uncertainty relations given by $C_{r}$, and calculate the average values of $C_{r}$, $C_{T,α}$ and $C_{l_{1}}$ over random pure quantum states. All these results include the corresponding results of the conventional coherence as special cases.

preprint2021arXiv

Learning Ultrasound Rendering from Cross-Sectional Model Slices for Simulated Training

Purpose. Given the high level of expertise required for navigation and interpretation of ultrasound images, computational simulations can facilitate the training of such skills in virtual reality. With ray-tracing based simulations, realistic ultrasound images can be generated. However, due to computational constraints for interactivity, image quality typically needs to be compromised. Methods. We propose herein to bypass any rendering and simulation process at interactive time, by conducting such simulations during a non-time-critical offline stage and then learning image translation from cross-sectional model slices to such simulated frames. We use a generative adversarial framework with a dedicated generator architecture and input feeding scheme, which both substantially improve image quality without increase in network parameters. Integral attenuation maps derived from cross-sectional model slices, texture-friendly strided convolutions, providing stochastic noise and input maps to intermediate layers in order to preserve locality are all shown herein to greatly facilitate such translation task. Results. Given several quality metrics, the proposed method with only tissue maps as input is shown to provide comparable or superior results to a state-of-the-art that uses additional images of low-quality ultrasound renderings. An extensive ablation study shows the need and benefits from the individual contributions utilized in this work, based on qualitative examples and quantitative ultrasound similarity metrics. To that end, a local histogram statistics based error metric is proposed and demonstrated for visualization of local dissimilarities between ultrasound images.

preprint2021arXiv

Spectra of glueballs and oddballs and the equation of state from holographic QCD

We study the spectra of two-gluon glueballs and three-gluon oddballs and corresponding equation of state in $5$-dimensional deformed holographic QCD models in the graviton-dilaton system, where the metric, the dilaton field and dilaton potential are self-consistently solved from each other through the Einstein field equations and the equation of motion of the dilaton field. We compare the models by inputting the dilaton field, inputting the deformed metric and inputting the dilaton potential, and find that with only 2 parameters, the $5$-dimensional holographic QCD model predictions on glueballs/oddballs spectra in general are in good agreement with lattice results except two oddballs $0^{+-}$ and $2^{+-}$. From the results of glueballs/oddballs spectra at zero temperature and the equation of state at finite temperature, we observe that the model with quadratic dilaton field can simultaneously describe glueballs/oddballs spectra as well as equation of state of pure gluon system. The model with quadratic $A_{E}(z)$ can describe glueballs/oddballs spectra, but its corresponding equation of state behaves more like $N_{f}=2+1$ quark matter. These are consistent with dimension analysis at UV boundary.

preprint2021arXiv

Universal topological quench dynamics: Altland-Zirnbauer tenfold classes

Topological phases of the famous Altland-Zirnbauer (AZ) tenfold classes are defined on the equilibrium ground states. Whether such equilibrium topological phases have universal correspondence to far-from-equilibrium quantum dynamics is a fundamental issue of both theoretical and experimental importance. Here we uncover the universal topological quench dynamics linking to the equilibrium topological phases for the complete AZ tenfold classes, with a general framework being established. We show a fundamental result that a $d$-dimensional topological phase of the tenfold class, with an integer invariant or $\mathbb{Z}_{2}$ index defined on high symmetry momenta, is generically characterized by topology reduced to the highest-order band-inversion surfaces located at arbitrary discrete momenta of Brillouin zone. Such dimension-reduced topology is further captured by universal topological patterns emerging in far-from-equilibrium quantum dynamics by quenching the system from trivial phase to the topological regime, rendering the dynamical hallmark of the equilibrium topological phase. This work establishes a universal dynamical characterization for the complete AZ symmetry classes of topological phases, which has broad applications in theory and experiment.

preprint2020arXiv

Accretion disks around naked singularities

We investigate here the thermal properties of accretion disks in a spacetime for some galactic density profiles in spherical symmetry. The matter distributions have a finite outer radius with a naked central singularity. The luminosities of the accretion disks for some density profile models are found to be higher than those for a Schwarzschild black hole of the same mass. The slopes for the luminosity distributions with respect to frequencies are significantly different, especially at higher frequencies, from that in the Schwarzschild black hole case. Such features may be used to distinguish black holes from naked singularities. The efficiencies for the conversion of the mass energy of the accreting gas into radiation and the strength of naked singularities are analyzed. The novel feature that we find is, the strength of the singularity is different depending on the profiles considered, and the stronger the singularity is, the higher is the efficiency for the accretion disk.

preprint2020arXiv

Bayesian Spatial Models for Voxel-wise Prostate Cancer Classification Using Multi-parametric MRI Data

Multi-parametric magnetic resonance imaging (mpMRI) plays an increasingly important role in the diagnosis of prostate cancer. Various computer-aided detection algorithms have been proposed for automated prostate cancer detection by combining information from various mpMRI data components. However, there exist other features of mpMRI, including the spatial correlation between voxels and between-patient heterogeneity in the mpMRI parameters, that have not been fully explored in the literature but could potentially improve cancer detection if leveraged appropriately. This paper proposes novel voxel-wise Bayesian classifiers for prostate cancer that account for the spatial correlation and between-patient heterogeneity in mpMRI. Modeling the spatial correlation is challenging due to the extreme high dimensionality of the data, and we consider three computationally efficient approaches using Nearest Neighbor Gaussian Process (NNGP), knot-based reduced-rank approximation, and a conditional autoregressive (CAR) model, respectively. The between-patient heterogeneity is accounted for by adding a subject-specific random intercept on the mpMRI parameter model. Simulation results show that properly modeling the spatial correlation and between-patient heterogeneity improves classification accuracy. Application to in vivo data illustrates that classification is improved by spatial modeling using NNGP and reduced-rank approximation but not the CAR model, while modeling the between-patient heterogeneity does not further improve our classifier. Among our proposed models, the NNGP-based model is recommended considering its robust classification accuracy and high computational efficiency.

preprint2020arXiv

Coherence and complementarity based on modified generalized skew information

We introduce modified generalized Wigner-Yanase-Dyson (MGWYD) skew information and modified weighted generalized Wigner-Yanase-Dyson (MWGWYD) skew information. By revisiting state-channel interaction based on MGWYD skew information, a family of coherence measures with respect to quantum channels is proposed. Furthermore, explicit analytical expressions of these coherence measures of qubit states are derived with respect to different quantum channels. Moreover, complementarity relations based on MGWYD skew information and MWGWYD skew information are also presented. Specifically, the conservation relations are investigated, while two interpretations of them including symmetry-asymmetry complementarity and wave-particle duality have been proposed.

preprint2020arXiv

Constructing mutually unbiased bases from unextendible maximally entangled bases

We study mutually unbiased bases (MUBs) in which all the bases are unextendible maximally entangled ones. We first present a necessary and sufficient condition of constructing a pair of MUBs in $C^2 \otimes C^4$. Based on this condition, an analytical and necessary condition for constructing MUBs is given. Moreover we illustrate our approach by some detailed examples in $C^2 \otimes C^4$. The results are generalized to $C^2 \otimes C^d$ $(d\geq 3)$ and a concrete example in $C^2 \otimes C^8$ is given.

preprint2020arXiv

ConvGRU in Fine-grained Pitching Action Recognition for Action Outcome Prediction

Prediction of the action outcome is a new challenge for a robot collaboratively working with humans. With the impressive progress in video action recognition in recent years, fine-grained action recognition from video data turns into a new concern. Fine-grained action recognition detects subtle differences of actions in more specific granularity and is significant in many fields such as human-robot interaction, intelligent traffic management, sports training, health caring. Considering that the different outcomes are closely connected to the subtle differences in actions, fine-grained action recognition is a practical method for action outcome prediction. In this paper, we explore the performance of convolutional gate recurrent unit (ConvGRU) method on a fine-grained action recognition tasks: predicting outcomes of ball-pitching. Based on sequences of RGB images of human actions, the proposed approach achieved the performance of 79.17% accuracy, which exceeds the current state-of-the-art result. We also compared different network implementations and showed the influence of different image sampling methods, different fusion methods and pre-training, etc. Finally, we discussed the advantages and limitations of ConvGRU in such action outcome prediction and fine-grained action recognition tasks.

preprint2020arXiv

Deep Image Translation for Enhancing Simulated Ultrasound Images

Ultrasound simulation based on ray tracing enables the synthesis of highly realistic images. It can provide an interactive environment for training sonographers as an educational tool. However, due to high computational demand, there is a trade-off between image quality and interactivity, potentially leading to sub-optimal results at interactive rates. In this work we introduce a deep learning approach based on adversarial training that mitigates this trade-off by improving the quality of simulated images with constant computation time. An image-to-image translation framework is utilized to translate low quality images into high quality versions. To incorporate anatomical information potentially lost in low quality images, we additionally provide segmentation maps to image translation. Furthermore, we propose to leverage information from acoustic attenuation maps to better preserve acoustic shadows and directional artifacts, an invaluable feature for ultrasound image interpretation. The proposed method yields an improvement of 7.2% in Fréchet Inception Distance and 8.9% in patch-based Kullback-Leibler divergence.

preprint2020arXiv

Deep Network for Scatterer Distribution Estimation for Ultrasound Image Simulation

Simulation-based ultrasound training can be an essential educational tool. Realistic ultrasound image appearance with typical speckle texture can be modeled as convolution of a point spread function with point scatterers representing tissue microstructure. Such scatterer distribution, however, is in general not known and its estimation for a given tissue type is fundamentally an ill-posed inverse problem. In this paper, we demonstrate a convolutional neural network approach for probabilistic scatterer estimation from observed ultrasound data. We herein propose to impose a known statistical distribution on scatterers and learn the mapping between ultrasound image and distribution parameter map by training a convolutional neural network on synthetic images. In comparison with several existing approaches, we demonstrate in numerical simulations and with in-vivo images that the synthesized images from scatterer representations estimated with our approach closely match the observations with varying acquisition parameters such as compression and rotation of the imaged domain.

preprint2020arXiv

Deep Reinforcement Learning for Multi-Agent Power Control in Heterogeneous Networks

We consider a typical heterogeneous network (HetNet), in which multiple access points (APs) are deployed to serve users by reusing the same spectrum band. Since different APs and users may cause severe interference to each other, advanced power control techniques are needed to manage the interference and enhance the sum-rate of the whole network. Conventional power control techniques first collect instantaneous global channel state information (CSI) and then calculate sub-optimal solutions. Nevertheless, it is challenging to collect instantaneous global CSI in the HetNet, in which global CSI typically changes fast. In this paper, we exploit deep reinforcement learning (DRL) to design a multi-agent power control algorithm in the HetNet. To be specific, by treating each AP as an agent with a local deep neural network (DNN), we propose a multiple-actor-shared-critic (MASC) method to train the local DNNs separately in an online trial-and-error manner. With the proposed algorithm, each AP can independently use the local DNN to control the transmit power with only local observations. Simulations results show that the proposed algorithm outperforms the conventional power control algorithms in terms of both the converged average sum-rate and the computational complexity.

preprint2020arXiv

Describing Migdal effects in diamond crystal with atom-centered localized Wannier functions

Recent studies have theoretically investigated the atomic excitation and ionization induced by the dark matter (DM)-nucleus scattering, and it is found that the suddenly recoiled atom is much more likely to excite or lose its electrons than expected. Such phenomenon is called the "Migdal effect". In this paper, we extend the established strategy to describe the Migdal effect in isolated atoms to the case in semiconductors under the framework of tight-binding (TB) approximation. Since the localized aspects of electrons are respected in form of the Wannier functions (WFs), the extension of the existing Migdal approach for isolated atoms is much more natural, while the extensive nature of electrons in solids is reflected in the hopping integrals. We take diamond target as a concrete proof of principle for the methodology, and calculate relevant energy spectra and projected sensitivity of such diamond detector. It turns out that our method as a preliminary attempt is practically effective.

preprint2020arXiv

Edge-assisted Viewport Adaptive Scheme for real-time Omnidirectional Video transmission

Omnidirectional applications are immersive and highly interactive, which can improve the efficiency of remote collaborative work among factory workers. The transmission of omnidirectional video (OV) is the most important step in implementing virtual remote collaboration. Compared with the ordinary video transmission, OV transmission requires more bandwidth, which is still a huge burden even under 5G networks. The tile-based scheme can reduce bandwidth consumption. However, it neither accurately obtain the field of view(FOV) area, nor difficult to support real-time OV streaming. In this paper, we propose an edge-assisted viewport adaptive scheme (EVAS-OV) to reduce bandwidth consumption during real-time OV transmission. First, EVAS-OV uses a Gated Recurrent Unit(GRU) model to predict users' viewport. Then, users were divided into multicast clusters thereby further reducing the consumption of computing resources. EVAS-OV reprojects OV frames to accurately obtain users' FOV area from pixel level and adopt a redundant strategy to reduce the impact of viewport prediction errors. All computing tasks were offloaded to edge servers to reduce the transmission delay and improve bandwidth utilization. Experimental results show that EVAS-OV can save more than 60\% of bandwidth compared with the non-viewport adaptive scheme. Compared to a two-layer scheme with viewport adaptive, EVAS-OV still saves 30\% of bandwidth.

preprint2020arXiv

Joint Optimization of Spectrum and Energy Efficiency Considering the C-V2X Security: A Deep Reinforcement Learning Approach

Cellular vehicle-to-everything (C-V2X) communication, as a part of 5G wireless communication, has been considered one of the most significant techniques for Smart City. Vehicles platooning is an application of Smart City that improves traffic capacity and safety by C-V2X. However, different from vehicles platooning travelling on highways, C-V2X could be more easily eavesdropped and the spectrum resource could be limited when they converge at an intersection. Satisfying the secrecy rate of C-V2X, how to increase the spectrum efficiency (SE) and energy efficiency (EE) in the platooning network is a big challenge. In this paper, to solve this problem, we propose a Security-Aware Approach to Enhancing SE and EE Based on Deep Reinforcement Learning, named SEED. The SEED formulates an objective optimization function considering both SE and EE, and the secrecy rate of C-V2X is treated as a critical constraint of this function. The optimization problem is transformed into the spectrum and transmission power selections of V2V and V2I links using deep Q network (DQN). The heuristic result of SE and EE is obtained by the DQN policy based on rewards. Finally, we simulate the traffic and communication environments using Python. The evaluation results demonstrate that the SEED outperforms the DQN-wopa algorithm and the baseline algorithm by 31.83 % and 68.40 % in efficiency. Source code for the SEED is available at https://github.com/BandaidZ/OptimizationofSEandEEBasedonDRL.

preprint2020arXiv

Learning Global and Local Consistent Representations for Unsupervised Image Retrieval via Deep Graph Diffusion Networks

Diffusion has shown great success in improving accuracy of unsupervised image retrieval systems by utilizing high-order structures of image manifold. However, existing diffusion methods suffer from three major limitations: 1) they usually rely on local structures without considering global manifold information; 2) they focus on improving pair-wise similarities within existing images input output transductively while lacking flexibility to learn representations for novel unseen instances inductively; 3) they fail to scale to large datasets due to prohibitive memory consumption and computational burden due to intrinsic high-order operations on the whole graph. In this paper, to address these limitations, we propose a novel method, Graph Diffusion Networks (GRAD-Net), that adopts graph neural networks (GNNs), a novel variant of deep learning algorithms on irregular graphs. GRAD-Net learns semantic representations by exploiting both local and global structures of image manifold in an unsupervised fashion. By utilizing sparse coding techniques, GRAD-Net not only preserves global information on the image manifold, but also enables scalable training and efficient querying. Experiments on several large benchmark datasets demonstrate effectiveness of our method over state-of-the-art diffusion algorithms for unsupervised image retrieval.

preprint2020arXiv

Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling

This paper introduces our approaches for the Mask and Breathing Sub-Challenge in the Interspeech COMPARE Challenge 2020. For the mask detection task, we train deep convolutional neural networks with filter-bank energies, gender-aware features, and speaker-aware features. Support Vector Machines follows as the back-end classifiers for binary prediction on the extracted deep embeddings. Several data augmentation schemes are used to increase the quantity of training data and improve our models' robustness, including speed perturbation, SpecAugment, and random erasing. For the speech breath monitoring task, we investigate different bottleneck features based on the Bi-LSTM structure. Experimental results show that our proposed methods outperform the baselines and achieve 0.746 PCC and 78.8% UAR on the Breathing and Mask evaluation set, respectively.

preprint2020arXiv

Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning

Object transportation could be a challenging problem for a single robot due to the oversize and/or overweight issues. A multi-robot system can take the advantage of increased driving power and more flexible configuration to solve such a problem. However, increased number of individuals also changed the dynamics of the system which makes control of a multi-robot system more complicated. Even worse, if the whole system is sitting on a centralized decision making unit, the data flow could be easily overloaded due to the upscaling of the system. In this research, we propose a decentralized control scheme on a multi-robot system with each individual equipped with a deep Q-network (DQN) controller to perform an oversized object transportation task. DQN is a deep reinforcement learning algorithm thus does not require the knowledge of system dynamics, instead, it enables the robots to learn appropriate control strategies through trial-and-error style interactions within the task environment. Since analogous controllers are distributed on the individuals, the computational bottleneck is avoided systematically. We demonstrate such a system in a scenario of carrying an oversized rod through a doorway by a two-robot team. The presented multi-robot system learns abstract features of the task and cooperative behaviors are observed. The decentralized DQN-style controller is showing strong robustness against uncertainties. In addition, We propose a universal metric to assess the cooperation quantitatively.

preprint2020arXiv

One-Shot Object Detection without Fine-Tuning

Deep learning has revolutionized object detection thanks to large-scale datasets, but their object categories are still arguably very limited. In this paper, we attempt to enrich such categories by addressing the one-shot object detection problem, where the number of annotated training examples for learning an unseen class is limited to one. We introduce a two-stage model consisting of a first stage Matching-FCOS network and a second stage Structure-Aware Relation Module, the combination of which integrates metric learning with an anchor-free Faster R-CNN-style detection pipeline, eventually eliminating the need to fine-tune on the support images. We also propose novel training strategies that effectively improve detection performance. Extensive quantitative and qualitative evaluations were performed and our method exceeds the state-of-the-art one-shot performance consistently on multiple datasets.

preprint2020arXiv

Quantum simulation for three-dimensional chiral topological insulator

Quantum simulation, as a state-of-art technique, provides the powerful way to explore topological quantum phases beyond natural limits. Nevertheless, a previously-not-realized three-dimensional (3D) chiral topological insulator, and demonstrate by quantum quenches a complete study of both the bulk and surface topological physics. First, a dynamical bulk-surface correspondence in momentum space is observed, showing that the bulk topology of the 3D phase uniquely corresponds to the nontrivial quench dynamics emerging on 2D momentum hypersurfaces called band inversion surfaces (BISs), equivalent to the bulk-boundary correspondence in real space. Further, the symmetry protection of the 3D chiral phase is uncovered by measuring dynamical spin textures on BISs, which exhibit perfect (broken) topology when the chiral symmetry is preserved (broken). Finally we measure the topological charges to characterize directly the bulk topology, and identify an emergent dynamical topological transition when varying the quenches from deep to shallow regimes. This work opens a new avenue of quantum simulation towards for the complete study of topological quantum phases.

preprint2020arXiv

Real-Time Monitoring of Area Angles with Synchrophasor Measurements

This paper develops a comprehensive framework of Area Angle Monitoring (AAM) to monitor the stress of bulk power transfer across an area of a power transmission system in real-time. Area angle is calculated from synchrophasor measurements to provide alert to system operators if the area angle exceeds pre-defined thresholds. This paper proposes general methods to identify these warning and emergency thresholds, and tests a mitigation strategy to relieve the area stress when the area angle exceeds the threshold. In order to handle the limited coverage of synchrophasor measurements, this paper proposes methods to estimate phase angles for boundary buses without synchrophasor measurements, which extends the application of AAM. AAM is verified for a power transmission area in the Western Electricity Coordinating Council system with both simulated data and synchrophasor measurements recorded from real events. A utility deployment to test the framework for monitoring area angle with live-stream and recorded synchrophasor data is described.

preprint2020arXiv

Reinforcement Learning Driven Adaptive VR Streaming with Optical Flow Based QoE

With the merit of containing full panoramic content in one camera, Virtual Reality (VR) and 360-degree videos have attracted more and more attention in the field of industrial cloud manufacturing and training. Industrial Internet of Things (IoT), where many VR terminals needed to be online at the same time, can hardly guarantee VR's bandwidth requirement. However, by making use of users' quality of experience (QoE) awareness factors, including the relative moving speed and depth difference between the viewpoint and other content, bandwidth consumption can be reduced. In this paper, we propose OFB-VR (Optical Flow Based VR), an interactive method of VR streaming that can make use of VR users' QoE awareness to ease the bandwidth pressure. The Just-Noticeable Difference through Optical Flow Estimation (JND-OFE) is explored to quantify users' awareness of quality distortion in 360-degree videos. Accordingly, a novel 360-degree videos QoE metric based on PSNR and JND-OFE (PSNR-OF) is proposed. With the help of PSNR-OF, OFB-VR proposes a versatile-size tiling scheme to lessen the tiling overhead. A Reinforcement Learning(RL) method is implemented to make use of historical data to perform Adaptive BitRate(ABR). For evaluation, we take two prior VR streaming schemes, Pano and Plato, as baselines. Vast evaluations show that our system can increase the mean PSNR-OF score by 9.5-15.8% while maintaining the same rebuffer ratio compared with Pano and Plato in a fluctuate LTE bandwidth dataset. Evaluation results show that OFB-VR is a promising prototype for actual interactive industrial VR. A prototype of OFB-VR can be found in https://github.com/buptexplorers/OFB-VR.

preprint2020arXiv

The influence of anti-chiral edge states on Andreev reflection in graphene-superconductor junction

Using the tight binding model and the non-equilibrium Green function method, we study Andreev reflection in graphene-superconductor junction, where graphene has two nonequal Dirac Cones split in energy and therefore time reversal symmetry is broken. Due to the anti-chiral edge states of the current graphene model, an incident electron travelling along the edges makes distinct contribution to Andreev reflections. In a two-terminal device, because Andreev retro-reflection is not allowed for just the anti-chiral edges, in this case the mutual scattering between edge and bulk states is necessary, which leads that the coefficient of Andreev retro-reflection is always symmetrical about the incident energy. In a four-terminal junction, however, the edges are parallel to the interface of superconductor and graphene, so at the interface an incident electron travelling along the edges can be retro-reflected as a hole into bulk modes, or specularly reflected as a hole into anti-chiral edge states again. It is noted that, the coefficient of specular Andreev reflection keeps symmetric as to the incident energy of electron which is consistent with the reported results before, however the coefficient of Andreev retro-reflection shows an unexpected asymmetrical behavior due to the presence of anti-chiral edge states. Our results present some new ideas to study the anti-chiral edge modes and Andreev reflection for a graphene model with the broken time reversal symmetry.

preprint2020arXiv

Towards Palmprint Verification On Smartphones

With the rapid development of mobile devices, smartphones have gradually become an indispensable part of people's lives. Meanwhile, biometric authentication has been corroborated to be an effective method for establishing a person's identity with high confidence. Hence, recently, biometric technologies for smartphones have also become increasingly sophisticated and popular. But it is noteworthy that the application potential of palmprints for smartphones is seriously underestimated. Studies in the past two decades have shown that palmprints have outstanding merits in uniqueness and permanence, and have high user acceptance. However, currently, studies specializing in palmprint verification for smartphones are still quite sporadic, especially when compared to face- or fingerprint-oriented ones. In this paper, aiming to fill the aforementioned research gap, we conducted a thorough study of palmprint verification on smartphones and our contributions are twofold. First, to facilitate the study of palmprint verification on smartphones, we established an annotated palmprint dataset named MPD, which was collected by multi-brand smartphones in two separate sessions with various backgrounds and illumination conditions. As the largest dataset in this field, MPD contains 16,000 palm images collected from 200 subjects. Second, we built a DCNN-based palmprint verification system named DeepMPV+ for smartphones. In DeepMPV+, two key steps, ROI extraction and ROI matching, are both formulated as learning problems and then solved naturally by modern DCNN models. The efficiency and efficacy of DeepMPV+ have been corroborated by extensive experiments. To make our results fully reproducible, the labeled dataset and the relevant source codes have been made publicly available at https://cslinzhang.github.io/MobilePalmPrint/.

preprint2020arXiv

Value-Decomposition Networks based Distributed Interference Control in Multi-platoon Groupcast

Platooning is considered one of the most representative 5G use cases. Due to the small spacing within the platoon, the platoon needs more reliable transmission to guarantee driving safety while improving fuel and driving efficiency. However, efficient resource allocation between platoons has been a challenge, especially considering that the channel and power selected by each platoon will affect other platoons. Therefore, platoons need to coordinate with each other to ensure the groupcast quality of each platoon. To solve these challenges, we model the multi-platoon resource selection problem as Markov games and then propose a distributed resource allocation algorithm based on Value-Decomposition Networks. Our scheme utilizes the historical data of each platoon for centralized training. In distributed execution, agents only need their local observations to make decisions. At the same time, we decrease the training burden by sharing the neural network parameters. Simulation results show that the proposed algorithm has excellent convergence. Compared with another multi-agent algorithm (MARL) and random algorithm, our proposed solution can dramatically reduce the probability of platoon groupcast failure and improve the quality of platoon groupcast.

preprint2019arXiv

Incompatibility probability of random quantum measurements

Incompatibility of quantum measurements is of fundamental importance in quantum mechanics. It is closely related to many nonclassical phenomena such as Bell nonlocality, quantum uncertainty relations, and quantum steering. We study the necessary and sufficient conditions of quantum compatibility for a given collection of $n$ measurements in $d$-dimensional space. From the compatibility criterion for two-qubit measurements, we compute the incompatibility probability of a pair of independent random measurements. For a pair of unbiased random qubit measurements, we derive that the incompatibility probability is exactly $\frac35$. Detailed results are also presented in figures for pairs of general qubit measurements.

preprint2019arXiv

Multipartite Entanglement Measure and Complete Monogamy Relation

Although many different entanglement measures have been proposed so far, much less is known in the multipartite case, which leads to the previous monogamy relations in literatures are not complete. We establish here a strict framework for defining multipartite entanglement measure (MEM): apart from the postulates of bipartite measure, a genuine MEM should additionally satisfy the unification condition and the hierarchy condition. We then come up with a complete monogamy formula for the unified MEM and a tightly complete monogamy relation for the genuine MEM. Consequently, we propose MEMs which are multipartite extensions of entanglement of formation (EoF), concurrence, tangle, Tsallis $q$-entropy of entanglement, Rényi $α$-entropy of entanglement, the convex-roof extension of negativity and negativity, respectively. We show that (i) the extensions of EoF, concurrence, tangle, and Tsallis $q$-entropy of entanglement are genuine MEMs, (ii) multipartite extensions of Rényi $α$-entropy of entanglement, negativity and the convex-roof extension of negativity are unified MEMs but not genuine MEMs, and (iii) all these multipartite extensions are completely monogamous and the ones which are defined by the convex-roof structure (except for the Rényi $α$-entropy of entanglement and the convex-roof extension of negativity) are not only completely monogamous but also tightly completely monogamous. In addition, we find a class of tripartite states that one part can maximally entangled with other two parts simultaneously according to the definition of maximally entangled mixed state (MEMS) in [Quantum Inf. Comput. 12, 0063 (2012)]. Consequently, we improve the definition of maximally entangled state (MES) and prove that there is no MEMS and that the only MES is the pure MES.