Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
37works
0followers
28topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

37 published item(s)

preprint2026arXiv

GEAR: Granularity-Adaptive Advantage Reweighting for LLM Agents via Self-Distillation

Reinforcement learning has become a widely used post-training approach for LLM agents, where training commonly relies on outcome-level rewards that provide only coarse supervision. While finer-grained credit assignment is promising for effective policy updates, obtaining reliable local credit and assigning it to the right parts of the long-horizon trajectory remains an open challenge. In this paper, we propose Granularity-adaptivE Advantage Reweighting (GEAR), an adaptive-granularity credit assignment framework that reshapes the trajectory-level GRPO advantage using token- and segment-level signals derived from self-distillation. GEAR compares an on-policy student with a ground-truth-conditioned teacher to obtain a reference-guided divergence signal for identifying adaptive segment boundaries and modulating local advantage weights. This divergence often spikes at the onset of a semantic deviation, while later tokens in the same autoregressive continuation may return to low divergence. GEAR therefore treats such spikes as anchors for adaptive credit regions: where the student remains aligned with the teacher, token-level resolution is preserved; where it departs, GEAR groups the corresponding continuation into an adaptive segment and uses the divergence at the departure point to modulate the segment' s advantage. Experiments across eight mathematical reasoning and agentic tool-use benchmarks with Qwen3 4B and 8B models show that GEAR consistently outperforms standard GRPO, self-distillation-only baselines, and token- or turn-level credit-assignment methods. The gains are especially strong on benchmarks with lower GRPO baseline accuracy, reaching up to around 20\% over GRPO, suggesting that the proposed adaptive reweighting scheme is especially useful in more challenging long-horizon settings.

preprint2024arXiv

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

Correspondence pruning aims to find correct matches (inliers) from an initial set of putative correspondences, which is a fundamental task for many applications. The process of finding is challenging, given the varying inlier ratios between scenes/image pairs due to significant visual differences. However, the performance of the existing methods is usually limited by the problem of lacking visual cues (\eg texture, illumination, structure) of scenes. In this paper, we propose a Visual-Spatial Fusion Transformer (VSFormer) to identify inliers and recover camera poses accurately. Firstly, we obtain highly abstract visual cues of a scene with the cross attention between local features of two-view images. Then, we model these visual cues and correspondences by a joint visual-spatial fusion module, simultaneously embedding visual cues into correspondences for pruning. Additionally, to mine the consistency of correspondences, we also design a novel module that combines the KNN-based graph and the transformer, effectively capturing both local and global contexts. Extensive experiments have demonstrated that the proposed VSFormer outperforms state-of-the-art methods on outdoor and indoor benchmarks. Our code is provided at the following repository: https://github.com/sugar-fly/VSFormer.

preprint2023arXiv

A First Search for Solar $^8$B Neutrino in the PandaX-4T Experiment using Neutrino-Nucleus Coherent Scattering

A search for interactions from solar $^8$B neutrinos elastically scattering off xenon nuclei using PandaX-4T commissioning data is reported. The energy threshold of this search is further lowered compared with the previous search for dark matter, with various techniques utilized to suppress the background that emerges from data with the lowered threshold. A blind analysis is performed on the data with an effective exposure of 0.48 tonne$\cdot$year, and no significant excess of events is observed. Among results obtained using the neutrino-nucleus coherent scattering, our results give the best constraint on the solar $^8$B neutrino flux. We further provide a more stringent limit on the cross section between dark matter and nucleon in the mass range from 3 to 9 GeV/c$^2$.

preprint2022arXiv

A Gaseous Time Projection Chamber with Micromegas Readout for Low Radioactive Material Screening

Low radioactive material screening is becoming essential for rare event search experiments, such as neutrinoless double beta decay and dark matter searches in underground laboratories. A gaseous time projection chamber (TPC) can be used for such purposes with large active areas and high efficiency. A gaseous TPC with a Micromegas readout plane of approximately 20$\times$20 cm$^2$ is successfully constructed for surface alpha contamination measurements. We have characterized the energy resolution, gain stability, and tracking capability with calibration sources. With the unique track-related background suppression cuts of the gaseous TPC, we have established that the alpha background rate of the TPC is 0.13$\pm$0.03 $μ$Bq/cm$^2$, comparable to the leading commercial solutions.

preprint2022arXiv

A LN2-based cryogenic system prototype for future PandaX experiment

This paper describes results on R&D of an economical and efficient cryogenic system prototype for future liquid xenon detector. The test module of the prototype has a "cold head" attached to a copper rod, which is specially designed to transport heat loads to a free-boiling liquid nitrogen bath. The performance of the test module and commercial refrigerators is compared. The module with an optimized copper rod has demonstrated more than 1500W cooling power at 178K.The temperature of the "cold head" can be kept stable with an error of 0.02K, its fluctuation is within 0.1K.

preprint2022arXiv

A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment

We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China Jinping Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enhance the signal sensitivity. Our result excludes the dark matter-nucleon elastic scattering cross section between 10$^{-31}$cm$^{2}$ and 10$^{-28}$cm$^{2}$ for a dark matter masses from 0.1 MeV/$c^2$ to 0.1 GeV/$c^2$, with a large parameter space previously unexplored by experimental collaborations.

preprint2022arXiv

A search for two-component Majorana dark matter in a simplified model using the full exposure data of PandaX-II experiment

In the two-component Majorana dark matter model, one dark matter particle can scatter off the target nuclei, and turn into a slightly heavier component. In the framework of a simplified model with a vector boson mediator, both the tree-level and loop-level processes contribute to the signal in direct detection experiment. In this paper, we report the search results for such dark matter from PandaX-II experiment, using total data of the full 100.7 tonne$\cdot$day exposure. No significant excess is observed, so strong constraints on the combined parameter space of mediator mass and dark matter mass are derived. With the complementary search results from collider experiments, a large range of parameter space can be excluded.

preprint2022arXiv

Curriculum Offline Imitation Learning

Offline reinforcement learning (RL) tasks require the agent to learn from a pre-collected dataset with no further interactions with the environment. Despite the potential to surpass the behavioral policies, RL-based methods are generally impractical due to the training instability and bootstrapping the extrapolation errors, which always require careful hyperparameter tuning via online evaluation. In contrast, offline imitation learning (IL) has no such issues since it learns the policy directly without estimating the value function by bootstrapping. However, IL is usually limited in the capability of the behavioral policy and tends to learn a mediocre behavior from the dataset collected by the mixture of policies. In this paper, we aim to take advantage of IL but mitigate such a drawback. Observing that behavior cloning is able to imitate neighboring policies with less data, we propose \textit{Curriculum Offline Imitation Learning (COIL)}, which utilizes an experience picking strategy for imitating from adaptive neighboring policies with a higher return, and improves the current policy along curriculum stages. On continuous control benchmarks, we compare COIL against both imitation-based and RL-based methods, showing that it not only avoids just learning a mediocre behavior on mixed datasets but is also even competitive with state-of-the-art offline RL methods.

preprint2022arXiv

Design and Operation of the PandaX-4T High Speed Ultra-high Purity Xenon Recuperation System

In order to recuperate the ultra-high purity xenon from PandaX-4T dark matter detector to high-pressure gas cylinders in emergency or at the end-of-run situation, a high speed ultra-high purity xenon recuperation system is designed and developed. This system includes a diaphragm pump, the heat management system, the main recuperation pipeline, the reflux pipeline, the auxiliary recuperation pipeline and the automatic control system. The liquid xenon in the detector is vaporized by the heat management system, and the gaseous xenon is compressed to 6 MPa at the flow rate of 200 standard litres per minute (SLPM) using the diaphragm compressor. The high-pressure xenon is filled into 128 gas cylinders via the main recuperation pipeline. During the recuperation, the low pressure and temperature conditions of 2 ~ 3 atmospheres and 178 ~ 186.5 K in PandaX-4T dark matter detector are kept by the cooperation of the main recuperation pipeline, reflux pipeline and the auxiliary recuperation pipeline to guarantee the safety, and the purity of the recuperated xenon gas is measured to ensure no contamination happened. The development of the high speed ultra-high purity xenon recuperation system is important for the operation of large-scale dark matter detectors with the requirements of strict temperature and pressure environment and low background.

preprint2022arXiv

Glass-patternable notch-shaped microwave architecture for on-chip spin detection in biological samples

We report a notch-shaped coplanar microwave waveguide antenna on a glass plate designed for on-chip detection of optically detected magnetic resonance (ODMR) of fluorescent nanodiamonds (NDs). A lithographically patterned thin wire at the center of the notch area in the coplanar waveguide realizes a millimeter-scale ODMR detection area (1.5 x 2.0 mm^2) and gigahertz-broadband characteristics with low reflection (about 8%). The ODMR signal intensity in the detection area is quantitatively predictable by numerical simulation. Using this chip device, we demonstrate a uniform ODMR signal intensity over the detection area for cells, tissue, and worms. The present demonstration of a chip-based microwave architecture will enable scalable chip integration of ODMR-based quantum sensing technology into various bioassay platforms.

preprint2022arXiv

Inspector: Pixel-Based Automated Game Testing via Exploration, Detection, and Investigation

Deep reinforcement learning (DRL) has attracted much attention in automated game testing. Early attempts rely on game internal information for game space exploration, thus requiring deep integration with games, which is inconvenient for practical applications. In this work, we propose using only screenshots/pixels as input for automated game testing and build a general game testing agent, Inspector, that can be easily applied to different games without deep integration with games. In addition to covering all game space for testing, our agent tries to take human-like behaviors to interact with key objects in a game, since some bugs usually happen in player-object interactions. Inspector is based on purely pixel inputs and comprises three key modules: game space explorer, key object detector, and human-like object investigator. Game space explorer aims to explore the whole game space by using a curiosity-based reward function with pixel inputs. Key object detector aims to detect key objects in a game, based on a small number of labeled screenshots. Human-like object investigator aims to mimic human behaviors for investigating key objects via imitation learning. We conduct experiments on two popular video games: Shooter Game and Action RPG Game. Experiment results demonstrate the effectiveness of Inspector in exploring game space, detecting key objects, and investigating objects. Moreover, Inspector successfully discovers two potential bugs in those two games. The demo video of Inspector is available at https://github.com/Inspector-GameTesting/Inspector-GameTesting.

preprint2022arXiv

Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment

PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China Jinping Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low background materials were selected to assemble the detector. Surface treatment procedures were investigated to further suppress radioactive background. Combining measured results and Monte Carlo simulation, the total material background rates of PandaX-4T in the energy region of 1-25 keV$\rm{}_{ee}$ are estimated to be (9.9 $\pm$ 1.9) $\times \ 10^{-3}$ mDRU for electron recoil and (2.8 $\pm$ 0.6) $\times \ 10^{-4}$ mDRU for nuclear recoil. In addition, $^{nat}$Kr in the detector is estimated to be <8 ppt.

preprint2022arXiv

Neutron-induced nuclear recoil background in the PandaX-4T experiment

Neutron-induced nuclear recoil background is critical to the dark matter searches in the PandaX-4T liquid xenon experiment. This paper studies the feature of neutron background in liquid xenon and evaluates their contribution in the single scattering nuclear recoil events through three methods. The first method is fully Monte Carlo simulation based. The last two are data-driven methods that also use the multiple scattering signals and high energy signals in the data, respectively. In the PandaX-4T commissioning data with an exposure of 0.63 tonne-year, all these methods give a consistent result that there are $1.15\pm0.57$ neutron-induced background in dark matter signal region within an approximated nuclear recoil energy window between 5 and 100 keV.

preprint2022arXiv

Parallel Multi-Stage Preconditioners with Adaptive Setup for the Black Oil Model

The black oil model is widely used to describe multiphase porous media flow in the petroleum industry. The fully implicit method features strong stability and weak constraints on time step-sizes; hence, commonly used in the current mainstream commercial reservoir simulators. In this paper, a CPR-type preconditioner with an adaptive &#34;setup phase&#34; is developed to improve parallel efficiency of petroleum reservoir simulation. Furthermore, we propose a multi-color Gauss-Seidel (GS) algorithm for algebraic multigrid method based on the coefficient matrix of strong connections. Numerical experiments show that the proposed preconditioner can improve the parallel performance for both OpenMP and CUDA implements. Moreover, the proposed algorithm yields good parallel speedup as well as same convergence behavior as the corresponding single-threaded algorithm. In particular, for a three-phase benchmark problem, the parallel speedup of the OpenMP version is over 6.5 with 16 threads and the CUDA version reaches more than 9.5.

preprint2022arXiv

Shadow and weak deflection angle of a black hole in nonlocal gravity

Black hole shadow and gravitational lensing play important roles in testing gravitational theories in the strong field regime. As the first-order modifications from quantum gravity, the nonlocality can be manifested by black hole shadow and gravitational lensing. For example, the cut-off parameter introduced by nonlocality will affect the shape and size of the black hole shadow, and also affect the deflection angle of light rays. In this paper, we mainly investigate the effects of the nonlocality on the black hole shadow and the gravitational lensing for two types of rotating black holes in nonlocal gravity. It is found that the size of the black hole shadow decreases with the cut-off parameter since the nonlocality weakens the gravitational constant, and the shape of the shadow gets more deformed with the increase of the cut-off parameter. However, if the rotation parameter is small, the shape of the shadow is almost a circle even though the cut-off parameter approaches its maximum. The energy emission rate in both models is also studied. The results show that there is a peak for each curve and the peak decreases and shifts to the low frequency with the increase of the cut-off parameter. Besides, we also explore the shadow of both types of black holes surrounded by a nonmagnetized pressureless plasma which satisfies the separability condition. It is found that the plasma has a frequency-dependent dispersive effect on the size and shape of the black hole shadow. For the gravitational lensing, we find that the cut-off parameter of model A makes a positive contribution to the deflection angle, which can be compared with the contribution of the rotation parameter, while the cut-off parameter of model B makes a negative contribution which can be ignored. These results may be helpful for probing nonlocal gravity in future observations.

preprint2022arXiv

Spontaneously translational symmetry breaking in the excited states of holographic superconductor

We revisit HHH model in [Phys. Rev. Lett. {\bf 101}, 031601 (2008)] and extend the ansatz of matter fields to being of depending on a spatial dimension except the holographic direction. Despite homogeneous solutions of ground and excited states, especially for the excited states, there also exists solutions where the translational invariance is broken. It is worth mentioning that no periodic sources are assigned to the matter fields, so the translational symmetry is broken spontaneously. We investigate how the new solutions and the condensates of excited states develop with the change of temperature. Moreover, since this kind of condensate will decrease at certain temperature and eventually vanish at sufficiently low temperature, we also study the relation between this interval and length of lattice. Besides, we compare the free energies of non-translational invariant solutions and those of translational invariance in the HHH model, and find that the free energies of the former situations are lower.

preprint2022arXiv

Study of background from accidental coincidence signals in the PandaX-II experiment

The PandaX-II experiment employed a 580kg liquid xenon detector to search for the interactions between dark matter particles and the target xenon atoms. The accidental coincidences of isolated signals result in a dangerous background which mimic the signature of the dark matter. We performed a detailed study on the accidental coincidence background in PandaX-II, including the possible origin of the isolated signals, the background level and corresponding background suppression method. With a boosted-decision-tree algorithm, the accidental coincidence background is reduced by 70% in the dark matter signal region, thus the sensitivity of dark matter search at PandaX-II is improved.

preprint2022arXiv

Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality

Deployment efficiency is an important criterion for many real-world applications of reinforcement learning (RL). Despite the community&#39;s increasing interest, there lacks a formal theoretical formulation for the problem. In this paper, we propose such a formulation for deployment-efficient RL (DE-RL) from an &#34;optimization with constraints&#34; perspective: we are interested in exploring an MDP and obtaining a near-optimal policy within minimal \emph{deployment complexity}, whereas in each deployment the policy can sample a large batch of data. Using finite-horizon linear MDPs as a concrete structural model, we reveal the fundamental limit in achieving deployment efficiency by establishing information-theoretic lower bounds, and provide algorithms that achieve the optimal deployment efficiency. Moreover, our formulation for DE-RL is flexible and can serve as a building block for other practically relevant settings; we give &#34;Safe DE-RL&#34; and &#34;Sample-Efficient DE-RL&#34; as two examples, which may be worth future investigation.

preprint2021arXiv

A conjecture of thermo-gravitation based on geometry, classical physics and classical thermodynamics

One of the goals that physicists have been pursuing is to get the same explanation from different angles for the same phenomenon, so as to realize the unity of basic physical laws. Geometry, classical mechanics and classical thermodynamics are three relatively old disciplines. Their research methods and perspectives for the same phenomenon are quite different. However, there must be some undetermined connections and symmetries among them. In previous studies, there is a lack of horizontal analogical research on the basic theories of different disciplines, but revealing the deep connections between them will help to deepen the understanding of the existing system and promote the common development of multiple disciplines. Using the method of analogy analysis, five basic axioms of geometry, four laws of classical mechanics and four laws of thermodynamics are compared and analyzed. The similarity and relevance of basic laws between different disciplines is proposed. Then, by comparing the axiom of circle in geometry and Newton&#39;s law of universal gravitation, the conjecture of the law of thermo-gravitation is put forward.

preprint2021arXiv

Dark Matter Search Results from the PandaX-4T Commissioning Run

We report the first dark matter search results using the commissioning data from PandaX-4T. Using a time projection chamber with 3.7-tonne of liquid xenon target and an exposure of 0.63 tonne$\cdot$year, 1058 candidate events are identified within an approximate nuclear recoil energy window between 5 and 100 keV. No significant excess over background is observed. Our data set a stringent limit to the dark matter-nucleon spin-independent interactions, with a lowest excluded cross section (90% C.L.) of $3.8\times10^{-47} $cm$^2$ at a dark matter mass of 30 GeV/$c^2$.

preprint2021arXiv

First-order formalism and thick branes in mimetic gravity

In this paper, we investigate thick branes generated by a scalar field in mimetic gravity theory. By introducing two auxiliary super-potentials, we transform the second-order field equations of the system into a set of first-order equations. With this first-order formalism, several types of analytical thick brane solutions are obtained. Then, tensor and scalar perturbations are analysed. We find that both kinds of perturbations are stable. The effective potentials for the tensor and scalar perturbations are dual to each other. The tensor zero mode can be localized on the brane while the scalar zero mode cannot. Thus, the four-dimensional Newtonian potential can be recovered on the brane.

preprint2021arXiv

Internal Calibration of the PandaX-II Detector with Radon Gaseous Sources

We have developed a low-energy electron recoil (ER) calibration method with $^{220}$Rn for the PandaX-II detector. $^{220}$Rn, emanated from natural thorium compounds, was fed into the detector through the xenon purification system. From 2017 to 2019, we performed three dedicated calibration campaigns with different radon sources. We studied the detector response to $α$, $β$, and $γ$ particles with focus on low energy ER events. During the runs in 2017 and 2018, the amount of radioactivity of $^{222}$Rn were on the order of 1\% of that of $^{220}$Rn and thorium particulate contamination was negligible, especially in 2018. We also measured the background contribution from $^{214}$Pb for the first time in PandaX-II with the help from a $^{222}$Rn injection. Calibration strategy with $^{220}$Rn and $^{222}$Rn will be implemented in the upcoming PandaX-4T experiment and can be useful for other xenon-based detectors as well.

preprint2021arXiv

Light yield and field dependence measurement in PandaX-II dual-phase xenon detector

The dual-phase xenon time projection chamber (TPC) is one of the most sensitive detector technology for dark matter direct search, where the energy deposition of incoming particle can be converted into photons and electrons through xenon excitation and ionization. The detector response to signal energy deposition varies significantly with the electric field in liquid xenon. We study the detector&#39;s light yield and its dependence on the electric field in the PandaX-II dual-phase detector containing 580~kg liquid xenon in the sensitive volume. From our measurements, the light yield at electric fields from 0~V/cm to 317~V/cm is obtained for energy depositions up to 236~keV.

preprint2021arXiv

Photovoltaic Self-Powered Gas Sensing: A Review

The self-powered sensing system could harness ambient energy to power the sensor without the need for external electrical energy. Recently, the concept of photovoltaic (PV) self-powered gas sensing has aroused wider attentions due to room-temperature operation, low power consumption, small size and potential applications. The PV self-powered gas sensors integrate the photovoltaic effects and the gas sensing function into a single chip, which could truly achieve the goal of zero power consumption for an independent gas sensing device. As an emerging concept, the PV self-powered gas sensing has been achieved by using different strategies, including integrated gas sensor and solar cell, integrated light filter and solar cell, gas-sensitive heterojunction photovoltaics, and gas-sensitive lateral photovoltaics, respectively. The purpose of this review is to summarize recent advances of PV self-powered gas sensing and also remark on the directions for future research in this topic.

preprint2021arXiv

Results of Dark Matter Search using the Full PandaX-II Exposure

We report the dark matter search results obtained using the full 132 ton$\cdot$day exposure of the PandaX-II experiment, including all data from March 2016 to August 2018. No significant excess of events is identified above the expected background. Upper limits are set on the spin-independent dark matter-nucleon interactions. The lowest 90% confidence level exclusion on the spin-independent cross section is $2.2\times 10^{-46}$ cm$^2$ at a WIMP mass of 30 GeV/$c^2$.

preprint2021arXiv

Return-Based Contrastive Representation Learning for Reinforcement Learning

Recently, various auxiliary tasks have been proposed to accelerate representation learning and improve sample efficiency in deep reinforcement learning (RL). However, existing auxiliary tasks do not take the characteristics of RL problems into consideration and are unsupervised. By leveraging returns, the most important feedback signals in RL, we propose a novel auxiliary task that forces the learnt representations to discriminate state-action pairs with different returns. Our auxiliary loss is theoretically justified to learn representations that capture the structure of a new form of state-action abstraction, under which state-action pairs with similar return distributions are aggregated together. In low data regime, our algorithm outperforms strong baselines on complex tasks in Atari games and DeepMind Control suite, and achieves even better performance when combined with existing auxiliary tasks.

preprint2021arXiv

Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data

This paper presents a novel framework to build a voice conversion (VC) system by learning from a text-to-speech (TTS) synthesis system, that is called TTS-VC transfer learning. We first develop a multi-speaker speech synthesis system with sequence-to-sequence encoder-decoder architecture, where the encoder extracts robust linguistic representations of text, and the decoder, conditioned on target speaker embedding, takes the context vectors and the attention recurrent network cell output to generate target acoustic features. We take advantage of the fact that TTS system maps input text to speaker independent context vectors, and reuse such a mapping to supervise the training of latent representations of an encoder-decoder voice conversion system. In the voice conversion system, the encoder takes speech instead of text as input, while the decoder is functionally similar to TTS decoder. As we condition the decoder on speaker embedding, the system can be trained on non-parallel data for any-to-any voice conversion. During voice conversion training, we present both text and speech to speech synthesis and voice conversion networks respectively. At run-time, the voice conversion network uses its own encoder-decoder architecture. Experiments show that the proposed approach outperforms two competitive voice conversion baselines consistently, namely phonetic posteriorgram and variational autoencoder methods, in terms of speech quality, naturalness, and speaker similarity.

preprint2020arXiv

A Vision of C-V2X: Technologies, Field Testing and Challenges with Chinese Development

C-V2X (Cellular Vehicle-to-Everything) is the important enabling technology for autonomous driving and intelligent transportation systems. It evolves from LTE (Long Term Evolution)-V2X to NR (New Radio)-V2X, which will coexist and be complementary with each other to provide low latency, high reliability, and high throughput communications for various C-V2X applications. In this article, a vision of C-V2X is presented. The requirements of the basic road safety and advanced applications, the architecture, the key technologies, and the standards of C-V2X are introduced, highlighting the technical evolution path from LTE-V2X to NR-V2X. Especially, based on the continual and active promotion of C-V2X research, field testing and development in China, the related works and progresses are also presented. Lastly, the trends of C-V2X applications with technical challenges are envisioned.

preprint2020arXiv

Excited states of holographic superconductors

In this paper we re-investigate the model of the anti-de Sitter gravity coupled to Maxwell and charged scalar fields, which has been studied as the gravitational dual to a superconductor for a long time since the famous work [Phys.\ Rev.\ Lett.\ {\bf 101}, 031601 (2008)]. By numerical method, we present a novel family of solutions of holographical superconductor with excited states, and find there exists a lower critical temperature in the corresponding excited state. Moreover, we study the condensate and conductivity in the excited states. It is very interesting that the conductivity $σ$ of each excited state has an additional pole in $\text{Im}[σ]$ and a delta function in $\text{Re}[σ]$ arising at the low temperature inside the gap, which is just the evidence of the existence of excited states.

preprint2020arXiv

Experimental Search for Dark Matter in China

The nature of dark matter is one of the greatest mysteries in modern physics and astronomy. A wide variety of experiments have been carried out worldwide to search for the evidence of particle dark matter. Chinese physicists started experimental search for dark matter about ten years ago, and have produced results with high scientific impact. In this paper, we present an overview of the dark matter program in China, and discuss recent results and future directions.

preprint2020arXiv

Fully Parameterized Quantile Function for Distributional Reinforcement Learning

Distributional Reinforcement Learning (RL) differs from traditional RL in that, rather than the expectation of total returns, it estimates distributions and has achieved state-of-the-art performance on Atari Games. The key challenge in practical distributional RL algorithms lies in how to parameterize estimated distributions so as to better approximate the true continuous distribution. Existing distributional RL algorithms parameterize either the probability side or the return value side of the distribution function, leaving the other side uniformly fixed as in C51, QR-DQN or randomly sampled as in IQN. In this paper, we propose fully parameterized quantile function that parameterizes both the quantile fraction axis (i.e., the x-axis) and the value axis (i.e., y-axis) for distributional RL. Our algorithm contains a fraction proposal network that generates a discrete set of quantile fractions and a quantile value network that gives corresponding quantile values. The two networks are jointly trained to find the best approximation of the true distribution. Experiments on 55 Atari Games show that our algorithm significantly outperforms existing distributional RL algorithms and creates a new record for the Atari Learning Environment for non-distributed agents.

preprint2020arXiv

Kaluza-Klein modes of $U(1)$ gauge vector field on brane with codimension-$d$

From the paper [JHEP 01 (2019) 021], it is known that the effective action of a massless $U(1)$ gauge vector field on a codimension-2 brane is gauge invariant due to the coupling between the vector Kaluza-Klein (KK) modes with two types of scalar KK modes. It is interesting to generalize this result to a brane world model with an arbitrary number of extra dimensions. In this work, we first investigate the case with three extra dimensions. After KK decomposition, there are three types of scalar KK modes. In addition to the mutual coupling between these scalar modes, there are also coupling between the scalar and the vector KK modes. The coupling constants are not all independent. The relationships between the coupling constants enable us to obtain a gauge invariant effective action, from which we can see that the masses of the vector KK modes depend on all the three extra dimensions. The masses of the scalar modes, however, depend only on two of the three extra dimensions. Then we generalize the results into branes with codimension $d$ ($d=1, 2...$), and find that $d$ will directly affect the masses of the KK modes. But there is always a gauge invariant effective action for the massive vector KK modes.

preprint2020arXiv

Spherical Accretion Flow onto General Parameterized Spherically Symmetric Black Hole Spacetimes

The transonic phenomenon of black hole accretion and the existence of the photon sphere are the characteristics of strong gravitational fields near a black hole horizon. In this work, we study spherical accretion flow onto a general parametrized spherically symmetric black hole spacetimes. For this purpose, we analyze the accretion process of various perfect fluids, such as the isothermal fluid of ultra-stiff, ultra-relativistic, and sub-relativistic types and polytropic fluid, respectively. The influences of extra parameters beyond the Schwarzschild black hole in the general parameterized spherically symmetric black hole on the flow behaviors of the above-mentioned test fluids are studied in detail. In addition, by studying the accretion of ideal photon gas, we further discuss the correspondence between the sonic radius of accreting photon gas and the photon sphere for the general parameterized spherically symmetric black hole. Some possible future extensions of our analysis are also discussed.

preprint2020arXiv

Suphx: Mastering Mahjong with Deep Reinforcement Learning

Artificial Intelligence (AI) has achieved great success in many domains, and game AI is widely regarded as its beachhead since the dawn of AI. In recent years, studies on game AI have gradually evolved from relatively simple environments (e.g., perfect-information games such as Go, chess, shogi or two-player imperfect-information games such as heads-up Texas hold&#39;em) to more complex ones (e.g., multi-player imperfect-information games such as multi-player Texas hold&#39;em and StartCraft II). Mahjong is a popular multi-player imperfect-information game worldwide but very challenging for AI research due to its complex playing/scoring rules and rich hidden information. We design an AI for Mahjong, named Suphx, based on deep reinforcement learning with some newly introduced techniques including global reward prediction, oracle guiding, and run-time policy adaptation. Suphx has demonstrated stronger performance than most top human players in terms of stable rank and is rated above 99.99% of all the officially ranked human players in the Tenhou platform. This is the first time that a computer program outperforms most top human players in Mahjong.

preprint2020arXiv

Trimming the Sail: A Second-order Learning Paradigm for Stock Prediction

Nowadays, machine learning methods have been widely used in stock prediction. Traditional approaches assume an identical data distribution, under which a learned model on the training data is fixed and applied directly in the test data. Although such assumption has made traditional machine learning techniques succeed in many real-world tasks, the highly dynamic nature of the stock market invalidates the strict assumption in stock prediction. To address this challenge, we propose the second-order identical distribution assumption, where the data distribution is assumed to be fluctuating over time with certain patterns. Based on such assumption, we develop a second-order learning paradigm with multi-scale patterns. Extensive experiments on real-world Chinese stock data demonstrate the effectiveness of our second-order learning paradigm in stock prediction.

preprint2019arXiv

Searching for Neutrino-less Double Beta Decay of $^{136}$Xe with PandaX-II Liquid Xenon Detector

We report the Neutrino-less Double Beta Decay (NLDBD) search results from PandaX-II dual-phase liquid xenon time projection chamber. The total live time used in this analysis is 403.1 days from June 2016 to August 2018. With NLDBD-optimized event selection criteria, we obtain a fiducial mass of 219 kg of natural xenon. The accumulated xenon exposure is 242 kg$\cdot$yr, or equivalently 22.2 kg$\cdot$yr of $^{136}$Xe exposure. At the region around $^{136}$Xe decay Q-value of 2458 keV, the energy resolution of PandaX-II is 4.2%. We find no evidence of NLDBD in PandaX-II and establish a lower limit for decay half-life of 2.4 $ \times 10^{23} $ yr at the 90% confidence level, which corresponds to an effective Majorana neutrino mass $m_{ββ} < (1.3 - 3.5)$ eV. This is the first NLDBD result reported from a dual-phase xenon experiment.

preprint2019arXiv

Thick brane in reduced Horndeski theory

Horndeski theory is the most general scalar-tensor theory retaining second-order field equations, although the action includes higher-order terms. This is achieved by a special choice of coupling constants. In this paper, we investigate thick brane system in reduced Horndeski theory, especially the effect of the non-minimal derivative coupling on thick brane. First, the equations of motion are presented and a set of analytic background solutions are obtained. Then, to investigate the stability of the background scalar profile, we present a novel canonically normalized method, and show that although the original background scalar field is unstable, the canonical one is stable. The stability of the thick brane under tensor perturbation is also considered. It is shown that the tachyon is absent and the graviton zero mode can be localized on the brane. The localized graviton zero mode recovers the four-dimensional Newtonian potential and the presence of the non-minimal derivative coupling results in a splitting of its wave function. The correction of the massive graviton KK modes to the Newtonian potential is also analyzed briefly.