Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance

Reinforcement Learning with Verifiable Rewards (RLVR) has achieved great success in developing Large Language Models (LLMs) with chain-of-thought rollouts for many tasks such as math and coding. Nevertheless, RLVR struggles with sample efficiency on difficult problems where correct rollouts are hard to generate. Prior works propose to address this issue via demonstration-guided RLVR, i.e., to conduct Supervised FineTuning (SFT) when RL fails; however, SFT often requires a lot of data, which can be expensive to acquire. In this paper, we propose FEST, a FEw-ShoT demonstration-guided RLVR algorithm. It attains compelling results with only 128 demonstrations randomly selected from an SFT dataset. We find that three components are vital for the success: supervised signal, on-policy signal, and decaying weights on the few-shot SFT dataset to prevent overfitting from multiple-epoch training. On several benchmarks, FEST outperforms baselines with magnitudes less SFT data, even matching their performance with full dataset.

preprint2022arXiv

Improving the estimation of directional area scattering factor (DASF) from canopy reflectance: theoretical basis and validation

Directional area scattering factor (DASF) is a critical canopy structural parameter for vegetation monitoring. It provides an efficient tool for decoupling of canopy structure and leaf optics from canopy reflectance. Current standard approach to estimate DASF from canopy bidirectional reflectance factor (BRF) is based on the assumption that in the weakly absorbing 710 to 790 nm spectral interval, leaf scattering does not change much with the concentration of dry matter and thus its variation can be neglected. This results in biased estimates of DASF and consequently leads to uncertainty in DASF-related applications. This study proposes a new approach to account for variations in concentrations of this biochemical constituent, which additionally uses the canopy BRF at 2260 nm. In silico analysis of the proposed approach suggests significant increase in accuracy over the standard technique by a relative root mean square error (rRMSE) of 49% and 34% for one- and three dimensional scenes, respectively. When compared with indoor multi-angular hyperspectral measurements reported in literature, the mean absolute error has reduced by 68% for needle leaf and 20% for broadleaf canopies. Thus, the proposed DASF estimation approach outperforms the current one and can be used more reliably in DASF-related applications, such as vegetation monitoring of functional traits, dynamics, and radiation budget.

preprint2022arXiv

Physics-Based Inverse Rendering using Combined Implicit and Explicit Geometries

Mathematically representing the shape of an object is a key ingredient for solving inverse rendering problems. Explicit representations like meshes are efficient to render in a differentiable fashion but have difficulties handling topology changes. Implicit representations like signed-distance functions, on the other hand, offer better support of topology changes but are much more difficult to use for physics-based differentiable rendering. We introduce a new physics-based inverse rendering pipeline that uses both implicit and explicit representations. Our technique enjoys the benefit of both representations by supporting both topology changes and differentiable rendering of complex effects such as environmental illumination, soft shadows, and interreflection. We demonstrate the effectiveness of our technique using several synthetic and real examples.

preprint2022arXiv

Prospect of detecting TeV halos with LHAASO: in the framework of the anisotropic diffusion model

The particle diffusion coefficients of three TeV pulsar halos observed so far are inferred to be significantly smaller than the typical value of the interstellar medium (ISM). The anisotropic diffusion model ascribes the slow diffusion to the cross-field diffusion assuming sub-Alfv{é}nic turbulence in the ISM around the pulsar if the viewing angle between the observer's line-of-sight (LOS) to the pulsar and the local mean field direction is small. In general, the TeV halo's morphology under this model highly depends on the viewing angle, and an elongated, asymmetric morphology is predicted if the LOS is not approximately aligned with the local mean field direction. While the specific requirement of a small viewing angle is supposedly established only for a small fraction of TeV halos, TeV halos with apparent asymmetric morphology has not been detected. In this paper we will study the expectation of TeV halos measured by the TeV-PeV gamma-ray detector LHAASO in the framework of anisotropic diffusion model, with a particular focus on the influence of the viewing angle on the detectability. We show that a TeV halo is more detectable with a smaller viewing angle and this selection effect may explain why the morphologies of all three detected TeV halos so far are consistent being spherical. We also demonstrate that LHAASO is capable of detecting asymmetric TeV halos after several-year operation with reasonable source parameters. This can serve as a critical test of the anisotropic diffusion model.

preprint2022arXiv

Realization of fast all-microwave CZ gates with a tunable coupler

The development of high-fidelity two-qubit quantum gates is essential for digital quantum computing. Here, we propose and realize an all-microwave parametric Controlled-Z (CZ) gates by coupling strength modulation in a superconducting Transmon qubit system with tunable couplers. After optimizing the design of the tunable coupler together with the control pulse numerically, we experimentally realized a 100 ns CZ gate with high fidelity of 99.38%$ \pm$0.34% and the control error being 0.1%. We note that our CZ gates are not affected by pulse distortion and do not need pulse correction, {providing a solution for the real-time pulse generation in a dynamic quantum feedback circuit}. With the expectation of utilizing our all-microwave control scheme to reduce the number of control lines through frequency multiplexing in the future, our scheme draws a blueprint for the high-integrable quantum hardware design.

preprint2021arXiv

Decoherence of Dirac-particle quantumness for fermionic fields in a dilatonic black hole

The quantumness of Dirac paticles for quantized fields in a dilatonic black hole is estimated by means of quantum channel. We develop a general Bloch vector representation of quantum channel in black hole spacetimes beyond single mode approximation. The nonclassicality of Dirac particles can be measured by the minimization of quantum coherence over all orthonormal basis sets. The quantumness of the channel decreases as the dilaton parameter increases. The interplay between the external reservoir noise and dilaton black hole on the dynamical behavior of quantum coherence and steerability is investigated in the Pauli basis. The external environment is modelled by a random telegraph noise channel. The monotonous decay of quantum nonlocality occurs in the weak coupling case. The degradation and revival of quantum nonlocality are observed in the strong coupling condition. It is found that quantum fluctuation effects of the external reservoir can protect quantum coherence and steerability from the information loss of the black hole.

preprint2020arXiv

Algorithms in Multi-Agent Systems: A Holistic Perspective from Reinforcement Learning and Game Theory

Deep reinforcement learning (RL) has achieved outstanding results in recent years, which has led a dramatic increase in the number of methods and applications. Recent works are exploring learning beyond single-agent scenarios and considering multi-agent scenarios. However, they are faced with lots of challenges and are seeking for help from traditional game-theoretic algorithms, which, in turn, show bright application promise combined with modern algorithms and boosting computing power. In this survey, we first introduce basic concepts and algorithms in single agent RL and multi-agent systems; then, we summarize the related algorithms from three aspects. Solution concepts from game theory give inspiration to algorithms which try to evaluate the agents or find better solutions in multi-agent systems. Fictitious self-play becomes popular and has a great impact on the algorithm of multi-agent reinforcement learning. Counterfactual regret minimization is an important tool to solve games with incomplete information, and has shown great strength when combined with deep learning.

preprint2020arXiv

Deriving canonical differential equations for Feynman integrals from a single uniform weight integral

Differential equations are a powerful tool for evaluating Feynman integrals. Their solution is straightforward if a transformation to a canonical form is found. In this paper, we present an algorithm for finding such a transformation. This novel technique is based on a method due to Hoschele et al. and relies only on the knowledge of a single integral of uniform transcendental weight. As a corollary, the algorithm can also be used to test the uniform transcendentality of a given integral. We discuss the application to several cutting-edge examples, including non-planar four-loop HQET and non-planar two-loop five-point integrals. A Mathematica implementation of our algorithm is made available together with this paper.

preprint2020arXiv

Subleading Power Resummation of Rapidity Logarithms: The Energy-Energy Correlator in $\mathcal{N}=4$ SYM

We derive and solve renormalization group equations that allow for the resummation of subleading power rapidity logarithms. Our equations involve operator mixing into a new class of operators, which we term the "rapidity identity operators", that will generically appear at subleading power in problems involving both rapidity and virtuality scales. To illustrate our formalism, we analytically solve these equations to resum the power suppressed logarithms appearing in the back-to-back (double light cone) limit of the Energy-Energy Correlator (EEC) in $\mathcal{N}$=4 super-Yang-Mills. These logarithms can also be extracted to $\mathcal{O}(α_s^3)$ from a recent perturbative calculation, and we find perfect agreement to this order. Instead of the standard Sudakov exponential, our resummed result for the subleading power logarithms is expressed in terms of Dawson's integral, with an argument related to the cusp anomalous dimension. We call this functional form "Dawson's Sudakov". Our formalism is widely applicable for the resummation of subleading power rapidity logarithms in other more phenomenologically relevant observables, such as the EEC in QCD, the $p_T$ spectrum for color singlet boson production at hadron colliders, and the resummation of power suppressed logarithms in the Regge limit.

preprint2020arXiv

The full angle-dependence of the four-loop cusp anomalous dimension in QED

The angle-dependent cusp anomalous dimension governs divergences coming from soft gluon exchanges between heavy particles, such as top quarks. We focus on the matter-dependent contributions and compute the first truly non-planar terms. They appear at four loops and are proportional to a quartic Casimir operator in color space. Specializing our general gauge theory result to U(1), we obtain the full QED four-loop angle-dependent cusp anomalous dimension. While more complicated functions appear at intermediate steps, the analytic answer depends only on multiple polylogarithms with singularities at fourth roots of unity. It can be written in terms of four rational structures, and contains functions of up to maximal transcendental weight seven. Despite this complexity, we find that numerically the answer is tantalizingly close to the appropriately rescaled one-loop formula, over most of the kinematic range. We take several limits of our analytic result, which serves as a check and allows us to obtain new, power-suppressed terms. In the anti-parallel lines limit, which corresponds to production of two massive particles at threshold, we find that the subleading power correction vanishes. Finally, we compute the quartic Casimir contribution for scalars in the loop. Taking into account a supersymmetric decomposition, we derive the first non-planar corrections to the quark anti-quark potential in maximally supersymmetric gauge theory.

preprint2020arXiv

The Influence of Elastic Strain on Catalytic Activity Towards the Hydrogen Evolution Reaction

Understanding the role of elastic strain in modifying catalytic reaction rates is crucial for catalyst design, but experimentally, this effect is often coupled with a ligand effect. To isolate the strain effect, we have investigated the influence of externally applied elastic strain on the catalytic activity of metal films towards the hydrogen evolution reaction (HER). We show that elastic strain tunes the catalytic activity in a controlled and predictable way. Both theory and experiment show strain controls reactivity in a controlled manner consistent with the qualitative predictions of the HER volcano plot and the d-band theory: Ni and Pt activity were accelerated by compression, while Cu activity was accelerated by tension. By isolating the elastic strain effect from the ligand effect, this study provides a greater insight into the role of elastic strain in controlling electrocatalytic activity.