Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2026arXiv

Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning

LLM agents operating over massive, dynamic tool libraries rely on effective retrieval, yet standard single-shot dense retrievers struggle with complex requests. These failures primarily stem from the disconnect between abstract user goals and technical documentation, and the limited capacity of fixed-size embeddings to model combinatorial tool compositions. To address these challenges, we propose TOOLQP, a lightweight framework that models retrieval as iterative query planning. Instead of single-shot matching, TOOLQP decomposes instructions into sub-tasks and dynamically generates queries to interact with the retriever, effectively bridging the semantic gap by targeting the specific sub-tasks required for composition. We train TOOLQP using synthetic query trajectories followed by optimization via Reinforcement Learning with Verifiable Rewards (RLVR). Experiments demonstrate that TOOLQP achieves state-of-the-art performance, exhibiting superior zero-shot generalization, robustness across diverse retrievers, and significant improvements in downstream agentic execution.

preprint2024arXiv

Parallel Spiking Neurons with High Efficiency and Ability to Learn Long-term Dependencies

Vanilla spiking neurons in Spiking Neural Networks (SNNs) use charge-fire-reset neuronal dynamics, which can only be simulated serially and can hardly learn long-time dependencies. We find that when removing reset, the neuronal dynamics can be reformulated in a non-iterative form and parallelized. By rewriting neuronal dynamics without reset to a general formulation, we propose the Parallel Spiking Neuron (PSN), which generates hidden states that are independent of their predecessors, resulting in parallelizable neuronal dynamics and extremely high simulation speed. The weights of inputs in the PSN are fully connected, which maximizes the utilization of temporal information. To avoid the use of future inputs for step-by-step inference, the weights of the PSN can be masked, resulting in the masked PSN. By sharing weights across time-steps based on the masked PSN, the sliding PSN is proposed to handle sequences of varying lengths. We evaluate the PSN family on simulation speed and temporal/static data classification, and the results show the overwhelming advantage of the PSN family in efficiency and accuracy. To the best of our knowledge, this is the first study about parallelizing spiking neurons and can be a cornerstone for the spiking deep learning research. Our codes are available at \url{https://github.com/fangwei123456/Parallel-Spiking-Neuron}.

preprint2024arXiv

Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket

Spiking Neural Networks (SNNs), known for their biologically plausible architecture, face the challenge of limited performance. The self-attention mechanism, which is the cornerstone of the high-performance Transformer and also a biologically inspired structure, is absent in existing SNNs. To this end, we explore the potential of leveraging both self-attention capability and biological properties of SNNs, and propose a novel Spiking Self-Attention (SSA) and Spiking Transformer (Spikformer). The SSA mechanism eliminates the need for softmax and captures the sparse visual feature employing spike-based Query, Key, and Value. This sparse computation without multiplication makes SSA efficient and energy-saving. Further, we develop a Spiking Convolutional Stem (SCS) with supplementary convolutional layers to enhance the architecture of Spikformer. The Spikformer enhanced with the SCS is referred to as Spikformer V2. To train larger and deeper Spikformer V2, we introduce a pioneering exploration of Self-Supervised Learning (SSL) within the SNN. Specifically, we pre-train Spikformer V2 with masking and reconstruction style inspired by the mainstream self-supervised Transformer, and then finetune the Spikformer V2 on the image classification on ImageNet. Extensive experiments show that Spikformer V2 outperforms other previous surrogate training and ANN2SNN methods. An 8-layer Spikformer V2 achieves an accuracy of 80.38% using 4 time steps, and after SSL, a 172M 16-layer Spikformer V2 reaches an accuracy of 81.10% with just 1 time step. To the best of our knowledge, this is the first time that the SNN achieves 80+% accuracy on ImageNet. The code will be available at Spikformer V2.

preprint2022arXiv

Deep Residual Learning in Spiking Neural Networks

Deep Spiking Neural Networks (SNNs) present optimization difficulties for gradient-based approaches due to discrete binary activation and complex spatial-temporal dynamics. Considering the huge success of ResNet in deep learning, it would be natural to train deep SNNs with residual learning. Previous Spiking ResNet mimics the standard residual block in ANNs and simply replaces ReLU activation layers with spiking neurons, which suffers the degradation problem and can hardly implement residual learning. In this paper, we propose the spike-element-wise (SEW) ResNet to realize residual learning in deep SNNs. We prove that the SEW ResNet can easily implement identity mapping and overcome the vanishing/exploding gradient problems of Spiking ResNet. We evaluate our SEW ResNet on ImageNet, DVS Gesture, and CIFAR10-DVS datasets, and show that SEW ResNet outperforms the state-of-the-art directly trained SNNs in both accuracy and time-steps. Moreover, SEW ResNet can achieve higher performance by simply adding more layers, providing a simple method to train deep SNNs. To our best knowledge, this is the first time that directly training deep SNNs with more than 100 layers becomes possible. Our codes are available at https://github.com/fangwei123456/Spike-Element-Wise-ResNet.

preprint2022arXiv

Dynamical Stability of the Power Law K-essence Dark Energy Model with a New Interaction

We investigate the cosmological evolution of the power law K-essence dark energy (DE) model $F(X)= -\sqrt{X} + X$ with a new interaction $Q = αρ_mρ_{ϕ}H^{-1}$ in FRWL spacetime. The evolution behavior of dark energy under this interaction is analyzed by using dynamical systems method, and ten critical points are obtained. Among those critical points, a new stable point, which we called Scaling-like dark energy(DE) solution, is very important and interesting. The cosmological meaning of this attractor is different from the Scaling solution and dark energy dominated solution. For some value of model parameters, the universe will evolve to the attractor solution with the dark energy density parameter $Ω_ϕ=0.682946$ and the the equation of state $w_ϕ=-0.99$, which can be in good agreement with the observed data, and alleviate the Coincidence Problem.

preprint2022arXiv

General analytical nuclear force and molecular potential energy surface from full configuration interaction quantum Monte Carlo

Full configuration interaction quantum Monte Carlo (FCIQMC) is a state-of-the-art stochastic electronic structure method, providing a methodology to compute FCI-level state energies of molecular systems within a quantum chemical basis. However, especially to probe {\em dynamics} at the FCIQMC level, it is necessary to devise more efficient schemes to produce nuclear forces and potential energy surfaces (PES) from FCIQMC. In this work, we derive the general formula for nuclear force from FCIQMC, and clarify different contributions of the total force. This method to obtain FCIQMC forces eliminates previous restrictions, and can be used with frozen core approximation and free selection of orbitals, making it promising for more efficient nuclear force calculations. After numerical check of this procedure on the binding curve of N$_2$ molecule, we use the FCIQMC energy and force to obtain the full-dimensional ground state PES of water molecule via Gaussian processes regression. The new water FCIQMC PES can be used as the basis for H$_2$O ground state nuclear dynamics, structure optimization, and rotation-vibrational spectrum calculation.

preprint2022arXiv

Quantum tunnelling driven H$_2$ formation on graphene

It is commonly believed that it is unfavourable for adsorbed H atoms on carbonaceous surfaces to form H$_2$ without the help of incident H atoms. Using ring-polymer instanton theory to describe multidimensional tunnelling effects, combined with ab initio electronic structure calculations, we find that these quantum-mechanical simulations reveal a qualitatively different picture. Recombination of adsorbed H atoms, which was believed to be irrelevant at low temperature due to high barriers, is enabled by deep tunnelling, with reaction rates enhanced by tens of orders of magnitude. Furthermore, we identify a new path for H recombination that proceeds via multidimensional tunnelling, but would have been predicted to be unfeasible by a simple one-dimensional description of the reaction. The results suggest that hydrogen molecule formation at low temperatures are rather fast processes that should not be ignored in experimental settings and natural environments with graphene, graphite and other planar carbon segments.

preprint2021arXiv

Both qubits of the singlet state can be steered simultaneously by multiple independent observers via sequential measurement

Quantum correlation is a fundamental property which distinguishes quantum systems from classical ones, and it is also a fragile resource under projective measurement. Recently, it has been shown that a subsystem in entangled pairs can share nonlocality with multiple observers in sequence. Here we present a new steering scenario where both subsystems are accessible by multiple observers. And it is found that the two qubits in singlet state can be simultaneously steered by two sequential observers, respectively.

preprint2021arXiv

Broadband highly efficient nonlinear optical processes in on-chip integrated lithium niobate microdisk resonators of Q-factor above 10^8

We demonstrated broadband highly efficient optical nonlinear processes in on-chip integrated lithium niobate (LN) microdisk resonators. The Q factors of the micro-resonators fabricated by femtosecond laser writing and chemo-mechanical polishing are reliably above 10^8, approaching the intrinsic material absorption limit of LN. Broadband nonlinear processes, including optical parametric oscillation (OPO), second harmonic generation (SHG), third harmonic generation, and fourth harmonic generation, were observed with ultrahigh efficiencies in the same LN microdisk without introducing domain inversion, thanks to the natural quasi phase-matching and the dense spectral modes of the X-cut LN microdisk with millimeter diameter. The threshold of OPO and the absolute conversion efficiency of SHG are 19.6 microwatt and 66%, both surpass the state-of-the-art values among on-chip LN micro-resonators demonstrated so far. The broadband and highly efficient nonlinear frequency conversions achieved with the ultrahigh-Q LN microdisk resonators promise high-density integration of nonlinear photonic devices such as frequency convertors and entangled photon sources.

preprint2021arXiv

Rapid water diffusion at cryogenic temperatures through an inchworm-like mechanism

Water diffusion across the surfaces of materials is of importance to disparate processes such as water purification, ice formation, and more. Despite reports of rapid water diffusion on surfaces the molecular-level details of such processes remain unclear. Here, with scanning tunneling microscopy, we observe structural rearrangements and diffusion of water trimers at unexpectedly low temperatures (< 10 K) on a copper surface; temperatures at which water monomers or other clusters do not diffuse. Density functional theory calculations reveal a facile trimer diffusion process involving transformations between elongated and almost cyclic conformers in an inchworm-like manner. These subtle intermolecular reorientations maintain an optimal balance of hydrogen-bonding and water-surface interactions throughout the process. This work shows that the diffusion of hydrogen-bonded clusters can occur at exceedingly low temperatures without the need for hydrogen bond breakage or exchange; findings that will influence Ostwald ripening of ice nanoclusters and hydrogen bonded clusters in general.

preprint2020arXiv

Deciphering exciton-generation processes in quantum-dot electroluminescence

Electroluminescence (EL) of colloidal nanocrystals promises a new generation of high-performance and solution-processable light-emitting diodes (LEDs). The operation of nanocrystal-based LEDs relies on the recombination of electrically-generated excitons. However, a fundamental question, i.e, how excitons are electrically generated in individual nanocrystals, remains unanswered. Here, we reveal a molecular mechanism of sequential electron-hole injection for the exciton generation in nanocrystal-based EL devices. To decipher the corresponding elementary processes, we develop electrically-pumped single-nanocrystal spectroscopy. While hole injection into neutral quantum dots (QDs) is generally-considered to be inefficient, we find that the intermediate negatively-charged state of QD triggers confinement-enhanced Coulomb interactions, which simultaneously accelerate hole injection and hinder excessive electron injection. In-situ/operando spectroscopy on state-of-the-art QD-LEDs demonstrate that exciton generation at the ensemble level is consistent with the charge-confinement-enabled sequential electron-hole injection mechanism revealed at the single-nanocrystal level. Our findings provide a universal mechanism for enhancing charge balance in nanocrystal-based EL devices.

preprint2020arXiv

Efficient light coupling between an ultra-low loss lithium niobate waveguide and an adiabatically tapered single mode optical fiber

A lithium niobate on insulator ridge waveguide allows constructing high-density photonic integrated circuits thanks to its small bending radius offered by the high index contrast. Meanwhile, the significant mode-field mismatch between an optical fiber and the single-mode lithium niobate waveguide leads to low coupling efficiencies. Here, we demonstrate, both numerically and experimentally, that the problem can be solved with a tapered single mode fiber of an optimized mode field profile. Numerical simulation shows that the minimum coupling losses for the TE and TM mode are 0.32 dB and 0.86 dB, respectively. Experimentally, though without anti-reflection coating, the measured coupling losses for TE and TM mode are 1.32 dB and 1.88 dB, respectively. Our technique paves a way for a broad range of on-chip lithium niobate applications.

preprint2020arXiv

Importance Sampling for Pathwise Sensitivity of Stochastic Chaotic Systems

This paper proposes a new pathwise sensitivity estimator for chaotic SDEs. By introducing a spring term between the original and perturbated SDEs, we derive a new estimator by importance sampling. The variance of the new estimator increases only linearly in time $T,$ compared with the exponential increase of the standard pathwise estimator. We compare our estimator with the Malliavin estimator and extend both of them to the Multilevel Monte Carlo method, which further improves the computational efficiency. Finally, we also consider using this estimator for the SDE with small volatility to approximate the sensitivities of the invariant measure of chaotic ODEs. Furthermore, Richardson-Romberg extrapolation on the volatility parameter gives a more accurate and efficient estimator. Numerical experiments support our analysis.

preprint2020arXiv

Observation of photon antibunching with a single conventional detector

The second-order photon correlation function is of great importance in quantum optics which is typically measured with the Hanbury Brown and Twiss interferometer which employs a pair of single-photon detectors and a dual-channel time acquisition module. Here we demonstrate a new method to measure and extract the second-order correlation function with a standard single-photon avalanche photodiode (dead-time = 22 ns) and a single-channel time acquisition module. This is realized by shifting the informative coincidence counts near the zero-time delay to a time window which is not obliterated by the dead-time and after-pulse of detection system. The new scheme is verified by measuring the second-order correlation from a single colloidal nanocrystal. Photon antibunching is unambiguously observed and agrees well with the result measured using the standard HBT setup. Our scheme simplifies the higher-order correlation technique and might be favored in cost-sensitive circumstances.

preprint2020arXiv

Revisiting nuclear tunnelling in the aqueous ferrous-ferric electron transfer

The aqueous ferrous-ferric system provides a classic example of an electron-transfer process in solution. There has been a long standing argument spanning more than three decades around the importance of nuclear tunnelling in this system, with estimates based on Wolynes theory suggesting a quantum correction factor of 65, while estimates based on a related spin-boson model suggest a smaller factor of 7-36. Recently, we have shown that Wolynes theory can break down for systems with multiple transition states leading to an overestimation of the rate, and we suggest that a liquid system such as the one investigated here may be particularly prone to this. We re-investigate this old yet interesting system with the first application of the recently developed golden-rule quantum transition-state theory (GR-QTST). We find that GR-QTST can be applied to this complex system without apparent difficulties and that it gives a prediction for the quantum rate 6 times smaller than that from Wolynes theory. The fact that these theories give different results suggests that although it is well known that the system can be treated using linear response and therefore resembles a spin-boson model in the classical limit, this approximation is questionable in the quantum case. It also intriguingly suggests the possibility that the previous predictions were overestimating the rate due to a break down of Wolynes theory.

preprint2020arXiv

The color center singlet state of oxygen vacancies in TiO$_2$

Oxygen vacancies are ubiquitous in TiO$_2$ and play key roles in catalysis and magnetism applications.Despite being extensively investigated, the electronic structure of oxygen vacancies in TiO$_2$ remains controversial both experimentally and theoretically.Here we report a study of a neutral oxygen vacancy in TiO$_2$ using state-of-the-art quantum chemical electronic structure methods.We find that the ground state is a color center singlet state in both the rutile and the anatase phase of TiO$_2$. Specifically, embedded CCSD(T) calculations find, for an oxygen vacancy in rutile, that the lowest triplet state energy is 0.6 eV above the singlet state, and in anatase the triplet state energy is higher by 1.4 eV. Our study provides fresh insights on the electronic structure of the oxygen vacancy in TiO$_2$, clarifying earlier controversies and potentially inspiring future studies of defects with correlated wave function theories.

preprint2019arXiv

Nonadiabatic quantum transition-state theory in the golden-rule limit. II. Overcoming the pitfalls of the saddle-point and semiclassical approximations

We describe a path-integral molecular dynamics implementation of our recently developed golden-rule quantum transition-state theory (GR-QTST). The method is applied to compute the reaction rate in various models of electron transfer and benchmarked against exact results. We demonstrate that for systems exhibiting two or more transition states, rates computed using Wolynes theory [P. G. Wolynes, J.\ Chem.\ Phys.\ 87, 6559 (1987)] can be overestimated by orders of magnitude, whereas the GR-QTST predictions are numerically accurate. This is the case both at low temperature, where nuclear tunneling makes a considerable contribution, and also in the classical limit, where only GR-QTST rigorously tends to the correct result. Analysis shows that the saddle-point approximation employed by Wolynes theory is not valid in this case, which results in predictions of unphysical reaction pathways, whilst the energy constraint employed by GR-QTST resolves this problem. The GR-QTST method is also seen to give accurate results for a strongly anharmonic system by sampling configurations around the instanton pathway without making the semiclassical approximation. These promising results indicate that the GR-QTST method could be an efficient and accurate approach for simulating electron-transfer reactions in complex molecular systems.