Source author record

Yan Jiang

Yan Jiang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

quant-ph Machine Learning eess.SY Systems and Control math.AP math.NA physics.optics Artificial Intelligence Computation and Language Computer Vision eess.SP math-ph math.MP math.OC math.SP Networking and Internet Architecture Numerical Analysis physics.atom-ph physics.comp-ph

Catalog footprint

What is connected

21works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models

Recently, reinforcement learning (RL) has been widely applied during post-training for diffusion large language models (dLLMs) to enhance reasoning with block-wise semi-autoregressive generation. Block size has therefore become a vital factor in dLLMs, since it determines the parallel decoding granularity and affects the rollout trajectories during RL optimisation, e.g., GRPO. Instead of investigating the effect of block size during inference on individual domains, this paper studies block size from a domain conflict perspective for dLLM RL post-training in multi-domain scenarios. The main contributions are: (1) a formulation of domain block size conflict in multi-domain RL for dLLMs, which will largely affect the post-training effectiveness for rollout-based RL methods; (2) a novel dataset, Block-R1-41K is constructed with a best-improved training block size for each sample, which also induces a Block Size Conflict Score to quantitatively measure the domain conflict; (3) a new benchmark, Block-R1, for flexible RL post-training for dLLMs in both single and cross domain; and (4) a simple yet powerful cross-domain post-training method with sample-level best-improved training block sizes. Extensive experiments on 13 distinct datasets, 7 latest RL algorithms and diverse dLLM backbones are comprehensively covered in Block-R1. The benchmark is open-sourced at https://github.com/YanJiangJerry/Block-R1 with the dataset released at https://huggingface.co/datasets/YanJiangJerry/Block-R1-41K.

preprint2026arXiv

Break the Block: Dynamic-size Reasoning Blocks for Diffusion Large Language Models via Monotonic Entropy Descent with Reinforcement Learning

Recent diffusion large language models (dLLMs) have demonstrated both effectiveness and efficiency in reasoning via a block-based semi-autoregressive generation paradigm. Despite their progress, the fixed-size block generations remain a critical bottleneck for effective and coherent reasoning. 1. From a global perspective, different reasoning tasks would correspond to different optimal decoding block sizes, which makes a ``one-size-fits-all'' assumption ineffective. 2. Even within a single reasoning task, the rigid block partitioning would break the logical flow and reduce reasoning coherence. Through empirical observations, we reveal that for block-wise entropy, incorrect reasoning exhibits a fluctuating and unsteady trend between blocks, whereas the correctly generated tasks follow a consistent descending trend. Therefore, this paper proposes b1, a novel post-training framework for dLLMs that learns dynamic-size reasoning blocks via a Monotonic Entropy Descent objective with reinforcement learning to enhance reasoning coherence.b1 integrates seamlessly as a plug-and-play module with existing dLLM's post-training algorithms. Extensive experiments across various reasoning benchmarks showcase b1's consistent improvement over existing fixed-size block baselines. Our code has been released at https://github.com/YanJiangJerry/Block-R1.

preprint2026arXiv

GFMate: Empowering Graph Foundation Models with Test-time Prompt Tuning

Graph prompt tuning has shown great potential in graph learning by introducing trainable prompts to enhance the model performance in conventional single-domain scenarios. Recent research has extended graph prompts to improve Graph Foundation Models (GFMs) by few-shot tuning auxiliary prompts. Despite their progress, most existing methods embed source-domain information into prompts, which serve either as input to GFMs or encoded during model pre-training. Such prompt entanglement with specific source domains and GFM pre-training strategy restricts their generalisability to other domains and different GFMs. Furthermore, existing GFM prompts merely rely on few-shot tuning for adaptation, neglecting the rich information in unlabelled target domain test data. Motivated by these insights, this paper aims to empower GFMs with pre-training-agnostic test-time graph prompt tuning, named GFMate. GFMate introduces centroid and layer prompts applied after pre-training on target domains, avoiding entanglement with specific source domains and model pre-training. In addition, a test-time complementary learning objective is devised to exploit both labelled and unlabelled target domain data for effective test-time prompt tuning. Extensive experiments on 12 benchmark datasets demonstrate the superior performance and efficiency of GFMate, achieving improvements of up to 30.63%. Code is available at https://github.com/YanJiangJerry/GFMate.

preprint2026arXiv

Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning

Multi-modal multi-agent systems (MM-MAS) have gained increasing attention for their capacity to enable complex reasoning and coordination across diverse modalities. As these systems continue to expand in scale and functionality, investigating their potential vulnerabilities has become increasingly important. However, existing studies on adversarial attacks in multi-agent systems primarily focus on isolated agents or unimodal settings, leaving the vulnerabilities of MM-MAS largely underexplored. To bridge this gap, we introduce HAM$^{3}$, a Hierarchical Attack framework for multi-modal multi-agent systems that decomposes attacks into three interconnected layers. Specifically, at the perception layer, HAM$^{3}$ mounts attacks by perturbing visual inputs, textual inputs, and their fused visual-textual representations. At the communication layer, it performs communication-level attacks that corrupt message content and interaction topology, such as manipulating shared context or communication links to distort collective information flow. At the reasoning layer, it conducts reasoning-level attacks that interfere with each agent's cognitive pipeline, biasing reasoning trajectories and ultimately compromising final decisions. We evaluate HAM$^{3}$ on the GQA benchmark through multi-agent systems built on distinct reasoning paradigms including ReAct, Plan-and-Solve, and Reflexion. Experiments demonstrate that our framework achieves an Attack Success Rate of up to 78.3%, with reasoning-layer attacks being the most effective. More than half of the successful attacks lead multiple agents to produce consistent errors. These findings offer valuable insights for building more robust and interpretable multi-agent intelligence.

preprint2026arXiv

Inverse Spectral Problem With Low Regularity Refractive Index

This article investigates the unique determination of a radial refractive index n from spectral data. First, we demonstrate that for piecewise twice continuously differentiable functions, n is not uniquely determined by the special transmission eigenvalues associated with radially symmetric eigenfunctions. Subsequently we prove that if n \in M is twice continuously differentiable functions(or continuously differentiable functions with Lipschitz continuous derivative), then n is uniquely determined on [0,1] by all special transmission eigenvalues when supplemented by partial a priori information on the refractive index.

preprint2024arXiv

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Large-scale text-to-image diffusion models achieve unprecedented success in image generation and editing. However, how to extend such success to video editing is unclear. Recent initial attempts at video editing require significant text-to-video data and computation resources for training, which is often not accessible. In this work, we propose vid2vid-zero, a simple yet effective method for zero-shot video editing. Our vid2vid-zero leverages off-the-shelf image diffusion models, and doesn't require training on any video. At the core of our method is a null-text inversion module for text-to-video alignment, a cross-frame modeling module for temporal consistency, and a spatial regularization module for fidelity to the original video. Without any training, we leverage the dynamic nature of the attention mechanism to enable bi-directional temporal modeling at test time. Experiments and analyses show promising results in editing attributes, subjects, places, etc., in real-world videos. Code is made available at \url{https://github.com/baaivision/vid2vid-zero}.

preprint2022arXiv

Boundary localization of transmission eigenfunctions in spherically stratified media

Consider the transmission eigenvalue problem for $u \in H^1(Ω)$ and $v\in H^1(Ω)$ associated with $(Ω; σ, \mathbf{n}^2)$, where $Ω$ is a ball in $\mathbb{R}^N$, $N=2,3$. If $σ$ and $\mathbf{n}$ are both radially symmetric, namely they are functions of the radial parameter $r$ only, we show that there exists a sequence of transmission eigenfunctions $\{u_m, v_m\}_{m\in\mathbb{N}}$ associated with $k_m\rightarrow+\infty$ as $m\rightarrow+\infty$ such that the $L^2$-energies of $v_m$'s are concentrated around $\partialΩ$. If $σ$ and $\mathbf{n}$ are both constant, we show the existence of transmission eigenfunctions $\{u_j, v_j\}_{j\in\mathbb{N}}$ such that both $u_j$ and $v_j$ are localized around $\partialΩ$. Our results extend the recent studies in [15,16]. Through numerics, we also discuss the effects of the medium parameters, namely $σ$ and $\mathbf{n}$, on the geometric patterns of the transmission eigenfunctions.

preprint2022arXiv

Few-Shot Stance Detection via Target-Aware Prompt Distillation

Stance detection aims to identify whether the author of a text is in favor of, against, or neutral to a given target. The main challenge of this task comes two-fold: few-shot learning resulting from the varying targets and the lack of contextual information of the targets. Existing works mainly focus on solving the second issue by designing attention-based models or introducing noisy external knowledge, while the first issue remains under-explored. In this paper, inspired by the potential capability of pre-trained language models (PLMs) serving as knowledge bases and few-shot learners, we propose to introduce prompt-based fine-tuning for stance detection. PLMs can provide essential contextual information for the targets and enable few-shot learning via prompts. Considering the crucial role of the target in stance detection task, we design target-aware prompts and propose a novel verbalizer. Instead of mapping each label to a concrete word, our verbalizer maps each label to a vector and picks the label that best captures the correlation between the stance and the target. Moreover, to alleviate the possible defect of dealing with varying targets with a single hand-crafted prompt, we propose to distill the information learned from multiple prompts. Experimental results show the superior performance of our proposed model in both full-data and few-shot scenarios.

preprint2022arXiv

On the Performance of Data Compression in Clustered Fog Radio Access Networks

The fog-radio-access-network (F-RAN) has been proposed to address the strict latency requirements, which offloads computation tasks generated in user equipments (UEs) to the edge to reduce the processing latency. However, it incorporates the task transmission latency, which may become the bottleneck of latency requirements. Data compression (DC) has been considered as one of the promising techniques to reduce the transmission latency. By compressing the computation tasks before transmitting, the transmission delay is reduced due to the shrink transmitted data size, and the original computing task can be retrieved by employing data decompressing (DD) at the edge nodes or the centre cloud. Nevertheless, the DC and DD incorporate extra processing latency, and the latency performance has not been investigated in the large-scale DC-enabled F-RAN. Therefore, in this work, the successful data compression probability (SDCP) is defined to analyse the latency performance of the F-RAN. Moreover, to analyse the effect of compression offloading ratio (COR), a novel hybrid compression mode is proposed based on the queueing theory. Based on this, the closed-form result of SDCP in the large-scale DC-enabled F-RAN is derived by combining the Matern cluster process and M/G/1 queueing model, and validated by Monte Carlo simulations. Based on the derived SDCP results, the effects of COR on the SDCP is analysed numerically. The results show that the SDCP with the optimal COR can be enhanced with a maximum value of 0.3 and 0.55 as compared with the cases of compressing all computing tasks at the edge and at the UE, respectively. Moreover, for the system requiring the minimal latency, the proposed hybrid compression mode can alleviate the requirement on the backhaul capacity.

preprint2020arXiv

Free-stream preserving finite difference schemes for ideal magnetohydrodynamics on curvilinear meshes

In this paper, a high order free-stream preserving finite difference weighted essentially non-oscillatory (WENO) scheme is developed for the ideal magnetohydrodynamic (MHD) equations on curvilinear meshes. Under the constrained transport framework, magnetic potential evolved by a Hamilton-Jacobi (H-J) equation is introduced to control the divergence error. In this work, we use the alternative formulation of WENO scheme [10] for the nonlinear hyperbolic conservation law, and design a novel method to solve the magnetic potential. Theoretical derivation and numerical results show that the scheme can preserve free-stream solutions of MHD equations, and reduce error more effectively than the standard finite difference WENO schemes for such problems.

preprint2020arXiv

Grid-forming frequency shaping control

As power systems transit to a state of high renewable penetration, little or no presence of synchronous generators makes the prerequisite of well-regulated frequency for grid-following inverters unrealistic. Thus, there is a trend to resort to grid-forming inverters which set frequency directly. We propose a novel grid-forming frequency shaping control that is able to shape the aggregate system frequency dynamics into a first-order one with the desired steady-state frequency deviation and Rate of Change of Frequency (RoCoF) after a sudden power imbalance. The no overshoot property resulting from the first-order dynamics allows the system frequency to monotonically move towards its new steady-state without experiencing frequency Nadir, which largely improves frequency security. We prove that our grid-forming frequency-shaping control renders the system internally stable under mild assumptions. The performance of the proposed control is verified via numerical simulations on a modified Icelandic Power Network test case.

preprint2020arXiv

Modified Gaussian Process Regression Models for Cyclic Capacity Prediction of Lithium-ion Batteries

This paper presents the development of machine learning-enabled data-driven models for effective capacity predictions for lithium-ion batteries under different cyclic conditions. To achieve this, a model structure is first proposed with the considerations of battery ageing tendency and the corresponding operational temperature and depth-of-discharge. Then based on a systematic understanding of covariance functions within the Gaussian process regression, two related data-driven models are developed. Specifically, by modifying the isotropic squared exponential kernel with an automatic relevance determination structure, 'Model A' could extract the highly relevant input features for capacity predictions. Through coupling the Arrhenius law and a polynomial equation into a compositional kernel, 'Model B' is capable of considering the electrochemical and empirical knowledge of battery degradation. The developed models are validated and compared on the Nickel Manganese Cobalt Oxide (NMC) lithium-ion batteries with various cycling patterns. Experimental results demonstrate that the modified Gaussian process regression model considering the battery electrochemical and empirical ageing signature outperforms other counterparts and is able to achieve satisfactory results for both one-step and multi-step predictions. The proposed technique is promising for battery capacity predictions under various cycling cases.

preprint2020arXiv

Storage-Based Frequency Shaping Control

With the decrease in system inertia, frequency security becomes an issue for power systems around the world. Energy storage systems (ESS), due to their excellent ramping capabilities, are considered as a natural choice for the improvement of frequency response following major contingencies. In this manuscript, we propose a new strategy for energy storage -- frequency shaping control -- that allows to completely eliminate the frequency Nadir, one of the main issue in frequency security, and at the same time tune the rate of change of frequency (RoCoF) to a desired value. With Nadir eliminated, the frequency security assessment can be performed via simple algebraic calculations, as opposed to dynamic simulations for conventional control strategies. Moreover, our proposed control is also very efficient in terms of the requirements on storage peak power, requiring up to 40% less power than conventional virtual inertia approach for the same performance.

preprint2018arXiv

Robust Decentralized Secondary Frequency Control in Power Systems: Merits and Trade-Offs

Frequency restoration in power systems is conventionally performed by broadcasting a centralized signal to local controllers. As a result of the energy transition, technological advances, and the scientific interest in distributed control and optimization methods, a plethora of distributed frequency control strategies have been proposed recently that rely on communication amongst local controllers. In this paper we propose a fully decentralized leaky integral controller for frequency restoration that is derived from a classic lag element. We study steady-state, asymptotic optimality, nominal stability, input-to-state stability, noise rejection, transient performance, and robustness properties of this controller in closed loop with a nonlinear and multivariable power system model. We demonstrate that the leaky integral controller can strike an acceptable trade-off between performance and robustness as well as between asymptotic disturbance rejection and transient convergence rate by tuning its DC gain and time constant. We compare our findings to conventional decentralized integral control and distributed-averaging-based integral control in theory and simulations.

preprint2016arXiv

A WENO-based Method of Line Transpose Approach for Vlasov Simulations

In this paper, a high order implicit Method of Line Transpose (MOL$^T$ ) method based on a weighted essentially non-oscillatory (WENO) methodology is developed for one-dimensional linear transport equations and further applied to the Vlasov-Poisson (VP) simulations via dimensional splitting. In the MOL$^T$ framework, the time variable is first discretized by a diagonally implicit strong-stability-preserving Runge-Kutta method, resulting in a boundary value problem (BVP) at the discrete time levels. Then an integral formulation coupled with a high order WENO methodology is employed to solve the BVP. As a result, the proposed scheme is high order accurate in both space and time and free of oscillations even though the solution is discontinuous or has sharp gradients. Moreover, the scheme is able to take larger time step evolution compared with an explicit MOL WENO scheme with the same order of accuracy. The desired positivity-preserving (PP) property of the scheme is further attained by incorporating a newly proposed high order PP limiter. We perform numerical experiments on several benchmarks including linear advection, solid body rotation problem; and on the Landau damping, two-stream instabilities, bump-on-tail, and plasma sheath by solving the VP system. The efficacy and efficiency of the proposed scheme is numerically verified.

preprint2015arXiv

Arbitrary Rotation of a Single Spinwave Qubit in an Atomic-Ensemble Quantum Memory

We report the first experimental realization of single-qubit manipulation for single spinwaves stored in an atomic ensemble quantum memory. In order to have high-fidelity gate operations, we make use of stimulated Raman transition and controlled Lamor precession jointly. We characterize the gate performances with quantum state tomography and quantum process tomography, both of which imply that high-fidelity operations have been achieved. Our work complements the experimental toolbox of atomic-ensemble quantum memories by adding the capability of single-qubit manipulation, thus may have important applications in future scalable quantum networks.

preprint2015arXiv

Experimental demonstration of quantum lithography beyond diffraction limit via Rabi oscillations

Diffraction of light sets the fundamental limit for optical lithography. Many quantum lithography schemes have so far been proposed to overcome this limit either by making use of highly entangled photons, multi-photon processes or multiple Lambda transitions, which are all experimentally high-demanding. Recently, Liao et al. proposed a novel quantum lithography scheme which merely employs Rabi oscillation to surpass the diffraction limit. Here we report a faithful experimental realization of this scheme. Resolution up to ninth of the Rayleigh diffraction limit has been observed. Possibility of creating an arbitrary pattern is also tested experimentally by demonstrating the peak narrowing process using several Rabi floppings together with state-selective optical depletion. Our work may have direct applications in atom pattern engineering for quantum information or quantum simulation applications, and will also possibly boost the adoption of quantum lithography into real-world applications in the near future.

preprint2015arXiv

Operating Spin Echo in the Quantum Regime for an Atomic-Ensemble Quantum Memory

Spin echo is a powerful technique to extend atomic or nuclear coherence time by overcoming the dephasing due to inhomogeneous broadening. However, applying this technique to an ensemble-based quantum memory at single-quanta level remains challenging. In our experimental study we find that noise due to imperfection of the rephasing pulses is highly directional. By properly arranging the beam directions and optimizing the pulse fidelities, we have successfully managed to operate the spin echo technique in the quantum regime and observed nonclassical photon-photon correlations. In comparison to the case without applying the rephasing pulses, quantum memory lifetime is extended by 5 folds. Our work for the first time demonstrates the feasibility of harnessing the spin echo technique to extend lifetime of ensemble-based quantum memories at single-quanta level.

preprint2014arXiv

Self-organized Voids Revisited: Experimental Verification of the Formation Mechanism*

In this paper, several experiments were conducted to further clarify the formation mechanism of self organized void array induced by a single laser beam, including energy-related experiments, refractive-index-contrast-related experiments, depth-related experiments and effective-numerical-aperture experiment. These experiments indicate that the interface spherical aberration is indeed responsible for the formation of void arrays.

preprint2013arXiv

Quantum teleportation and entanglement distribution over 100-kilometre free-space channels

A long standing goal for quantum communication is to transfer a quantum state over arbitrary distances. Free-space quantum communication provides a promising solution towards this challenging goal. Here, through a 97-km free space channel, we demonstrate long distance quantum teleportation over a 35-53 dB loss one-link channel, and entanglement distribution over a 66-85 dB high-loss two-link channel. We achieve an average fidelity of {80.4(9)}% for teleporting six distinct initial states and observe the violation of the Clauser-Horne-Shimony-Holt inequality after distributing entanglement. Besides being of fundamental interest, our result represents a significant step towards a global quantum network. Moreover, the high-frequency and high-accuracy acquiring, pointing and tracking technique developed in our experiment provides an essential tool for future satellite-based quantum communication.

preprint2012arXiv

Implementing the Deutsch's algorithm with spin-orbital angular momentum of photon without interferometer

Deutsch's algorithm is the simplest quantum algorithm which shows the acceleration of quantum computer. In this paper, we theoretically advance a scheme to implement quantum Deutsch's algorithm in spin-orbital angular momentum space. Our scheme exploits a newly developed optical device called "q-plate", which can couple and manipulate the spin-orbital angular momentum simultaneously. This experimental setup is of high stability and efficiency theoretically for there is no interferometer in it.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint

Fields this researcher appears in

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2605.02263:author:1:yan-jiang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.11726:author:1:yan-jiang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.14809:author:1:yan-jiang

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.13213:author:3:yan-jiang

Imported May 20, 2026Synced May 20, 2026

4 works

Jian-Wei Pan

Researcher

Jian-Wei Pan contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Bo Zhao

Researcher

Bo Zhao contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Enrique Mallada

Researcher

Enrique Mallada contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Jun Rui

Researcher

Jun Rui contributes to research discovery and scholarly infrastructure.

Open to collaborate

Yan Jiang

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models

Break the Block: Dynamic-size Reasoning Blocks for Diffusion Large Language Models via Monotonic Entropy Descent with Reinforcement Learning

GFMate: Empowering Graph Foundation Models with Test-time Prompt Tuning

Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning

Inverse Spectral Problem With Low Regularity Refractive Index

Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models

Boundary localization of transmission eigenfunctions in spherically stratified media

Few-Shot Stance Detection via Target-Aware Prompt Distillation

On the Performance of Data Compression in Clustered Fog Radio Access Networks

Free-stream preserving finite difference schemes for ideal magnetohydrodynamics on curvilinear meshes

Grid-forming frequency shaping control

Modified Gaussian Process Regression Models for Cyclic Capacity Prediction of Lithium-ion Batteries

Storage-Based Frequency Shaping Control

Robust Decentralized Secondary Frequency Control in Power Systems: Merits and Trade-Offs

A WENO-based Method of Line Transpose Approach for Vlasov Simulations

Arbitrary Rotation of a Single Spinwave Qubit in an Atomic-Ensemble Quantum Memory

Experimental demonstration of quantum lithography beyond diffraction limit via Rabi oscillations

Operating Spin Echo in the Quantum Regime for an Atomic-Ensemble Quantum Memory

Self-organized Voids Revisited: Experimental Verification of the Formation Mechanism*

Quantum teleportation and entanglement distribution over 100-kilometre free-space channels

Implementing the Deutsch's algorithm with spin-orbital angular momentum of photon without interferometer