Source author record

Xiaojun Jia

Xiaojun Jia appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

quant-ph Computer Vision Cryptography and Security Machine Learning Artificial Intelligence Multimedia physics.optics

Catalog footprint

What is connected

22works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Adversarial Attacks Against MLLMs via Progressive Resolution Processing and Adaptive Feature Alignment

Adversarial perturbations can mislead Multimodal Large Language Models (MLLMs) recognize a benign image as a specific target object, posing serious risks in safety-critical scenarios such as autonomous driving and medical diagnosis. This makes transfer-based targeted attacks crucial for understanding and improving black-box MLLM robustness. Existing transfer-based targeted attack methods typically rely on the final global features of the surrogate encoder and anchor optimization to original-resolution target crops, leading to their limited transferability and robustness. To address these challenges, we propose Progressive Resolution Processing and Adaptive Feature Alignment (PRAF-Attack), a targeted transfer-based attack framework that integrates multi-scale global semantic guidance with robust intermediate-layer local alignment. Unlike prior methods that align only the surrogate encoder's final layer, we design an adaptive feature alignment strategy that leverages intermediate representations to enhance transferability. Specifically, we introduce an adaptive intermediate layer selection mechanism to identify transferable hierarchical features across surrogate ensembles via gradient consistency, along with an adaptive patch-level optimization strategy that preserves highly correlated local regions through efficient patch filtering. To overcome the reliance on fixed original-resolution target crops, we propose a progressive resolution processing strategy that gradually refines optimization from coarse to fine, enabling the attack to better exploit target information at multiple scales and achieve stronger transferability. We evaluate PRAF-Attack on a diverse suite of black-box MLLMs, including six open-source models and six closed-source commercial APIs. Compared with seven state-of-the-art targeted attack baselines, the proposed PRAF-Attack consistently achieves superior transferability.

preprint2026arXiv

AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin

Fine-tuning large language models (LLMs) improves performance but introduces critical safety vulnerabilities: even minimal harmful data can severely compromise safety measures. We observe that perturbations orthogonal to the alignment direction - defined by weight differences between aligned (safe) and unaligned models - rapidly compromise model safety. In contrast, updates along the alignment direction largely preserve it, revealing the parameter space as a "narrow safety basin". To address this, we propose AsFT (Anchoring Safety in Fine-Tuning) to maintain safety by explicitly constraining update directions during fine-tuning. By penalizing updates orthogonal to the alignment direction, AsFT effectively constrains the model within the "narrow safety basin," thus preserving its inherent safety. Extensive experiments on multiple datasets and models show that AsFT reduces harmful behaviors by up to 7.60%, improves task performance by 3.44%, and consistently outperforms existing methods across multiple tasks.

preprint2026arXiv

Knowledge-Driven Multi-Turn Jailbreaking on Large Language Models

Large Language Models (LLMs) face a significant threat from multi-turn jailbreak attacks, where adversaries progressively steer conversations to elicit harmful outputs. However, the practical effectiveness of existing attacks is undermined by several critical limitations: they struggle to maintain a coherent progression over long interactions, often losing track of what has been accomplished and what remains to be done; they rely on rigid or pre-defined patterns, and fail to adapt to the LLM's dynamic and unpredictable conversational state. To address these shortcomings, we introduce Mastermind, a multi-turn jailbreak framework that adopts a dynamic and self-improving approach. Mastermind operates in a closed loop of planning, execution, and reflection, enabling it to autonomously build and refine its knowledge of model vulnerabilities through interaction. It employs a hierarchical planning architecture that decouples high-level attack objectives from low-level tactical execution, ensuring long-term focus and coherence. This planning is guided by a knowledge repository that autonomously discovers and refines effective attack patterns by reflecting on interactive experiences. Mastermind leverages this accumulated knowledge to dynamically recombine and adapt attack vectors, dramatically improving both effectiveness and resilience. We conduct comprehensive experiments against state-of-the-art models, including GPT-5 and Claude 3.7 Sonnet. The results demonstrate that Mastermind significantly outperforms existing baselines, achieving substantially higher attack success rates and harmfulness ratings. Moreover, our framework exhibits notable resilience against multiple advanced defense mechanisms.

preprint2023arXiv

Does Few-shot Learning Suffer from Backdoor Attacks?

The field of few-shot learning (FSL) has shown promising results in scenarios where training data is limited, but its vulnerability to backdoor attacks remains largely unexplored. We first explore this topic by first evaluating the performance of the existing backdoor attack methods on few-shot learning scenarios. Unlike in standard supervised learning, existing backdoor attack methods failed to perform an effective attack in FSL due to two main issues. Firstly, the model tends to overfit to either benign features or trigger features, causing a tough trade-off between attack success rate and benign accuracy. Secondly, due to the small number of training samples, the dirty label or visible trigger in the support set can be easily detected by victims, which reduces the stealthiness of attacks. It seemed that FSL could survive from backdoor attacks. However, in this paper, we propose the Few-shot Learning Backdoor Attack (FLBA) to show that FSL can still be vulnerable to backdoor attacks. Specifically, we first generate a trigger to maximize the gap between poisoned and benign features. It enables the model to learn both benign and trigger features, which solves the problem of overfitting. To make it more stealthy, we hide the trigger by optimizing two types of imperceptible perturbation, namely attractive and repulsive perturbation, instead of attaching the trigger directly. Once we obtain the perturbations, we can poison all samples in the benign support set into a hidden poisoned support set and fine-tune the model on it. Our method demonstrates a high Attack Success Rate (ASR) in FSL tasks with different few-shot learning paradigms while preserving clean accuracy and maintaining stealthiness. This study reveals that few-shot learning still suffers from backdoor attacks, and its security should be given attention.

preprint2022arXiv

Boosting Fast Adversarial Training with Learnable Adversarial Initialization

Adversarial training (AT) has been demonstrated to be effective in improving model robustness by leveraging adversarial examples for training. However, most AT methods are in face of expensive time and computational cost for calculating gradients at multiple steps in generating adversarial examples. To boost training efficiency, fast gradient sign method (FGSM) is adopted in fast AT methods by calculating gradient only once. Unfortunately, the robustness is far from satisfactory. One reason may arise from the initialization fashion. Existing fast AT generally uses a random sample-agnostic initialization, which facilitates the efficiency yet hinders a further robustness improvement. Up to now, the initialization in fast AT is still not extensively explored. In this paper, we boost fast AT with a sample-dependent adversarial initialization, i.e., an output from a generative network conditioned on a benign image and its gradient information from the target network. As the generative network and the target network are optimized jointly in the training phase, the former can adaptively generate an effective initialization with respect to the latter, which motivates gradually improved robustness. Experimental evaluations on four benchmark databases demonstrate the superiority of our proposed method over state-of-the-art fast AT methods, as well as comparable robustness to advanced multi-step AT methods. The code is released at https://github.com//jiaxiaojunQAQ//FGSM-SDI.

preprint2022arXiv

High-performance cavity-enhanced quantum memory with warm atomic cell

High-performance quantum memory for quantized states of light is a prerequisite building block of quantum information technology. Despite great progresses of optical quantum memories based on interactions of light and atoms, physical features of these memories still cannot satisfy requirements for applications in practical quantum information systems, since all of them suffer from trade-off between memory efficiency and excess noise. Here, we report a high-performance cavity-enhanced electromagnetically-induced-transparency memory with warm atomic cell in which a scheme of optimizing the spatial and temporal modes based on the time-reversal approach is applied. The memory efficiency up to 67% is directly measured and a noise level close to quantum noise limit is simultaneously reached. It has been experimentally demonstrated that the average fidelities for a set of input coherent states with different phases and amplitudes within a Gaussian distribution have exceeded the classical benchmark fidelities. Thus the realized quantum memory platform has been capable of preserving quantized optical states, and is ready to be applied in quantum information systems, such as distributed quantum logic gates and quantum-enhanced atomic magnetometry.

preprint2022arXiv

LAS-AT: Adversarial Training with Learnable Attack Strategy

Adversarial training (AT) is always formulated as a minimax problem, of which the performance depends on the inner optimization that involves the generation of adversarial examples (AEs). Most previous methods adopt Projected Gradient Decent (PGD) with manually specifying attack parameters for AE generation. A combination of the attack parameters can be referred to as an attack strategy. Several works have revealed that using a fixed attack strategy to generate AEs during the whole training phase limits the model robustness and propose to exploit different attack strategies at different training stages to improve robustness. But those multi-stage hand-crafted attack strategies need much domain expertise, and the robustness improvement is limited. In this paper, we propose a novel framework for adversarial training by introducing the concept of "learnable attack strategy", dubbed LAS-AT, which learns to automatically produce attack strategies to improve the model robustness. Our framework is composed of a target network that uses AEs for training to improve robustness and a strategy network that produces attack strategies to control the AE generation. Experimental evaluations on three benchmark databases demonstrate the superiority of the proposed method. The code is released at https://github.com/jiaxiaojunQAQ/LAS-AT.

preprint2022arXiv

Prior-Guided Adversarial Initialization for Fast Adversarial Training

Fast adversarial training (FAT) effectively improves the efficiency of standard adversarial training (SAT). However, initial FAT encounters catastrophic overfitting, i.e.,the robust accuracy against adversarial attacks suddenly and dramatically decreases. Though several FAT variants spare no effort to prevent overfitting, they sacrifice much calculation cost. In this paper, we explore the difference between the training processes of SAT and FAT and observe that the attack success rate of adversarial examples (AEs) of FAT gets worse gradually in the late training stage, resulting in overfitting. The AEs are generated by the fast gradient sign method (FGSM) with a zero or random initialization. Based on the observation, we propose a prior-guided FGSM initialization method to avoid overfitting after investigating several initialization strategies, improving the quality of the AEs during the whole training process. The initialization is formed by leveraging historically generated AEs without additional calculation cost. We further provide a theoretical analysis for the proposed initialization method. We also propose a simple yet effective regularizer based on the prior-guided initialization,i.e., the currently generated perturbation should not deviate too much from the prior-guided initialization. The regularizer adopts both historical and current adversarial perturbations to guide the model learning. Evaluations on four datasets demonstrate that the proposed method can prevent catastrophic overfitting and outperform state-of-the-art FAT methods. The code is released at https://github.com/jiaxiaojunQAQ/FGSM-PGI.

preprint2022arXiv

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal

As a common security tool, visible watermarking has been widely applied to protect copyrights of digital images. However, recent works have shown that visible watermarks can be removed by DNNs without damaging their host images. Such watermark-removal techniques pose a great threat to the ownership of images. Inspired by the vulnerability of DNNs on adversarial perturbations, we propose a novel defence mechanism by adversarial machine learning for good. From the perspective of the adversary, blind watermark-removal networks can be posed as our target models; then we actually optimize an imperceptible adversarial perturbation on the host images to proactively attack against watermark-removal networks, dubbed Watermark Vaccine. Specifically, two types of vaccines are proposed. Disrupting Watermark Vaccine (DWV) induces to ruin the host image along with watermark after passing through watermark-removal networks. In contrast, Inerasable Watermark Vaccine (IWV) works in another fashion of trying to keep the watermark not removed and still noticeable. Extensive experiments demonstrate the effectiveness of our DWV/IWV in preventing watermark removal, especially on various watermark removal networks.

preprint2020arXiv

Adv-watermark: A Novel Watermark Perturbation for Adversarial Examples

Recent research has demonstrated that adding some imperceptible perturbations to original images can fool deep learning models. However, the current adversarial perturbations are usually shown in the form of noises, and thus have no practical meaning. Image watermark is a technique widely used for copyright protection. We can regard image watermark as a king of meaningful noises and adding it to the original image will not affect people's understanding of the image content, and will not arouse people's suspicion. Therefore, it will be interesting to generate adversarial examples using watermarks. In this paper, we propose a novel watermark perturbation for adversarial examples (Adv-watermark) which combines image watermarking techniques and adversarial example algorithms. Adding a meaningful watermark to the clean images can attack the DNN models. Specifically, we propose a novel optimization algorithm, which is called Basin Hopping Evolution (BHE), to generate adversarial watermarks in the black-box attack mode. Thanks to the BHE, Adv-watermark only requires a few queries from the threat models to finish the attacks. A series of experiments conducted on ImageNet and CASIA-WebFace datasets show that the proposed method can efficiently generate adversarial examples, and outperforms the state-of-the-art attack methods. Moreover, Adv-watermark is more robust against image transformation defense methods.

preprint2020arXiv

Quantum enhanced optical phase estimation with a squeezed thermal state

Quantum phase estimation protocols can provide a measuring method of phase shift with precision superior to standard quantum limit (SQL) due to the application of a nonclassical state of light. A squeezed vacuum state, whose variance in one quadrature is lower than the corresponding SQL, has been pointed out a sensitive resource for quantum phase estimation and the estimation accuracy is directly influenced by the properties of the squeezed state. Here we detailedly analyze the influence of the purity and squeezing level of the squeezed state on the accuracy of quantum phase estimation. The maximum precision that can be achieved for a squeezed thermal state is evaluated, and the experimental results are in agreement with the theoretical analyses. It is also found that the width of the phase estimation interval $Δθ$ beyond SQL is correlated with the purity of the squeezed state.

preprint2020arXiv

Quantum interferometer combining squeezing and parametric amplification

High precision interferometers are the building blocks of precision metrology and the ultimate interferometric sensitivity is limited by the quantum noise. Here we propose and experimentally demonstrate a compact quantum interferometer involving two optical parametric amplifiers and the squeezed states generated within the interferometer are directly used for the phase-sensing quantum state. By both squeezing shot noise and amplifying phase-sensing intensity the sensitivity improvement of $4.86\pm 0.24$ dB beyond the standard quantum limit is deterministically realized and a minimum detectable phase smaller than that of all present interferometers under the same phase-sensing intensity is achieved. This interferometric system has significantly potential applications in a variety of measurements for tiny variances of physical quantities.

preprint2015arXiv

Experimental generation of 8.4 dB entangled state with an optical cavity involving a wedged type-II nonlinear crystal

Entangled state of light is one of the essential quantum resources in quantum information science and technology. Especially, when the fundamental principle experiments have been achieved in labs and the applications of continuous variable quantum information in the real world are considered, it is crucial to design and construct the generation devices of entangled states with high entanglement and compact configuration. We have designed and built an efficient and compact light source of entangled state, which is a non-degenerate optical parametric amplifier (NOPA) with the triple resonance of the pump and two subharmonic modes. A wedged type-II KTP crystal inside the NOPA is used for implementing frequency-down-conversion of the pump field to generate the optical entangled state and achieving the dispersion compensation between the pump and the subharmonic waves. The EPR entangled state of light with quantum correlations of 8.4 dB for both amplitude and phase quadratures are experimentally produced by a single NOPA under the pump power of 75 mW.

preprint2014arXiv

Gates for one-way quantum computation based on Einstein-Podolsky-Rosen entanglement

Single-mode squeezing and Fourier transformation operations are two essential logical gates in continuous-variable quantum computation, which have been experimentally implemented by means of an optical four-mode cluster state. In this paper, we present a simpler and more efficient protocol based on the use of Einstein-Podolsky-Rosen two-mode entangled states to realize the same operations. The theoretical calculations and the experimental results demonstrate that the presented scheme not only decreases the requirement to the resource quantum states at the largest extent but also enhances significantly the squeezing degree and the fidelity of the resultant modes under an identical resource condition. That is because in our system the influence of the excess noises deriving from the imperfect squeezing of the resource states is degraded. The gate operations applying two-mode entanglement can be utilized as a basic element in a future quantum computer involving a large-scale cluster state.

preprint2013arXiv

Gate sequence for continuous variable one-way quantum computation

Measurement-based one-way quantum computation (QC) using cluster states as resources provides an efficient model to perform computation and information processing of quantum codes. Arbitrary Gaussian QC can be implemented by sufficiently long single-mode and two-mode gate sequences. However, continuous variable (CV) gate sequences have not been realized so far due to an absence of cluster states larger than four submodes. Here we present the first CV gate sequence consisting of a single-mode squeezing gate and a two-mode controlled-phase gate based on a six-mode cluster state. The quantum property of this gate sequence is confirmed by the fidelities and the quantum entanglement of two output modes, which depend on both the squeezing and controlled-phase gates. The experiment demonstrates the feasibility of implementing Gaussian QC by means of accessible gate sequences.

preprint2012arXiv

Cascaded Entanglement Enhancement

We present a cascaded system consisting of three non-degenerate optical parametric amplifiers (NOPAs) for the generation and the enhancement of quantum entanglement of continuous variables. The entanglement of optical fields produced by the first NOPA is successively enhanced by the second and the third NOPAs from -5.3 $dB$ to -8.1 $dB$ below the quantum noise limit. The dependence of the enhanced entanglement on the physical parameters of the NOPAs and the reachable entanglement limitation for a given cascaded NOPA system are calculated. The calculation results are in good agreement with the experimental measurements.

preprint2012arXiv

Coherent feedback control of multipartite quantum entanglement for optical fields

Coherent feedback control (CFC) of multipartite optical entangled states produced by a non-degenerate optical parametric amplifier is theoretically studied. The features of the quantum correlations of amplitude and phase quadratures among more than two entangled optical modes can be controlled by tuning the transmissivity of the optical beam splitter in CFC loop. The physical conditions to enhance continuous variable multipartite entanglement of optical fields utilizing CFC loop are obtained. The numeric calculations based on feasible physical parameters of realistic systems provide direct references for the design of experimental devices.

preprint2012arXiv

Experimental preparation of eight-partite linear and two-diamond shape cluster states for photonic qumodes

The preparation of multipartite entangled states is the prerequisite for exploring quantum information networks and quantum computation. In this letter, we present the first experimental demonstration of eight-partite spatially separated CV entangled states. The initial resource quantum states are eight squeezed states of light, through the linearly optical transformation of which two types of the eight-partite cluster entangled states are prepared, respectively. The generated eight entangled photonic qumodes are spatially separated, which provide valuable quantum resources to implement more complicated quantum information task.

preprint2012arXiv

Experimental realization of three-color entanglement at optical fiber communication and atomic storage wavelengths

Multi-color entangled states of light including low-loss optical fiber transmission and atomic resonance frequencies are essential resources for future quantum information network. We present the experimental achievement on the three-color entanglement generation at 852 nm, 1550 nm and 1440 nm wavelengths for optical continuous variables. The entanglement generation system consists of two cascaded non-degenerated optical parametric oscillators (NOPOs). The flexible selectivity of nonlinear crystals in the two NOPOs and the tunable property of NOPO provide large freedom for the frequency selection of three entangled optical beams, so the present system is possible to be developed as practical devices used for quantum information networks with atomic storage units and long fiber transmission lines.

preprint2011arXiv

Continuous variable quantum key distribution based on optical entangled states without signal modulation

In this paper, we present the first experimental demonstration on continuous variable quantum key distribution using determinant Einstein-Podolsky-Rosen entangled states of optical field. By means of the instantaneous measurements of the quantum fluctuations of optical modes respectively distributed at sender and receiver, the random bits of secret key are obtained without the need for signal modulation. The post-selection boundaries for the presented entanglement-based scheme against both Gaussian collective and individual attacks are theoretically concluded. The final secret key rates of 84 kbits/s and 3 kbits/s are completed under the collective attack for the transmission efficiency of 80% and 40%, respectively.

preprint2010arXiv

Continuous variable entanglement enhancement and manipulation by a sub-threshold type-II optical parametric amplifier

We experimentally demonstrate that the quantum entanglement between amplitude and phase quadratures of optical modes produced from a non-degenerate optical parametric amplifier (NOPA) can be enhanced and manipulated phase-sensitively by means of another NOPA. When both NOPAs operate at de-amplification, the entanglement degree is increased at the cavity resonance of the second NOPA. When the first NOPA operates at de-amplification and the second one at amplification, the spectral features of the correlation variances are significantly changed. The experimental results are in good agreement with the theoretical expectation.

preprint2009arXiv

Quantum Communication Network Utilizing Quadripartite Entangled States of Optical Field

We propose two types of quantum dense coding communication networks with optical continuous variables, in which a quadripartite entangled state of the optical field with totally three-party correlations of quadrature amplitudes is utilized. In the networks, the exchange of information between any two participants can be manipulated by one or two of the remaining participants. The channel capacities for a variety of communication protocols are numerically calculated. Due to the fact that the quadripartite entangled states applied in the communication systems have been successfully prepared already in the laboratory, the proposed schemes are experimentally accessible at present.

Xiaojun Jia

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

Adversarial Attacks Against MLLMs via Progressive Resolution Processing and Adaptive Feature Alignment

AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin

Knowledge-Driven Multi-Turn Jailbreaking on Large Language Models

Does Few-shot Learning Suffer from Backdoor Attacks?

Boosting Fast Adversarial Training with Learnable Adversarial Initialization

High-performance cavity-enhanced quantum memory with warm atomic cell

LAS-AT: Adversarial Training with Learnable Attack Strategy

Prior-Guided Adversarial Initialization for Fast Adversarial Training

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal

Adv-watermark: A Novel Watermark Perturbation for Adversarial Examples

Quantum enhanced optical phase estimation with a squeezed thermal state

Quantum interferometer combining squeezing and parametric amplification

Experimental generation of 8.4 dB entangled state with an optical cavity involving a wedged type-II nonlinear crystal

Gates for one-way quantum computation based on Einstein-Podolsky-Rosen entanglement

Gate sequence for continuous variable one-way quantum computation

Cascaded Entanglement Enhancement

Coherent feedback control of multipartite quantum entanglement for optical fields

Experimental preparation of eight-partite linear and two-diamond shape cluster states for photonic qumodes

Experimental realization of three-color entanglement at optical fiber communication and atomic storage wavelengths

Continuous variable quantum key distribution based on optical entangled states without signal modulation

Continuous variable entanglement enhancement and manipulation by a sub-threshold type-II optical parametric amplifier

Quantum Communication Network Utilizing Quadripartite Entangled States of Optical Field