Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2026arXiv

Beyond Prompts: Space-Time Decoupling Control-Plane Jailbreaks in LLM Structured Output

Content Warning: This paper may contain unsafe or harmful content generated by LLMs that may be offensive to readers. Large Language Models (LLMs) are extensively used as tooling platforms through structured output APIs to ensure syntax compliance so that robust integration with existing software, like agent systems, can be achieved. However, the feature enabling the functionality of grammar-guided structured output presents significant security vulnerabilities. In this work, we reveal a critical control-plane attack surface orthogonal to traditional data-plane vulnerabilities. We introduce Constrained Decoding Attack (CDA), a novel jailbreak class that weaponizes structured output constraints to bypass both external auditing and internal safety alignment. Unlike prior attacks focused on input prompt designs, CDA operates by embedding malicious intent in schema-level grammar rules (control-plane) while maintaining benign surface prompts (data-plane). We instantiate this with two proof-of-concept attacks: EnumAttack, which embeds malicious content in enum fields; and the more evasive DictAttack, which decouples the malicious payload across a benign prompt and a dictionary-based grammar. Our evaluation spans a broad spectrum of 13 proprietary/open-weight models. In particular, DictAttack achieves 94.3--99.5% ASR across five benchmarks on gpt-5, gemini-2.5-pro, deepseek-r1, and gpt-oss-120b. Furthermore, we demonstrate the significant challenge in defending against these threats: while basic grammar auditing mitigates EnumAttack, the more sophisticated DictAttack maintains a 75.8% ASR even against multiple state-of-the-art jailbreak guardrails. This exposes a critical "semantic gap" in current safety architectures and underscores the urgent need for cross-plane defenses that can bridge the data and control planes to secure the LLM generation pipeline.

preprint2026arXiv

The New Compiler Stack: A Survey on the Synergy of LLMs and Compilers

This survey has provided a systematic overview of the emerging field of LLM-enabled compilation by addressing several key research questions. We first answered how LLMs are being integrated by proposing a comprehensive, multi-dimensional taxonomy that categorizes works based on their Design Philosophy (Selector, Translator, Generator), LLM Methodology, their operational Level of Code Abstraction, and the specific Task Type they address. In answering what advancements these approaches offer, we identified three primary benefits: the democratization of compiler development, the discovery of novel optimization strategies, and the broadening of the compiler's traditional scope. Finally, in addressing the field's challenges and opportunities, we highlighted the critical hurdles of ensuring correctness and achieving scalability, while identifying the development of hybrid systems as the most promising path forward. By providing these answers, this survey serves as a foundational roadmap for researchers and practitioners, charting the course for a new generation of LLM-powered, intelligent, adaptive and synergistic compilation tools.

preprint2022arXiv

A narrow-stencil framework for convergent numerical approximations of fully nonlinear second order PDEs

This paper develops a unified general framework for designing convergent finite difference and discontinuous Galerkin methods for approximating viscosity and regular solutions of fully nonlinear second order PDEs. Unlike the well-known monotone (finite difference) framework, the proposed new framework allows for the use of narrow stencils and unstructured grids which makes it possible to construct high order methods. The general framework is based on the concepts of consistency and g-monotonicity which are both defined in terms of various numerical derivative operators. Specific methods that satisfy the framework are constructed using numerical moments. Admissibility, stability, and convergence properties are proved, and numerical experiments are provided along with some computer implementation details.

preprint2022arXiv

A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling

Automatic speech recognition (ASR) systems used on smart phones or vehicles are usually required to process speech queries from very different domains. In such situations, a vanilla ASR system usually fails to perform well on every domain. This paper proposes a multi-domain ASR framework for Tencent Map, a navigation app used on smart phones and in-vehicle infotainment systems. The proposed framework consists of three core parts: a basic ASR module to generate n-best lists of a speech query, a text classification module to determine which domain the speech query belongs to, and a reranking module to rescore n-best lists using domain-specific language models. In addition, an instance sampling based method to training neural network language models (NNLMs) is proposed to address the data imbalance problem in multi-domain ASR. In experiments, the proposed framework was evaluated on navigation domain and music domain, since navigating and playing music are two main features of Tencent Map. Compared to a general ASR system, the proposed framework achieves a relative 13% $\sim$ 22% character error rate reduction on several test sets collected from Tencent Map and our in-car voice assistant.

preprint2022arXiv

An efficient multi-modes Monte Carlo homogenization method for random materials

In this paper, we propose and analyze a new stochastic homogenization method for diffusion equations with random and fast oscillatory coefficients. In the proposed method, the homogenized solutions are sought through a two-stage procedure. In the first stage, the original oscillatory diffusion equation is approximated, for each fixed random sample w, by a spatially homogenized diffusion equation with piecewise constant coefficients, resulting a random diffusion equation. In the second stage, the resulted random diffusion equation is approximated and computed by using an efficient multi-modes Monte Carlo method which only requires to solve a diffusion equation with a constant diffusion coefficient and a random right-hand side. The main advantage of the proposed method is that it separates the computational difficulty caused by the spatial fast oscillation of the solution and that caused by the randomness of the solution, so they can be overcome separately using different strategies. The convergence of the solution of the spatially homogenized equation (from the first stage) to the solution of the original random diffusion equation is established and the optimal rate of convergence is also obtained for the proposed multi-modes Monte Carlo method. Numerical experiments on some benchmark test problems for random composite materials are also presented to gauge the efficiency and accuracy of the proposed two-stage stochastic homogenization method.

preprint2022arXiv

Higher order time discretization for the stochastic semilinear wave equation with multiplicative noise

In this paper, a higher-order time-discretization scheme is proposed, where the iterates approximate the solution of the stochastic semilinear wave equation driven by multiplicative noise with general drift and diffusion. We employ a variational method for its error analysis and prove an improved convergence order of 3/2 for the approximates of the solution. The core of the analysis is Holder continuity in time and moment bounds for the solutions of the continuous and the discrete problem. Computational experiments are also presented.

preprint2021arXiv

High-order mass- and energy-conserving SAV-Gauss collocation finite element methods for the nonlinear Schrödinger equation

A family of arbitrarily high-order fully discrete space-time finite element methods are proposed for the nonlinear Schrödinger equation based on the scalar auxiliary variable formulation, which consists of a Gauss collocation temporal discretization and the finite element spatial discretization. The proposed methods are proved to be well-posed and conserving both mass and energy at the discrete level. An error bound of the form $O(h^p+τ^{k+1})$ in the $L^\infty(0,T;H^1)$-norm is established, where $h$ and $τ$ denote the spatial and temporal mesh sizes, respectively, and $(p,k)$ is the degree of the space-time finite elements. Numerical experiments are provided to validate the theoretical results on the convergence rates and conservation properties. The effectiveness of the proposed methods in preserving the shape of a soliton wave is also demonstrated by numerical results.

preprint2020arXiv

A new theory of fractional differential calculus

This paper presents a self-contained new theory of weak fractional differential calculus in one-dimension. The crux of this new theory is the introduction of a weak fractional derivative notion which is a natural generalization of integer order weak derivatives; it also helps to unify multiple existing fractional derivative definitions and characterize what functions are fractionally differentiable. Various calculus rules including a fundamental theorem calculus, product and chain rules, and integration by parts formulas are established for weak fractional derivatives. Additionally, relationships with classical fractional derivatives and detailed characterizations of weakly fractional differentiable functions are also established. Furthermore, the notion of weak fractional derivatives is also systematically extended to general distributions instead of only to some special distributions. This new theory lays down a solid theoretical foundation for systematically and rigorously developing new theories of fractional Sobolev spaces, fractional calculus of variations, and fractional PDEs as well as their numerical solutions in subsequent works. This paper is a concise presentation of the materials of Sections 1-4 and 6 of reference [9].

preprint2020arXiv

A new theory of fractional differential calculus and fractional Sobolev spaces: One-dimensional case

This paper presents a self-contained new theory of weak fractional differential calculus and fractional Sobolev spaces in one-dimension. The crux of this new theory is the introduction of a weak fractional derivative notion which is a natural generalization of integer order weak derivatives; it also helps to unify multiple existing fractional derivative definitions and characterize what functions are fractionally differentiable. Various calculus rules including a fundamental theorem of calculus, product and chain rules, and integration by parts formulas are established for weak fractional derivatives and relationships with classical derivatives are also obtained. Based on the weak fractional derivative notion, new fractional order Sobolev spaces are introduced and many important theorems and properties, such as density/approximation theorem, extension theorems, trace theorem, and various embedding theorems in these Sobolev spaces are established. Moreover, a few relationships with existing fractional Sobolev spaces are also established. Furthermore, the notion of weak fractional derivatives is also systematically extended to general distributions instead of only to some special distributions. The new theory lays down a solid theoretical foundation for systematically and rigorously developing a fractional calculus of variations theory and a fractional PDE theory as well as their numerical solutions in subsequent works.

preprint2020arXiv

Accelerating Deep Learning Inference with Cross-Layer Data Reuse on GPUs

Accelerating the deep learning inference is very important for real-time applications. In this paper, we propose a novel method to fuse the layers of convolutional neural networks (CNNs) on Graphics Processing Units (GPUs), which applies data reuse analysis and access optimization in different levels of the memory hierarchy. To achieve the balance between computation and memory access, we explore the fusion opportunities in the CNN computation graph and propose three fusion modes of convolutional neural networks: straight, merge and split. Then, an approach for generating efficient fused code is designed, which goes deeper in multi-level memory usage for cross-layer data reuse. The effectiveness of our method is evaluated with the network layers from state-of-the-art CNNs on two different GPU platforms, NVIDIA TITAN Xp and Tesla P4. The experiments show that the average speedup is 2.02x on representative structures of CNNs, and 1.57x on end-to-end inference of SqueezeNet.

preprint2020arXiv

Analysis of Fully Discrete Mixed Finite Element Methods for Time-dependent Stochastic Stokes Equations with Multiplicative Noise

This paper is concerned with fully discrete mixed finite element approximations of the time-dependent stochastic Stokes equations with multiplicative noise. A prototypical method, which comprises of the Euler-Maruyama scheme for time discretization and the Taylor-Hood mixed element for spatial discretization is studied in detail. Strong convergence with rates is established not only for the velocity approximation but also for the pressure approximation (in a time-averaged fashion). A stochastic inf-sup condition is established and used in a nonstandard way to obtain the error estimate for the pressure approximation in the time-averaged fashion. Numerical results are also provided to validate the theoretical results and to gauge the performance of the proposed fully discrete mixed finite methods.

preprint2020arXiv

LANCE: Efficient Low-Precision Quantized Winograd Convolution for Neural Networks Based on Graphics Processing Units

Accelerating deep convolutional neural networks has become an active topic and sparked an interest in academia and industry. In this paper, we propose an efficient low-precision quantized Winograd convolution algorithm, called LANCE, which combines the advantages of fast convolution and quantization techniques. By embedding linear quantization operations into the Winograd-domain, the fast convolution can be performed efficiently under low-precision computation on graphics processing units. We test neural network models with LANCE on representative image classification datasets, including SVHN, CIFAR, and ImageNet. The experimental results show that our 8-bit quantized Winograd convolution improves the performance by up to 2.40x over the full-precision convolution with trivial accuracy loss.

preprint2020arXiv

On New Families of Fractional Sobolev Spaces

This paper presents three new families of fractional Sobolev spaces and their accompanying theory in one-dimension. The new construction and theory are based on a newly developed notion of weak fractional derivatives, which are natural generalizations of the well-established integer order Sobolev spaces and theory. In particular, two new families of one-sided fractional Sobolev spaces are introduced and analyzed, they reveal more insights about another family of so-called symmetric fractional Sobolev spaces. Many key theorems/properties, such as density/approximation theorem, extension theorems, one-sided trace theorem, and various embedding theorems and Sobolev inequalities in those Sobolev spaces are established. Moreover, a few relationships with existing fractional Sobolev spaces are also discovered. The results of this paper lay down a solid theoretical foundation for systematically developing a fractional calculus of variations theory and a fractional PDE theory as well as their numerical solutions in subsequent works. This paper is a concise presentation of the materials of Sections 1, 4 and 5 of reference [7].

preprint2020arXiv

Optimally Convergent Mixed Finite Element Methods for the Stochastic Stokes Equations

We propose some new mixed finite element methods for the time dependent stochastic Stokes equations with multiplicative noise, which use the Helmholtz decomposition of the driving multiplicative noise. It is known [16] that the pressure solution has a low regularity, which manifests in sub-optimal convergence rates for well-known inf-sup stable mixed finite element methods in numerical simulations, see [10]. We show that eliminating this gradient part from the noise in the numerical scheme leads to optimally convergent mixed finite element methods, and that this conceptual idea may be used to retool numerical methods that are well-known in the deterministic setting, including pressure stabilization methods, so that their optimal convergence properties can still be maintained in the stochastic setting. Computational experiments are also provided to validate the theoretical results and to illustrate the conceptional usefulness of the proposed numerical approach.