Researcher profile

Hongliang Liu

Hongliang Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - Baseline
5works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Behavioral Integrity Verification for AI Agent Skills

Agent skills extend LLM agents with privileged third-party capabilities such as filesystem access, credentials, network calls, and shell execution. Existing safety work catches malicious prompts and risky runtime actions, but the skill artifact itself goes unverified. We formalize this as the behavioral integrity verification (BIV) problem: a typed set comparison between declared and actual capabilities over a shared taxonomy that bridges code, instructions, and metadata. The BIV framework instantiates this comparison by pairing deterministic code analysis with LLM-assisted capability extraction. The resulting structured evidence supports three downstream analyses: deviation taxonomy, root-cause classification, and malicious-skill detection. On 49,943 skills from the OpenClaw registry, the deviation taxonomy reveals a pervasive description-implementation gap: 80.0% of skills deviate from declared behavior, with four novel compound-threat categories surfaced. Root-cause classification finds that deviations are mostly oversight, not malice: 81.1% trace to developer oversight and 18.9% to adversarial intent, with 5.0% of skills carrying predicted multi-stage attack chains. On a 906-skill malicious-skill detection benchmark, BIV reaches an F1 of 0.946, outperforming state-of-the-art rule-based and single-pass LLM baselines. These results demonstrate behavioral integrity auditing for agent skills at scale.

preprint2026arXiv

Perturbation Probing: A Two-Pass-per-Prompt Diagnostic for FFN Behavioral Circuits in Aligned LLMs

Perturbation probing generates task-specific causal hypotheses for FFN neurons in large language models using two forward passes per prompt and no backpropagation, followed by a one-time intervention sweep of about 150 passes amortized across all identified neurons. Across eight behavioral circuits, 13 models, and four architecture families, we identify two circuit structures that organize LLM behavior. Opposition circuits appear when RLHF suppresses a pre-training tendency. In safety refusal, about 50 neurons, or 0.014 percent of all neurons, control the refusal template; ablating them changes 80 percent of response formats on 520 AdvBench prompts while producing near-zero harmful compliance, 3 of 520 cases, all with disclaimers. Routing circuits appear for pre-training behaviors distributed through attention. For language selection, residual-stream direction injection switches English to Chinese output on 99.1 percent of 580 benchmark prompts in the 3 of 19 tested models that satisfy three observed conditions: bilingual training, FFN-to-skip signal ratio between 0.3 and 1.1, and linear representability. The same intervention fails on the other 16 models and on math, code, and factual circuits, defining the limits of directional steering. The FFN-to-skip signal ratio, computed from the same two forward passes, distinguishes the two structures and predicts the appropriate intervention. Circuit topology varies by architecture, from Qwen's concentrated FFN bottleneck to Gemma's normalization-shielded circuit. In Qwen3.5-2B, ablating 20 neurons eliminates multi-turn sycophantic capitulation, while amplifying 10 related neurons improves factual correction from 52 percent to 88 percent on 200 TruthfulQA prompts. These results show that perturbation probing offers mechanistic insight into RLHF-organized behavior and a practical toolkit for precision template-layer editing.

preprint2016arXiv

A Bifurcation Monte Carlo Scheme for Rare Event Simulation

The bifurcation method is a way to do rare event sampling -- to estimate the probability of events that are too rare to be found by direct simulation. We describe the bifurcation method and use it to estimate the transition rate of a double well potential problem. We show that the associated constrained path sampling problem can be addressed by a combination of Crooks-Chandler sampling and parallel tempering and marginalization.

preprint2016arXiv

The Effect of Noise on the Propagating Speed of Pre-mixed Laminar Flame Fronts

We study the effect of thermal noise on the propagation speed of a planar flame. We show that this out of equilibrium greatly amplifies the effect of thermal noise to yield macroscopic reductions in the flame speed over what is predicted by the noise-free model. Computations show that noise slows the flame significantly. The flame is modeled using Navier Stokes equations with appropriate diffusive transport terms and chemical kinetic mechanism of hydrogen/oxygen. Thermal noise is modeled within the continuum framework using a system of stochastic partial differential equations, with transport noise from fluctuating hydrodynamics and reaction noise from a poisson model. We use a full chemical kinetics model in order to get quantitatively meaningful results. We compute steady and dynamic flames using an operator split finite volume scheme. New characteristic boundary conditions avoid non-physical boundary layers at computational boundaries. New limiters prevent stochastic terms from introducing non-physical negative concentrations. This represents the first computation of a model with thermal noise is a system with this degree of physical detail.

preprint2011arXiv

Global investigation of odd-even mass differences and radii with isospin dependent pairing interactions

The neutron and proton odd-even mass differences are systematically studied with Hartree-Fock+BCS (HFBCS) calculations with Skyrme interactions and an isospin dependent contact pairing interaction. The strength of pairing interactions is determined to reproduce empirical odd-even mass differences in a wide region of mass table. By using the optimal parameter, we perform global HF+BCS calculations of nuclei and compare with experimental data. The importance of isospin dependence of the pairing interaction is singled out for odd-even mass differences in medium and heavy isotopes. The proton and neutron radii are studied systematically by using the same model.