Source author record

Huilin Zhou

Huilin Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Artificial Intelligence Computation and Language Computer Vision eess.SP Information Theory math.IT physics.geo-ph

Catalog footprint

What is connected

5works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization

Red teaming is critical for uncovering vulnerabilities in Large Language Models (LLMs). While automated methods have improved scalability, existing approaches often rely on static heuristics or stochastic search, rendering them brittle against advanced safety alignment. To address this, we introduce Metis, a framework that reformulates jailbreaking as inference-time policy optimization within an adversarial Partially Observable Markov Decision Process (POMDP). Metis employs a self-evolving metacognitive loop to perform causal diagnosis of a target's defense logic and leverages structured feedback as a semantic gradient to refine its policy, offering enhanced interpretability through transparent reasoning traces. Extensive evaluations across 10 diverse models demonstrate that Metis achieves the strongest average Attack Success Rate (ASR) among compared methods at 89.2%, maintaining high efficacy on resilient frontier models (e.g., 76.0% on O1 and 78.0% on GPT-5-chat) where traditional baselines exhibit substantial performance degradation. By replacing redundant exploration with directed optimization, Metis reduces token costs by an average of 8.2x and up to 11.4x. Our analysis reveals that current defenses remain vulnerable to internally-steered, closed-loop reasoning trajectories under the tested settings, highlighting a critical need for next-generation defenses capable of reasoning about safety dynamically during inference.

preprint2026arXiv

Model-Driven GPR Inversion Network With Surrogate Forward Solver

Data-driven deep learning is considered a promising solution for ground-penetrating radar (GPR) full-waveform inversion (FWI), while its generalization ability is limited due to the heavy reliance on abundant labeled samples. In contrast, Deep unfolding network (DUN) usually exhibits better generalization by integrating model-driven and data-driven approaches, yet its application to GPR FWI remains challenging due to the high computational cost associated with forward simulations. In this paper, we integrate a deep learning-based (DL-based) forward solver within an unfolding framework to form a fully neural-network-based architecture, UA-Net, for GPR FWI. The forward solver rapidly predicts B-scans given permittivity and conductivity models and enables automatic differentiation to compute gradients for inversion. In the inversion stage, an optimization process based on the Alternating Direction Method of Multipliers (ADMM) is unfolded into a multi-stage network with three interconnected modules: data fitting, regularization, and multiplier update. Specifically, the regularization module is trained end-to-end for adaptive learning of sparse target features. Experimental results demonstrate that UA-Net outperforms classical FWI and data-driven methods in reconstruction accuracy. Moreover, by employing transfer learning to fine-tune the network, UA-Net can be effectively applied to field data and produce reliable results.

preprint2021arXiv

Building Interpretable Interaction Trees for Deep NLP Models

This paper proposes a method to disentangle and quantify interactions among words that are encoded inside a DNN for natural language processing. We construct a tree to encode salient interactions extracted by the DNN. Six metrics are proposed to analyze properties of interactions between constituents in a sentence. The interaction is defined based on Shapley values of words, which are considered as an unbiased estimation of word contributions to the network prediction. Our method is used to quantify word interactions encoded inside the BERT, ELMo, LSTM, CNN, and Transformer networks. Experimental results have provided a new perspective to understand these DNNs, and have demonstrated the effectiveness of our method.

preprint2020arXiv

Interpretable CNNs for Object Classification

This paper proposes a generic method to learn interpretable convolutional filters in a deep convolutional neural network (CNN) for object classification, where each interpretable filter encodes features of a specific object part. Our method does not require additional annotations of object parts or textures for supervision. Instead, we use the same training data as traditional CNNs. Our method automatically assigns each interpretable filter in a high conv-layer with an object part of a certain category during the learning process. Such explicit knowledge representations in conv-layers of CNN help people clarify the logic encoded in the CNN, i.e., answering what patterns the CNN extracts from an input image and uses for prediction. We have tested our method using different benchmark CNNs with various structures to demonstrate the broad applicability of our method. Experiments have shown that our interpretable filters are much more semantically meaningful than traditional filters.

preprint2020arXiv

Robust Trajectory and Transmit Power Optimization for Secure UAV-Enabled Cognitive Radio Networks

Cognitive radio is a promising technology to improve spectral efficiency. However, the secure performance of a secondary network achieved by using physical layer security techniques is limited by its transmit power and channel fading. In order to tackle this issue, a cognitive unmanned aerial vehicle (UAV) communication network is studied by exploiting the high flexibility of a UAV and the possibility of establishing line-of-sight links. The average secrecy rate of the secondary network is maximized by robustly optimizing the UAV's trajectory and transmit power. Our problem formulation takes into account two practical inaccurate location estimation cases, namely, the worst case and the outage-constrained case. In order to solve those challenging non-convex problems, an iterative algorithm based on $\mathcal{S}$-Procedure is proposed for the worst case while an iterative algorithm based on Bernstein-type inequalities is proposed for the outage-constrained case. The proposed algorithms can obtain effective suboptimal solutions of the corresponding problems. Our simulation results demonstrate that the algorithm under the outage-constrained case can achieve a higher average secrecy rate with a low computational complexity compared to that of the algorithm under the worst case. Moreover, the proposed schemes can improve the secure communication performance significantly compared to other benchmark schemes.

Huilin Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization

Model-Driven GPR Inversion Network With Surrogate Forward Solver

Building Interpretable Interaction Trees for Deep NLP Models

Interpretable CNNs for Object Classification

Robust Trajectory and Transmit Power Optimization for Secure UAV-Enabled Cognitive Radio Networks