Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

Navigating by Old Maps: The Pitfalls of Static Mechanistic Localization in LLM Post-Training

The "Locate-then-Update" paradigm has become a predominant approach in the post-training of large language models (LLMs), identifying critical components via mechanistic interpretability for targeted parameter updates. However, this paradigm rests on a fundamental yet unverified assumption: can mechanisms derived from current static parameters reliably guide future dynamic parameter updates? To investigate this, we systematically track the structural evolution of Transformer circuits throughout the supervised fine-tuning (SFT) process, revealing the underlying dynamics of task mechanisms. We introduce three novel metrics-Circuit Distance, Circuit Stability, and Circuit Conflict-to analyze circuit evolution across three dimensions: neural migration, semantic stability, and cross-task interference. Our empirical results reveal that circuits inherently exhibit "Free Evolution" during parameter updates. Consequently, static mechanisms extracted from current states inevitably suffer from temporal latency, making them fundamentally inadequate for guiding future states. Moreover, by deconstructing the "illusion of effectiveness" in existing methods, this work underscores the necessity of "foresight" in mechanistic localization and proposes a predictive framework for future research.

preprint2026arXiv

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

We present NextFlow, a unified decoder-only autoregressive transformer trained on 6 trillion interleaved text-image discrete tokens. By leveraging a unified vision representation within a unified autoregressive architecture, NextFlow natively activates multimodal understanding and generation capabilities, unlocking abilities of image editing, interleaved content and video generation. Motivated by the distinct nature of modalities - where text is strictly sequential and images are inherently hierarchical - we retain next-token prediction for text but adopt next-scale prediction for visual generation. This departs from traditional raster-scan methods, enabling the generation of 1024x1024 images in just 5 seconds - orders of magnitude faster than comparable AR models. We address the instabilities of multi-scale generation through a robust training recipe. Furthermore, we introduce a prefix-tuning strategy for reinforcement learning. Experiments demonstrate that NextFlow achieves state-of-the-art performance among unified models and rivals specialized diffusion baselines in visual quality.

preprint2026arXiv

Towards Efficient 3D Object Detection for Vehicle-Infrastructure Collaboration via Risk-Intent Selection

Vehicle-Infrastructure Collaborative Perception (VICP) is pivotal for resolving occlusion in autonomous driving, yet the trade-off between communication bandwidth and feature redundancy remains a critical bottleneck. While intermediate fusion mitigates data volume compared to raw sharing, existing frameworks typically rely on spatial compression or static confidence maps, which inefficiently transmit spatially redundant features from non-critical background regions. To address this, we propose Risk-intent Selective detection (RiSe), an interaction-aware framework that shifts the paradigm from identifying visible regions to prioritizing risk-critical ones. Specifically, we introduce a Potential Field-Trajectory Correlation Model (PTCM) grounded in potential field theory to quantitatively assess kinematic risks. Complementing this, an Intention-Driven Area Prediction Module (IDAPM) leverages ego-motion priors to proactively predict and filter key Bird's-Eye-View (BEV) areas essential for decision-making. By integrating these components, RiSe implements a semantic-selective fusion scheme that transmits high-fidelity features only from high-interaction regions, effectively acting as a feature denoiser. Extensive experiments on the DeepAccident dataset demonstrate that our method reduces communication volume to 0.71\% of full feature sharing while maintaining state-of-the-art detection accuracy, establishing a competitive Pareto frontier between bandwidth efficiency and perception performance.

preprint2022arXiv

A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

Understanding causality helps to structure interventions to achieve specific goals and enables predictions under interventions. With the growing importance of learning causal relationships, causal discovery tasks have transitioned from using traditional methods to infer potential causal structures from observational data to the field of pattern recognition involved in deep learning. The rapid accumulation of massive data promotes the emergence of causal search methods with brilliant scalability. Existing summaries of causal discovery methods mainly focus on traditional methods based on constraints, scores and FCMs, there is a lack of perfect sorting and elaboration for deep learning-based methods, also lacking some considers and exploration of causal discovery methods from the perspective of variable paradigms. Therefore, we divide the possible causal discovery tasks into three types according to the variable paradigm and give the definitions of the three tasks respectively, define and instantiate the relevant datasets for each task and the final causal model constructed at the same time, then reviews the main existing causal discovery methods for different tasks. Finally, we propose some roadmaps from different perspectives for the current research gaps in the field of causal discovery and point out future research directions.

preprint2022arXiv

Learning a General Clause-to-Clause Relationships for Enhancing Emotion-Cause Pair Extraction

Emotion-cause pair extraction (ECPE) is an emerging task aiming to extract potential pairs of emotions and corresponding causes from documents. Previous approaches have focused on modeling the pair-to-pair relationship and achieved promising results. However, the clause-to-clause relationship, which fundamentally symbolizes the underlying structure of a document, has still been in its research infancy. In this paper, we define a novel clause-to-clause relationship. To learn it applicably, we propose a general clause-level encoding model named EA-GAT comprising E-GAT and Activation Sort. E-GAT is designed to aggregate information from different types of clauses; Activation Sort leverages the individual emotion/cause prediction and the sort-based mapping to propel the clause to a more favorable representation. Since EA-GAT is a clause-level encoding model, it can be broadly integrated with any previous approach. Experimental results show that our approach has a significant advantage over all current approaches on the Chinese and English benchmark corpus, with an average of $2.1\%$ and $1.03\%$.

preprint2022arXiv

On Random Graph Properties

We consider 15 properties of labeled random graphs that are of interest in the graph-theoretical and the graph mining literature, such as clustering coefficients, centrality measures, spectral radius, degree assortativity, treedepth, treewidth, etc. We analyze relationships and correlations between these properties. Whereas for graphs on a small number of vertices we can exactly compute the average values and range for each property of interest, this becomes infeasible for larger graphs. We show that graphs generated by the \ErdosRenyi graph generator with $p = 1/2$ model well the underlying space of all labeled graphs with a fixed number of vertices. The later observation allows us to analyze properties and correlations between these properties for larger graphs. We then use linear and non-linear models to predict a given property based on the others and for each property, we find the most predictive subset. We experimentally show that pairs and triples of properties have high predictive power, making it possible to estimate computationally expensive to compute properties with ones for which there are efficient algorithms.

preprint2022arXiv

Partial wave analysis of two-body decay with helicity formalism

In this paper, We find that the angular distribution of two-body decay and scattering process that can be studied by introducing the helicity method.It has been argued that the angular distribution only depends on magnitude of the decaying amplitude which final particle momentum is along the $Z$ axis. Therefore, when calculating the angular distribution,we only need to know the helicity amplitude in the $Z$ direction. At the same time, we also discuss the symmetry of physical processes, such as parity and time inversion transformation and so on. The effects of these symmetric transformations on the helicity state and the canonical states are studied in this paper. When it is applied to two body decay process, the symmetry of the amplitude can be obtained by helicity method. In addition, we also introduce the density matrix to study the polarization of the final state particles, and discuss some inherent symmetries of the density matrix. When we study the polarization and angular distribution of final state particles, we can use these properties to simplify density matrix without knowing all the elements of the density matrix.Then we find that there is a close relationship between the partial wave\cite{2003Tree}coupling constants and the angular distribution parameters, indicating that these coupling constants can be obtained by experimental fitting to data.In a word, the properties of two body decay and two body cascade decay process can be analyzed by helicity method, which has a very good application .

preprint2022arXiv

Revealing room temperature ferromagnetism in exfoliated Fe$_5$GeTe$_2$ flakes with quantum magnetic imaging

Van der Waals material Fe$_5$GeTe$_2$, with its long-range ferromagnetic ordering near room temperature, has significant potential to become an enabling platform for implementing novel spintronic and quantum devices. To pave the way for applications, it is crucial to determine the magnetic properties when the thickness of Fe5GeTe2 reaches the few-layers regime. However, this is highly challenging due to the need for a characterization technique that is local, highly sensitive, artifact-free, and operational with minimal fabrication. Prior studies have indicated that Curie temperature TC can reach up to close to room temperature for exfoliated Fe$_5$GeTe$_2$ flakes, as measured via electrical transport; there is a need to validate these results with a measurement that reveals magnetism more directly. In this work, we investigate the magnetic properties of exfoliated thin flakes of van der Waals magnet Fe$_5$GeTe$_2$ via a quantum magnetic imaging technique based on nitrogen vacancy diamond. Through imaging the stray fields, we confirm room-temperature magnetic order in Fe$_5$GeTe$_2$ thin flakes with thickness down to 7 units cell. The stray field patterns and their response to magnetizing fields with different polarities point to a perpendicular easy-axis anisotropy. Furthermore, we perform imaging at different temperatures and determine the Curie temperature of the flakes at Tc~300 K. These results provide the basis for realizing a room-temperature monolayer ferromagnet with Fe$_5$GeTe$_2$. This work also demonstrates that the imaging technique enables a rapid screening of multiple flakes simultaneously, thereby paving the way towards high throughput characterization of potential 2D magnets near room temperature.

preprint2021arXiv

High-Performance HZO/InAlN/GaN MIS-HEMT with fT/fmax of 155/250 GHz

Scaling of GaN high-electron-mobility transistors (HEMTs) usually increases gate leakage current and deteriorates breakdown characteristic, limiting the maximum drain current and output power density. These bottlenecks can be circumvented by inserting a dielectric material under the gate of HEMTs. Doped HfO2 is an excellent dielectric material but unexplored so far as the gate material of HEMTs for high-speed device application. Here we demonstrate that Zr-doped HfO2 (HZO)-gated InAlN/GaN metal-insulator-semiconductor (MIS) HEMTs exhibit remarkable properties. The device with a gate length (Lg) of 50 nm exhibits maximum drain current (Id,max) of 2.15 A/mm, a transconductance (gm) peak of 476 mS/mm, an on/off current ratio (Ion/Ioff) of 9.3*107, a low drain-induced barrier lowing (DIBL) of 45 mV/V. RF characterizations reveal a current gain cutoff frequency (fT) of 155 GHz and a maximum oscillation frequency (fmax) of 250 GHz, resulting in a (fT*fmax)1/2 of 197 GHz.These properties, particularly the high (fT/fmax)1/2 and JFOM are highly desirable for the millimeter-wave power applications, demonstrating the great technological potential of HZO/InAlN/GaN MIS-HEMTs.

preprint2021arXiv

Robust Blockchained Federated Learning with Model Validation and Proof-of-Stake Inspired Consensus

Federated learning (FL) is a promising distributed learning solution that only exchanges model parameters without revealing raw data. However, the centralized architecture of FL is vulnerable to the single point of failure. In addition, FL does not examine the legitimacy of local models, so even a small fraction of malicious devices can disrupt global training. To resolve these robustness issues of FL, in this paper, we propose a blockchain-based decentralized FL framework, termed VBFL, by exploiting two mechanisms in a blockchained architecture. First, we introduced a novel decentralized validation mechanism such that the legitimacy of local model updates is examined by individual validators. Second, we designed a dedicated proof-of-stake consensus mechanism where stake is more frequently rewarded to honest devices, which protects the legitimate local model updates by increasing their chances of dictating the blocks appended to the blockchain. Together, these solutions promote more federation within legitimate devices, enabling robust FL. Our emulation results of the MNIST classification corroborate that with 15% of malicious devices, VBFL achieves 87% accuracy, which is 7.4x higher than Vanilla FL.

preprint2020arXiv

A Novel Application of Boolean Functions with High Algebraic Immunity in Minimal Codes

Boolean functions with high algebraic immunity are important cryptographic primitives in some stream ciphers. In this paper, two methodologies for constructing binary minimal codes from sets, Boolean functions and vectorial Boolean functions with high algebraic immunity are proposed. More precisely, a general construction of new minimal codes using minimal codes contained in Reed-Muller codes and sets without nonzero low degree annihilators is presented. The other construction allows us to yield minimal codes from certain subcodes of Reed-Muller codes and vectorial Boolean functions with high algebraic immunity. Via these general constructions, infinite families of minimal binary linear codes of dimension $m$ and length less than or equal to $m(m+1)/2$ are obtained. In addition, a lower bound on the minimum distance of the proposed minimal linear codes is established. Conjectures and open problems are also presented. The results of this paper show that Boolean functions with high algebraic immunity have nice applications in several fields such as symmetric cryptography, coding theory and secret sharing schemes.

preprint2020arXiv

Correlating Subword Articulation with Lip Shapes for Embedding Aware Audio-Visual Speech Enhancement

In this paper, we propose a visual embedding approach to improving embedding aware speech enhancement (EASE) by synchronizing visual lip frames at the phone and place of articulation levels. We first extract visual embedding from lip frames using a pre-trained phone or articulation place recognizer for visual-only EASE (VEASE). Next, we extract audio-visual embedding from noisy speech and lip videos in an information intersection manner, utilizing a complementarity of audio and visual features for multi-modal EASE (MEASE). Experiments on the TCD-TIMIT corpus corrupted by simulated additive noises show that our proposed subword based VEASE approach is more effective than conventional embedding at the word level. Moreover, visual embedding at the articulation place level, leveraging upon a high correlation between place of articulation and lip shapes, shows an even better performance than that at the phone level. Finally the proposed MEASE framework, incorporating both audio and visual embedding, yields significantly better speech quality and intelligibility than those obtained with the best visual-only and audio-only EASE systems.

preprint2020arXiv

Search for a generic heavy Higgs at the LHC

A generic heavy Higgs has both dim-4 and effective dim-6 interactions with the Standard Model (SM) particles. The former has been the focus of LHC searches in all major Higgs production channels, just as the SM one, but with negative results so far. If the heavy Higgs is connected with Beyond Standard Model (BSM) physics at a few TeV scale, its dim-6 operators will play a very important role - they significantly enhance the Higgs momentum, and reduce the SM background in a special phase space corner to a level such that a heavy Higgs emerges, which is not possible with dim-4 operators only. We focus on the associated VH production channel, where the effect of dim-6 operators is the largest and the SM background is the lowest. Main search regions for this type of signal are identified, and substructure variables of boosted jets are employed to enhance the signal from backgrounds. The parameter space of these operators are scanned over, and expected exclusion regions with 300 fb$^{-1}$ and 3 ab$^{-1}$ LHC data are shown, if no BSM is present. The strategy given in this paper will shed light on a heavy Higgs which may be otherwise hiding in the present and future LHC data.

preprint2019arXiv

Investigation of spin orbit torque driven dynamics in ferromagnetic heterostructures

We use time-resolved (TR) measurements based on the polar magneto-optical Kerr effect (MOKE) to study the magnetization dynamics excited by spin orbit torques in Py (Permalloy)/Pt and Ta/CoFeB bilayers. The analysis reveals that the field-like (FL) spin orbit torque (SOT) dominates the amplitude of the first oscillation cycle of the magnetization precession and the damping-like (DL) torque determines the final steady-state magnetization. In our bilayer samples, we have extracted the effective fields, hFL and hDL, of the two SOTs from the time-resolved magnetization oscillation spectrum. The extracted values are in good agreement with those extracted from time-integrated DCMOKE measurements, suggesting that the SOTs do not change at high frequencies. We also find that the amplitude ratio of the first oscillation to steady state is linearly proportional to the ratio hFL/hDL. The first oscillation amplitude is inversely proportional to, whereas the steady state value is independent of, the applied external field along the current direction.

preprint2018arXiv

Rigidity of minimal submanifolds in space forms

In this paper, we consider the rigidity for an $n(\geq 4)$-dimensional submanfolds $M^n$ with parallel mean curvature in the space form ${\mathbb M}^{n+p}_c$ when the integral Ricci curvature of $M$ has some bound. We prove that, if $c+H^2>0$ and $\|\mathrm{Ric}_{-}^λ\|_{n/2}< ε(n,c, λ, H)$ for $λ$ satisfying $ \frac{n-2}{n-1} (c+H^2) < λ\le c+H^2$, then $M$ is the totally umbilical sphere $\mathbb{S}^n(\tfrac{1}{\sqrt{c+H^2}})$. Here $H$ is the norm of the parallel mean curvature of $M$, and $ε(n,c,λ, H)$ is a positive constant depending only on $n, c,λ$ and $H$. This extends some of the earlier work of [15] from pointwise Ricci curvature lower bound to inetgral Ricci curvature lower bound.