Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Learning Rate Engineering: From Coarse Single Parameter to Layered Evolution

Learning rate scheduling has evolved from the single global fixed rate of early SGD to sophisticated layer-wise adaptive strategies. We systematize this evolution into five generations: (Gen1) global fixed learning rates, (Gen2) global scheduling, (Gen3) parameter-level adaptation, (Gen4) layer-level differentiation, and (Gen5) joint layer-time scheduling. We trace the fundamental motivation behind each transition, showing how the shift from one-size-fits-all to tailoring by layer and time addresses the impossible trinity of transfer learning: lower layers require small updates to preserve general knowledge while higher layers need large updates to adapt to new tasks. Building on this taxonomy, we propose Discriminative Adaptive Layer Scaling (DALS), a unified framework that integrates phase-adaptive cosine scheduling, depth-aware Grokfast gradient filtering, and LARS-style trust ratios into a single coherent optimizer. We benchmark 18 strategies including three DALS variants across all five generations on five datasets: synthetic, CIFAR-10 (from scratch), RTE, TREC-6, and IMDb (fine-tuning). On synthetic, DALS achieves the best accuracy at 98.0%, while DALS-Fast reaches 90% in just 3 epochs. The cross-dataset analysis reveals striking regime-dependent patterns -- no single strategy wins across all regimes. Critically, STLR+Discriminative, the ULMFiT champion, catastrophically fails on from-scratch tasks (43.6% on TREC-6 from scratch vs. 96.8% with RAdam), confirming that directional decay biases are harmful without pretrained features. DALS avoids either extreme, achieving the best synthetic result while maintaining competitive fine-tuning performance.

preprint2025arXiv

Companion Agents: A Table-Information Mining Paradigm for Text-to-SQL

Large-scale Text-to-SQL benchmarks such as BIRD typically assume complete and accurate database annotations as well as readily available external knowledge, which fails to reflect common industrial settings where annotations are missing, incomplete, or erroneous. This mismatch substantially limits the real-world applicability of state-of-the-art (SOTA) Text-to-SQL systems. To bridge this gap, we explore a database-centric approach that leverages intrinsic, fine-grained information residing in relational databases to construct missing evidence and improve Text-to-SQL accuracy under annotation-scarce conditions. Our key hypothesis is that when a query requires multi-step reasoning over extensive table information, existing methods often struggle to reliably identify and utilize the truly relevant knowledge. We therefore propose to "cache" query-relevant knowledge on the database side in advance, so that it can be selectively activated at inference time. Based on this idea, we introduce Companion Agents (CA), a new Text-to-SQL paradigm that incorporates a group of agents accompanying database schemas to proactively mine and consolidate hidden inter-table relations, value-domain distributions, statistical regularities, and latent semantic cues before query generation. Experiments on BIRD under the fully missing evidence setting show that CA recovers +4.49 / +4.37 / +14.13 execution accuracy points on RSL-SQL / CHESS / DAIL-SQL, respectively, with larger gains on the Challenging subset +9.65 / +7.58 / +16.71. These improvements stem from CA's automatic database-side mining and evidence construction, suggesting a practical path toward industrial-grade Text-to-SQL deployment without reliance on human-curated evidence.

preprint2022arXiv

1T-FeS$_2$$:$ a new type of two-dimensional metallic ferromagnet

Discovery of intrinsic two-dimensional (2D) magnetic materials is crucial for understanding the fundamentals of 2D magnetism and realizing next-generation magnetoelectronic and magneto-optical devices. Although significant efforts have been devoted to identifying 2D magnetism by exfoliating bulk magnetic layered materials, seldom studies are performed to synthesize ultra-thin magnetic materials directly for non-layered magnetic materials. Here, we report the successful synthesis of a new type of theoretically proposed 2D metallic ferromagnet 1T FeS2, through the molten-salt-assisted chemical vapor deposition (CVD) method. The long-range 2D ferromagnetic order is confirmed by the observation of a large anomalous Hall effect (AHE) and a hysteretic magnetoresistance. The experimentally detected out-of-plane ferromagnetic ordering is theoretically suported with Stoner criterion. Our findings open up new possibilities to search novel 2D ferromagnets in non-layered compounds and render opportunities for realizing realistic ultra-thin spintronic devices.

preprint2022arXiv

EEG-based Cross-Subject Driver Drowsiness Recognition with an Interpretable Convolutional Neural Network

In the context of electroencephalogram (EEG)-based driver drowsiness recognition, it is still challenging to design a calibration-free system, since EEG signals vary significantly among different subjects and recording sessions. Many efforts have been made to use deep learning methods for mental state recognition from EEG signals. However, existing work mostly treats deep learning models as black-box classifiers, while what have been learned by the models and to which extent they are affected by the noise in EEG data are still underexplored. In this paper, we develop a novel convolutional neural network combined with an interpretation technique that allows sample-wise analysis of important features for classification. The network has a compact structure and takes advantage of separable convolutions to process the EEG signals in a spatial-temporal sequence. Results show that the model achieves an average accuracy of 78.35% on 11 subjects for leave-one-out cross-subject drowsiness recognition, which is higher than the conventional baseline methods of 53.40%-72.68% and state-of-the-art deep learning methods of 71.75%-75.19%. Interpretation results indicate the model has learned to recognize biologically meaningful features from EEG signals, e.g., Alpha spindles, as strong indicators of drowsiness across different subjects. In addition, we also explore reasons behind some wrongly classified samples with the interpretation technique and discuss potential ways to improve the recognition accuracy. Our work illustrates a promising direction on using interpretable deep learning models to discover meaningful patterns related to different mental states from complex EEG signals.

preprint2021arXiv

Detecting Confined and Deconfined Spinons in Dynamical Quantum Simulations

Dynamical spin-structure factor (DSF) contains fingerprint information of collective excitations in interacting quantum spin systems. In solid state experiments, DSF can be measured through neutron scatterings. However, it is in general challenging to compute the spectral properties accurately via many-body simulations. Currently, quantum simulation and computation constitute a thriving research field, which are believed to provide a very promising platform for simulating quantum many-body systems. In this work, we establish a link between the many-body dynamics and quantum simulations by studying the non-equilibrium DSF (nDSF) measured on direct product states, which are accessible in contemporary quantum simulators with Rydberg atoms, superconducting qubits, etc. Based on the many-body calculations of transverse field Ising chains, we find the nDSF can be used to sensitively probe the multi-spinon continua associated with the two-spinon creation and the spinon-antispinon process, etc. Moreover, we further demonstrate that the low-energy spinons can be confined -- forming spinon bound states -- under a finite longitudinal field. Our results pave the way of quantum simulation and manipulation of fractional excitations in highly-entangled quantum many-body systems.

preprint2021arXiv

Reconsidering the design of planar plasmonic lasers: gain, gap layers, and mode competition

Because surface plasmons can be confined below the diffraction limit, metallic lasers that support plasmonic modes can provide miniaturized sources of electromagnetic waves. Such devices often exploit a multilayer design, in which a semiconductor gain layer is placed near a metallic interface with a gap layer in between. However, despite many experimental demonstrations, key considerations for these planar metallic lasers remain understudied, leading to incorrect conclusions about the optimal design. Here, we pursue a detailed experimental and theoretical study of planar metallic lasers to explore the effect of design parameters on the lasing behavior. We print semiconductor nanoplatelets as a gain layer of controllable thickness onto alumina-coated silver films with integrated planar Fabry-Pérot cavities. Lasing behavior is then monitored with spectrally and polarization-resolved far-field imaging. The results are compared with a theoretical waveguide model and a detailed rate-equation model, which consider both plasmonic and photonic modes. We show that the nature of the lasing mode is dictated by the gain-layer thickness. Moreover, by explicitly treating gain in our waveguide model, we find that, contrary to conventional wisdom, a gap layer with high refractive index is advantageous for plasmonic lasing. Additionally, our rate-equation model reveals a regime where plasmonic and photonic modes compete within the same device, raising the possibility of facile, active mode switching. These findings provide guidance for future designs of metallic lasers and could lead to on-chip lasers with controlled photonic and plasmonic output, switchable at high speeds.

preprint2020arXiv

Diagonal entropy in many-body systems: Volume effect and quantum phase transitions

We investigate the diagonal entropy(DE) of the ground state for quantum many-body systems, including the XY model and the Ising model with next nearest neighbour interactions. We focus on the DE of a subsystem of L continuous spins. We show that the DE in many-body systems, regardless of integrability, can be represented as a volume term plus a logarithmic correction and a constant offset. Quantum phase transition points can be explicitly identified by the three coefficients thereof. Besides, by combining entanglement entropy and the relative entropy of quantum coherence, as two celebrated representatives of quantumness, we simply obtain the DE, which naturally has the potential to reveal the information of quantumness. More importantly, the DE is concerning only the diagonal form of the ground state reduced density matrix, making it feasible to measure in real experiments, and therefore it has immediate applications in demonstrating quantum supremacy on state-of-the-art quantum simulators.

preprint2020arXiv

Quantum Many-Body Simulations of the 2D Fermi-Hubbard Model in Ultracold Optical Lattices

Understanding quantum many-body states of correlated electrons is one main theme in modern condensed matter physics. Given that the Fermi-Hubbard model, the prototype of correlated electrons, has been recently realized in ultracold optical lattices, it is highly desirable to have controlled numerical methodology to provide precise finite-temperature results upon doping, to directly compare with experiments. Here, we demonstrate the exponential tensor renormalization group (XTRG) algorithm [Phys. Rev. X 8, 031082 (2018)], complemented with independent determinant quantum Monte Carlo (DQMC) offer a powerful combination of tools for this purpose. XTRG provides full and accurate access to the density matrix and thus various spin and charge correlations, down to unprecedented low temperature of few percents of the fermion tunneling energy scale. We observe excellent agreement with ultracold fermion measurements at both half-filling and finite-doping, including the sign-reversal behavior in spin correlations due to formation of magnetic polarons, and the attractive hole-doublon and repulsive hole-hole pairs that are responsible for the peculiar bunching and antibunching behavior of the antimoments.

preprint2019arXiv

Phase evolution and superconductivity enhancement in Se-substituted MoTe$_2$ thin films

The strong spin$-$orbit coupling (SOC) and numerous crystal phases in few$-$layer transition metal dichalcogenides (TMDCs) MX$_2$ (M$=$W, Mo, and X$=$Te, Se, S) has led to a variety of novel physics, such as Ising superconductivity and quantum spin Hall effect realized in monolayer 2H$-$ and Td$-$MX$_2$, respectively. Consecutive tailoring of the MX$_2$ structure from 2H to Td phase may realize the long$-$sought topological superconductivity in one material system by incorporating superconductivity and quantum spin Hall effect together. In this work, by combing Raman spectrum, X-ray photoelectron spectrum (XPS), scanning transmission electron microscopy imaging (STEM) as well as electrical transport measurements, we demonstrate that a consecutively structural phase transitions from Td to 1T$'$ to 2H polytype can be realized as the Se-substitution concentration increases. More importantly, the Se$-$substitution has been found to notably enhance the superconductivity of the MoTe$_2$ thin film, which is interpreted as the introduction of the two$-$band superconductivity. The chemical constituent induced phase transition offers a new strategy to study the s$_{+-}$ superconductivity and the possible topological superconductivity as well as to develop phase$-$sensitive devices based on MX$_2$ materials.