Source author record

Zhen Huang

Zhen Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mes-hall Computation and Language cond-mat.str-el Machine Learning cond-mat.mtrl-sci physics.app-ph Artificial Intelligence Computer Vision eess.AS Networking and Internet Architecture Neural and Evolutionary Computing physics.class-ph quant-ph Sound

Catalog footprint

What is connected

20works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CTTA-T: Continual Test-Time Adaptation for Text Understanding via Teacher-Student with a Domain-aware and Generalized Teacher

Text understanding often suffers from domain shifts. To handle testing domains, domain adaptation (DA) is trained to adapt to a fixed and observed testing domain; a more challenging paradigm, test-time adaptation (TTA), cannot access the testing domain during training and online adapts to the testing samples during testing, where the samples are from a fixed domain. We aim to explore a more practical and underexplored scenario, continual test-time adaptation (CTTA) for text understanding, which involves a sequence of testing (unobserved) domains in testing. Current CTTA methods struggle in reducing error accumulation over domains and enhancing generalization to handle unobserved domains: 1) Noise-filtering reduces accumulated errors but discards useful information, and 2) accumulating historical domains enhances generalization, but it is hard to achieve adaptive accumulation. In this paper, we propose a CTTA-T (continual test-time adaptation for text understanding) framework adaptable to evolving target domains: it adopts a teacher-student framework, where the teacher is domain-aware and generalized for evolving domains. To improve teacher predictions, we propose a refine-then-filter based on dropout-driven consistency, which calibrates predictions and removes unreliable guidance. For the adaptation-generalization trade-off, we construct a domain-aware teacher by dynamically accumulating cross-domain semantics via incremental PCA, which continuously tracks domain shifts. Experiments show CTTA-T excels baselines.

preprint2026arXiv

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

General reasoning represents a long-standing and formidable challenge in artificial intelligence. Recent breakthroughs, exemplified by large language models (LLMs) and chain-of-thought prompting, have achieved considerable success on foundational reasoning tasks. However, this success is heavily contingent upon extensive human-annotated demonstrations, and models' capabilities are still insufficient for more complex problems. Here we show that the reasoning abilities of LLMs can be incentivized through pure reinforcement learning (RL), obviating the need for human-labeled reasoning trajectories. The proposed RL framework facilitates the emergent development of advanced reasoning patterns, such as self-reflection, verification, and dynamic strategy adaptation. Consequently, the trained model achieves superior performance on verifiable tasks such as mathematics, coding competitions, and STEM fields, surpassing its counterparts trained via conventional supervised learning on human demonstrations. Moreover, the emergent reasoning patterns exhibited by these large-scale models can be systematically harnessed to guide and enhance the reasoning capabilities of smaller models.

preprint2024arXiv

Realizing topological edge states in graphene-like elastic metamaterials

The study of topological states in electronic structures, which allows robust transport properties against impurities and defects, has been recently extended to the realm of elasticity. This work shows that nontrivial topological flexural edge states located on the free boundary of the elastic graphene-like metamaterial can be realized without breaking the time reversal, mirror, or inversion symmetry of the system. Numerical calculations and experimental studies demonstrate the robust transport of flexural waves along the boundaries of the designed structure. The topological edge states on the free boundary are not limited by the size of the finite structure, which can reduce the scale of the topological state system. In addition, unlike the edge states localized on the free boundary in graphene where the group velocity is zero, the edge states on the elastic metamaterial plate have propagation states with non-zero group velocity. There is a frequency range for the edge states, and we introduce the concept of Shannon entropy for elastic waves and use it to assess the frequency range of the edge states in graphene-like elastic metamaterials. This work represents a relevant advance in the study of elastic wave topological states, providing a theoretical basis for engineering applications such as vibration reduction and vibration isolation of mechanical structures.

preprint2024arXiv

Topological transmission in Suzuki phase sonic crystals

This work reports topological extraordinary properties of sound transmission through topological states in sonic crystals denominated Suzuki phase, consisting of a rectangular lattice of vacancies created in a triangular lattice. These low-symmetry crystals exhibit unique properties due to the embedded lattice of vacancies. A generalized folding method explains the band structure and the quasi-type-II Dirac point in the Suzuki phase, which is related to the underlying triangular lattice. In analogy to the acoustic valley Hall effect, the Suzuki phase contains three types of topological edge states on the four possible interfaces separating two Suzuki phase crystals with distinct topological phases. The edge states have defined symmetries with inherent directionality, which affect the topological sound transmission and are different from chirality, valley vorticity or helicity. Particularly, the existence of topological deaf bands is here reported. The propagation of topological eigenmodes on the same interface is also different, which is quantified using the acoustic Shannon entropy, making the topological transport dependent on the frequency of the edge states. Based on the abundant topological edge states of Suzuki phase crystals, a multifunctional device with acoustic diodes, multi-channel transmission, and selective acoustic transmission can be designed. Numerical simulations and measurements demonstrate the topological transmission. Our work extends the research platform of acoustic topological states to lattices with low symmetry, which opens new avenues for enriching topological states with broad engineering applications.

preprint2023arXiv

UniFed: All-In-One Federated Learning Platform to Unify Open-Source Frameworks

Federated Learning (FL) has become a practical and widely adopted distributed learning paradigm. However, the lack of a comprehensive and standardized solution covering diverse use cases makes it challenging to use in practice. In addition, selecting an appropriate FL framework for a specific use case can be a daunting task. In this work, we present UniFed, the first unified platform for standardizing existing open-source FL frameworks. The platform streamlines the end-to-end workflow for distributed experimentation and deployment, encompassing 11 popular open-source FL frameworks. In particular, to address the substantial variations in workflows and data formats, UniFed introduces a configuration-based schema-enforced task specification, offering 20 editable fields. UniFed also provides functionalities such as distributed execution management, logging, and data analysis. With UniFed, we evaluate and compare 11 popular FL frameworks from the perspectives of functionality, privacy protection, and performance, through conducting developer surveys and code-level investigation. We collect 15 diverse FL scenario setups (e.g., horizontal and vertical settings) for FL framework evaluation. This comprehensive evaluation allows us to analyze both model and system performance, providing detailed comparisons and offering recommendations for framework selection. UniFed simplifies the process of selecting and utilizing the appropriate FL framework for specific use cases, while enabling standardized distributed experimentation and deployment. Our results and analysis based on experiments with up to 178 distributed nodes provide valuable system design and deployment insights, aiming to empower practitioners in their pursuit of effective FL solutions.

preprint2022arXiv

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization

Cloth-Changing person re-identification (CC-ReID) aims at matching the same person across different locations over a long-duration, e.g., over days, and therefore inevitably meets challenge of changing clothing. In this paper, we focus on handling well the CC-ReID problem under a more challenging setting, i.e., just from a single image, which enables high-efficiency and latency-free pedestrian identify for real-time surveillance applications. Specifically, we introduce Gait recognition as an auxiliary task to drive the Image ReID model to learn cloth-agnostic representations by leveraging personal unique and cloth-independent gait information, we name this framework as GI-ReID. GI-ReID adopts a two-stream architecture that consists of a image ReID-Stream and an auxiliary gait recognition stream (Gait-Stream). The Gait-Stream, that is discarded in the inference for high computational efficiency, acts as a regulator to encourage the ReID-Stream to capture cloth-invariant biometric motion features during the training. To get temporal continuous motion cues from a single image, we design a Gait Sequence Prediction (GSP) module for Gait-Stream to enrich gait information. Finally, a high-level semantics consistency over two streams is enforced for effective knowledge regularization. Experiments on multiple image-based Cloth-Changing ReID benchmarks, e.g., LTCC, PRCC, Real28, and VC-Clothes, demonstrate that GI-ReID performs favorably against the state-of-the-arts. Codes are available at https://github.com/jinx-USTC/GI-ReID.

preprint2022arXiv

IMCI: Integrate Multi-view Contextual Information for Fact Extraction and Verification

With the rapid development of automatic fake news detection technology, fact extraction and verification (FEVER) has been attracting more attention. The task aims to extract the most related fact evidences from millions of open-domain Wikipedia documents and then verify the credibility of corresponding claims. Although several strong models have been proposed for the task and they have made great progress, we argue that they fail to utilize multi-view contextual information and thus cannot obtain better performance. In this paper, we propose to integrate multi-view contextual information (IMCI) for fact extraction and verification. For each evidence sentence, we define two kinds of context, i.e. intra-document context and inter-document context}. Intra-document context consists of the document title and all the other sentences from the same document. Inter-document context consists of all other evidences which may come from different documents. Then we integrate the multi-view contextual information to encode the evidence sentences to handle the task. Our experimental results on FEVER 1.0 shared task show that our IMCI framework makes great progress on both fact extraction and verification, and achieves state-of-the-art performance with a winning FEVER score of 72.97% and label accuracy of 75.84% on the online blind test set. We also conduct ablation study to detect the impact of multi-view contextual information. Our codes will be released at https://github.com/phoenixsecularbird/IMCI.

preprint2022arXiv

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Unsupervised sentence embedding aims to obtain the most appropriate embedding for a sentence to reflect its semantic. Contrastive learning has been attracting developing attention. For a sentence, current models utilize diverse data augmentation methods to generate positive samples, while consider other independent sentences as negative samples. Then they adopt InfoNCE loss to pull the embeddings of positive pairs gathered, and push those of negative pairs scattered. Although these models have made great progress on sentence embedding, we argue that they may suffer from feature suppression. The models fail to distinguish and decouple textual similarity and semantic similarity. And they may overestimate the semantic similarity of any pairs with similar textual regardless of the actual semantic difference between them. This is because positive pairs in unsupervised contrastive learning come with similar and even the same textual through data augmentation. To alleviate feature suppression, we propose contrastive learning for unsupervised sentence embedding with soft negative samples (SNCSE). Soft negative samples share highly similar textual but have surely and apparently different semantic with the original samples. Specifically, we take the negation of original sentences as soft negative samples, and propose Bidirectional Margin Loss (BML) to introduce them into traditional contrastive learning framework, which merely involves positive and negative samples. Our experimental results show that SNCSE can obtain state-of-the-art performance on semantic textual similarity (STS) task with average Spearman's correlation coefficient of 78.97% on BERTbase and 79.23% on RoBERTabase. Besides, we adopt rank-based error analysis method to detect the weakness of SNCSE for future study.

preprint2020arXiv

Characteristic Lengths of Interlayer Charge-Transfer in Correlated Oxide Heterostructures

Using interlayer interaction to control functional heterostructures with atomic-scale designs has become one of the most effective interface-engineering strategies nowadays. Here, we demonstrate the effect of a crystalline LaFeO3 buffer layer on amorphous and crystalline LaAlO3/SrTiO3 heterostructures. The LaFeO3 buffer layer acts as an energetically favored electron acceptor in both LaAlO3/SrTiO3 systems, resulting in modulation of interfacial carrier density and hence metal-to-insulator transition. For amorphous and crystalline LaAlO3/SrTiO3 heterostructures, the metal-to-insulator transition is found when the LaFeO3 layer thickness crosses 3 and 6 unit cells, respectively. Such different critical LaFeO3 thicknesses are explained in terms of distinct characteristic lengths of the redox-reaction-mediated and polar-catastrophe-dominated charge transfer, controlled by the interfacial atomic contact and Thomas-Fermi screening effect, respectively. Our results not only shed light on the complex interlayer charge transfer across oxide heterostructures but also provides a new route to precisely tailor the charge-transfer process at a functional interface.

preprint2020arXiv

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

Very deep CNNs achieve state-of-the-art results in both computer vision and speech recognition, but are difficult to train. The most popular way to train very deep CNNs is to use shortcut connections (SC) together with batch normalization (BN). Inspired by Self- Normalizing Neural Networks, we propose the self-normalizing deep CNN (SNDCNN) based acoustic model topology, by removing the SC/BN and replacing the typical RELU activations with scaled exponential linear unit (SELU) in ResNet-50. SELU activations make the network self-normalizing and remove the need for both shortcut connections and batch normalization. Compared to ResNet- 50, we can achieve the same or lower (up to 4.5% relative) word error rate (WER) while boosting both training and inference speed by 60%-80%. We also explore other model inference optimization schemes to further reduce latency for production use.

preprint2019arXiv

Aperiodic quantum oscillations in the two-dimensional electron gas at the LaAlO3/SrTiO3 interface

Despite several attempts, the intimate electronic structure of two-dimensional electron systems buried at the interface between LaAlO3 and SrTiO3 still remains to be experimentally revealed. Here, we investigate the transport properties of a high-mobility quasi-two-dimensional electron gas at this interface under high magnetic field (55 T) and provide new insights for electronic band structure by analyzing the Shubnikov-de Haas oscillations. Interestingly, the quantum oscillations are not 1/B-periodic and produce a highly non-linear Landau plot (Landau level index versus 1/B). Among possible scenarios, the Roth-Gao-Niu equation provides a natural explanation for 1/B-aperiodic oscillations in relation with the magnetic response functions of the system. Overall, the magneto-transport data are discussed in light of high-resolution scanning transmission electron microscopy analysis of the interface as well as calculations from density functional theory.

preprint2016arXiv

Evolution and control of the phase competition morphology in a manganite film

The competition among different phases in perovskite manganites is pronounced since their energies are very close under the interplay of charge, spin, orbital and lattice degrees of freedom. To reveal the roles of underlying interactions, many efforts have been devoted towards directly imaging phase transitions at microscopic scales. Here we show images of the charge-ordered insulator (COI) phase transition from a pure ferromagnetic metal with reducing field or increasing temperature in a strained phase-separated manganite film, using a home-built magnetic force microscope. Compared with the COI melting transition, this reverse transition is sharp, cooperative and martensitic-like with astonishingly unique yet diverse morphologies. The COI domains show variable-dimensional growth at different temperatures and their distribution can illustrate the delicate balance of the underlying interactions in manganites. Our findings also display how phase domain engineering is possible and how the phase competition can be tuned in a controllable manner.

preprint2016arXiv

High field magneto-transport in two-dimensional electron gas LaAlO3/SrTiO3

Transport properties of the complex oxide LaAlO3/SrTiO3 interface are investigated under high magnetic field (55T). By rotating the sample with respect to the magnetic field, the two-dimensional nature of charge transport is clearly established. Small oscillations of the agnetoresistance with altered periodicity are observed when plotted versus inverse magnetic field. We attribute this effect to Rashba spin-orbit coupling which remains consistent with large negative magnetoresistance when the field is parallel to the sample plane. A large inconsistency between the carrier density extracted from Shubnikov-de Haas analysis and from the Hall effect is explained by the contribution to transport of at least two bands with different mobility.

preprint2016arXiv

Liquid-Gated High Mobility and Quantum Oscillation of the Two-Dimensional Electron Gas at an Oxide Interface

Electric field effect in electronic double layer transistor (EDLT) configuration with ionic liquids as the dielectric materials is a powerful means of exploring various properties in different materials. Here we demonstrate the modulation of electrical transport properties and extremely high mobility of two-dimensional electron gas at LaAlO$_3$/SrTiO$_3$ (LAO/STO) interface through ionic liquid-assisted electric field effect. By changing the gate voltages, the depletion of charge carrier and the resultant enhancement of electron mobility up to 19380 cm$^2$/Vs are realized, leading to quantum oscillations of the conductivity at the LAO/STO interface. The present results suggest that high-mobility oxide interfaces which exhibit quantum phenomena could be obtained by ionic liquid-assisted field effect.

preprint2015arXiv

Feature Selection with Redundancy-complementariness Dispersion

Feature selection has attracted significant attention in data mining and machine learning in the past decades. Many existing feature selection methods eliminate redundancy by measuring pairwise inter-correlation of features, whereas the complementariness of features and higher inter-correlation among more than two features are ignored. In this study, a modification item concerning the complementariness of features is introduced in the evaluation criterion of features. Additionally, in order to identify the interference effect of already-selected False Positives (FPs), the redundancy-complementariness dispersion is also taken into account to adjust the measurement of pairwise inter-correlation of features. To illustrate the effectiveness of proposed method, classification experiments are applied with four frequently used classifiers on ten datasets. Classification results verify the superiority of proposed method compared with five representative feature selection methods.

preprint2015arXiv

Maximum a Posteriori Adaptation of Network Parameters in Deep Models

We present a Bayesian approach to adapting parameters of a well-trained context-dependent, deep-neural-network, hidden Markov model (CD-DNN-HMM) to improve automatic speech recognition performance. Given an abundance of DNN parameters but with only a limited amount of data, the effectiveness of the adapted DNN model can often be compromised. We formulate maximum a posteriori (MAP) adaptation of parameters of a specially designed CD-DNN-HMM with an augmented linear hidden networks connected to the output tied states, or senones, and compare it to feature space MAP linear regression previously proposed. Experimental evidences on the 20,000-word open vocabulary Wall Street Journal task demonstrate the feasibility of the proposed framework. In supervised adaptation, the proposed MAP adaptation approach provides more than 10% relative error reduction and consistently outperforms the conventional transformation based methods. Furthermore, we present an initial attempt to generate hierarchical priors to improve adaptation efficiency and effectiveness with limited adaptation data by exploiting similarities among senones.

preprint2013arXiv

Effect of growth oxygen pressure on anisotropic-strain-induced phase separation in epitaxial La$_{0.67}$Ca$_{0.33}$MnO$_{3}$/NdGaO$_{3}$(001) films

The effect of deposition oxygen pressure (P$_{O}$) on phase separation (PS) induced in epitaxial La$_{0.67}$Ca$_{0.33}$MnO$_{3}$/NdGaO$_{3}$(001) films was investigated. Fully oxygenated films grown at high P$_{O}$ are anisotropically strained. They exhibit PS over a wide temperature range, because of the large orthorhombicity of NdGaO$_{3}$ substrates. The paramagnetic insulator-to-ferromagnetic metal (FM) and FM-to-antiferromagnetic insulator (AFI) transitions gradually shift to lower temperatures with decreasing PO. The AFI state is initially weakened (P$_{O}$ >= 30 Pa), but then becomes more robust against the magnetic field (P$_{O}$ < 30 Pa). The out-of-plane film lattice parameter increases with decreasing P$_{O}$. For films grown at P$_{O}$>= 30 Pa, the slight oxygen deficiency may enlarge the lattice unit cell, reduce the anisotropic strain and suppress the AFI state. Films deposited at P$_{O}$ < 30 Pa instead experience an average compressive strain. The enhanced compressive strain and structural defects in the films may lead to the robust AFI state. These results aid our understanding of PS in manganite films.

preprint2011arXiv

Data-centric Misbehavior Detection in VANETs

Detecting misbehavior (such as transmissions of false information) in vehicular ad hoc networks (VANETs) is very important problem with wide range of implications including safety related and congestion avoidance applications. We discuss several limitations of existing misbehavior detection schemes (MDS) designed for VANETs. Most MDS are concerned with detection of malicious nodes. In most situations, vehicles would send wrong information because of selfish reasons of their owners, e.g. for gaining access to a particular lane. Because of this (\emph{rational behavior}), it is more important to detect false information than to identify misbehaving nodes. We introduce the concept of data-centric misbehavior detection and propose algorithms which detect false alert messages and misbehaving nodes by observing their actions after sending out the alert messages. With the data-centric MDS, each node can independently decide whether an information received is correct or false. The decision is based on the consistency of recent messages and new alert with reported and estimated vehicle positions. No voting or majority decisions is needed, making our MDS resilient to Sybil attacks. Instead of revoking all the secret credentials of misbehaving nodes, as done in most schemes, we impose fines on misbehaving nodes (administered by the certification authority), discouraging them to act selfishly. This reduces the computation and communication costs involved in revoking all the secret credentials of misbehaving nodes.

preprint2009arXiv

Simulated Quantum Computation of Global Minima

Finding the optimal solution to a complex optimization problem is of great importance in practically all fields of science, technology, technical design and econometrics. We demonstrate that a modified Grover's quantum algorithm can be applied to real problems of finding a global minimum using modest numbers of quantum bits. Calculations of the global minimum of simple test functions and Lennard-Jones clusters have been carried out on a quantum computer simulator using a modified Grover's algorithm. The number of function evaluations $N$ reduced from O(N) in classical simulation to $O(\sqrt{N})$ in quantum simulation. We also show how the Grover's quantum algorithm can be combined with the classical Pivot method for global optimization to treat larger systems.

preprint2008arXiv

Nuclear-induced time evolution of entanglement of two-electron spins in anisotropically coupled quantum dot

We study the time evolution of entanglement of two spins in anisotropically coupled quantum dot interacting with the unpolarized nuclear spins environment. We assume that the exchange coupling strength in the z-direction $J_z$ is different from the lateral one $J_l$. We observe that the entanglement decays as a result of the coupling to the nuclear environment and reaches a saturation value, which depends on the value of the exchange interaction difference $J=\| J_l-J_z\|$ between the two spins and the strength of the applied external magnetic field. We find that the entanglement exhibits a critical behavior controlled by the competition between the exchange interaction $J$ and the external magnetic field. The entanglement shows a quasi-symmetric behavior above and below a critical value of the exchange interaction. It becomes more symmetric as the external magnetic field increases. The entanglement reaches a large saturation value, close to unity, when the exchange interaction is far above or below its critical value and a small one as it closely approaches the critical value. Furthermore, we find that the decay rate profile of entanglement is linear when the exchange interaction is much higher or lower than the critical value but converts to a power law and finally to a Gaussian as the critical value is approached from both directions. The dynamics of entanglement is found to be independent of the exchange interaction for isotropically coupled quantum dot.

Zhen Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

20 published item(s)

CTTA-T: Continual Test-Time Adaptation for Text Understanding via Teacher-Student with a Domain-aware and Generalized Teacher

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Realizing topological edge states in graphene-like elastic metamaterials

Topological transmission in Suzuki phase sonic crystals

UniFed: All-In-One Federated Learning Platform to Unify Open-Source Frameworks

Cloth-Changing Person Re-identification from A Single Image with Gait Prediction and Regularization

IMCI: Integrate Multi-view Contextual Information for Fact Extraction and Verification

SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples

Characteristic Lengths of Interlayer Charge-Transfer in Correlated Oxide Heterostructures

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

Aperiodic quantum oscillations in the two-dimensional electron gas at the LaAlO3/SrTiO3 interface

Evolution and control of the phase competition morphology in a manganite film

High field magneto-transport in two-dimensional electron gas LaAlO3/SrTiO3

Liquid-Gated High Mobility and Quantum Oscillation of the Two-Dimensional Electron Gas at an Oxide Interface

Feature Selection with Redundancy-complementariness Dispersion

Maximum a Posteriori Adaptation of Network Parameters in Deep Models

Effect of growth oxygen pressure on anisotropic-strain-induced phase separation in epitaxial La$_{0.67}$Ca$_{0.33}$MnO$_{3}$/NdGaO$_{3}$(001) films

Data-centric Misbehavior Detection in VANETs

Simulated Quantum Computation of Global Minima

Nuclear-induced time evolution of entanglement of two-electron spins in anisotropically coupled quantum dot