Source author record

Wen Wen

Wen Wen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Machine Learning Computer Vision

Catalog footprint

What is connected

5works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Beyond Sharpness: A Flatness Decomposition Framework for Efficient Continual Learning

Continual Learning (CL) aims to enable models to sequentially learn multiple tasks without forgetting previous knowledge. Recent studies have shown that optimizing towards flatter loss minima can improve model generalization. However, existing sharpness-aware methods for CL suffer from two key limitations: (1) they treat sharpness regularization as a unified signal without distinguishing the contributions of its components. and (2) they introduce substantial computational overhead that impedes practical deployment. To address these challenges, we propose FLAD, a novel optimization framework that decomposes sharpness-aware perturbations into gradient-aligned and stochastic-noise components, and show that retaining only the noise component promotes generalization. We further introduce a lightweight scheduling scheme that enables FLAD to maintain significant performance gains even under constrained training time. FLAD can be seamlessly integrated into various CL paradigms and consistently outperforms standard and sharpness-aware optimizers in diverse experimental settings, demonstrating its effectiveness and practicality in CL.

preprint2026arXiv

Information-Theoretic Generalization Bounds of Replay-based Continual Learning

Continual learning (CL) has emerged as a dominant paradigm for acquiring knowledge from sequential tasks while avoiding catastrophic forgetting. Although many CL methods have been proposed to show impressive empirical performance, the theoretical understanding of their generalization behavior remains limited, particularly for replay-based approaches. This paper establishes a unified theoretical framework for replay-based CL, deriving a series of information-theoretic generalization bounds that explicitly elucidate the impact of the memory buffer alongside the current task on generalization performance. Specifically, our hypothesis-based bounds capture the trade-off between the number of selected exemplars and the information dependency between the hypothesis and the memory buffer. Our prediction-based bounds yield tighter and computationally tractable upper bounds on the generalization error by leveraging low-dimensional variables. Theoretical analysis is general and broadly applicable to a wide range of learning algorithms, exemplified by stochastic gradient Langevin dynamics (SGLD) as a representative method. Comprehensive experimental evaluations demonstrate the effectiveness of our derived bounds in capturing the generalization dynamics in replay-based CL settings.

preprint2026arXiv

Prompt-Anchored Vision-Text Distillation for Lifelong Person Re-identification

Lifelong person re-identification (LReID) aims to train a generalizable model with sequentially collected data. However, such models often suffer from semantic drift, limited adaptability, and catastrophic forgetting as new domains emerge. Existing exemplar-free approaches largely rely on visual-only distillation or parameter regularization, while overlooking the potential of auxiliary modalities, such as text, to preserve semantic stability and enable incremental plasticity. We observe that the frozen text encoder in pretrained vision-language models can serve as a stable semantic anchor across domains. To decouple the roles of vision and text, we propose Prompt-Anchored vision-text Distillation (PAD), an asymmetric vision-text framework for semantic alignment and cross-domain generalization. On the textual side, we distill prompts to preserve vision-text alignment under a fixed semantic space, acting as a global semantic reference rather than a dominant learning signal. On the visual side, an EMA-based teacher with an adaptive prompt pool enables domain-wise adaptation by allocating new slots while freezing past ones. Extensive experiments show that PAD substantially outperforms state-of-the-art methods across seen and unseen domains, achieving a strong balance between stability and plasticity. Project page is available at https://github.com/zu-zi/PAD.

preprint2023arXiv

Structure Prediction of Epitaxial Organic Interfaces with Ogre, Demonstrated for TCNQ on TTF

Highly ordered epitaxial interfaces between organic semiconductors are considered as a promising avenue for enhancing the performance of organic electronic devices including solar cells, light emitting diodes, and transistors, thanks to their well-controlled, uniform electronic properties and high carrier mobilities. Although the phenomenon of organic epitaxy has been known for decades, computational methods for structure prediction of epitaxial organic interfaces have lagged far behind the existing methods for their inorganic counterparts. We present a method for structure prediction of epitaxial organic interfaces based on lattice matching followed by surface matching, implemented in the open-source Python package, Ogre. The lattice matching step produces domain-matched interfaces, where commensurability is achieved with different integer multiples of the substrate and film unit cells. In the surface matching step, Bayesian optimization (BO) is used to find the interfacial distance and registry between the substrate and film. The BO objective function is based on dispersion corrected deep neural network interatomic potentials, shown to be in excellent agreement with density functional theory (DFT). The application of Ogre is demonstrated for an epitaxial interface of 7,7,8,8-tetracyanoquinodimethane (TCNQ) on tetrathiafulvalene (TTF), whose electronic structure has been probed by ultraviolet photoemission spectroscopy (UPS), but whose structure had been hitherto unknown [Organic Electronics 48, 371 (2017)]. We find that TCNQ(001) on top of TTF(100) is the most stable interface configuration, closely followed by TCNQ(010) on top of TTF(100). The density of states, calculated using DFT, is in excellent agreement with UPS, including the presence of an interface charge transfer state.

preprint2013arXiv

Periodic elastic nanodomains in ultrathin tetrogonal-like BiFeO3 films

We present a synchrotron grazing incidence x-ray diffraction analysis of the domain structure and polar symmetry of highly strained BiFeO3 thin films grown on LaAlO3 substrate. We revealed the existence of periodic elastic nanodomains in the pure tetragonal-like BFO ultrathin films down to a thickness of 6 nm. A unique shear strain accommodation mechanism is disclosed. We further demonstrated that the periodicity of the nanodomains increases with film thickness but deviates from the classical Kittel's square root law in ultrathin thickness regime (6 - 30 nm). Temperature-dependent experiments also reveal the disappearance of periodic modulation above 90C due to a MC-MA structural phase transition.

Wen Wen

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Beyond Sharpness: A Flatness Decomposition Framework for Efficient Continual Learning

Information-Theoretic Generalization Bounds of Replay-based Continual Learning

Prompt-Anchored Vision-Text Distillation for Lifelong Person Re-identification

Structure Prediction of Epitaxial Organic Interfaces with Ogre, Demonstrated for TCNQ on TTF

Periodic elastic nanodomains in ultrathin tetrogonal-like BiFeO3 films