Researcher profile

Ao Sun

Ao Sun contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

CuMA: Aligning LLMs with Sparse Cultural Values via Demographic-Aware Mixture of Adapters

As Large Language Models (LLMs) serve a global audience, alignment must transition from enforcing universal consensus to respecting cultural pluralism. We demonstrate that dense models, when forced to fit conflicting value distributions, suffer from \textbf{Mean Collapse}, converging to a generic average that fails to represent diverse groups. We attribute this to \textbf{Cultural Sparsity}, where gradient interference prevents dense parameters from spanning distinct cultural modes. To resolve this, we propose \textbf{\textsc{CuMA}} (\textbf{Cu}ltural \textbf{M}ixture of \textbf{A}dapters), a framework that frames alignment as a \textbf{conditional capacity separation} problem. By incorporating demographic-aware routing, \textsc{CuMA} internalizes a \textit{Latent Cultural Topology} to explicitly disentangle conflicting gradients into specialized expert subspaces. Extensive evaluations on WorldValuesBench, Community Alignment, and PRISM demonstrate that \textsc{CuMA} achieves state-of-the-art performance, significantly outperforming both dense baselines and semantic-only MoEs. Crucially, our analysis confirms that \textsc{CuMA} effectively mitigates mean collapse, preserving cultural diversity. Our code is available at https://github.com/Throll/CuMA.

preprint2022arXiv

A Generalized Knockoff Procedure for FDR Control in Structural Change Detection

Controlling false discovery rate (FDR) is crucial for variable selection, multiple testing, among other signal detection problems. In literature, there is certainly no shortage of FDR control strategies when selecting individual features, but the relevant works for structural change detection, such as profile analysis for piecewise constant coefficients and integration analysis with multiple data sources, are limited. In this paper, we propose a generalized knockoff procedure (GKnockoff) for FDR control under such problem settings. We prove that the GKnockoff possesses pairwise exchangeability, and is capable of controlling the exact FDR under finite sample sizes. We further explore GKnockoff under high dimensionality, by first introducing a new screening method to filter the high-dimensional potential structural changes. We adopt a data splitting technique to first reduce the dimensionality via screening and then conduct GKnockoff on the refined selection set. Furthermore, the powers of proposed methods are systematically studied. Numerical comparisons with other methods show the superior performance of GKnockoff, in terms of both FDR control and power. We also implement the proposed methods to analyze a macroeconomic dataset for detecting changes of driven effects of economic development on the secondary industry.

preprint2022arXiv

Backtesting Trading Strategies with GAN To Avoid Overfitting

Many works have shown the overfitting hazard of selecting a trading strategy based only on good IS (in sample) performance. But most of them have merely shown such phenomena exist without offering ways to avoid them. We propose an approach to avoid overfitting: A good (meaning non-overfitting) trading strategy should still work well on paths generated in accordance with the distribution of the historical data. We use GAN with LSTM to learn or fit the distribution of the historical time series . Then trading strategies are backtested by the paths generated by GAN to avoid overfitting.(This paper is an tanslated English version of a thesis (10.6342/NTU201801645) which was originally written in Chinese in 2018, where some statements and claims are outdated in 2022)

preprint2021arXiv

On the Entropy of Parabolic Allen-Cahn Equation

We define a (mean curvature flow) entropy for Radon measures in $\mathbb{R}^n$ or in a compact manifold. Moreover, we prove a monotonicity formula of the entropy of the measures associated with the parabolic Allen-Cahn equations. If the ambient manifold is a compact manifold with non-negative sectional curvature and parallel Ricci curvature, this is a consequence of a new monotonicity formula for the parabolic Allen-Cahn equation. As an application, we show that when the entropy of the initial data is small enough (less than twice of the energy of the one-dimensional standing wave), the limit measure of the parabolic Allen-Cahn equation has unit density for all future time.

preprint2020arXiv

Alignment Strength and Correlation for Graphs

When two graphs have a correlated Bernoulli distribution, we prove that the alignment strength of their natural bijection strongly converges to a novel measure of graph correlation $ρ_T$ that neatly combines intergraph with intragraph distribution parameters. Within broad families of the random graph parameter settings, we illustrate that exact graph matching runtime and also matchability are both functions of $ρ_T$, with thresholding behavior starkly illustrated in matchability.

preprint2020arXiv

Entropy in A Closed Manifold and Partial Regularity of Mean Curvature Flow Limit of Surfaces

Inspired by the idea of Colding-Minicozzi in [CM1], we define (mean curvature flow) entropy for submanifolds in a general ambient Riemannian manifold. In particular, this entropy is equivalent to area growth of a closed submanifold in a closed ambient manifold with non-negative Ricci curvature. Moreover, this entropy is monotone along the mean curvature flow in a closed Riemannian manifold with non-negative sectional curvatures and parallel Ricci curvature. As an application, we show the partial regularity of the limit of mean curvature flow of surfaces in a three dimensional Riemannian manifold with non-negative sectional curvatures and parallel Ricci curvature.

preprint2018arXiv

Min-max minimal disks with free boundary in Riemannian manifolds

In this paper, we establish a min-max theory for constructing minimal disks with free boundary in any closed Riemannian manifold. The main result is an effective version of the partial Morse theory for minimal disks with free boundary established by Fraser. Our theory also includes as a special case the min-max theory for Plateau problem of minimal disks, which can be used to generalize the famous work by Morse-Thompkins and Shiffman on minimal surfaces in $\mathbf{R}^n$ to the Riemannian setting. More precisely, we generalize the min-max construction of minimal surfaces using harmonic replacement introduced by Colding and Minicozzi to the free boundary setting. As a key ingredient to this construction, we show an energy convexity for weakly harmonic maps with mixed Dirichlet and free boundaries from the half unit $2$-disk in $\mathbf{R}^2$ into any closed Riemannian manifold, which in particular yields the uniqueness of such weakly harmonic maps. This is a free boundary analogue of the energy convexity and uniqueness for weakly harmonic maps with Dirichlet boundary on the unit $2$-disk proved by Colding and Minicozzi.