Researcher profile

Chunxiao Liu

Chunxiao Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Stream-T1: Test-Time Scaling for Streaming Video Generation

While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training, current test-time video generation methods based on diffusion models suffer from exorbitant candidate exploration costs and lack temporal guidance. To address these structural bottlenecks, we propose shifting the focus to streaming video generation. We identify that its chunk-level synthesis and few denoising steps are intrinsically suited for TTS, significantly lowering computational overhead while enabling fine-grained temporal control. Driven by this insight, we introduced Stream-T1, a pioneering comprehensive TTS framework exclusively tailored for streaming video generation. Specifically, Stream-T1 is composed of three units: (1) Stream -Scaled Noise Propagation, which actively refines the initial latent noise of the generating chunk using historically proven, high-quality previous chunk noise, effectively establishes temporal dependency and utilizing the historical Gaussian prior to guide the current generation; (2) Stream -Scaled Reward Pruning, which comprehensively evaluates generated candidates to strike an optimal balance between local spatial aesthetics and global temporal coherence by integrating immediate short-term assessments with sliding-window-based long-term evaluations; (3) Stream-Scaled Memory Sinking, which dynamically routes the context evicted from KV-cache into distinct updating pathways guided by the reward feedback, ensuring that previously generated visual information effectively anchors and guides the subsequent video stream. Evaluated on both 5s and 30s comprehensive video benchmarks, Stream-T1 demonstrates profound superiority, significantly improving temporal consistency, motion smoothness, and frame-level visual quality.

preprint2026arXiv

STT-Arena: A More Realistic Environment for Tool-Using with Spatio-Temporal Dynamics

Large language models (LLMs) deployed in real-world agentic applications must be capable of replanning and adapting when mid-task disruptions invalidate their prior decisions. Existing dynamic benchmarks primarily measure whether LLMs can detect temporal changes in a timely manner, leaving the complementary challenge of adaptive replanning under spatio-temporal dynamics largely unexplored. We introduce STT-Arena (Spatio-Temporal Tool-Use Arena), a benchmark of 227 high-quality interactive tasks spanning nine spatio-temporal conflict types and four solvability levels. Each task is grounded in a realistic, executable environment equipped with injected spatio-temporal triggers that can abruptly invalidate an ongoing plan, forcing the model to detect the state shift and construct a revised execution strategy. Extensive evaluation of frontier LLMs reveals that even the SOTA proprietary models, including Claude-4.6-Opus, achieves less than 40\% overall accuracies, highlighting the fundamental difficulty of spatio-temporal dynamic reasoning. Systematic analysis of failure trajectories uncovers three recurring error modes of existing models: Stale-State Execution, Misdiagnosis of Dynamic Triggers, and Missing Post-Adaptation Verification. Guided by these findings, we propose an iterative trajectory refinement technique that eliminates these failure patterns from training data, and combine it with online RL to produce STT-Agent-4B which outperforms frontier LLMs on STT-Arena.

preprint2026arXiv

Topological BF Theory construction of twisted dihedral quantum double phases from spontaneous symmetry breaking

Nonabelian topological orders host exotic anyons central to quantum computing, yet established realizations rely on case-by-case constructions that are often conceptually involved. In this work, we present a systematic construction of nonabelian dihedral quantum double phases based on a continuous $O(2)$ gauge field. We first formulate a topological $S[O(2)\times O(2)]$ BF theory, and by identifying the Wilson loops and twist operators of this theory with anyons, we show that our topological BF theory reproduces the complete anyon data, and can incorporate all Dijkgraaf--Witten twists. Building on this correspondence, we present a microscopic model with $O(2)$ lattice gauge field coupled to Ising and rotor matter whose Higgsing yields the desired dihedral quantum double phase. A perturbative renormalization group analysis further suggests a direct transition from this phase to a $U(1)$ Coulomb or chiral topological phase at a stable multicritical point with emergent $O(3)$ symmetry. Our proposal offers an alternative route to nonabelian topological order with promising prospects in synthetic gauge field platforms.

preprint2022arXiv

Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization

Self-Driven Particles (SDP) describe a category of multi-agent systems common in everyday life, such as flocking birds and traffic flows. In a SDP system, each agent pursues its own goal and constantly changes its cooperative or competitive behaviors with its nearby agents. Manually designing the controllers for such SDP system is time-consuming, while the resulting emergent behaviors are often not realistic nor generalizable. Thus the realistic simulation of SDP systems remains challenging. Reinforcement learning provides an appealing alternative for automating the development of the controller for SDP. However, previous multi-agent reinforcement learning (MARL) methods define the agents to be teammates or enemies before hand, which fail to capture the essence of SDP where the role of each agent varies to be cooperative or competitive even within one episode. To simulate SDP with MARL, a key challenge is to coordinate agents' behaviors while still maximizing individual objectives. Taking traffic simulation as the testing bed, in this work we develop a novel MARL method called Coordinated Policy Optimization (CoPO), which incorporates social psychology principle to learn neural controller for SDP. Experiments show that the proposed method can achieve superior performance compared to MARL baselines in various metrics. Noticeably the trained vehicles exhibit complex and diverse social behaviors that improve performance and safety of the population as a whole. Demo video and source code are available at: https://decisionforce.github.io/CoPO/

preprint2022arXiv

Structure of domain walls in chiral spin liquids

The chiral spin liquid is one of the canonical examples of a topological state of quantum spins coexisting with symmetry-breaking chiral order; its experimental realization has been actively discussed in the past few years. Here, motivated by the interplay between topology and symmetry breaking, we examine the physics of the interface between two chiral spin liquid domains with opposite chiralities. We show that a self-consistent mean-field description for the spinons exists that describes both the change of chirality at the domain wall and the gapless edge modes living on it. A Ginzburg--Landau theory for the domain wall is formulated based on the mean-field picture, from which we obtain the non-universal properties of the domain wall such as the wall width and tension. We show that the velocity of the topologically protected domain wall edge states can be accessed through the Jackiw-Rebbi mechanism. We further argue that the gapless modes at the edge contribute an extra, non-analytic $|ϕ^3|$ term to the domain wall theory, and find numerical evidence for this non-analyticity.

preprint2022arXiv

Universal Entanglement Transitions of Free Fermions with Long-range Non-unitary Dynamics

Non-unitary evolution can give rise to novel steady states classified by their entanglement properties. In this work, we aim to understand its interplay with long-range hopping that decays with $r^{-α}$ in free-fermion systems. We first study two solvable Brownian models with long-range non-unitary dynamics: a large-$N$ SYK$_2$ chain and a single-flavor fermion chain and we show that they share the same phase diagram. When $α>0.5$, we observe two critical phases with subvolume entanglement scaling: (i) $α>1.5$, a logarithmic phase with dynamical exponent $z=1$ and logarithmic subsystem entanglement, and (ii) $0.5<α<1.5$, a fractal phase with $z=\frac{2α-1}{2}$ and subsystem entanglement $S_A\propto L_A^{1-z}$, where $L_A$ is the length of the subsystem $A$. These two phases cannot be distinguished by the purification dynamics, in which the entropy always decays as $L/T$. We then confirm that the results are also valid for the static SYK$_2$ chain, indicating the phase diagram is universal for general free-fermion systems. We also discuss phase diagrams in higher dimensions and the implication in measurement-induced phase transitions.

preprint2021arXiv

Network Pruning via Resource Reallocation

Channel pruning is broadly recognized as an effective approach to obtain a small compact model through eliminating unimportant channels from a large cumbersome network. Contemporary methods typically perform iterative pruning procedure from the original over-parameterized model, which is both tedious and expensive especially when the pruning is aggressive. In this paper, we propose a simple yet effective channel pruning technique, termed network Pruning via rEsource rEalLocation (PEEL), to quickly produce a desired slim model with negligible cost. Specifically, PEEL first constructs a predefined backbone and then conducts resource reallocation on it to shift parameters from less informative layers to more important layers in one round, thus amplifying the positive effect of these informative layers. To demonstrate the effectiveness of PEEL , we perform extensive experiments on ImageNet with ResNet-18, ResNet-50, MobileNetV2, MobileNetV3-small and EfficientNet-B0. Experimental results show that structures uncovered by PEEL exhibit competitive performance with state-of-the-art pruning algorithms under various pruning settings. Our code is available at https://github.com/cardwing/Codes-for-PEEL.

preprint2021arXiv

Non-unitary dynamics of Sachdev-Ye-Kitaev chain

We construct a series of one-dimensional non-unitary dynamics consisting of both unitary and imaginary evolutions based on the Sachdev-Ye-Kitaev model. Starting from a short-range entangled state, we analyze the entanglement dynamics using the path integral formalism in the large $N$ limit. Among all the results that we obtain, two of them are particularly interesting: (1) By varying the strength of the imaginary evolution, the interacting model exhibits a first order phase transition from the highly entangled volume law phase to an area law phase; (2) The one-dimensional free fermion model displays an extensive critical regime with emergent two-dimensional conformal symmetry.

preprint2021arXiv

Understanding the wiring evolution in differentiable neural architecture search

Controversy exists on whether differentiable neural architecture search methods discover wiring topology effectively. To understand how wiring topology evolves, we study the underlying mechanism of several existing differentiable NAS frameworks. Our investigation is motivated by three observed searching patterns of differentiable NAS: 1) they search by growing instead of pruning; 2) wider networks are more preferred than deeper ones; 3) no edges are selected in bi-level optimization. To anatomize these phenomena, we propose a unified view on searching algorithms of existing frameworks, transferring the global optimization to local cost minimization. Based on this reformulation, we conduct empirical and theoretical analyses, revealing implicit inductive biases in the cost&#39;s assignment mechanism and evolution dynamics that cause the observed phenomena. These biases indicate strong discrimination towards certain topologies. To this end, we pose questions that future differentiable methods for neural wiring discovery need to confront, hoping to evoke a discussion and rethinking on how much bias has been enforced implicitly in existing NAS methods.

preprint2020arXiv

Channel-wise Alignment for Adaptive Object Detection

Generic object detection has been immensely promoted by the development of deep convolutional neural networks in the past decade. However, in the domain shift circumstance, the changes in weather, illumination, etc., often cause domain gap, and thus performance drops substantially when detecting objects from one domain to another. Existing methods on this task usually draw attention on the high-level alignment based on the whole image or object of interest, which naturally, cannot fully utilize the fine-grained channel information. In this paper, we realize adaptation from a thoroughly different perspective, i.e., channel-wise alignment. Motivated by the finding that each channel focuses on a specific pattern (e.g., on special semantic regions, such as car), we aim to align the distribution of source and target domain on the channel level, which is finer for integration between discrepant domains. Our method mainly consists of self channel-wise and cross channel-wise alignment. These two parts explore the inner-relation and cross-relation of attention regions implicitly from the view of channels. Further more, we also propose a RPN domain classifier module to obtain a domain-invariant RPN network. Extensive experiments show that the proposed method performs notably better than existing methods with about 5% improvement under various domain-shift settings. Experiments on different task (e.g. instance segmentation) also demonstrate its good scalability.

preprint2020arXiv

DSNAS: Direct Neural Architecture Search without Parameter Retraining

If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this definition can reduce the vaguely-defined NAS evaluation to i) accuracy of this task and ii) the total computation consumed to finally obtain a model with satisfying accuracy. Seeing that most existing methods do not solve this problem directly, we propose DSNAS, an efficient differentiable NAS framework that simultaneously optimizes architecture and parameters with a low-biased Monte Carlo estimate. Child networks derived from DSNAS can be deployed directly without parameter retraining. Comparing with two-stage methods, DSNAS successfully discovers networks with comparable accuracy (74.4%) on ImageNet in 420 GPU hours, reducing the total time by more than 34%. Our implementation is available at https://github.com/SNAS-Series/SNAS-Series.

preprint2020arXiv

Frustrated Heisenberg $J_1-J_2$ model within the stretched diamond lattice of LiYbO2

We investigate the magnetic properties of LiYbO$_2$, containing a three-dimensionally frustrated, diamond-like lattice via neutron scattering, magnetization, and heat capacity measurements. The stretched diamond network of Yb$^{3+}$ ions in LiYbO$_2$ enters a long-range incommensurate, helical state with an ordering wave vector ${\bf{k}} = (0.384, \pm 0.384, 0)$ that &#34;locks-in&#34; to a commensurate ${\bf{k}} = (1/3, \pm 1/3, 0)$ phase under the application of a magnetic field. The spiral magnetic ground state of LiYbO$_2$ can be understood in the framework of a Heisenberg $J_1-J_2$ Hamiltonian on a stretched diamond lattice, where the propagation vector of the spiral is uniquely determined by the ratio of $J_2/|J_1|$. The pure Heisenberg model, however, fails to account for the relative phasing between the Yb moments on the two sites of the bipartite lattice, and this detail as well as the presence of an intermediate, partially disordered, magnetic state below 1 K suggests interactions beyond the classical Heisenberg description of this material.

preprint2020arXiv

Graph Structured Network for Image-Text Matching

Image-text matching has received growing interest since it bridges vision and language. The key challenge lies in how to learn correspondence between image and text. Existing works learn coarse correspondence based on object co-occurrence statistics, while failing to learn fine-grained phrase correspondence. In this paper, we present a novel Graph Structured Matching Network (GSMN) to learn fine-grained correspondence. The GSMN explicitly models object, relation and attribute as a structured phrase, which not only allows to learn correspondence of object, relation and attribute separately, but also benefits to learn fine-grained correspondence of structured phrase. This is achieved by node-level matching and structure-level matching. The node-level matching associates each node with its relevant nodes from another modality, where the node can be object, relation or attribute. The associated nodes then jointly infer fine-grained correspondence by fusing neighborhood associations at structure-level matching. Comprehensive experiments show that GSMN outperforms state-of-the-art methods on benchmarks, with relative Recall@1 improvements of nearly 7% and 2% on Flickr30K and MSCOCO, respectively. Code will be released at: https://github.com/CrossmodalGroup/GSMN.

preprint2020arXiv

Inter-Region Affinity Distillation for Road Marking Segmentation

We study the problem of distilling knowledge from a large deep teacher network to a much smaller student network for the task of road marking segmentation. In this work, we explore a novel knowledge distillation (KD) approach that can transfer &#39;knowledge&#39; on scene structure more effectively from a teacher to a student model. Our method is known as Inter-Region Affinity KD (IntRA-KD). It decomposes a given road scene image into different regions and represents each region as a node in a graph. An inter-region affinity graph is then formed by establishing pairwise relationships between nodes based on their similarity in feature distribution. To learn structural knowledge from the teacher network, the student is required to match the graph generated by the teacher. The proposed method shows promising results on three large-scale road marking segmentation benchmarks, i.e., ApolloScape, CULane and LLAMAS, by taking various lightweight models as students and ResNet-101 as the teacher. IntRA-KD consistently brings higher performance gains on all lightweight models, compared to previous distillation methods. Our code is available at https://github.com/cardwing/Codes-for-IntRA-KD.

preprint2020arXiv

SNAS: Stochastic Neural Architecture Search

We propose Stochastic Neural Architecture Search (SNAS), an economical end-to-end solution to Neural Architecture Search (NAS) that trains neural operation parameters and architecture distribution parameters in same round of back-propagation, while maintaining the completeness and differentiability of the NAS pipeline. In this work, NAS is reformulated as an optimization problem on parameters of a joint distribution for the search space in a cell. To leverage the gradient information in generic differentiable loss for architecture search, a novel search gradient is proposed. We prove that this search gradient optimizes the same objective as reinforcement-learning-based NAS, but assigns credits to structural decisions more efficiently. This credit assignment is further augmented with locally decomposable reward to enforce a resource-efficient constraint. In experiments on CIFAR-10, SNAS takes less epochs to find a cell architecture with state-of-the-art accuracy than non-differentiable evolution-based and reinforcement-learning-based NAS, which is also transferable to ImageNet. It is also shown that child networks of SNAS can maintain the validation accuracy in searching, with which attention-based NAS requires parameter retraining to compete, exhibiting potentials to stride towards efficient NAS on big datasets. We have released our implementation at https://github.com/SNAS-Series/SNAS-Series.

preprint2020arXiv

Spin excitations in the frustrated triangular lattice antiferromagnet NaYbO$_2$

Here we present a neutron scattering-based study of magnetic excitations and magnetic order in NaYbO$_2$ under the application of an external magnetic field. The crystal electric field-split $J = 7/2$ multiplet structure is determined, revealing a mixed $|m_z>$ ground state doublet and is consistent with a recent report Ding et al. [1]. Our measurements further suggest signatures of exchange effects in the crystal field spectrum, manifested by a small splitting in energy of the transition into the first excited doublet. The field-dependence of the low-energy magnetic excitations across the transition from the quantum disordered ground state into the fluctuation-driven ordered regime is analyzed. Signs of a first-order phase transition into a noncollinear ordered state are revealed at the upper-field phase boundary of the ordered regime, and higher order magnon scattering, suggestive of strong magnon-magnon interactions, is resolved within the previously reported $up-up-down$ phase. Our results reveal a complex phase diagram of field-induced order and spin excitations within NaYbO$_2$ and demonstrate the dominant role of quantum fluctuations cross a broad range of fields within its interlayer frustrated triangular lattice.

preprint2020arXiv

Subsystem Rényi Entropy of Thermal Ensembles for SYK-like models

The Sachdev-Ye-Kitaev model is an $N$-modes fermionic model with infinite range random interactions. In this work, we study the thermal Rényi entropy for a subsystem of the SYK model using the path-integral formalism in the large-$N$ limit. The results are consistent with exact diagonalization [1] and can be well approximated by thermal entropy with an effective temperature [2] when subsystem size $M\leq N/2$. We also consider generalizations of the SYK model with quadratic random hopping term or $U(1)$ charge conservation.

preprint2020arXiv

TSIT: A Simple and Versatile Framework for Image-to-Image Translation

We introduce a simple and versatile framework for image-to-image translation. We unearth the importance of normalization layers, and provide a carefully designed two-stream generative model with newly proposed feature transformations in a coarse-to-fine fashion. This allows multi-scale semantic structure information and style representation to be effectively captured and fused by the network, permitting our method to scale to various tasks in both unsupervised and supervised settings. No additional constraints (e.g., cycle consistency) are needed, contributing to a very clean and simple method. Multi-modal image synthesis with arbitrary style control is made possible. A systematic study compares the proposed method with several state-of-the-art task-specific baselines, verifying its effectiveness in both perceptual quality and quantitative evaluations.

preprint2019arXiv

Topological thermal Hall effect of &#34;magnetic monopoles&#34; in pyrochlore U(1) spin liquid

&#34;Magnetic monopole&#34; is an exotic quantum excitation in pyrochlore U(1) spin liquid, and its emergence is purely of quantum origin and has no classical analogue. We predict topological thermal Hall effect (TTHE) of &#34;magnetic monopoles&#34; and present this prediction through non-Kramers doublets. We observe that, when the external magnetic field polarizes the Ising component of the local moment, internally this corresponds to the induction of emergent dual U(1) gauge flux for the &#34;magnetic monopoles&#34;. The motion of &#34;magnetic monopoles&#34; is then twisted by the induced dual gauge flux. This emergent Lorentz force on &#34;magnetic monopoles&#34; is the fundamental origin of TTHE. Therefore, TTHE would be a direct evidence of the &#34;monopole&#34;-gauge coupling and the emergent U(1) gauge structure in pyrochlore U(1) spin liquid. Our result does not depend strongly on our choice of non-Kramers doublets for our presentation, and can be well extended to Kramers doublets. Our prediction can be readily tested among the pyrochlore spin liquid candidate materials. We give a detailed discussion about the expectation for different pyrochlore magnets.