Researcher profile

Zixuan Yang

Zixuan Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search

Agentic search has emerged as a promising paradigm for complex information seeking by enabling Large Language Models (LLMs) to interleave reasoning with tool use. However, prevailing systems rely on monolithic agents that suffer from structural bottlenecks, including unconstrained reasoning outputs that inflate trajectories, sparse outcome-level rewards that complicate credit assignment, and stochastic search noise that destabilizes learning. To address these challenges, we propose \textbf{M-ASK} (Multi-Agent Search and Knowledge), a framework that explicitly decouples agentic search into two complementary roles: Search Behavior Agents, which plan and execute search actions, and Knowledge Management Agents, which aggregate, filter, and maintain a compact internal context. This decomposition allows each agent to focus on a well-defined subtask and reduces interference between search and context construction. Furthermore, to enable stable coordination, M-ASK employs turn-level rewards to provide granular supervision for both search decisions and knowledge updates. Experiments on multi-hop QA benchmarks demonstrate that M-ASK outperforms strong baselines, achieving not only superior answer accuracy but also significantly more stable training dynamics.\footnote{The source code for M-ASK is available at https://github.com/chenyiqun/M-ASK.}

preprint2026arXiv

Simply Stabilizing the Loop via Fully Looped Transformer

Scaling model performance typically requires increasing model size. Looped Transformer offers a compelling alternative by iteratively reusing the same Transformer blocks, trading additional computation for improved performance without increasing parameter count or context length. Because the number of loop iterations can be adjusted at inference, it also provides a natural mechanism for balancing performance and test-time compute. However, Looped Transformer still suffers from training instability when the number of loop iterations increases. Our analysis reveals that this instability stems from two sources: gradient oscillation and residual explosion. To address these two problems, we propose the Fully Looped Transformer, which introduces two parameter-free modifications: (1) Fully Looped Architecture, which distributes inter-loop signals across all layers to mitigate residual explosion; (2) Attention Injection, which reuses the existing attention block to suppress gradient oscillation. These modifications stabilize training dynamics, enabling the Fully Looped Transformer to be trained stably up to 12 loop iterations, whereas other baseline looped models collapse in this regime. In milder settings where Looped Transformer does not collapse, Fully Looped Transformer still improves average downstream-task performance by up to 13.2\%. Overall, our experiments demonstrate that Fully Looped Transformer improves training stability, enhances downstream performance, and provides preliminary adaptability under different test-time compute budgets by varying loop iterations at inference.

preprint2022arXiv

Controllability of Multilayer Networked Sampled-data Systems

This paper explores the state controllability of multilayer networked sampled-data systems with inter-layer couplings, where zero-order holders (ZOHs) are on the control and transmission channels. The effects of both single- and multi-rate sampling on controllability of multilayer networked linear time-invariant (LTI) systems are analyzed, with some sufficient and/or necessary controllability conditions derived. Under specific conditions, the pathological sampling of single node systems could be eliminated by the network structure and inner couplings among different nodes and different layers. The representative drive-response inter-layer coupling mode is studied, and it reveals that the whole system could be controllable due to the inter-layer couplings even if the response layer is uncontrollable itself. Moreover, simulated examples show that the modification of sampling rate on local channels could lay a positive or negative effect on the controllability of the whole system. All the results indicate that the controllability of the multilayer networked sampled-data system is collectively affected by mutually coupled factors.

preprint2022arXiv

On the wavenumber-frequency spectra of the wall pressure fluctuations in turbulent channel flow

Direct numerical simulations (DNS) of turbulent channel flows up to $Re_τ \approx 1000$ are conducted to investigate the three-dimensional (consisting of streamwise wavenumber, spanwise wavenumber and frequency) spectrum of wall pressure fluctuations. To develop a predictive model of the wavenumber-frequency spectrum from the wavenumber spectrum, the time decorrelation mechanisms of wall pressure fluctuations are investigated. It is discovered that the energy-containing part of the wavenumber-frequency spectrum of wall pressure fluctuations can be well predicted using a similar random sweeping model for streamwise velocity fluctuations. To refine the investigation, we further decompose the spectrum of the total wall pressure fluctuations into the auto spectra of rapid and slow pressure fluctuations, and the cross spectrum between them. We focus on evaluating the assumption applied in many predictive models, that is, the magnitude of the cross spectrum is negligibly small. The present DNS shows that neglecting the cross spectrum causes a maximum error up to 4.7dB in the sub-convective region for all Reynolds numbers under test. Our analyses indicate that the assumption of neglecting the cross spectrum needs to be applied carefully in the investigations of acoustics at low Mach numbers, in which the sub-convective components of wall pressure fluctuations make important contributions to the radiated acoustic power.

preprint2021arXiv

On the wavenumber-frequency spectra of wall pressure fluctuations in turbulent channel flows

The characteristics of the wavenumber-frequency spectra of the rapid, slow and total wall pressure fluctuations are investigated using direct numerical simulation (DNS) of turbulent channel flow up to $\Rey_τ\approx 1000$. For the wavenumber-frequency spectra of the total wall pressure fluctuations, a valley-like behavior of contour lines in the sub-convective region is found, which may be linked to the Kraichnan-Phillips theorem. For the decomposition of the wall pressure spectra, it is commonly assumed in previous studies that the cross spectral density (CSD) between the rapid and slow components of the wall pressure fluctuations can be neglected. Yet no experimental or numerical evidence is available for either confirming or disproving this assumption. In this paper, we use DNS data to quantitatively evaluate this assumption. Emphasizes are put on the error in decibel scale caused by neglecting the CSD between the rapid and slow components of the wall pressure fluctuations. It is found that this assumption is approximately accurate for one- and two-dimensional spectra, but causes a large magnitude of error in the three-dimensional wavenumber-frequency spectra. An error of 5dB is observed in the sub-convective region and such a large error is observed for a wide range of Reynolds numbers ($180\le\Rey_τ\le 1000$). The analyses show that the assumption of neglecting the CSD needs to be applied carefully at the scales falling in the sub-convective region.