Researcher profile

Siyuan Zhang

Siyuan Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Convergence of Decentralized Stochastic Subgradient-based Methods for Nonsmooth Nonconvex functions

In this paper, we focus on the decentralized stochastic subgradient-based methods in minimizing nonsmooth nonconvex functions without Clarke regularity, especially in the decentralized training of nonsmooth neural networks. We propose a general framework that unifies various decentralized subgradient-based methods, such as decentralized stochastic subgradient descent (DSGD), DSGD with gradient-tracking technique (DSGD-T), and DSGD with momentum (DSGD-M). To establish the convergence properties of our proposed framework, we relate the discrete iterates to the trajectories of a continuous-time differential inclusion, which is assumed to have a coercive Lyapunov function with a stable set $\mathcal{A}$. We prove the asymptotic convergence of the iterates to the stable set $\mathcal{A}$ with sufficiently small and diminishing step-sizes. These results provide first convergence guarantees for some well-recognized of decentralized stochastic subgradient-based methods without Clarke regularity of the objective function. Preliminary numerical experiments demonstrate that our proposed framework yields highly efficient decentralized stochastic subgradient-based methods with convergence guarantees in the training of nonsmooth neural networks.

preprint2026arXiv

EXG: Self-Evolving Agents with Experience Graphs

Large language model (LLM)-based agents have demonstrated strong capabilities in complex reasoning and problem solving through multi-step interactions, yet most deployed agents remain behaviorally static, with knowledge acquired during execution rarely translating into systematic improvement over time. In response, a growing line of work on self-evolving agents explores how agents can improve through experience during deployment, but most existing approaches either rely on ad hoc reflection limited to single-task correction or adopt unstructured memory that accumulates fragmented experience with delayed usability. To address this limitation, we introduce EXG, an experience graph framework for self-evolving agents that explicitly organizes accumulated successes and failures into a structured, relational representation. EXG is the first experience graph designed for self-evolving agents, supporting both online, real-time graph growth during execution for immediate cross-task experience reuse, and offline reuse of a consolidated experience graph as an external memory module. This design also enables EXG to serve as a plug-and-play component for existing self-evolving agents, organizing prior experience into a unified experience graph and improving both solution quality and resource efficiency as deployment progresses. Extensive experiments across code generation and reasoning benchmarks show that EXG attains more favorable performance-efficiency trade-offs than reflection- and memory-based baselines in both online and offline evaluations. Our results suggest that structuring experience as a graph provides a principled foundation for scalable and transferable self-evolving agent behavior.

preprint2026arXiv

GPS-Synchronized Monitoring of Core-collapse Supernova Bursts with PandaX-4T via Coherent Elastic Neutrino Nuclear Scattering

The landmark detection of neutrinos from SN1987A marked the dawn of neutrino astrophysics. The neutrino burst provided essential insights into fundamental properties of neutrinos, and served as key probes of stellar evolution and supernova dynamics. The recent advancement in coherent elastic neutrino-nucleus scattering enables the detection of core-collapse supernova burst neutrinos using tonne-scale liquid xenon detectors originally designed for dark matter direct detection. Leveraging this capability, we developed and deployed an online supernova monitoring system for the PandaX-4T experiment. This system features a GPS module with millisecond-level timing precision, a low false-alarm rate, and high sensitivity to galactic core-collapse supernova explosion events. The methodology is robust, directly scalable, and planned for implementation in the next-generation PandaX-20T experiment.

preprint2026arXiv

STEP3-VL-10B Technical Report

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized through two strategic shifts: first, a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens that integrates a language-aligned Perception Encoder with a Qwen3-8B decoder to establish intrinsic vision-language synergy; and second, a scaled post-training pipeline featuring over 1k iterations of reinforcement learning. Crucially, we implement Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute, allocating resources to scalable perceptual reasoning that explores and synthesizes diverse visual hypotheses. Consequently, despite its compact 10B footprint, STEP3-VL-10B rivals or surpasses models 10$\times$-20$\times$ larger (e.g., GLM-4.6V-106B, Qwen3-VL-235B) and top-tier proprietary flagships like Gemini 2.5 Pro and Seed-1.5-VL. Delivering best-in-class performance, it records 92.2% on MMBench and 80.11% on MMMU, while excelling in complex reasoning with 94.43% on AIME2025 and 75.95% on MathVision. We release the full model suite to provide the community with a powerful, efficient, and reproducible baseline.

preprint2022arXiv

Automatic Quantization for Physics-Based Simulation

Quantization has proven effective in high-resolution and large-scale simulations, which benefit from bit-level memory saving. However, identifying a quantization scheme that meets the requirement of both precision and memory efficiency requires trial and error. In this paper, we propose a novel framework to allow users to obtain a quantization scheme by simply specifying either an error bound or a memory compression rate. Based on the error propagation theory, our method takes advantage of auto-diff to estimate the contributions of each quantization operation to the total error. We formulate the task as a constrained optimization problem, which can be efficiently solved with analytical formulas derived for the linearized objective function. Our workflow extends the Taichi compiler and introduces dithering to improve the precision of quantized simulations. We demonstrate the generality and efficiency of our method via several challenging examples of physics-based simulation, which achieves up to 2.5x memory compression without noticeable degradation of visual quality in the results. Our code and data are available at https://github.com/Hanke98/AutoQuantizer.

preprint2022arXiv

Elemental (im-)miscibility determines phase formation of multinary nanoparticles co-sputtered in ionic liquids

Non-equilibrium synthesis methods allow to alloy bulk-immiscible elements into multinary nanoparticles, which broadens the design space for new materials. Whereas sputtering onto solid substrates can combine immiscible elements into thin film solid solutions, this is not clear for sputtering of nanoparticles in ionic liquids. Thus, the suitability of sputtering in ionic liquids for producing nanoparticles of immiscible elements is investigated by co-sputtering the systems Au-Cu (miscible), Au-Ru and Cu-Ru (both immiscible), and Au-Cu-Ru on the surface of the ionic liquid 1-butyl-3-methylimidazolium bis-trifluoromethylsulfonyl)imide [Bmim][(Tf)2N]. The sputtered nanoparticles were analyzed to obtain (i) knowledge concerning the general formation process of nanoparticles when sputtering onto ionic liquid surfaces and (ii) information, if alloy nanoparticles of immiscible elements can be synthesized as well as (iii) evidence if the Hume-Rothery rules for solid solubility are valid for sputtered nanoparticles. Accompanying atomistic simulations using density-functional theory for clusters of different size and ordering confirm that the miscibility of Au-Cu and the immiscibility of Au-Ru and Cu-Ru govern the thermodynamic stability of the nanoparticles. Based on the matching experimental and theoretical results for the NP/IL-systems concerning NP stability, a formation model of multinary NPs in ILs was developed.

preprint2022arXiv

Full-color three-loop three-point form factors in N=4 SYM

We present the detailed computation of full-color three-loop three-point form factors of both the stress-tensor supermultiplet and a length-three BPS operator in N=4 SYM. The integrands are constructed based on the color-kinematics (CK) duality and generalized unitarity method. An interesting observation is that the CK-dual integrands contain a large number of free parameters. We discuss the origin of these free parameters in detail and check that they cancel in the simplified integrands. We further perform the numerical evaluation of the integrals at a special kinematics point using public packages FIESTA and pySecDec based on the sector-decomposition approach. We find that the numerical computation can be significantly simplified by expressing the integrals in terms of uniformly transcendental basis, although the final three-loop computations still require large computational resources. Having the full-color numerical results, we verify that the non-planar infrared divergences reproduce the non-dipole structures, which firstly appear at three loops. As for the finite remainder functions, we check that the numerical planar remainder for the stress-tensor supermultiplet is consistent with the known result of the bootstrap computation. We also obtain for the first time the numerical results of the three-loop non-planar remainder for the stress-tensor supermultiplet as well as the three-loop remainder for the length-three operator.

preprint2022arXiv

Preferred corrosion pathways for oxygen in Al2Ca-twin boundaries and dislocations

With an ongoing discussion on the oxygen diffusion along crystal defects remaining, it is difficult to study this phenomenon in Al containing intermetallic materials due to its rapid and passivating oxide formation. We report here the observation of enhanced oxygen diffusion along crystal defects, i.e. dislocations and twin boundaries, in the C15 Al 2 Ca Laves phase and how the presence of oxygen induces structural changes at these defects. Three main phases were identified and characterized structurally by aberration-corrected, atomic resolution scanning transmission electron microscopy, analytically by energy dispersive X-ray spectroscopy and electron energy loss spectroscopy. Unlike the C15 bulk phase, the twin boundary and dislocation transformed into a few nanometer wide amorphous phase, which depletes in Al and Ca but is highly enriched in oxygen. The dislocation even shows coexistence of the amorphous phase with a simple Al-rich A1 fcc phase. This A1 phase only depletes in Ca, not in Al (Al remains at bulk concentration), and is also enriched in oxygen. The Al-rich A1 phase is coherent with the C15 matrix. Electron energy loss spectroscopy revealed the amorphous phase to be Al 2 O 3 . We thereby show as one of the first studies that oxygen diffusion along crystal defects, especially also at the twin boundary can induce the formation of an amorphous oxide along themselves. The identification of oxygen-induced transformation at strained defects has to be considered when the material is exposed to air during plastic deformation at elevated temperatures.

preprint2021arXiv

Different Photostability of BiVO4 in Near-pH-Neutral Electrolytes

Photoelectrochemical water splitting is a promising route to produce hydrogen from solar energy. However, corrosion of semiconducting photoelectrodes remains a fundamental challenge for their practical application. The stability of BiVO4, one of the best performing photoanode materials, is systematically examined here using an illuminated scanning flow cell to measure its dissolution operando. The dissolution rates of BiVO4 under illumination depend on the electrolyte and decrease in the order: borate (pH=9.3) > phosphate (pH=7.2) > citrate (pH=7.0). BiVO4 exhibits an inherent lack of stability during the oxygen evolution reaction (OER), while hole-scavenging citrate electrolyte offers kinetic protection. The dissolution of Bi peaks at different potentials than the dissolution of V in phosphate buffer, whereas both ions dissolve simultaneously in borate buffer. The life cycle of a 90 nm BiVO4 film is monitored during one hour of light-driven OER in borate buffer. The photocurrent and dissolution rates show independent trends with time, highlighting the importance to measure both quantities operando. Dissolution rates are correlated to the surface morphology and chemistry characterized using electron microscopy, X-ray photoelectron spectroscopy and atom probe tomography. These correlative measurements further the understanding on corrosion processes of photoelectrodes down to the nanoscopic scale to facilitate their future developments.

preprint2020arXiv

A feature-supervised generative adversarial network for environmental monitoring during hazy days

The adverse haze weather condition has brought considerable difficulties in vision-based environmental applications. While, until now, most of the existing environmental monitoring studies are under ordinary conditions, and the studies of complex haze weather conditions have been ignored. Thence, this paper proposes a feature-supervised learning network based on generative adversarial networks (GAN) for environmental monitoring during hazy days. Its main idea is to train the model under the supervision of feature maps from the ground truth. Four key technical contributions are made in the paper. First, pairs of hazy and clean images are used as inputs to supervise the encoding process and obtain high-quality feature maps. Second, the basic GAN formulation is modified by introducing perception loss, style loss, and feature regularization loss to generate better results. Third, multi-scale images are applied as the input to enhance the performance of discriminator. Finally, a hazy remote sensing dataset is created for testing our dehazing method and environmental detection. Extensive experimental results show that the proposed method has achieved better performance than current state-of-the-art methods on both synthetic datasets and real-world remote sensing images.

preprint2020arXiv

Joint Embedding in Named Entity Linking on Sentence Level

Named entity linking is to map an ambiguous mention in documents to an entity in a knowledge base. The named entity linking is challenging, given the fact that there are multiple candidate entities for a mention in a document. It is difficult to link a mention when it appears multiple times in a document, since there are conflicts by the contexts around the appearances of the mention. In addition, it is difficult since the given training dataset is small due to the reason that it is done manually to link a mention to its mapping entity. In the literature, there are many reported studies among which the recent embedding methods learn vectors of entities from the training dataset at document level. To address these issues, we focus on how to link entity for mentions at a sentence level, which reduces the noises introduced by different appearances of the same mention in a document at the expense of insufficient information to be used. We propose a new unified embedding method by maximizing the relationships learned from knowledge graphs. We confirm the effectiveness of our method in our experimental studies.