Researcher profile

Xiaobo Yang

Xiaobo Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

We introduce Parallel Coordinated Reasoning (PaCoRe), a training-and-inference framework designed to overcome a central limitation of contemporary language models: their inability to scale test-time compute (TTC) far beyond sequential reasoning under a fixed context window. PaCoRe departs from the traditional sequential paradigm by driving TTC through massive parallel exploration coordinated via a message-passing architecture in multiple rounds. Each round launches many parallel reasoning trajectories, compacts their findings into context-bounded messages, and synthesizes these messages to guide the next round and ultimately produce the final answer. Trained end-to-end with large-scale, outcome-based reinforcement learning, the model masters the synthesis abilities required by PaCoRe and scales to multi-million-token effective TTC without exceeding context limits. The approach yields strong improvements across diverse domains, and notably pushes reasoning beyond frontier systems in mathematics: an 8B model reaches 94.5% on HMMT 2025, surpassing GPT-5's 93.2% by scaling effective TTC to roughly two million tokens. We open-source model checkpoints, training data, and the full inference pipeline to accelerate follow-up work.

preprint2026arXiv

STEP3-VL-10B Technical Report

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized through two strategic shifts: first, a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens that integrates a language-aligned Perception Encoder with a Qwen3-8B decoder to establish intrinsic vision-language synergy; and second, a scaled post-training pipeline featuring over 1k iterations of reinforcement learning. Crucially, we implement Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute, allocating resources to scalable perceptual reasoning that explores and synthesizes diverse visual hypotheses. Consequently, despite its compact 10B footprint, STEP3-VL-10B rivals or surpasses models 10$\times$-20$\times$ larger (e.g., GLM-4.6V-106B, Qwen3-VL-235B) and top-tier proprietary flagships like Gemini 2.5 Pro and Seed-1.5-VL. Delivering best-in-class performance, it records 92.2% on MMBench and 80.11% on MMMU, while excelling in complex reasoning with 94.43% on AIME2025 and 75.95% on MathVision. We release the full model suite to provide the community with a powerful, efficient, and reproducible baseline.

preprint2019arXiv

Exposing and extending the interior waves field by transformation materials

Based on transformation optics, a strategy is proposed to expose the inner one-dimensional space of a wave field inside a beam volume to the surface of the propagation medium and extend the space from one-dimensional to two-dimensional, allowing the corresponding field distribution to be detected directly and more subtly, which is important in optical signal processing. The method is applied to the quadratic graded index lens to construct a new graded index lens, and its enhanced chirpyness detection ability is demonstrated by numerical simulation.

preprint2018arXiv

Moving mesh finite difference solution of non-equilibrium radiation diffusion equations

A moving mesh finite difference method based on the moving mesh partial differential equation is proposed for the numerical solution of the 2T model for multi-material, non-equilibrium radiation diffusion equations. The model involves nonlinear diffusion coefficients and its solutions stay positive for all time when they are positive initially. Nonlinear diffusion and preservation of solution positivity pose challenges in the numerical solution of the model. A coefficient-freezing predictor-corrector method is used for nonlinear diffusion while a cutoff strategy with a positive threshold is used to keep the solutions positive. Furthermore, a two-level moving mesh strategy and a sparse matrix solver are used to improve the efficiency of the computation. Numerical results for a selection of examples of multi-material non-equilibrium radiation diffusion show that the method is capable of capturing the profiles and local structures of Marshak waves with adequate mesh concentration. The obtained numerical solutions are in good agreement with those in the existing literature. Comparison studies are also made between uniform and adaptive moving meshes and between one-level and two-level moving meshes.