Source author record

Xiaobo Yang

Xiaobo Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Systems and Control Computer Vision Information Theory Machine Learning math.IT math.NA physics.optics

Catalog footprint

What is connected

7works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

We introduce Parallel Coordinated Reasoning (PaCoRe), a training-and-inference framework designed to overcome a central limitation of contemporary language models: their inability to scale test-time compute (TTC) far beyond sequential reasoning under a fixed context window. PaCoRe departs from the traditional sequential paradigm by driving TTC through massive parallel exploration coordinated via a message-passing architecture in multiple rounds. Each round launches many parallel reasoning trajectories, compacts their findings into context-bounded messages, and synthesizes these messages to guide the next round and ultimately produce the final answer. Trained end-to-end with large-scale, outcome-based reinforcement learning, the model masters the synthesis abilities required by PaCoRe and scales to multi-million-token effective TTC without exceeding context limits. The approach yields strong improvements across diverse domains, and notably pushes reasoning beyond frontier systems in mathematics: an 8B model reaches 94.5% on HMMT 2025, surpassing GPT-5's 93.2% by scaling effective TTC to roughly two million tokens. We open-source model checkpoints, training data, and the full inference pipeline to accelerate follow-up work.

preprint2026arXiv

STEP3-VL-10B Technical Report

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized through two strategic shifts: first, a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens that integrates a language-aligned Perception Encoder with a Qwen3-8B decoder to establish intrinsic vision-language synergy; and second, a scaled post-training pipeline featuring over 1k iterations of reinforcement learning. Crucially, we implement Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute, allocating resources to scalable perceptual reasoning that explores and synthesizes diverse visual hypotheses. Consequently, despite its compact 10B footprint, STEP3-VL-10B rivals or surpasses models 10$\times$-20$\times$ larger (e.g., GLM-4.6V-106B, Qwen3-VL-235B) and top-tier proprietary flagships like Gemini 2.5 Pro and Seed-1.5-VL. Delivering best-in-class performance, it records 92.2% on MMBench and 80.11% on MMMU, while excelling in complex reasoning with 94.43% on AIME2025 and 75.95% on MathVision. We release the full model suite to provide the community with a powerful, efficient, and reproducible baseline.

preprint2019arXiv

Exposing and extending the interior waves field by transformation materials

Based on transformation optics, a strategy is proposed to expose the inner one-dimensional space of a wave field inside a beam volume to the surface of the propagation medium and extend the space from one-dimensional to two-dimensional, allowing the corresponding field distribution to be detected directly and more subtly, which is important in optical signal processing. The method is applied to the quadratic graded index lens to construct a new graded index lens, and its enhanced chirpyness detection ability is demonstrated by numerical simulation.

preprint2018arXiv

Moving mesh finite difference solution of non-equilibrium radiation diffusion equations

A moving mesh finite difference method based on the moving mesh partial differential equation is proposed for the numerical solution of the 2T model for multi-material, non-equilibrium radiation diffusion equations. The model involves nonlinear diffusion coefficients and its solutions stay positive for all time when they are positive initially. Nonlinear diffusion and preservation of solution positivity pose challenges in the numerical solution of the model. A coefficient-freezing predictor-corrector method is used for nonlinear diffusion while a cutoff strategy with a positive threshold is used to keep the solutions positive. Furthermore, a two-level moving mesh strategy and a sparse matrix solver are used to improve the efficiency of the computation. Numerical results for a selection of examples of multi-material non-equilibrium radiation diffusion show that the method is capable of capturing the profiles and local structures of Marshak waves with adequate mesh concentration. The obtained numerical solutions are in good agreement with those in the existing literature. Comparison studies are also made between uniform and adaptive moving meshes and between one-level and two-level moving meshes.

preprint2016arXiv

Distributed Fusion of Labeled Multi-Object Densities Via Label Spaces Matching

In this paper, we address the problem of the distributed multi-target tracking with labeled set filters in the framework of Generalized Covariance Intersection (GCI). Our analyses show that the label space mismatching (LS-DM) phenomenon, which means the same realization drawn from label spaces of different sensors does not have the same implication, is quite common in practical scenarios and may bring serious problems. Our contributions are two-fold. Firstly, we provide a principled mathematical definition of "label spaces matching (LS-DM)" based on information divergence, which is also referred to as LS-M criterion. Then, to handle the LS-DM, we propose a novel two-step distributed fusion algorithm, named as GCI fusion via label spaces matching (GCI-LSM). The first step is to match the label spaces from different sensors. To this end, we build a ranked assignment problem and design a cost function consistent with LS-M criterion to seek the optimal solution of matching correspondence between label spaces of different sensors. The second step is to perform the GCI fusion on the matched label space. We also derive the GCI fusion with generic labeled multi-object (LMO) densities based on LS-M, which is the foundation of labeled distributed fusion algorithms. Simulation results for Gaussian mixture implementation highlight the performance of the proposed GCI-LSM algorithm in two different tracking scenarios.

preprint2016arXiv

Distributed Fusion with Multi-Bernoulli Filter based on Generalized Covariance Intersection

In this paper, we propose a distributed multi-object tracking algorithm through the use of multi-Bernoulli (MB) filter based on generalized Covariance Intersection (G-CI). Our analyses show that the G-CI fusion with two MB posterior distributions does not admit an accurate closed-form expression. To solve this challenging problem, we firstly approximate the fused posterior as the unlabeled version of $δ$-generalized labeled multi-Bernoulli ($δ$-GLMB) distribution, referred to as generalized multi-Bernoulli (GMB) distribution. Then, to allow the subsequent fusion with another multi-Bernoulli posterior distribution, e.g., fusion with a third sensor node in the sensor network, or fusion in the feedback working mode, we further approximate the fused GMB posterior distribution as an MB distribution which matches its first-order statistical moment. The proposed fusion algorithm is implemented using sequential Monte Carlo technique and its performance is highlighted by numerical results.

preprint2016arXiv

Optimal Deployment of Multistatic Radar System Using Multi-Objective Particle Swarm Optimization

We consider an optimization deployment problem of multistatic radar system (MSRS). Through the antenna placing and the transmitted power allocating, we optimally deploy the MSRS for two goals: 1) the first one is to improve the coverage ratio of surveillance region; 2) the second goal is to get a even distribution of signal energy in surveillance region. In two typical working modes of MSRS, we formulate the optimization problem by introducing two objective functions according to the two mentioned goals, respectively. Addressing on two main challenges of applying multi-objective particle swarm optimization (MOPSO) in solving the proposed optimization problem, we propose a deployment algorithm based on multiobjective particle swarm optimization with non-dominated relative crowding distance (MOPSO-NRCD). For the challenge of value difference, we propose a novel selection method with a non-dominated relative crowding distance. For the challenge of particle allocation, a multi-swarm structure of MOPSO is also introduced. Finally, simulation results are given out to prove the advantages and validity of the proposed deployment algorithm. It is shown that with same number of employed particles, the proposed MOPSO-NRCD algorithm can achieve better optimization performance than that of traditional multiobjective particle swarm optimization with crowding distance (MOPSO-CD).

Xiaobo Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

STEP3-VL-10B Technical Report

Exposing and extending the interior waves field by transformation materials

Moving mesh finite difference solution of non-equilibrium radiation diffusion equations

Distributed Fusion of Labeled Multi-Object Densities Via Label Spaces Matching

Distributed Fusion with Multi-Bernoulli Filter based on Generalized Covariance Intersection

Optimal Deployment of Multistatic Radar System Using Multi-Objective Particle Swarm Optimization