Researcher profile

Boyuan Zhang

Boyuan Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

IGenBench: Benchmarking the Reliability of Text-to-Infographic Generation

Infographics are composite visual artifacts that combine data visualizations with textual and illustrative elements to communicate information. While recent text-to-image (T2I) models can generate aesthetically appealing images, their reliability in generating infographics remains unclear. Generated infographics may appear correct at first glance but contain easily overlooked issues, such as distorted data encoding or incorrect textual content. We present IGENBENCH, the first benchmark for evaluating the reliability of text-to-infographic generation, comprising 600 curated test cases spanning 30 infographic types. We design an automated evaluation framework that decomposes reliability verification into atomic yes/no questions based on a taxonomy of 10 question types. We employ multimodal large language models (MLLMs) to verify each question, yielding question-level accuracy (Q-ACC) and infographic-level accuracy (I-ACC). We comprehensively evaluate 10 state-of-the-art T2I models on IGENBENCH. Our systematic analysis reveals key insights for future model development: (i) a three-tier performance hierarchy with the top model achieving Q-ACC of 0.90 but I-ACC of only 0.49; (ii) data-related dimensions emerging as universal bottlenecks (e.g., Data Completeness: 0.21); and (iii) the challenge of achieving end-to-end correctness across all models. We release IGENBENCH at https://igen-bench.vercel.app/.

preprint2026arXiv

SDiT: Semantic Region-Adaptive for Diffusion Transformers

Diffusion Transformers (DiTs) achieve state-of-the-art performance in text-to-image synthesis but remain computationally expensive due to the iterative nature of denoising and the quadratic cost of global attention. In this work, we observe that denoising dynamics are spatially non-uniform-background regions converge rapidly while edges and textured areas evolve much more actively. Building on this insight, we propose SDiT, a Semantic Region-Adaptive Diffusion Transformer that allocates computation according to regional complexity. SDiT introduces a training-free framework combining (1) semantic-aware clustering via fast Quickshift-based segmentation, (2) complexity-driven regional scheduling to selectively update informative areas, and (3) boundary-aware refinement to maintain spatial coherence. Without any model retraining or architectural modification, SDiT achieves up to 3.0x acceleration while preserving nearly identical perceptual and semantic quality to full-attention inference.

preprint2026arXiv

The Promise of Time-Series Foundation Models for Agricultural Forecasting: Evidence from Commodity Prices

Forecasting agricultural markets remains challenging due to nonlinear dynamics, structural breaks, and sparse data. A long-standing belief holds that simple time-series methods outperform more advanced alternatives. This paper provides the first systematic evidence that this belief no longer holds with modern time-series foundation models (TSFMs). Using USDA ERS monthly commodity price data from 1997-2025, we evaluate 17 forecasting approaches across four model classes, including traditional time-series, machine learning, deep learning, and five state-of-the-art TSFMs (Chronos, Chronos-2, TimesFM 2.5, Time-MoE, Moirai-2), and construct annual marketing year price predictions to compare with USDA's futures-based season-average price (SAP) forecasts. We show that zero-shot foundation models consistently outperform traditional time-series methods, machine learning, and deep learning architectures trained from scratch in both monthly and annual forecasting. Furthermore, foundation models remarkably outperform USDA's futures-based forecasts on three of four major commodities despite USDA's information advantage from forward-looking futures markets. Time-MoE delivers the largest accuracy gains, achieving 54.9% improvement on wheat and 18.5% improvement on corn relative to USDA ERS benchmarks on recent data (2017-2024 excluding COVID). These results point to a paradigm shift in agricultural forecasting.

preprint2022arXiv

An A-Phi Formulation Solver in Electromagnetics Based on Discrete Exterior Calculus

An efficient numerical solver for the A-Phi formulation in electromagnetics based on the discrete exterior calculus (DEC) is proposed in this paper. The A-Phi formulation is immune to low-frequency breakdown and ideal for broadband and multi-scale analysis. The generalized Lorenz gauge is used in this paper, which decouples the A equation and the Phi equation. The A-Phi formulation is discretized by using the DEC, which is the discretized version of the exterior calculus in differential geometry. In general, DEC can be viewed as a generalized version of the finite difference method, where Stokes' theorem and Gauss's theorem are naturally preserved. Furthermore, compared with finite difference method, where rectangular grids are applied, DEC can be implemented with unstructured mesh schemes, such as tetrahedral meshes. Thus, the proposed DEC A-Phi solver is inherently stable, free of spurious solutions and can capture highly complex structures efficiently. In this paper, the background knowledge about the A-Phi formulation and DEC is introduced, as well as technical details in implementing the DEC A-Phi solver with different boundary conditions. Numerical examples are provided for validation purposes as well.

preprint2022arXiv

An FPGA-based Trigger System for CSHINE

A trigger system of general function is designed using the commercial module CAEN V2495 for heavy ion nuclear reaction experiment at Fermi energies. The system has been applied and verified on CSHINE (Compact Spectrometer for Heavy IoN Experiment). Based on the field programmable logic gate array (FPGA) technology of command register access and remote computer control operation, trigger functions can be flexibly configured according to the experimental physical goals. Using the trigger system on CSHINE, we carried out the beam experiment of 25 MeV/u $ ^{86}{\rm Kr}+ ^{124}{\rm Sn}$ on the Radioactive Ion Beam Line 1 in Lanzhou (RIBLL1), China. The online results demonstrate that the trigger system works normally and correctly. The system can be extended to other experiments.

preprint2022arXiv

NeuVV: Neural Volumetric Videos with Immersive Rendering and Editing

Some of the most exciting experiences that Metaverse promises to offer, for instance, live interactions with virtual characters in virtual environments, require real-time photo-realistic rendering. 3D reconstruction approaches to rendering, active or passive, still require extensive cleanup work to fix the meshes or point clouds. In this paper, we present a neural volumography technique called neural volumetric video or NeuVV to support immersive, interactive, and spatial-temporal rendering of volumetric video contents with photo-realism and in real-time. The core of NeuVV is to efficiently encode a dynamic neural radiance field (NeRF) into renderable and editable primitives. We introduce two types of factorization schemes: a hyper-spherical harmonics (HH) decomposition for modeling smooth color variations over space and time and a learnable basis representation for modeling abrupt density and color changes caused by motion. NeuVV factorization can be integrated into a Video Octree (VOctree) analogous to PlenOctree to significantly accelerate training while reducing memory overhead. Real-time NeuVV rendering further enables a class of immersive content editing tools. Specifically, NeuVV treats each VOctree as a primitive and implements volume-based depth ordering and alpha blending to realize spatial-temporal compositions for content re-purposing. For example, we demonstrate positioning varied manifestations of the same performance at different 3D locations with different timing, adjusting color/texture of the performer's clothing, casting spotlight shadows and synthesizing distance falloff lighting, etc, all at an interactive speed. We further develop a hybrid neural-rasterization rendering framework to support consumer-level VR headsets so that the aforementioned volumetric video viewing and editing, for the first time, can be conducted immersively in virtual 3D space.

preprint2022arXiv

On the Aggregation of Probability Assessments: Regularized Mixtures of Predictive Densities for Eurozone Inflation and Real Interest Rates

We propose methods for constructing regularized mixtures of density forecasts. We explore a variety of objectives and regularization penalties, and we use them in a substantive exploration of Eurozone inflation and real interest rate density forecasts. All individual inflation forecasters (even the ex post best forecaster) are outperformed by our regularized mixtures. From the Great Recession onward, the optimal regularization tends to move density forecasts' probability mass from the centers to the tails, correcting for overconfidence.

preprint2022arXiv

Track Recognition for the $ΔE-E$ Telescopes with Silicon Strip Detectors

For the high granularity and high energy resolution, Silicon Strip Detector (SSD) is widely applied in assembling telescopes to measure the charged particles in heavy ion reactions. In this paper, we present a novel method to achieve track recognition in the SSD telescopes of the Compact Spectrometer for Heavy Ion Experiment (CSHINE). Each telescope consists of a single-sided silicon strip detector (SSSSD) and a double-sided silicon strip detector (DSSSD) backed by $3 \times 3$ CsI(Tl) crystals. Detector calibration and track reconstruction are implemented. Special decoding algorithm is developed for the multi-track recognition procedure to deal with the multi-hit effect convoluted by charge sharing and the missing signals with certain probability. It is demonstrated that the track recognition efficiency of the method is approximately 90\% and 80\% for the DSSSD-CsI and SSSSD-DSSSD events, respectively.

preprint2020arXiv

Optimal Combination of Arctic Sea Ice Extent Measures: A Dynamic Factor Modeling Approach

The diminishing extent of Arctic sea ice is a key indicator of climate change as well as an accelerant for future global warming. Since 1978, Arctic sea ice has been measured using satellite-based microwave sensing; however, different measures of Arctic sea ice extent have been made available based on differing algorithmic transformations of the raw satellite data. We propose and estimate a dynamic factor model that combines four of these measures in an optimal way that accounts for their differing volatility and cross-correlations. We then use the Kalman smoother to extract an optimal combined measure of Arctic sea ice extent. It turns out that almost all weight is put on the NSIDC Sea Ice Index, confirming and enhancing confidence in the Sea Ice Index and the NASA Team algorithm on which it is based.