Researcher profile

Jianlong Chen

Jianlong Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

Flow Matching Transport for Quasi-Monte Carlo Integration

High-dimensional integration with respect to complex target measures remains a fundamental challenge in computational science. While Flow Matching (FM) offers a powerful paradigm for constructing continuous-time transport maps, its deployment in high-precision integration is severely limited by the discretization bias inherent to numerical ODE solvers and the lack of rigorous convergence guarantees when coupled with Quasi-Monte Carlo (QMC) methods. This paper addresses these critical gaps by proposing Flow Matching Importance Sampling Quasi-Monte Carlo (FM-ISQMC), a framework designed to transform biased generative flows into unbiased, high-order integration schemes. Methodologically, we construct a transport map by composing a logistic base transformation with an Euler-discretized neural ODE field and employ importance sampling to correct for residual transport errors. Our central contribution is twofold. First, we establish a general convergence analysis for QMC importance sampling with arbitrary transport maps, identifying sufficient growth conditions for the $\mathcal{O}(N^{-1+\varepsilon})$ root-mean-square error rate. Second, we rigorously prove that the specific transport architecture of Flow Matching satisfies these conditions. Consequently, we establish a $\mathcal{O}(N^{-1+\varepsilon})$ root-mean-square error for the unbiased FM-ISQMC estimator, extending classical QMC theory to the realm of generative models. Numerical experiments validate that FM-ISQMC consistently breaks through the error floor observed in direct transport methods, delivering superior precision. This work thus bridges the divide between deep generative modeling and numerical integration.

preprint2026arXiv

Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward

Multimodal Large Language Models (MLLMs) struggle with complex geometric reasoning, largely because "black box" outcome-based supervision fails to distinguish between lucky guesses and rigorous deduction. To address this, we introduce a paradigm shift towards subgoal-level evaluation and learning. We first construct GeoGoal, a benchmark synthesized via a rigorous formal verification data engine, which converts abstract proofs into verifiable numeric subgoals. This structure reveals a critical divergence between reasoning quality and outcome accuracy. Leveraging this, we propose the Sub-Goal Verifiable Reward (SGVR) framework, which replaces sparse signals with dense rewards based on the Skeleton Rate. Experiments demonstrate that SGVR not only enhances geometric performance (+9.7%) but also exhibits strong generalization, transferring gains to general math (+8.0%) and other general reasoning tasks (+2.8%), demonstrating broad applicability across diverse domains.

preprint2025arXiv

GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Geometric problem solving constitutes a critical branch of mathematical reasoning, requiring precise analysis of shapes and spatial relationships. Current evaluations of geometric reasoning in vision-language models (VLMs) face limitations, including the risk of test data contamination from textbook-based benchmarks, overemphasis on final answers over reasoning processes, and insufficient diagnostic granularity. To address these issues, we present GeoBench, a hierarchical benchmark featuring four reasoning levels in geometric problem-solving: Visual Perception, Goal-Oriented Planning, Rigorous Theorem Application, and Self-Reflective Backtracking. Through six formally verified tasks generated via TrustGeoGen, we systematically assess capabilities ranging from attribute extraction to logical error correction. Experiments reveal that while reasoning models like OpenAI-o3 outperform general MLLMs, performance declines significantly with increasing task complexity. Key findings demonstrate that sub-goal decomposition and irrelevant premise filtering critically influence final problem-solving accuracy, whereas Chain-of-Thought prompting unexpectedly degrades performance in some tasks. These findings establish GeoBench as a comprehensive benchmark while offering actionable guidelines for developing geometric problem-solving systems.