Source author record

Junfeng Liu

Junfeng Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.FA Computation and Language Computer Vision Distributed, Parallel, and Cluster Computing hep-th Machine Learning physics.optics

Catalog footprint

What is connected

9works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

STEP3-VL-10B Technical Report

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized through two strategic shifts: first, a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens that integrates a language-aligned Perception Encoder with a Qwen3-8B decoder to establish intrinsic vision-language synergy; and second, a scaled post-training pipeline featuring over 1k iterations of reinforcement learning. Crucially, we implement Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute, allocating resources to scalable perceptual reasoning that explores and synthesizes diverse visual hypotheses. Consequently, despite its compact 10B footprint, STEP3-VL-10B rivals or surpasses models 10$\times$-20$\times$ larger (e.g., GLM-4.6V-106B, Qwen3-VL-235B) and top-tier proprietary flagships like Gemini 2.5 Pro and Seed-1.5-VL. Delivering best-in-class performance, it records 92.2% on MMBench and 80.11% on MMMU, while excelling in complex reasoning with 94.43% on AIME2025 and 75.95% on MathVision. We release the full model suite to provide the community with a powerful, efficient, and reproducible baseline.

preprint2022arXiv

Contextualized Scene Imagination for Generative Commonsense Reasoning

Humans use natural language to compose common concepts from their environment into plausible, day-to-day scene descriptions. However, such generative commonsense reasoning (GCSR) skills are lacking in state-of-the-art text generation methods. Descriptive sentences about arbitrary concepts generated by neural text generation models (e.g., pre-trained text-to-text Transformers) are often grammatically fluent but may not correspond to human common sense, largely due to their lack of mechanisms to capture concept relations, to identify implicit concepts, and to perform generalizable reasoning about unseen concept compositions. In this paper, we propose an Imagine-and-Verbalize (I&V) method, which learns to imagine a relational scene knowledge graph (SKG) with relations between the input concepts, and leverage the SKG as a constraint when generating a plausible scene description. We collect and harmonize a set of knowledge resources from different domains and modalities, providing a rich auxiliary supervision signal for I&V. The experiments demonstrate the effectiveness of I&V in improving language models on both concept-to-sentence and concept-to-story generation tasks, while enabling the model to learn well from fewer task examples and generate SKGs that make common sense to human annotators.

preprint2022arXiv

Generalized space-time fractional stochastic kinetic equation

In this paper, we study a class of nonlinear space-time fractional stochastic kinetic equations in $\mathbb{R}^d$ with Gaussian noise which is white in time and homogeneous in space. This type of equation constitutes an extension of the non-linear stochastic heat equation involving fractional derivative in time and fractional Laplacian in space. We give a necessary condition on the spatial covariance for the existence and uniqueness of the solution. We also study various properties of the solution: path regularity, the behavior of second moment and the stationarity in the case of linear additive noise.

preprint2022arXiv

The Beurling-type theorem in the Bergman space $A^2_α(D)$ for any $-1<α<+\infty$

In this paper, we use a new method to solve a long-standing problem. More specifically, we show that the Beurling-type theorem holds in the Bergman space $A^2_α(D)$ for any $-1<α< +\infty$. That is, every invariant subspace $H$ for the shift operator $S$ on $A^2_α(D)$ $(-1<α< +\infty)$ has the property $H=[H\ominus zH]_{S,A^2_α\left(D\right)}$.

preprint2021arXiv

Intersecting Surface defects and 3d Superconformal indices

We compute the 3d N = 2 superconformal indices for 3d/1d coupled systems, which arise as the worldvolume theories of intersecting surface defects engineered by Higgsing 5d N = 1 gauge theories. We generalize some known 3d dualities, including non-Abelian 3d mirror symmetry and 3d/3d correspondence, to some of the simple 3d/1d coupled systems. Finally we propose a q-Virasoro construction for the superconformal indices.

preprint2020arXiv

Elastic Bulk Synchronous Parallel Model for Distributed Deep Learning

The bulk synchronous parallel (BSP) is a celebrated synchronization model for general-purpose parallel computing that has successfully been employed for distributed training of machine learning models. A prevalent shortcoming of the BSP is that it requires workers to wait for the straggler at every iteration. To ameliorate this shortcoming of classic BSP, we propose ELASTICBSP a model that aims to relax its strict synchronization requirement. The proposed model offers more flexibility and adaptability during the training phase, without sacrificing on the accuracy of the trained model. We also propose an efficient method that materializes the model, named ZIPLINE. The algorithm is tunable and can effectively balance the trade-off between quality of convergence and iteration throughput, in order to accommodate different environments or applications. A thorough experimental evaluation demonstrates that our proposed ELASTICBSP model converges faster and to a higher accuracy than the classic BSP. It also achieves comparable (if not higher) accuracy than the other sensible synchronization models.

preprint2016arXiv

Coherent and incoherent nonparaxial self-accelerating Weber beams

We investigate the coherent and incoherent nonparaxial Weber beams, theoretically and numerically. We show that the superposition of coherent self-accelerating Weber beams with transverse displacement cannot display the nonparaxial accelerating Talbot effect. The reason is that their lobes do not accelerate in unison, which is a requirement for the appearance of the effect. While for the incoherent Weber beams, they naturally cannot display the accelerating Talbot effect but can display the nonparaxial accelerating properties, although the transverse coherence length is smaller than the beam width, based on the second-order coherence theory. Our research method directly applies to the nonparaxial Mathieu beams as well, and one will obtain similar conclusions as for the Weber beams, although this is not discussed in the paper. Our investigation identifies families of nonparaxial accelerating beams that do not exhibit the accelerating Talbot effect, and in addition broadens the understanding of coherence properties of such nonparaxial accelerating beams.

preprint2012arXiv

The Bouleau-Yor identity for a bi-fractional Brownian motion

Let $B$ be a bi-fractional Brownian motion with indices $H\in (0,1),K\in (0,1]$, $2HK=1$ and let ${\mathscr L}(x,t)$ be its local time process. We construct a Banach space ${\mathscr H}$ of measurable functions such that the quadratic covariation $[f(B),B]$ and the integral $\int_{\mathbb R}f(x){\mathscr L}(dx,t)$ exist provided $f\in {\mathscr H}$. Moreover, the Bouleau-Yor identity $$ [f(B),B]_t=-2^{1-K}\int_{\mathbb R}f(x){\mathscr L}(dx,t),\qquad t\geq 0, $$ holds for all $f\in {\mathscr H}$.

preprint2011arXiv

The generalized quadratic covariation for fractional Brownian motion with Hurst index less than 1/2

Let $B^H$ be a fractional Brownian motion with Hurst index $0<H<1/2$. In this paper we study the {\it generalized quadratic covariation} $[f(B^H),B^H]^{(W)}$ defined by $$ [f(B^H),B^H]^{(W)}_t=\lim_{ε\downarrow 0}\frac{2H}{ε^{2H}}\int_0^t\{f(B^{H}_{s+ε})-f(B^{H}_s)\}(B^{H}_{s+ε}- B^{H}_s)s^{2H-1}ds, $$ where the limit is uniform in probability and $x\mapsto f(x)$ is a deterministic function. We construct a Banach space ${\mathscr H}$ of measurable functions such that the generalized quadratic covariation exists in $L^2$ and the Bouleau-Yor identity takes the form $$ [f(B^H),B^H]_t^{(W)}=-\int_{\mathbb {R}}f(x){\mathscr L}^{H}(dx,t) $$ provided $f\in {\mathscr H}$, where ${\mathscr L}^{H}(x,t)$ is the weighted local time of $B^H$. This allows us to write the fractional Itô formula for absolutely continuous functions with derivative belonging to ${\mathscr H}$. These are also extended to the time-dependent case.

Junfeng Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

STEP3-VL-10B Technical Report

Contextualized Scene Imagination for Generative Commonsense Reasoning

Generalized space-time fractional stochastic kinetic equation

The Beurling-type theorem in the Bergman space $A^2_α(D)$ for any $-1<α<+\infty$

Intersecting Surface defects and 3d Superconformal indices

Elastic Bulk Synchronous Parallel Model for Distributed Deep Learning

Coherent and incoherent nonparaxial self-accelerating Weber beams

The Bouleau-Yor identity for a bi-fractional Brownian motion

The generalized quadratic covariation for fractional Brownian motion with Hurst index less than 1/2