Researcher profile

Lina Chen

Lina Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

STEP3-VL-10B Technical Report

We present STEP3-VL-10B, a lightweight open-source foundation model designed to redefine the trade-off between compact efficiency and frontier-level multimodal intelligence. STEP3-VL-10B is realized through two strategic shifts: first, a unified, fully unfrozen pre-training strategy on 1.2T multimodal tokens that integrates a language-aligned Perception Encoder with a Qwen3-8B decoder to establish intrinsic vision-language synergy; and second, a scaled post-training pipeline featuring over 1k iterations of reinforcement learning. Crucially, we implement Parallel Coordinated Reasoning (PaCoRe) to scale test-time compute, allocating resources to scalable perceptual reasoning that explores and synthesizes diverse visual hypotheses. Consequently, despite its compact 10B footprint, STEP3-VL-10B rivals or surpasses models 10$\times$-20$\times$ larger (e.g., GLM-4.6V-106B, Qwen3-VL-235B) and top-tier proprietary flagships like Gemini 2.5 Pro and Seed-1.5-VL. Delivering best-in-class performance, it records 92.2% on MMBench and 80.11% on MMMU, while excelling in complex reasoning with 94.43% on AIME2025 and 75.95% on MathVision. We release the full model suite to provide the community with a powerful, efficient, and reproducible baseline.

preprint2024arXiv

Can Large Language Models Understand Real-World Complex Instructions?

Large language models (LLMs) can understand human instructions, showing their potential for pragmatic applications beyond traditional NLP tasks. However, they still struggle with complex instructions, which can be either complex task descriptions that require multiple tasks and constraints, or complex input that contains long context, noise, heterogeneous information and multi-turn format. Due to these features, LLMs often ignore semantic constraints from task descriptions, generate incorrect formats, violate length or sample count constraints, and be unfaithful to the input text. Existing benchmarks are insufficient to assess LLMs' ability to understand complex instructions, as they are close-ended and simple. To bridge this gap, we propose CELLO, a benchmark for evaluating LLMs' ability to follow complex instructions systematically. We design eight features for complex instructions and construct a comprehensive evaluation dataset from real-world scenarios. We also establish four criteria and develop corresponding metrics, as current ones are inadequate, biased or too strict and coarse-grained. We compare the performance of representative Chinese-oriented and English-oriented models in following complex instructions through extensive experiments. Resources of CELLO are publicly available at https://github.com/Abbey4799/CELLO.

preprint2022arXiv

Almost maximal volume entropy rigidity for integral Ricci curvature in the non-collapsing case

In this note we will show the almost maximal volume entropy rigidity for manifolds with lower integral Ricci curvature bound in the non-collapsing case: Given $n, d, p>\frac{n}{2}$, there exist $δ(n, d, p), ε(n, d, p)>0$, such that for $δ<δ(n, d, p)$, $ε<ε(n, d, p)$, if a compact $n$-manifold $M$ satisfies that the integral Ricci curvature has lower bound $\bar k(-1, p)\leq δ$, the diameter $diam(M)\leq d$ and volume entropy $h(M)\geq n-1-ε$, then the universal cover of $M$ is Gromov-Hausdorff close to a hyperbolic space form $\Bbb H^k$, $k\leq n$; If in addition the volume of $M$, $vol(M)\geq v>0$, then $M$ is diffeomorphic and Gromov-Hausdorff close to a hyperbolic manifold where $δ, ε$ also depends on $v$.

preprint2022arXiv

Almost volume cone implies almost metric cone for annuluses centered at a compact set in $RCD(K, N)$-spaces

In \cite{CC1}, Cheeger-Colding considered manifolds with lower Ricci curvature bound and gave some almost rigidity results about warped products including almost metric cone rigidity and quantitative splitting theorem. As a generalization of manifolds with lower Ricci curvature bound, for metric measure spaces in $RCD(K, N)$, $1<N<\infty$, splitting theorem \cite{Gi13} and &#34;volume cone implies metric cone&#34; rigidity for balls and annuluses of a point \cite{PG} have been proved. In this paper we will generalize Cheeger-Colding&#39;s \cite{CC1} result about &#34;almost volume cone implies almost metric cone for annuluses of a compact subset &#34; to $RCD(K, N)$-spaces. More precisely, consider a $RCD(K, N)$-space $(X, d, \mathfrak m)$ and a Borel subset $Ω\subset X$. If the closed subset $S=\partial Ω$ has finite outer curvature, the diameter ${diam}(S)\leq D$ and the mean curvature of $S$ satisfies $$m(x)\leq m, \, \forall x\in S,$$ and \begin{equation*}\mathfrak m(A_{a, b}(S))\geq (1-ε)\int_a^b \left({sn}&#39;_H(r)+ \frac{m}{n-1}{sn}_H(r)\right)^{n-1}dr \mathfrak m_S(S)\end{equation*} then $A_{a&#39;, b&#39;}(S)$ is measured Gromov-Hausdorff close to a warped product $(a&#39;, b&#39;)\times_{{sn}&#39;_H(r)+ \frac{m}{n-1}{sn}_H(r)}Y,$ $A_{a, b}(S)=\{x\in X\setminus Ω, \, a<d(x, S)<b\}$, $a<a&#39;<b&#39;<b$, $Y$ is a metric space with finite components with each component is a $RCD(0, N-1)$-space when $m=0, K=0$ and is a $RCD(N-2, N-1)$-space for other cases and $H=\frac{K}{N-1}$. Note that when $m=0, K=0$, our result is a kind of quantitative splitting theorem and in other cases it is an almost metric cone rigidity. To prove this result, different from \cite{Gi13, PG}, we will use \cite{GiT}&#39;s second order differentiation formula and a method similar as \cite{CC1}.

preprint2022arXiv

HistoKT: Cross Knowledge Transfer in Computational Pathology

The lack of well-annotated datasets in computational pathology (CPath) obstructs the application of deep learning techniques for classifying medical images. %Since pathologist time is expensive, dataset curation is intrinsically difficult. Many CPath workflows involve transferring learned knowledge between various image domains through transfer learning. Currently, most transfer learning research follows a model-centric approach, tuning network parameters to improve transfer results over few datasets. In this paper, we take a data-centric approach to the transfer learning problem and examine the existence of generalizable knowledge between histopathological datasets. First, we create a standardization workflow for aggregating existing histopathological data. We then measure inter-domain knowledge by training ResNet18 models across multiple histopathological datasets, and cross-transferring between them to determine the quantity and quality of innate shared knowledge. Additionally, we use weight distillation to share knowledge between models without additional training. We find that hard to learn, multi-class datasets benefit most from pretraining, and a two stage learning framework incorporating a large source domain such as ImageNet allows for better utilization of smaller datasets. Furthermore, we find that weight distillation enables models trained on purely histopathological features to outperform models using external natural image data.

preprint2020arXiv

Maximizing spin-orbit torque efficiency of Ta(O)/Py via modulating oxygen-induced interface orbital hybridization

Spin-orbit torques due to interfacial Rashba and spin Hall effects have been widely considered as a potentially more efficient approach than the conventional spin-transfer torque to control the magnetization of ferromagnets. We report a comprehensive study of spin-orbit torque efficiency in Ta(O)/Ni81Fe19 bilayers by tuning low-oxidation of \b{eta}-phase tantalum, and find that the spin Hall angle θDL increases from ~ -0.18 of the pure Ta/Py to the maximum value ~ -0.30 of Ta(O)/Py with 7.8% oxidation. Furthermore, we distinguish the efficiency of the spin-orbit torque generated by the bulk spin Hall effect and by interfacial Rashba effect, respectively, via a series of Py/Cu(0-2 nm)/Ta(O) control experiments. The latter has more than twofold enhancement, and even more significant than that of the former at the optimum oxidation level. Our results indicate that 65% enhancement of the efficiency should be related to the modulation of the interfacial Rashba-like spin-orbit torque due to oxygen-induced orbital hybridization cross the interface. Our results suggest that the modulation of interfacial coupling via oxygen-induced orbital hybridization can be an alternative method to boost the change-spin conversion rate.