Researcher profile

Jianing Wang

Jianing Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

A Vehicle-portable Ultra-stable Laser for Operating on Highways

Portable ultra-stable lasers are essential for high-precision measurements. This study presents a 1550 nm vehicle-portable ultra-stable laser designed for continuous real-time operation on highways. We implement several measures to mitigate environmental impacts, including active temperature control with a standard deviation of mK/day to reduce frequency drift of the optical reference cavity, all-polarization-maintaining fiber devices to enhance the robustness of the optical path, and highly integrated electronic units to diminish thermal effects. The performance of the ultra-stable laser is evaluated through real-time beat frequency measurements with another similar ultra-stable laser over a transport distance of approximately 100 km, encompassing rural roads, national roads, urban roads, and expressways. The results indicate frequency stability of approximately 10-12/(0.01s-100 s) during transport, about 5E-14/s while the vehicle is stationary with the engine running, and around 3E-15/s with the engine off, all without active vibration isolation. This work marks the first recorded instance of a portable ultra-stable laser achieving continuous real-time operation on highways and lays a crucial foundation for non-laboratory applications, such as mobile laser communication and dynamic free-space time-frequency comparison.

preprint2026arXiv

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Recent advances in agentic harness with orchestration frameworks that coordinate multiple agents with memory, skills, and tool use have achieved remarkable success in complex reasoning tasks. However, the underlying mechanism that truly drives performance remains obscured behind intricate system designs. In this paper, we propose HeavySkill, a perspective that views heavy thinking not only as a minimal execution unit in orchestration harness but also as an inner skill internalized within the model's parameters that drives the orchestrator to solve complex tasks. We identify this skill as a two-stage pipeline, i.e., parallel reasoning then summarization, which can operate beneath any agentic harness. We present a systematic empirical study of HeavySkill across diverse domains. Our results show that this inner skill consistently outperforms traditional Best-of-N (BoN) strategies; notably, stronger LLMs can even approach Pass@N performance. Crucially, we demonstrate that the depth and width of heavy thinking, as a learnable skill, can be further scaled via reinforcement learning, offering a promising path toward self-evolving LLMs that internalize complex reasoning without relying on brittle orchestration layers.

preprint2023arXiv

Value Enhancement of Reinforcement Learning via Efficient and Robust Trust Region Optimization

Reinforcement learning (RL) is a powerful machine learning technique that enables an intelligent agent to learn an optimal policy that maximizes the cumulative rewards in sequential decision making. Most of methods in the existing literature are developed in \textit{online} settings where the data are easy to collect or simulate. Motivated by high stake domains such as mobile health studies with limited and pre-collected data, in this paper, we study \textit{offline} reinforcement learning methods. To efficiently use these datasets for policy optimization, we propose a novel value enhancement method to improve the performance of a given initial policy computed by existing state-of-the-art RL algorithms. Specifically, when the initial policy is not consistent, our method will output a policy whose value is no worse and often better than that of the initial policy. When the initial policy is consistent, under some mild conditions, our method will yield a policy whose value converges to the optimal one at a faster rate than the initial policy, achieving the desired ``value enhancement" property. The proposed method is generally applicable to any parametrized policy that belongs to certain pre-specified function class (e.g., deep neural networks). Extensive numerical studies are conducted to demonstrate the superior performance of our method.

preprint2022arXiv

KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering

Extractive Question Answering (EQA) is one of the most important tasks in Machine Reading Comprehension (MRC), which can be solved by fine-tuning the span selecting heads of Pre-trained Language Models (PLMs). However, most existing approaches for MRC may perform poorly in the few-shot learning scenario. To solve this issue, we propose a novel framework named Knowledge Enhanced Contrastive Prompt-tuning (KECP). Instead of adding pointer heads to PLMs, we introduce a seminal paradigm for EQA that transform the task into a non-autoregressive Masked Language Modeling (MLM) generation problem. Simultaneously, rich semantics from the external knowledge base (KB) and the passage context are support for enhancing the representations of the query. In addition, to boost the performance of PLMs, we jointly train the model by the MLM and contrastive learning objectives. Experiments on multiple benchmarks demonstrate that our method consistently outperforms state-of-the-art approaches in few-shot settings by a large margin.

preprint2022arXiv

Math-KG: Construction and Applications of Mathematical Knowledge Graph

Recently, the explosion of online education platforms makes a success in encouraging us to easily access online education resources. However, most of them ignore the integration of massive unstructured information, which inevitably brings the problem of \textit{information overload} and \textit{knowledge trek}. In this paper, we proposed a mathematical knowledge graph named Math-KG, which automatically constructed by the pipeline method with the natural language processing technology to integrate the resources of the mathematics. It is built from the corpora of Baidu Baike, Wikipedia. We implement a simple application system to validate the proposed Math-KG can make contributions on a series of scenes, including faults analysis and semantic search. The system is publicly available at GitHub \footnote{\url{https://github.com/wjn1996/Mathematical-Knowledge-Entity-Recognition}.}.

preprint2022arXiv

The skyrmion bags in an anisotropy gradient

Skyrmion bags as spin textures with arbitrary topological charge are expected to be the carriers in racetrack memory. Here, we theoretically and numerically investigated the dynamics of skyrmion bags in an anisotropy gradient. It is found that, without the boundary potential, the dynamics of skyrmion bags are dependent on the spin textures, and the velocity of skyrmionium with $Q = 0$ is faster than other skyrmion bags. However, when the skyrmion bags move along the boundary, the velocities of all skyrmion bags with different $Q$ are same. This can be attributed to the same value of $u/η_{xx}$, where the $u$ and $η_{xx}$ are the terms related to the magnetization distribution of skyrmion bag. In addition, we theoretically derived the dynamics of skyrmion bags in the two cases using the Thiele approach and discussed the scope of Thiele equation. Within a certain range, the simulation results are in good agreement with the analytically calculated results. Our findings provide an alternative way to manipulate the racetrack memory based on the skyrmion bags.

preprint2022arXiv

Towards Unified Prompt Tuning for Few-shot Text Classification

Prompt-based fine-tuning has boosted the performance of Pre-trained Language Models (PLMs) on few-shot text classification by employing task-specific prompts. Yet, PLMs are unfamiliar with prompt-style expressions during pre-training, which limits the few-shot learning performance on downstream tasks. It would be desirable if the models can acquire some prompting knowledge before adaptation to specific NLP tasks. We present the Unified Prompt Tuning (UPT) framework, leading to better few-shot text classification for BERT-style models by explicitly capturing prompting semantics from non-target NLP datasets. In UPT, a novel paradigm Prompt-Options-Verbalizer is proposed for joint prompt learning across different NLP tasks, forcing PLMs to capture task-invariant prompting knowledge. We further design a self-supervised task named Knowledge-enhanced Selective Masked Language Modeling to improve the PLM's generalization abilities for accurate adaptation to previously unseen tasks. After multi-task learning across multiple tasks, the PLM can be better prompt-tuned towards any dissimilar target tasks in low-resourced settings. Experiments over a variety of NLP tasks show that UPT consistently outperforms state-of-the-arts for prompt-based fine-tuning.

preprint2021arXiv

RH-Net: Improving Neural Relation Extraction via Reinforcement Learning and Hierarchical Relational Searching

Distant supervision (DS) aims to generate large-scale heuristic labeling corpus, which is widely used for neural relation extraction currently. However, it heavily suffers from noisy labeling and long-tail distributions problem. Many advanced approaches usually separately address two problems, which ignore their mutual interactions. In this paper, we propose a novel framework named RH-Net, which utilizes Reinforcement learning and Hierarchical relational searching module to improve relation extraction. We leverage reinforcement learning to instruct the model to select high-quality instances. We then propose the hierarchical relational searching module to share the semantics from correlative instances between data-rich and data-poor classes. During the iterative process, the two modules keep interacting to alleviate the noisy and long-tail problem simultaneously. Extensive experiments on widely used NYT data set clearly show that our method significant improvements over state-of-the-art baselines.

preprint2020arXiv

Interaction between defect and skyrmion in nanodisk

Magnetic skyrmions are topologically protected stable magnetization configurations, which are expected to be a promising candidate as information carrier, while defect is inevitable and plays an important role on the stabilization and movement of skyrmion. In this paper, we investigated the influence of a point defect and a ring defect on the stabilization and dynamics of skyrmion in the nanodisk, where the defect are acquired by local modification magnetic material parameters. Considering the combined action of geometry confinement and pinning effect, we demonstrate the preferred skyrmion position as a function of defect type and distance between skyrmion and a point defect. We also show the confinement effect on skyrmion size in the presence of a ring defect due to circular symmetry. Finally, in the application of spin transfer nano-oscillator, we show that the skyrmion can be pinned or rotate in the nanodisk, the oscillator frequency can be modified in a large variation by the ring defect. These findings provide a complete understanding of interaction between skyrmion and defect in the confined geometry and may provide a good strategy for the design of skyrmion oscillators.

preprint2019arXiv

Spin current pumped by resonant skyrmion

Spin pumping is a widely recognized method to generate the spin current in the spintronics, which is acknowledged as a fundamentally dynamic process equivalent to the spin-transfer torque. In this work, we theoretically verify that the oscillating spin current can be pumped from the microwave-motivated breathing skyrmion. The skyrmion spin pumping can be excited by a relatively low frequency compared with the ferromagnetic resonance (FMR) and the current density is larger than the ordinary FMR spin pumping. Based on the skyrmion spin pumping, we build a high reading-speed racetrack memory model whose reading speed is an order of magnitude higher than the SOT (spin-orbit torque) /STT (spin-transfer torque) skyrmion racetrack. Our work explored the spin pumping phenomenon in the skyrmion, and it may contribute to the applications of the skyrmion-based device.