Researcher profile

Jingyuan Wang

Jingyuan Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
16topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Genie Centurion: Accelerating Scalable Real-World Robot Training with Human Rewind-and-Refine Guidance

While Vision-Language-Action (VLA) models show strong generalizability in various tasks, real-world deployment of robotic policy still requires large-scale, high-quality human expert demonstrations. However, data collection via human teleoperation requires continuous operator attention, which is costly, hard to scale. To address this, we propose Genie Centurion (GCENT), a scalable and general data collection paradigm based on human rewind-and-refine guidance, enabling robots' interactive learning in deployment. GCENT starts at an imperfect policy and improves over time. When the robot execution failures occur, GCENT allows robots to revert to a previous state with a rewind mechanism, after which a teleoperator provides corrective demonstrations to refine the policy. This framework supports a one-human-to-many-robots supervision scheme with a Task Sentinel module, which autonomously predicts task success and solicits human intervention when necessary. Empirical results show that GCENT achieves up to 40% higher task success rates than state-of-the-art data collection methods, and reaches comparable performance using less than half the data in long-horizon and precise tasks. We also quantify the data yield-to-effort ratio under multi-robot scenarios, demonstrating GCENT's potential for scalable and cost-efficient robot policy training in real-world environments.

preprint2026arXiv

SOAR: Real-Time Joint Optimization of Order Allocation and Robot Scheduling in Robotic Mobile Fulfillment Systems

Robotic Mobile Fulfillment Systems (RMFS) rely on mobile robots for automated inventory transportation, coordinating order allocation and robot scheduling to enhance warehousing efficiency. However, optimizing RMFS is challenging due to strict real-time constraints and the strong coupling of multi-phase decisions. Existing methods either decompose the problem into isolated sub-tasks to guarantee responsiveness at the cost of global optimality, or rely on computationally expensive global optimization models that are unsuitable for dynamic industrial environments. To bridge this gap, we propose SOAR, a unified Deep Reinforcement Learning framework for real-time joint optimization. SOAR transforms order allocation and robot scheduling into a unified process by utilizing soft order allocations as observations. We formulate this as an Event-Driven Markov Decision Process, enabling the agent to perform simultaneous scheduling in response to asynchronous system events. Technically, we employ a Heterogeneous Graph Transformer to encode the warehouse state and integrate phased domain knowledge. Additionally, we incorporate a reward shaping strategy to address sparse feedback in long-horizon tasks. Extensive experiments on synthetic and real-world industrial datasets, in collaboration with Geekplus, demonstrate that SOAR reduces global makespan by 7.5\% and average order completion time by 15.4\% with sub-100ms latency. Furthermore, sim-to-real deployment confirms its practical viability and significant performance gains in production environments. The code is available at https://github.com/200815147/SOAR.

preprint2025arXiv

From Target Tracking to Targeting Track -- Part III: Stochastic Process Modeling and Online Learning

This is the third part of a series of studies that model the target trajectory, which describes the target state evolution over continuous time, as a sample path of a stochastic process (SP). By adopting a deterministic-stochastic decomposition framework, we decompose the learning of the trajectory SP into two sequential stages: the first fits the deterministic trend of the trajectory using a curve function of time, while the second estimates the residual stochastic component through parametric learning of either a Gaussian process (GP) or Student's-$t$ process (StP). This leads to a Markov-free data-driven tracking approach that produces the continuous-time trajectory with minimal prior knowledge of the target dynamics. Notably, our approach explicitly models both the temporal correlations of the state sequence and of measurement noises through the SP framework. It does not only take advantage of the smooth trend of the target but also makes use of the long-term temporal correlation of both the data noise and the model fitting error. Simulations in four maneuvering target tracking scenarios have demonstrated its effectiveness and superiority in comparison with existing approaches.

preprint2023arXiv

Absence of spontaneous time-reversal symmetry breaking and ferromagnetism in superconducting NiBi3 single crystal

Recent experiments have pointed to a chiral p-wave-like superconductivity in epitaxial Bi/Ni bilayers that are spontaneously time-reversal symmetry breaking (TRSB), making it a promising platform for exploring physics useful for topologically protected quantum computing. Quite intriguingly, evidence has emerged that in non-epitaxial Bi/Ni bilayers, superconductivity arises due to the formation of NiBi3, which has been reported to host coexisting ferromagnetic and superconducting orders at the surface. We perform high resolution surface magneto-optic Kerr effect (SMOKE) measurements using a Sagnac interferometer on single crystal NiBi3 and find no sign of any spontaneous Kerr signal except for contributions from trapped vortices. This strongly indicates the absence of TRSB in NiBi3, whether due to TRSB in the superconducting state or any coexisting ferromagnetism, and we conclude that the superconductivity found in non-epitaxial Bi/Ni is distinctively different from that in epitaxial Bi/Ni.

preprint2023arXiv

Continuous Trajectory Generation Based on Two-Stage GAN

Simulating the human mobility and generating large-scale trajectories are of great use in many real-world applications, such as urban planning, epidemic spreading analysis, and geographic privacy protect. Although many previous works have studied the problem of trajectory generation, the continuity of the generated trajectories has been neglected, which makes these methods useless for practical urban simulation scenarios. To solve this problem, we propose a novel two-stage generative adversarial framework to generate the continuous trajectory on the road network, namely TS-TrajGen, which efficiently integrates prior domain knowledge of human mobility with model-free learning paradigm. Specifically, we build the generator under the human mobility hypothesis of the A* algorithm to learn the human mobility behavior. For the discriminator, we combine the sequential reward with the mobility yaw reward to enhance the effectiveness of the generator. Finally, we propose a novel two-stage generation process to overcome the weak point of the existing stochastic generation process. Extensive experiments on two real-world datasets and two case studies demonstrate that our framework yields significant improvements over the state-of-the-art methods.

preprint2022arXiv

ElitePLM: An Empirical Study on General Language Ability Evaluation of Pretrained Language Models

Nowadays, pretrained language models (PLMs) have dominated the majority of NLP tasks. While, little research has been conducted on systematically evaluating the language abilities of PLMs. In this paper, we present a large-scale empirical study on general language ability evaluation of PLMs (ElitePLM). In our study, we design four evaluation dimensions, i.e. memory, comprehension, reasoning, and composition, to measure ten widely-used PLMs within five categories. Our empirical results demonstrate that: (1) PLMs with varying training objectives and strategies are good at different ability tests; (2) fine-tuning PLMs in downstream tasks is usually sensitive to the data size and distribution; (3) PLMs have excellent transferability between similar tasks. Moreover, the prediction results of PLMs in our experiments are released as an open resource for more deep and detailed analysis on the language abilities of PLMs. This paper can guide the future work to select, apply, and design PLMs for specific tasks. We have made all the details of experiments publicly available at https://github.com/RUCAIBox/ElitePLM.

preprint2021arXiv

A knowledge transfer model for COVID-19 predicting and non-pharmaceutical intervention simulation

Since December 2019, A novel coronavirus (2019-nCoV) has been breaking out in China, which can cause respiratory diseases and severe pneumonia. Mathematical and empirical models relying on the epidemic situation scale for forecasting disease outbreaks have received increasing attention. Given its successful application in the evaluation of infectious diseases scale, we propose a Susceptible-Undiagnosed-Infected-Removed (SUIR) model to offer the effective prediction, prevention, and control of infectious diseases. Our model is a modified susceptible-infected-recovered (SIR) model that injects undiagnosed state and offers pre-training effective reproduction number. Our SUIR model is more precise than the traditional SIR model. Moreover, we combine domain knowledge of the epidemic to estimate effective reproduction number, which addresses the initial susceptible population of the infectious disease model approach to the ground truth. These findings have implications for the forecasting of epidemic trends in COVID-19 as these could help the growth of estimating epidemic situation.

preprint2020arXiv

Are L2 adversarial examples intrinsically different?

Deep Neural Network (DDN) has achieved notable success in various tasks, including many security concerning scenarios. However, a considerable amount of work has proved its vulnerability to adversaries. We unravel the properties that can intrinsically differentiate adversarial examples and normal inputs through theoretical analysis. That is, adversarial examples generated by $L_2$ attacks usually have larger input sensitivity which can be used to identify them efficiently. We also found that those generated by $L_\infty$ attacks will be different enough in the pixel domain to be detected empirically. To verify our analysis, we proposed a \textbf{G}uided \textbf{C}omplementary \textbf{D}efense module (\textbf{GCD}) integrating detection and recovery processes. When compared with adversarial detection methods, our detector achieves a detection AUC of over 0.98 against most of the attacks. When comparing our guided rectifier with commonly used adversarial training methods and other rectification methods, our rectifier outperforms them by a large margin. We achieve a recovered classification accuracy of up to 99\% on MNIST, 89\% on CIFAR-10, and 87\% on ImageNet subsets against $L_2$ attacks. Furthermore, under the white-box setting, our holistic defensive module shows a promising degree of robustness. Thus, we confirm that at least $L_2$ adversarial examples are intrinsically different enough from normal inputs both theoretically and empirically. And we shed light upon designing simple yet effective defensive methods with these properties.

preprint2020arXiv

Correlation-driven eightfold magnetic anisotropy in a two-dimensional oxide monolayer

Engineering magnetic anisotropy in two-dimensional systems has enormous scientific and technological implications. The uniaxial anisotropy universally exhibited by two-dimensional magnets has only two stable spin directions, demanding 180 degrees spin switching between states. We demonstrate a novel eightfold anisotropy in magnetic SrRuO3 monolayers by inducing a spin reorientation in (SrRuO3)1/(SrTiO3)N superlattices, in which the magnetic easy axis of Ru spins is transformed from uniaxial <001> direction (N = 1 and 2) to eightfold <111> directions (N = 3, 4 and 5). This eightfold anisotropy enables 71 and 109 degrees spin switching in SrRuO3 monolayers, analogous to 71 and 109 degrees polarization switching in ferroelectric BiFeO3. First-principle calculations reveal that increasing the SrTiO3 layer thickness induces an emergent correlation-driven orbital ordering, tuning spin-orbit interactions and reorienting the SrRuO3 monolayer easy axis. Our work demonstrates that correlation effects can be exploited to substantially change spin-orbit interactions, stabilizing unprecedented properties in two-dimensional magnets and opening rich opportunities for low-power, multi-state device applications.

preprint2019arXiv

Tailoring Hybrid Anomalous Hall Response in Engineered Magnetic Topological Insulator Heterostructures

Engineering the anomalous Hall effect (AHE) in the emerging magnetic topological insulators (MTIs) has great potentials for quantum information processing and spintronics applications. In this letter, we synthesize the epitaxial Bi2Te3/MnTe magnetic heterostructures and observe pronounced AHE signals from both layers combined together. The evolution of the resulting hybrid AHE intensity with the top Bi2Te3 layer thickness manifests the presence of an intrinsic ferromagnetic phase induced by the topological surface states at the heterolayer-interface. More importantly, by doping the Bi2Te3 layer with Sb, we are able to manipulate the sign of the Berry phase-associated AHE component. Our results demonstrate the un-paralleled advantages of MTI heterostructures over magnetically doped TI counterparts, in which the tunability of the AHE response can be greatly enhanced. This in turn unveils a new avenue for MTI heterostructure-based multifunctional applications.