Source author record

Zidong Wang

Zidong Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mes-hall cond-mat.mtrl-sci Artificial Intelligence Computer Vision Machine Learning Biomolecules Computation and Language eess.IV Information Retrieval physics.med-ph

Catalog footprint

What is connected

14works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

Dynamic spatial reasoning from monocular video is essential for bridging visual intelligence and the physical world, yet remains challenging for vision-language models (VLMs). Prior approaches either verbalize spatial-temporal reasoning entirely as text, which is inherently verbose and imprecise for complex dynamics, or rely on external geometric modules that increase inference complexity without fostering intrinsic model capability. In this paper, we present 4DThinker, the first framework that enables VLMs to "think with 4D" through dynamic latent mental imagery, i.e., internally simulating how scenes evolve within the continuous hidden space. Specifically, we first introduce a scalable, annotation-free data generation pipeline that synthesizes 4D reasoning data from raw videos. We then propose Dynamic-Imagery Fine-Tuning (DIFT), which jointly supervises textual tokens and 4D latents to ground the model in dynamic visual semantics. Building on this, 4D Reinforcement Learning (4DRL) further tackles complex reasoning tasks via outcome-based rewards, restricting policy gradients to text tokens to ensure stable optimization. Extensive experiments across multiple dynamic spatial reasoning benchmarks demonstrate that 4DThinker consistently outperforms strong baselines and offers a new perspective toward 4D reasoning in VLMs. Our code is available at https://github.com/zhangquanchen/4DThinker.

preprint2026arXiv

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

Large language models (LLMs) are increasingly used as interactive agents, but optimizing them for long-horizon decision making remains difficult because current methods are largely purely reactive, which weakens both exploration and credit assignment over extended trajectories. In this work, we present Strategic Trajectory Abstraction (StraTA), a simple framework that introduces an explicit trajectory-level strategy into agentic reinforcement learning (RL). StraTA samples a compact strategy from the initial task state, conditions subsequent actions on that strategy, and trains strategy generation and action execution jointly with a hierarchical GRPO-style rollout design, further enhanced by diverse strategy rollout and critical self-judgment. Experiments on ALFWorld, WebShop, and SciWorld show that StraTA consistently improves both sample efficiency and final performance over strong baselines. StraTA reaches success rates of 93.1% on ALFWorld and 84.2% on WebShop. On SciWorld, StraTA attains a 63.5% overall score, outperforming frontier closed-source models.

preprint2025arXiv

KP-Agent: Keyword Pruning in Sponsored Search Advertising via LLM-Powered Contextual Bandits

Sponsored search advertising (SSA) requires advertisers to constantly adjust keyword strategies. While bid adjustment and keyword generation are well-studied, keyword pruning-refining keyword sets to enhance campaign performance-remains under-explored. This paper addresses critical inefficiencies in current practices as evidenced by a dataset containing 0.5 million SSA records from a pharmaceutical advertiser on search engine Meituan, China's largest delivery platform. We propose KP-Agent, an LLM agentic system with domain tool set and a memory module. By modeling keyword pruning within a contextual bandit framework, KP-Agent generates code snippets to refine keyword sets through reinforcement learning. Experiments show KP-Agent improves cumulative profit by up to 49.28% over baselines.

preprint2022arXiv

Data and Physics Driven Learning Models for Fast MRI -- Fundamentals and Methodologies from CNN, GAN to Attention and Transformers

Research studies have shown no qualms about using data driven deep learning models for downstream tasks in medical image analysis, e.g., anatomy segmentation and lesion detection, disease diagnosis and prognosis, and treatment planning. However, deep learning models are not the sovereign remedy for medical image analysis when the upstream imaging is not being conducted properly (with artefacts). This has been manifested in MRI studies, where the scanning is typically slow, prone to motion artefacts, with a relatively low signal to noise ratio, and poor spatial and/or temporal resolution. Recent studies have witnessed substantial growth in the development of deep learning techniques for propelling fast MRI. This article aims to (1) introduce the deep learning based data driven techniques for fast MRI including convolutional neural network and generative adversarial network based methods, (2) survey the attention and transformer based models for speeding up MRI reconstruction, and (3) detail the research in coupling physics and data driven models for MRI acceleration. Finally, we will demonstrate through a few clinical applications, explain the importance of data harmonisation and explainable models for such fast MRI techniques in multicentre and multi-scanner studies, and discuss common pitfalls in current research and recommendations for future research directions.

preprint2022arXiv

PSP: Million-level Protein Sequence Dataset for Protein Structure Prediction

Proteins are essential component of human life and their structures are important for function and mechanism analysis. Recent work has shown the potential of AI-driven methods for protein structure prediction. However, the development of new models is restricted by the lack of dataset and benchmark training procedure. To the best of our knowledge, the existing open source datasets are far less to satisfy the needs of modern protein sequence-structure related research. To solve this problem, we present the first million-level protein structure prediction dataset with high coverage and diversity, named as PSP. This dataset consists of 570k true structure sequences (10TB) and 745k complementary distillation sequences (15TB). We provide in addition the benchmark training procedure for SOTA protein structure prediction model on this dataset. We validate the utility of this dataset for training by participating CAMEO contest in which our model won the first place. We hope our PSP dataset together with the training benchmark can enable a broader community of AI/biology researchers for AI-driven protein related research.

preprint2021arXiv

AsymptoticNG: A regularized natural gradient optimization algorithm with look-ahead strategy

Optimizers that further adjust the scale of gradient, such as Adam, Natural Gradient (NG), etc., despite widely concerned and used by the community, are often found poor generalization performance, compared with Stochastic Gradient Descent (SGD). They tend to converge excellently at the beginning of training but are weak at the end. An immediate idea is to complement the strengths of these algorithms with SGD. However, a truncated replacement of optimizer often leads to a crash of the update pattern, and new algorithms often require many iterations to stabilize their search direction. Driven by this idea and to address this problem, we design and present a regularized natural gradient optimization algorithm with look-ahead strategy, named asymptotic natural gradient (ANG). According to the total iteration step, ANG dynamic assembles NG and Euclidean gradient, and updates parameters along the new direction using the intensity of NG. Validation experiments on CIFAR10 and CIFAR100 data sets show that ANG can update smoothly and stably at the second-order speed, and achieve better generalization performance.

preprint2016arXiv

Driving Skyrmions in a Composite Bilayer

Magnetic Skyrmions and multiferroics are the most interesting objects in nanostructure science that have great potential in future spin-electronic technology. The study of multiferroic Skyrmions has attracted much interest in recent years. This article reports magnetic Bloch Skyrmions induced by an electric driving field in a composite bilayer (chiral-magnetic/ferroelectric bilayer) lattice. By using the spin dynamics method, we use a classical magnetic spin model and an electric pseudospin model, which are coupled by a strong magnetoelectric coupling in the dynamical simulations. Interestingly, we observe some skyrmion-like objects in the electric component either during the switching process or by applying a magnetic field, which is due to the connection between the electric and the magnetic structures.

preprint2016arXiv

Dynamic Response in a Finite Size Composite Multiferroic Thin Film

Composite multiferroics, heterostructures of ferromagnetic (FM) and ferroelectric (FE) materials, are characterized by a remarkable magnetoelectric effect at the interface. Previous work has supported the ferromagnetic structure with magnetic spins and the ferroelectric with pseudospins which act as electric dipoles in a microscopic model, coupled with a magnetoelectric interaction [J. Appl. Phys. 118, 124109 (2015)]. In this work, by solving the stochastic Landau-Lifshitz-Gilbert equation, the electric-field-induced magnetization switching in a twisted boundary condition has been studied, and a behavior of domain wall in the ferromagnetic structure is discussed.

preprint2016arXiv

Ferroelectrics Manipulate Magnetic Bloch Skyrmions in a Composite Bilayer

Theoretical investigation demonstrates that the composite bilayer (i.e., chiral-magnetic/ferroelectric bilayer) offers the possibility of electric-induced magnetic Skyrmions [Phys. Rev. B \textbf{94}, 014311 (2016)]. In this Article, we propose a micromagnetic model to physically manipulate magnetic Bloch Skyrmions propagating in a chiral-magnetic thin film with a polarized ferroelectric essential to drive the system through the converse magnetoelectric effect. Effects caused by different velocities of the propagation, sizes of the thin film, and strength of the magnetoelectric couplings strongly impact on quality and quantity of the magnetic Skyrmions.

preprint2016arXiv

Magnetic Bloch Skyrmion Transport by Electric Fields in a Composite Bilayer

We investigate a mechanical method to manipulate magnetic Bloch Skyrmions by applying an electric field in a composite chiral-magnetic (CM)/ferroelectric (FE) bilayer. The magnetoelectric coupling at the interface allows the electric field to stimulate magnetic ordering. Therefore it offers the possibility to generate Skyrmions [Phys. Rev. B 94, 014311 (2016)]. Here, we design a movable and localized electric field source to drive skyrmion transport along the bilayer. A traveling velocity of the electric field source must be carefully chosen to show the stability and effciency of this process. The effects of high speed operation will be discussed.

preprint2015arXiv

Magneto-Electric Effect for Multiferroic Thin Film by Monte Carlo Simulation

Magneto-electric effect in a multiferroic heterostructure film, i.e. a coupled ferromagnetic-ferroelectric thin film, has been investigated through the use of the Metropolis algorithm in Monte Carlo simulations. A classical Heisenberg model describes the energy stored in the ferromagnetic film, and we use a pseudo-spin model with a transverse Ising Hamiltonian to characterise the energy of electric dipoles in the ferroelectric film. The purpose of this article is to demonstrate the dynamic response of polarisation is driven by an external magnetic field, when there is a linear magneto-electric coupling at the interface between the ferromagnetic and ferroelectric components.

preprint2015arXiv

Pseudo-Spin Based Dynamical Model for Polarisation Switching in Ferroelectrics

A microscopic view of the response of the electric dipoles to a dynamic external field in a ferroelectric (FE) chain has been studied by two spin dynamics methods. One is the prominent micromagnetic approach, and the other is the micromagnetic approach with a variable size of the pseudo-spin. The energy stored in the ferroelectric chain is described by the transverse Ising model (TIM) with electric pseudo-spins. The simulations are based on a modified Landau-Lifshitz-Gilbert (LLG) equation which is precession free. The results obtained are shown and compared with the result supplemented by Landau-Devonshire (L-D) theory in the Appendix.

preprint2015arXiv

Spin Dynamics in Driven Composite Multiferroics

A spin dynamics approach has been used to study the behavior of the magnetic spins and the electric pseudo-spins in a 1-D composite multiferroic chain with a linear magneto-electric coupling at the interface. The response is investigated with either external magnetic or electric fields driving the system. The spin dynamics is based on the Landau-Lifshitz-Gilbert equation. A Gaussian white noise is later added into the dynamic process to include the thermal effects. The interface requires a closer inspection of the magneto-electric effects. Thus, we construct a 2-D ladder model to describe the behavior of the magnetic spins and the electric pseudo-spins with different magneto-electric couplings.

preprint2015arXiv

Spin Dynamics Simulation of the Magneto-Electric Effect in a Composite Multiferroic Chain

A composite multiferroic chain with an interfacial linear magneto-electric coupling is used to study the magnetic and electric responses to an external magnetic or electric field. The simulation uses continuous spin dynamics through the Landau-Lifshitz-Gilbert equations of the magnetic spin and the electric pseudo-spin. The results demonstrate an accurate description of the distribution of the magnetisation and polarisation are induced by applied electric and magnetic field, respectively.

Zidong Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding

StraTA: Incentivizing Agentic Reinforcement Learning with Strategic Trajectory Abstraction

KP-Agent: Keyword Pruning in Sponsored Search Advertising via LLM-Powered Contextual Bandits

Data and Physics Driven Learning Models for Fast MRI -- Fundamentals and Methodologies from CNN, GAN to Attention and Transformers

PSP: Million-level Protein Sequence Dataset for Protein Structure Prediction

AsymptoticNG: A regularized natural gradient optimization algorithm with look-ahead strategy

Driving Skyrmions in a Composite Bilayer

Dynamic Response in a Finite Size Composite Multiferroic Thin Film

Ferroelectrics Manipulate Magnetic Bloch Skyrmions in a Composite Bilayer

Magnetic Bloch Skyrmion Transport by Electric Fields in a Composite Bilayer

Magneto-Electric Effect for Multiferroic Thin Film by Monte Carlo Simulation

Pseudo-Spin Based Dynamical Model for Polarisation Switching in Ferroelectrics

Spin Dynamics in Driven Composite Multiferroics

Spin Dynamics Simulation of the Magneto-Electric Effect in a Composite Multiferroic Chain