Researcher profile

Shuai Dong

Shuai Dong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

Group Relative Policy Optimization has emerged as essential for aligning video diffusion models with human preferences, but faces a critical computational bottleneck: training a 14B parametered model typically demands hundreds of GPU days per experiment. Existing efficiency methods reduce costs through sliding window subsampling training timesteps, but fundamentally compromise optimization, exhibiting severe instability and failing to reach full trajectory performance. We present Flash-GRPO, a single-step training framework that outperforms full trajectory training in alignment quality under low computational budgets while substantially improving training efficiency. Flash-GRPO addresses two critical challenges: iso-temporal grouping eliminates timestep-confounded variance by enforcing prompt-wise temporal consistency, decoupling policy performance from timestep difficulty; temporal gradient rectification neutralizes the time-dependent scaling factor that causes vastly inconsistent gradient magnitudes across timesteps. Experiments on 1.3B to 14B parameter models validate Flash-GRPO's effectiveness, demonstrating substantial training acceleration with consistent stability and state-of-the-art alignment quality.

preprint2026arXiv

Leveraging Error Diversity in Group Rollouts for Reinforcement Learning

Reinforcement Learning from Verifiable Rewards (RLVR) typically samples multiple responses per prompt and assigns binary rewards based on individual correctness, yet the collective structure of the group output, specifically the distribution of errors, is largely discarded. We identify this as a missed opportunity: empirical analysis reveals that error diversity within a group is a strong predictor of training success, with problems eliciting diverse wrong answers benefiting substantially more from RLVR than those producing homogeneous failures. Motivated by this observation, we propose Error Diversity Advantage Shaping (EDAS), a lightweight, algorithm-agnostic technique that modulates the advantage signal for incorrect rollouts based on intra-group error diversity. EDAS amplifies penalties for dominant, repeated errors and attenuates penalties for rare, exploratory ones, thereby encouraging the model to maintain diverse reasoning paths and discouraging error perseveration. Crucially, EDAS operates as a simple post-hoc adjustment that can be seamlessly integrated into any RLVR algorithm. We validate EDAS on top of several mainstream RLVR methods across a series of models and seven challenging math benchmarks, demonstrating consistent improvements. Notably, EDAS yields an average improvement of 6.29 points over DAPO on Qwen3-8B across seven benchmarks, confirming that exploiting the latent information in group rollouts is a broadly effective strategy for strengthening RLVR.

preprint2026arXiv

Magnetoelectric torque in polar magnetic bilayers

Energy-efficient fast switching of spin orientations or textures is a core issue of spintronics, which is highly demanded but remains challenging. Different from the mainstream routes based on spin-transfer torque or spin-orbit torque, here we propose another mechanism coined as magnetoelectric torque to switch the magnetization in polar magnetic bilayers via pure electric field. In some magnetic van der Waals bilayers, when the electrostatic energy of polarization can compensate the interlayer magnetic coupling, a magnetoelectric torque is generated to fastly flip spins within a few picoseconds, which is demonstrated by combining the first-principles calculations, analytic model, as well as atomistic simulations. Such a magnetoelectric torque doesn't rely on the spin-orbit coupling and is generally active in polar magnetic homostructures and heterostructures. Our work provides an alternative route to switch magnetization in nanoscale, which may benefit the energy-saving and fast-response spintronic devices.

preprint2026arXiv

Step-wise Rubric Rewards for LLM Reasoning

Reinforcement Learning with Verifiable Rewards (RLVR) is widely used to improve reasoning in large language models, but rewards only final-answer correctness with no supervision over intermediate steps. Rubric-based methods such as Rubrics as Rewards (RaR) introduce finer-grained supervision by scoring rollouts against structured criteria, yet the rubric scores are still aggregated into a single scalar applied to the entire response, causing three weaknesses: loss of multi-criterion structure, uniform supervision of correct and incorrect steps, and reward hacking through unbounded self-correction. On 1,000 problems, we find 18.2% of steps in correct-answer responses are wrong yet positively rewarded, while 49.9% of steps in incorrect-answer responses are correct yet penalized. We introduce Step-wise Rubrics as Rewards (SRaR), an RLVR framework that (i) uses an LLM judge to attribute each rubric item to a specific reasoning step, (ii) normalizes per-step rubric scores across rollouts so only steps whose quality varies produce a learning signal, and (iii) combines the per-step reward with the outcome reward through a decoupled advantage estimator that keeps the outcome baseline stable. We further build a 16K-problem rubric dataset by contrastively distilling rubric items from correct and flawed reasoning paths sampled from a strong model. Across six mathematical reasoning benchmarks, SRaR improves average accuracy over RaR by 3.57 points on Qwen3-8B and 2.75 points on Qwen3-32B, raises the Faithful Reasoning Rate on AIME 2025 from 34.5% to 46.7%, and reduces self-correction looping from 48.1% to 26.5%.

preprint2026arXiv

Unexpected type-II multiferroic phase in GdMnO3 under high magnetic fields

Perovskite manganites with small A-site ions, as the first and canonical branch of type-II multiferroics, are ideal systems to exhibit magnetism-induced ferroelectricity. Despite their established magnetoelectric phase diagrams under low magnetic fields, here an unidentified phase with a large magnetism-induced polarization (up to 1500 μC/m2) is revealed in GdMnO3 under high magnetic fields up to 60 T. Based on multiprobe experiments, a complete phase diagram is constructed with successive polar-nonpolar-polar-nonpolar transitions. Such a nonmonotonic evolution is well mimicked by model simulation, while the spin-lattice coupling is the key ingredient for the reentrant ferroelectric phase.

preprint2022arXiv

Evolution of magnetic phase in two dimensional van der Waals Mn$_{1-x}$Ni$_x$PS$_3$ single crystals

Metal thio(seleno)phosphates MPX$_3$ have attracted considerable attentions with wide spanned band gaps and rich magnetic properties. In this series, two neighboring members MnPS$_3$ and NiPS$_3$ differ in magnetic atoms, magnetic easy axes, spin anisotropy, as well as nearest-neighbor magnetic interactions. The competition between these components may cause intriguing physical phenomena. In this article, the evolution of magnetism of Mn$_{1-x}$Ni$_x$PS$_3$ series is reported. Despite the incompatible antiferromagnetic orders of two end members, the antiferromagnetism persists as the ground state in the whole substitution region. The magnetic ordering temperature $T_{\rm N}$ show nonmonotonic V-shape behavior, and the reentrant spin glass phase at x=0.5 is observed. In addition, abnormal bifurcation of $T_{\rm N}$ occurs at x=0.75, which may be due to the temperature-dependent spin reorientation or phase separation. The evolution of magnetism is further confirmed semi-quantitatively by our density functional theory calculations. Our study indicates that exotic magnetism can be intrigued when multi-degrees of freedom are involved in these low-dimensional systems, which call for more in-depth microscopic studies in future.

preprint2022arXiv

Gadolinium halide monolayers: a fertile family of two-dimensional 4f magnets

Two-dimensional (2D) magnets have great potentials for applications in next-generation information devices. Since the recent experimental discovery of intrinsic 2D magnetism in monolayer CrI$_3$ and few-layer Cr$_2$Ge$_2$Te$_6$, intensive studies have been stimulated in pursuing more 2D magnets and revealing their intriguing physical properties. In comparison to the magnetism based on $3d$ electrons, $4f$ electrons can provide larger magnetic moments and stronger spin-orbit coupling, but have been much less studied in the 2D forms. Only in very recent years, some exciting results have been obtained in this area. In this mini-review, we will introduce some recent progress in 2D Gd halides from a theoretical aspect. It is noteworthy that $4f$ and $5d$ orbitals of Gd both play key roles in these materials. For Gd$X_2$ ($X$=I, Br, Cl and F) monolayers and related Janus monolayers, robust ferromagnetism with large exchanges comes from the $4f^7$+$5d^1$ hybridization of Gd$^{2+}$. The spatially expanded $5d$ electrons act as a bridge to couple localized $4f$ spins. For Gd$X_3$ monolayers, the intercalation of metal atoms can dope electrons into Gd's $5d$ orbitals, which leads to numerous intriguing physical properties, such as ferroelasticity, ferromagnetism, and anisotropic conductance. In brief, Gd halides establish an effective strategy to take advantage of $f$-electron magnetism in 2D materials.

preprint2022arXiv

Magnetic phase transition induced ferroelectric polarization in BaFeF4 with room temperature weak ferromagnetism

BaMF4 (M=Fe, Co, Ni and Mn) family are typical multiferroic materials, having antiferromagnetism at around liquid nitrogen temperature. In this work, polycrystalline BaFeF4 has been prepared by solid state reaction. The slight deficiency of Fe leads to the coexistence of valence states of +2 and +3, facilitating the electrons to hop between the neighboring Fe2+ and Fe3+ ions through the middle F- ion, leading to the strong double exchange interaction with weak ferromagnetism above room temperature. A bifurcation at about 170 K between the zero-field-cooled and field-cooled temperature dependent magnetization curves indicates the onset of 2-dimensional antiferromagnetism, which is completed at about 125 K with the sudden drop of magnetization. Despite the fact of type-I multiferroic, its magnetoelectricity can be evidenced by the pyroelectric current, which shows a peak starting at about 170 K and finishing at about 125 K. The saturated ferroelectric polarization change of around 34 μC/m2 is observed, which is switchable by the reversed poling electric field and decreases to about 30 μC/m2 under a magnetic field of 90 kOe. This magnetoelectricity can be qualitatively reproduced by first-principles calculations. Our results represent substantial progress to search for high-temperature multiferroics in ferroelectric fluorides.

preprint2022arXiv

Stability and low-energy orientations of interphase boundaries in multiaxial ferroelectrics: Phase-field simulations

The coexistence of different ferroelectric phases enables the tunability of the macroscopic properties and extensive applications from piezoelectric transducers to nonvolatile memories. Here we develop a thermodynamic model to predict the stability and low-energy orientations of boundaries between different phases in ferroelectrics. Taking lead zirconate titanate and bismuth ferrite as two examples, we demonstrate that the low-energy orientations of interphase boundaries are largely determined by minimizing the electrostatic and elastic energies. Phase-field simulations are employed to analyze the competition between the interfacial energy and the electrostatic and elastic energies. Our simulation results demonstrate that the lowering of crystal symmetry could occur due to the electrical and mechanical incompatibilities between the two phases, which can be used to explain the experimentally observed low-symmetry phases near morphotropic phase boundaries. Our work provides theoretical foundations for understanding and controlling the interphase boundaries in ferroelectric materials for multifunctional applications.

preprint2022arXiv

Structural reconstruction and anisotropic conductance in $4f$-ferromagnetic monolayer

Two-dimensional magnets are promising for nanoscale spintronic applications. Currently, most available candidates are based on $3d$ transition metal compounds, with hexagonal or honeycomb lattice geometry. Here, a GdCl$_3$ monolayer with $4f$ moments is theoretically studied, which can be exfoliated from its existing bulk. Its orthorhombic structure and hendecahedral ion cages are unique in two-dimensional. Furthermore, a significant structural reconstruction is caused by the implantation of Li atoms into its interstitial position, which also lead to ferromagnetism via a double-exchange-like process. Its highly anisotropic conductance may be peculiarly useful for nanoelectronics.

preprint2022arXiv

Two-dimensional ferroelectricity induced by octahedral rotation distortion in perovskite oxides

Two-dimensional (2D) ferroelectricity has attracted extensive attention since its discovery in the monolayers of van der Waals materials. Here we show that 2D ferroelectricity induced by octahedral rotation distortion is widely present in the perovskite bilayer system through first-principles calculations. The perovskite tolerance factor plays a crucial role in the lattice dynamics and ground-state structure of the perovskite monolayers and bilayers, thus providing an important indicator for screening this hybrid improper ferroelectricity. Generally, the ferroelectric switching via an orthorhombic twin state has the lowest energy barrier. Epitaxial strain can effectively tune the ferroelectric polarization and ferroelectric switching by changing the amplitude of octahedral rotation and tilt distortion. The increasing compressive strain causes a polar to nonpolar phase transition by suppressing the tilt distortion. The cooperative effect of octahedral distortion at the interface with the substrate can reduce the energy barrier of the reversing rotation mode and can even change the lowest-energy ferroelectric switching path.

preprint2022arXiv

Vector vorticity of skyrmionic texture: an internal degree of freedom tunable by magnetic field

Different from the skyrmion driven by the Dzyaloshinskii-Moriya interaction in non-centrosymmetric materials, the skyrmionic texture in centrosymmetric magnet may possess the extra internal degrees of freedom, which greatly enrich its morphologies, imply the continuous deformation allowed by topological protection, and enable flexible tunability and potential functionality. To describe the internal degree of freedom to full extent, the conventional integer-valued scalar vorticity is extended into a vector vorticity with continuous rotation allowed. The further simplified vorticity angle, together with helicity angle, represents the whole rotation freedom of spin space. The centrosymmetric magnet with frustration provides a perfect platform for realizing a controllable manipulation on these internal degrees of freedom, where the magnetic field can be applied to tune vorticity continuously in both crystal and isolated forms of skyrmionic texture, and the helicity can be further controlled by electric field. Moreover, the simulation reveals the distinctive dynamic effects related to the vorticity modulation, namely, a straight motion can be generated by rotating magnetic field to tune vorticity, and the vorticity can also be controlled to modulate the dynamics induced by the spin polarized current.

preprint2021arXiv

Direct evidence for intermediate multiferroic phase in LiCuFe2(VO4)3

Magnetic susceptibility, specific heat, dielectric, and electric polarization of LiCuFe2(VO4)3 have been investigated. Two sequential antiferromagnetic transitions at TN1 ~ 9.95 K and TN2 ~ 8.17 K are observed under zero magnetic field. While a dielectric peak at TN1 is clearly identified, the measured pyroelectric current also exhibits a sharp peak at TN1, implying the magnetically relevant ferroelectricity. Interestingly, another pyroelectric peak around TN2 with opposite signal is observed, resulting in the disappearance of electric polarization below TN2. Besides, the electric polarization is significantly suppressed in response to external magnetic field, evidencing remarkable magnetoelectric effect. These results suggest the essential relevance of the magnetic structure with the ferroelectricity in LiCuFe2(VO4)3, deserving for further investigation of the underlying mechanism.

preprint2021arXiv

Incommensurate-commensurate magnetic phase transition in the double tungstate Li2Co(WO4)2

Magnetic susceptibility, specific heat, and neutron powder diffraction measurements have been performed on polycrystalline Li2Co(WO4)2 samples. Under zero magnetic field, two successive magnetic transitions at TN1 ~ 9.4 K and TN2 ~ 7.4 K are observed. The magnetic ordering temperatures gradually decrease as the magnetic field increases. Neutron diffraction reveals that Li2Co(WO4)2 enters an incommensurate magnetic state with a temperature dependent k between TN1 and TN2. The magnetic propagation vector locks-in to a commensurate value k = (1/2, 1/4, 1/4) below TN2. The antiferromagnetic structure is refined at 1.7 K with Co2+ magnetic moment 2.8(1) uB, consistent with our first-principles calculations.

preprint2020arXiv

Antiferromagnetism of Double Molybdate LiFe(MoO$_4$)$_2$

The magnetic properties of the spin-5/2 double molybdate LiFe(MoO$_4$)$_2$ have been characterized by heat capacity, magnetic susceptibility, and neutron powder diffraction techniques. Unlike the multiferroic system LiFe(MoO$_4$)$_2$ which exhibits two successive magnetic transitions, LiFe(MoO$_4$)$_2$ undergoes only one antiferromagnetic transition at $T_N$ ~ 23.8 K. Its antiferromagnetic magnetic structure with the commensurate propagation vector k = (0, 0.5, 0) has been determined. Density functional theory calculations confirm the antiferromagnetic ground state and provide a numerical estimate of the relevant exchange coupling constants.

preprint2020arXiv

Controlling the helicity of magnetic skyrmions by electrical field in frustrated magnets

The skyrmions generated by frustration in centrosymmetric structures host extra internal degrees of freedom: vorticity and helicity, resulting in distinctive properties and potential functionality, which are not shared by the skyrmions stemming from the Dzyaloshinskii-Moriya interaction in noncentrosymmetric structures. The present work indicates that the magnetism-driven electric polarization carried by skyrmions provides a direct handle for tuning helicity. Especially for the in-plane magnetized skyrmions, the helicity can be continuously rotated and exactly picked by applying an external electric field for both skyrmions and antiskyrmions. The in-plane uniaxial anisotropy is beneficial to this manipulation.

preprint2020arXiv

Direct Visualization of Irreducible Ferrielectricity in Crystals

In solids, charge polarity can one-to-one correspond to spin polarity phenomenologically, e.g. ferroelectricity/ferromagnetism, antiferroelectricity/antiferromagnetism, and even dipole-vortex/magnetic-vortex, but ferrielectricity/ferrimagnetism kept telling a disparate story in microscopic level. Since the definition of a charge dipole involves more than one ion, there may be multiple choices for a dipole unit, which makes most ferrielectric orders equivalent to ferroelectric ones, i.e. this ferrielectricity is not necessary to be a real independent branch of polarity. In this work, by using the spherical aberration-corrected scanning transmission electron microscope, we visualize a nontrivial ferrielectric structural evolution in BaFe2Se3, in which the development of two polar sub-lattices is out-of-sync, for which we term it as irreducible ferrielectricity. Such irreducible ferrielectricity leads to a non-monotonic behavior for the temperature-dependent polarization, and even a compensation point in the ordered state. Our finding unambiguously distinguishes ferrielectrics from ferroelectrics in solids.

preprint2020arXiv

First-principles study of the low-temperature charge density wave phase in the quasi-one-dimensional Weyl chiral compound (TaSe$_4$)$_2$I

Using {\it ab initio} density functional theory, we study the lattice phase transition of quasi-one-dimensional (TaSe$_4$)$_2$I. In the undistorted state, the strongly anisotropic semimetal band structure presents two non-equivalent Weyl points. In previous efforts, two possible Ta-tetramerization patterns were proposed to be associated with the low-temperature structure. Our phonon calculations indicate that the orthorhombic $F222$ CDW-I phase is the most likely ground state for this quasi-one-dimensional system. In addition, the monoclinic $C2$ CDW-II phase may also be stable according to the phonon dispersion spectrum. Since these two phases have very similar energies in our DFT calculations, both these Ta-tetramerization distortions likely compete or coexist at low temperatures. The semimetal to insulator transition is induced by a Fermi-surface-driven instability that supports the Peierls scenario, which affects the Weyl physics developed above $T_{\rm CDW}$. Furthermore, the spin-orbit coupling generates Rashba-like band splittings in the insulating CDW phases.

preprint2019arXiv

Iron telluride ladder compounds: Predicting the structural and magnetic properties of BaFe$_2$Te$_3$

Since the discovery of pressure-induced superconductivity in the two-leg ladder system BaFe$_2X_3$ ($X$=S, Se), with the 3$d$ iron electronic density $n = 6$, the quasi-one-dimensional iron-based ladders have attracted considerable attention. Here, we use Density Functional Theory (DFT) to predict that the novel $n = 6$ iron ladder BaFe$_2$Te$_3$ could be stable with a similar crystal structure as BaFe$_2$Se$_3$. Our results also indicate that BaFe$_2$Te$_3$ will display the complex 2$\times$2 Block-type magnetic order. Due to the magnetic striction effects of this Block order, BaFe$_2$Te$_3$ should be a magnetic noncollinear ferrielectric system with a net polarization $0.31$ $μ$C/cm$^2$. Compared with the S- or Se-based iron ladders, the electrons of the Te-based ladders are more localized, implying that the degree of electronic correlation is enhanced for the Te case which may induce additional interesting properties. The physical and structural similarity with BaFe$_2$Se$_3$ also suggests that BaFe$_2$Te$_3$ could become superconducting under high pressure.