Researcher profile

Si Wu

Si Wu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

DiLA: Disentangled Latent Action World Models

Latent Action Models (LAMs) enable the learning of world models from unlabeled video by inferring abstract actions between consecutive frames. However, LAMs face a fundamental trade-off between action abstraction and generation fidelity. Existing methods typically circumvent this issue by using two-stage training with pre-trained world models or by limiting predictions to optical flow. In this paper, we introduce DiLA, a novel Disentangled Latent Action world model that aims to resolve this trade-off via content-structure disentanglement. Our key insight is that disentanglement and latent action learning are co-evolving: the predictive bottleneck inherent in latent action learning serves as a driving force for disentanglement, compelling the model to distill spatial layouts into the structure pathway while offloading visual details to a separate content pathway for generation. This synergy yields a continuous, semantically structured latent action space without compromising generative quality. DiLA achieves superior results in video generation quality, action transfer, visual planning, and manifold interpretability. These findings establish DiLA as a unified framework that simultaneously achieves high-level action abstraction and high-fidelity generation, advancing the frontier of self-supervised world model learning.

preprint2023arXiv

AI of Brain and Cognitive Sciences: From the Perspective of First Principles

Nowadays, we have witnessed the great success of AI in various applications, including image classification, game playing, protein structure analysis, language translation, and content generation. Despite these powerful applications, there are still many tasks in our daily life that are rather simple to humans but pose great challenges to AI. These include image and language understanding, few-shot learning, abstract concepts, and low-energy cost computing. Thus, learning from the brain is still a promising way that can shed light on the development of next-generation AI. The brain is arguably the only known intelligent machine in the universe, which is the product of evolution for animals surviving in the natural environment. At the behavior level, psychology and cognitive sciences have demonstrated that human and animal brains can execute very intelligent high-level cognitive functions. At the structure level, cognitive and computational neurosciences have unveiled that the brain has extremely complicated but elegant network forms to support its functions. Over years, people are gathering knowledge about the structure and functions of the brain, and this process is accelerating recently along with the initiation of giant brain projects worldwide. Here, we argue that the general principles of brain functions are the most valuable things to inspire the development of AI. These general principles are the standard rules of the brain extracting, representing, manipulating, and retrieving information, and here we call them the first principles of the brain. This paper collects six such first principles. They are attractor network, criticality, random network, sparse coding, relational memory, and perceptual learning. On each topic, we review its biological background, fundamental property, potential application to AI, and future development.

preprint2022arXiv

Crystal growth engineering and origin of the weak ferromagnetism in antiferromagnetic matrix of orthochromates from $t$-$e$ orbital hybridization

We report a combined experimental and theoretical study on intriguing magnetic properties of quasiferroelectric orthochromates. Large single crystals of the family of RECrO$_3$ (RE = Y, Eu, Gd, Tb, Dy, Ho, Er, Tm, Yb, and Lu) compounds were successfully grown. Neutron Laue study indicates a good quality of the obtained single crystals. Applied magnetic-field and temperature dependent magnetization measurements reveal their intrinsic magnetic properties, especially the antiferromagnetic (AFM) transition temperatures. Density functional theory studies of the electronic structures were carried out using the Perdew-Burke-Ernzerhof functional plus Hubbard $U$ method. Crystallographic information and magnetism were theoretically optimized systematically. When RE$^{3+}$ cations vary from Y$^{3+}$ and Eu$^{3+}$ to Lu$^{3+}$ ions, the calculated $t$-$e$ orbital hybridization degree and Néel temperature behave similarly to the experimentally-determined AFM transition temperature with variation in cationic radius. We found that the $t$-$e$ hybridization is anisotropic, causing a magnetic anisotropy of Cr$^{3+}$ sublattices. This was evaluated with the nearest-neighbour $J_1$-$J_2$ model. Our research provides a picture of the electronic structures during the $t$-$e$ hybridization process while changing RE ions and sheds light on the nature of the weak ferromagnetism coexisting with predominated antiferromagnetism. The available large RECrO$_3$ single crystals build a platform for further studies of orthochromates.

preprint2022arXiv

Possible Dirac quantum spin liquid in a kagome quantum antiferromagnet YCu$_3$(OH)$_6$Br$_2$[Br$_{x}$(OH)$_{1-x}$]

We studied the magnetic properties of YCu$_3$(OH)$_6$Br$_2$[Br$_{1-x}$(OH)$_{x}$] ($x$ = 0.33), where Cu$^{2+}$ ions form two-dimensional kagome layers. There is no magnetic order down to 50 mK while the Curie-Weiss temperature is on the order of -100 K. At zero magnetic field, the low-temperature specific heat shows a $T^2$ dependence. Above 2 T, a linear temperature dependence term in specific heat emerges, and the value of $γ= C/T$ increases linearly with the field. Furthermore, the magnetic susceptibility tends to a constant value at $T = 0$. Our results suggest that the magnetic ground state of YCu$_3$(OH)$_6$Br$_2$[Br$_{1-x}$(OH)$_{x}$] is consistent with a Dirac quantum-spin-liquid state with a linearly dispersing spinon strongly coupled to an emergent gauge field, which has long been theoretically proposed as a candidate ground state in the two-dimensional kagome Heisenberg antiferromagnetic system.

preprint2022arXiv

Temperature-dependent structure of an intermetallic ErPd$_2$Si$_2$ single crystal: A combined synchrotron and in-house X-ray diffraction study

We have grown intermetallic ErPd$_2$Si$_2$ single crystals employing laser-diodes with the floating-zone method. The temperature-dependent crystallography was determined using synchrotron and in-house X-ray powder diffraction measurements from 20 to 500 K. The diffraction patterns fit well with the tetragonal $I$4/$mmm$ space group (No. 139) with two chemical formulas within one unit cell. Our synchrotron X-ray powder diffraction study shows that the refined lattice constants are $a$ = 4.10320(2) Å, $c$ = 9.88393(5) Å at 298 K and $a$ = 4.11737(2) Å, $c$ = 9.88143(5) Å at 500 K, resulting in the unit-cell volume $V$ = 166.408(1) Å$^3$ (298 K) and 167.517(2) Å$^3$ (500 K). In the whole studied temperature range, we did not find any structural phase transition. Upon cooling, the lattice constants a and c are shortened and elongated, respectively.

preprint2020arXiv

Colossal Negative Magnetoresistance Effect in A La$_{1.37}$Sr$_{1.63}$Mn$_2$O$_7$ Single Crystal Grown by Laser-Diode-Heated Floating-Zone Technique

We have grown La$_{1.37}$Sr$_{1.63}$Mn$_2$O$_7$ single crystals with a laser-diode-heated floating-zone furnace and studied the crystallinity, structure, and magnetoresistance (MR) effect by in-house X-ray Laue diffraction, X-ray powder diffraction, and resistance measurements. The La$_{1.37}$Sr$_{1.63}$Mn$_2$O$_7$ single crystal crystallizes into a tetragonal structure with space group \emph{I}4{/}\emph{mmm} at room temperature. At 0 T, the maximum resistance centers around $\sim$166.9 K. Below $\sim$35.8 K, it displays an insulating character with an increase in resistance upon cooling. An applied magnetic field of \emph{B}~=~7~T strongly suppresses the resistance indicative of a negative MR effect. The minimum MR value equals $-$91.23\% at 7 T and 128.7 K. The magnetic-field-dependent resistance shows distinct features at 1.67, 140, and 322 K, from which we calculated the corresponding MR values. At 14 T and 140 K, the colossal negative MR value is down to $-$94.04(5)\%. We schematically fit the MR values with different models for an ideal describing of the interesting features of the MR value versus \emph{B} curves.

preprint2020arXiv

Crystalline and magnetic structures, magnetization, heat capacity and anisotropic magnetostriction effect in a yttrium-chromium oxide

We have studied a nearly stoichiometric insulating Y$_{0.97(2)}$Cr$_{0.98(2)}$O$_{3.00(2)}$ single crystal by performing measurements of magnetization, heat capacity, and neutron diffraction. Albeit that the YCrO$_3$ compound behaviors like a soft ferromagnet with a coersive force of $\sim$ 0.05 T, there exist strong antiferromagnetic (AFM) interactions between Cr$^{3+}$ spins due to a strongly negative paramagnetic Curie-Weiss temperature, i.e., -433.2(6) K. The coexistence of ferromagnetism and antiferromagnetism may indicate a canted AFM structure. The AFM phase transition occurs at $T_\textrm{N} =$ 141.5(1) K, which increases to $T_\textrm{N}$(5T) = 144.5(1) K at 5 T. Within the accuracy of the present neuron-diffraction studies, we determine a G-type AFM structure with a propagation vector \textbf{k} = (1 1 0) and Cr$^{3+}$ spin directions along the crystallographic \emph{c} axis of the orthorhombic structure with space group \emph{Pnma} below $T_\textrm{N}$. At 12 K, the refined moment size is 2.45(6) $μ_\textrm{B}$, $\sim$ 82\% of the theoretical saturation value 3 $μ_\textrm{B}$. The Cr$^{3+}$ spin interactions are probably two-dimensional Ising like within the reciprocal (1 1 0) scattering plane. Below $T_\textrm{N}$, the lattice configuration (\emph{a}, \emph{b}, \emph{c}, and \emph{V}) deviates largely downward from the Gr$\ddot{\textrm{u}}$neisen law, displaying an anisotropic magnetostriction effect and a magnetoelastic effect. Especially, the sample contraction upon cooling is enhanced below the AFM transition temperature. There is evidence to suggest that the actual crystalline symmetry of YCrO$_3$ compound is probably lower than the currently assumed one. Additionally, we compared the $t_{2\textrm{g}}$ YCrO$_3$ and the $e_\textrm{g}$ La$_{7/8}$Sr$_{1/8}$MnO$_3$ single crystals for a further understanding of the reason for the possible symmetry lowering.

preprint2020arXiv

Super-Necking Crystal Growth and Structural and Magnetic Properties of SrTb$_2$O$_4$ Single Crystals

We report on single-crystal growths of the SrTb$_2$O$_4$ compound by a super-necking technique with a laser-floating-zone furnace and study the stoichiometry, growth mode, and structural and magnetic properties by scanning electronic microscopy, neutron Laue, X-ray powder diffraction, and the physical property measurement system. We optimized the growth parameters, mainly the growth speed, atmosphere, and the addition of a Tb$_4$O$_7$ raw material. Neutron Laue diffraction displays the characteristic feature of a single crystal. Our study reveals an atomic ratio of Sr:Tb $ = 0.97(2){:}2.00(1)$ and a possible layer by layer crystal growth mode. Our X-ray powder diffraction study determines the crystal structure, lattice constants and atomic positions. The paramagnetic (PM) Curie--Weiss (CW) temperature $θ_{\texttt{CW}} =$ 5.00(4) K, and the effective PM moment $M^{\texttt{eff}}_{\texttt{mea}} =$ 10.97(1) $μ_\texttt{B}$ per Tb$^{3+}$ ion. The data of magnetization versus temperature can be divided into three regimes, showing a coexistence of antiferromagnetic and ferromagnetic interactions. This probably leads to the magnetic frustration in the SrTb$_2$O$_4$ compound. The magnetization at 2 K and 14 T originates from both the Tb1 and Tb2 sites and is strongly frustrated with an expected saturation field at $\sim$41.5 T, displaying an intricate phase diagram with three ranges.

preprint2020arXiv

Vision at A Glance: Interplay between Fine and Coarse Information Processing Pathways

Object recognition is often viewed as a feedforward, bottom-up process in machine learning, but in real neural systems, object recognition is a complicated process which involves the interplay between two signal pathways. One is the parvocellular pathway (P-pathway), which is slow and extracts fine features of objects; the other is the magnocellular pathway (M-pathway), which is fast and extracts coarse features of objects. It has been suggested that the interplay between the two pathways endows the neural system with the capacity of processing visual information rapidly, adaptively, and robustly. However, the underlying computational mechanisms remain largely unknown. In this study, we build a computational model to elucidate the computational advantages associated with the interactions between two pathways. Our model consists of two convolution neural networks: one mimics the P-pathway, referred to as FineNet, which is deep, has small-size kernels, and receives detailed visual inputs; the other mimics the M-pathway, referred to as CoarseNet, which is shallow, has large-size kernels, and receives low-pass filtered or binarized visual inputs. The two pathways interact with each other via a Restricted Boltzmann Machine. We find that: 1) FineNet can teach CoarseNet through imitation and improve its performance considerably; 2) CoarseNet can improve the noise robustness of FineNet through association; 3) the output of CoarseNet can serve as a cognitive bias to improve the performance of FineNet. We hope that this study will provide insight into understanding visual information processing and inspire the development of new object recognition architectures.