Researcher profile

Zilong Chen

Zilong Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Unified 4D World Action Modeling from Video Priors with Asynchronous Denoising

We propose X-WAM, a Unified 4D World Model that unifies real-time robotic action execution and high-fidelity 4D world synthesis (video + 3D reconstruction) in a single framework, addressing the critical limitations of prior unified world models (e.g., UWM) that only model 2D pixel-space and fail to balance action efficiency and world modeling quality. To leverage the strong visual priors of pretrained video diffusion models, X-WAM imagines the future world by predicting multi-view RGB-D videos, and obtains spatial information efficiently through a lightweight structural adaptation: replicating the final few blocks of the pretrained Diffusion Transformer into a dedicated depth prediction branch for the reconstruction of future spatial information. Moreover, we propose Asynchronous Noise Sampling (ANS) to jointly optimize generation quality and action decoding efficiency. ANS applies a specialized asynchronous denoising schedule during inference, which rapidly decodes actions with fewer steps to enable efficient real-time execution, while dedicating the full sequence of steps to generate high-fidelity video. Rather than entirely decoupling the timesteps during training, ANS samples from their joint distribution to align with the inference distribution. Pretrained on over 5,800 hours of robotic data, X-WAM achieves 79.2% and 90.7% average success rate on RoboCasa and RoboTwin 2.0 benchmarks, while producing high-fidelity 4D reconstruction and generation surpassing existing methods in both visual and geometric metrics.

preprint2022arXiv

KCD: Knowledge Walks and Textual Cues Enhanced Political Perspective Detection in News Media

Political perspective detection has become an increasingly important task that can help combat echo chambers and political polarization. Previous approaches generally focus on leveraging textual content to identify stances, while they fail to reason with background knowledge or leverage the rich semantic and syntactic textual labels in news articles. In light of these limitations, we propose KCD, a political perspective detection approach to enable multi-hop knowledge reasoning and incorporate textual cues as paragraph-level labels. Specifically, we firstly generate random walks on external knowledge graphs and infuse them with news text representations. We then construct a heterogeneous information network to jointly model news content as well as semantic, syntactic and entity cues in news articles. Finally, we adopt relational graph neural networks for graph-level representation learning and conduct political perspective detection. Extensive experiments demonstrate that our approach outperforms state-of-the-art methods on two benchmark datasets. We further examine the effect of knowledge walks and textual cues and how they contribute to our approach's data efficiency.

preprint2022arXiv

KGAP: Knowledge Graph Augmented Political Perspective Detection in News Media

Identifying political perspectives in news media has become an important task due to the rapid growth of political commentary and the increasingly polarized political ideologies. Previous approaches focus on textual content and leave out the rich social and political context that is essential in the perspective detection process. To address this limitation, we propose KGAP, a political perspective detection method that incorporates external domain knowledge. Specifically, we construct a political knowledge graph to serve as domain-specific external knowledge. We then construct heterogeneous information networks to represent news documents, which jointly model news text and external knowledge. Finally, we adopt relational graph neural networks and conduct political perspective detection as graph-level classification. Extensive experiments demonstrate that our method consistently achieves the best performance on two real-world perspective detection benchmarks. Ablation studies further bear out the necessity of external knowledge and the effectiveness of our graph-based approach.

preprint2022arXiv

Legislator Representation Learning with Social Context and Expert Knowledge

Modeling the ideological perspectives of political actors is an essential task in computational political science with applications in many downstream tasks. Existing approaches are generally limited to textual data and voting records, while they neglect the rich social context and valuable expert knowledge for holistic evaluation. In this paper, we propose a representation learning framework of political actors that jointly leverages social context and expert knowledge. Specifically, we retrieve and extract factual statements about legislators to leverage social context information. We then construct a heterogeneous information network to incorporate social context and use relational graph neural networks to learn legislator representations. Finally, we train our model with three objectives to align representation learning with expert knowledge, model ideological stance consistency, and simulate the echo chamber phenomenon. Extensive experiments demonstrate that our learned representations successfully advance the state-of-the-art in three downstream tasks. Further analysis proves the correlation between learned legislator representations and various socio-political factors, as well as bearing out the necessity of social context and expert knowledge in modeling political actors.

preprint2021arXiv

Enhancing fiber atom interferometer by in-fiber laser cooling

We demonstrate an inertia sensitive atom interferometer optically guided inside a 22-cm-long negative curvature hollow-core photonic crystal fiber with an interferometer time of 20 ms. The result prolongs the previous fiber guided atom interferometer time by three orders of magnitude. The improvement arises from the realization of in-fiber Λ-enhanced gray molasses and delta-kick cooling to cool atoms from 32 μK to below 1 μK in 4 ms. The in-fiber cooling overcomes the inevitable heating during the atom loading process and allows a shallow guiding optical potential to minimize decoherence. Our results permit bringing atoms close to source fields for sensing and could lead to compact inertial quantum sensors with a sub-millimeter resolution.

preprint2020arXiv

Long Light Storage Time in an Optical Fiber

Light storage in an optical fiber is an attractive component in quantum optical delay line technologies. Although silica-core optical fibers are excellent in transmitting broadband optical signals, it is challenging to tailor their dispersive property to slow down a light pulse or store it in the silica-core for a long delay time. Coupling a dispersive and coherent medium with an optical fiber is promising in supporting long optical delay. Here, we load cold Rb atomic vapor into an optical trap inside a hollow-core photonic crystal fiber, and store the phase of the light in a long-lived spin-wave formed by atoms and retrieve it after a fully controllable delay time using electromagnetically-induced-transparency (EIT). We achieve over 50 ms of storage time and the result is equivalent to 8.7x10^-5 dB s^-1 of propagation loss in an optical fiber. Our demonstration could be used for buffering and regulating classical and quantum information flow between remote networks.

preprint2020arXiv

Quantum-Enhanced Velocimetry with Doppler-Broadened Atomic Vapor

Traditionally, measuring the center-of-mass (c.m.) velocity of an atomic ensemble relies on measuring the Doppler shift of the absorption spectrum of single atoms in the ensemble. Mapping out the velocity distribution of the ensemble is indispensable when determining the c.m. velocity using this technique. As a result, highly sensitive measurements require preparation of an ensemble with a narrow Doppler width. Here, we use a dispersive measurement of light passing through a moving room temperature atomic vapor cell to determine the velocity of the cell in a single shot with a short-term sensitivity of 5.5 $μ$m s$^{-1}$ Hz$^{-1/2}$. The dispersion of the medium is enhanced by creating quantum interference through an auxiliary transition for the probe light under electromagnetically induced transparency condition. In contrast to measurement of single atoms, this method is based on the collective motion of atoms and can sense the c.m. velocity of an ensemble without knowing its velocity distribution. Our results improve the previous measurements by 3 orders of magnitude and can be used to design a compact motional sensor based on thermal atoms.