Researcher profile

Feng Xiong

Feng Xiong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Ace-Skill: Bootstrapping Multimodal Agents with Prioritized and Clustered Evolution

Self-evolving agents present a promising path toward continual adaptation by distilling task interactions into reusable knowledge artifacts. In practice, this paradigm remains hindered by two coupled bottlenecks: data inefficiency, where costly rollout effort is disproportionately spent on low-value samples rather than informative ones, and knowledge interference, where heterogeneous knowledge stored in shared repositories leads to noisy retrieval and task-misaligned guidance. Together, these issues form a self-reinforcing failure loop in which uninformative rollouts yield noisy knowledge, which in turn degrades subsequent rollouts. In this work, we introduce Ace-Skill, a co-evolutionary framework that jointly optimizes rollout allocation and knowledge organization for self-evolving multimodal agents. Specifically, Ace-Skill combines aprioritized sampler with lazy-decay proficiency tracking to focus rollouts on informative and insufficiently mastered samples, and a clustered organizer that semantically clusters knowledge for cleaner retrieval and more reliable adaptation. By improving sampling and organization together, Ace-Skill turns self-evolution into a virtuous cycle in which more informative rollouts produce higher-quality knowledge that supports stronger subsequent rollouts. Across four multimodal tool-use benchmarks, Ace-Skill delivers strong gains (e.g., +35.46% relative improvement in Avg@4 accuracy), enabling an opensource 35B MoE model to match or surpass proprietary models. The acquired knowledge also transfers effectively in a zero-shot manner to smaller 9B and 4B models, allowing resource-constrained agents to inherit advanced capabilities without additional training. The code has been publicly available at https://github.com/AMAP-ML/Ace-Skill.

preprint2026arXiv

ALAM: Algebraically Consistent Latent Action Model for Vision-Language-Action Models

Vision-language-action (VLA) models remain constrained by the scarcity of action-labeled robot data, whereas action-free videos provide abundant evidence of how the physical world changes. Latent action models offer a promising way to extract such priors from videos, but reconstruction-trained latent codes are not necessarily suitable for policy generation: they may predict future observations while lacking the structure needed to be reused or generated coherently with robot actions. We introduce ALAM (Algebraic Latent Action Model), an Algebraically Consistent Latent Action Model that turns temporal relations in action-free video into structural supervision. Given frame triplets, ALAM learns latent transitions that are grounded by reconstruction while being regularized by composition and reversal consistency, encouraging a locally additive transition space. For downstream VLA learning, we freeze the pretrained encoder and use its latent transition sequences as auxiliary generative targets, co-generated with robot actions under a joint flow-matching objective. This couples structured latent transitions with flow-based policy generation, allowing the policy to exploit ALAM's locally consistent transition geometry without requiring latent-to-action decoding. Representation probes show that ALAM reduces additivity and reversibility errors by 25-85 times over unstructured latent-action baselines and improves long-horizon cumulative reconstruction. When transferred to VLA policies, ALAM raises the average success rate from 47.9% to 85.0% on MetaWorld MT50 and from 94.1% to 98.1% on LIBERO, with consistent gains on real-world manipulation tasks. Ablations further confirm that the strongest improvements arise from the synergy between algebraically structured latent transitions and joint flow matching.

preprint2026arXiv

Why Users Go There: World Knowledge-Augmented Generative Next POI Recommendation

Generative point-of-interest (POI) recommendation models based on large language models (LLMs) have shown promising results by formulating next POI prediction as a sequence generation task. However, the knowledge encoded in these models remains fixed after training, making them unable to perceive evolving real-world conditions that shape user mobility decisions, such as local events and cultural trends. To bridge this gap, we propose AWARE (Agent-based World knowledge Augmented REcommendation), which employs an LLM agent to generate location- and time-aware contextual narratives that capture regional cultural characteristics, seasonal trends, and ongoing events relevant to each user. Rather than introducing generic or noisy information, AWARE further anchors these narratives in each user's behavioral context, grounding external world knowledge in personalized spatial-temporal patterns. Extensive experiments on three real-world datasets demonstrate that AWARE consistently outperforms competitive baselines, achieving up to 12.4% relative improvement.

preprint2022arXiv

Understanding the three-dimensional quantum Hall effect in generic multi-Weyl semimetals

The quantum Hall effect in three-dimensional Weyl semimetal (WSM) receives significant attention for the emergence of the Fermi loop where the underlying two-dimensional Hall conductivity, namely, sheet Hall conductivity, shows quantized plateaus. Considering the tilted lattice models for multi Weyl semimetals (mWSMs), we systematically study the Landau levels (LLs) and magneto-Hall conductivity in the presence of parallel and perpendicular (with respect to the Weyl node's separation) magnetic field, i.e., $\mathbf{ B}\parallel z$ and $\mathbf{B}\parallel x$, to explore the impact of tilting and non-linearity in the dispersion. We make use of two (single) node low-energy models to qualitatively explain the emergence of mid-gap chiral (linear crossing of chiral) LLs on the lattice for $\mathbf{ B}\parallel z$ ($\mathbf{ B}\parallel x$). Remarkably, we find that the sheet Hall conductivity becomes quantized for $\mathbf{ B}\parallel z$ even when two Weyl nodes project onto a single Fermi point in two opposite surfaces, forming a Fermi loop with $k_z$ as the good quantum number. On the other hand, the Fermi loop, connecting two distinct Fermi points in two opposite surfaces, with $k_x$ being the good quantum number, causes the quantization in sheet Hall conductivity for $\mathbf{ B}\parallel x$. The quantization is almost lost (perfectly remained) in the type-II phase for $\mathbf{ B}\parallel x$ ($\mathbf{ B}\parallel z$). Interestingly, the jump profiles between the adjacent quantized plateaus change with the topological charge for both of the above cases. The momentum-integrated three-dimensional Hall conductivity is not quantized; however, it bears the signature of chiral LLs as resulting in the linear dependence on $μ$ for small $μ$. The linear zone (its slope) reduces (increases) as the tilt (topological charge) of the underlying WSM increases.

preprint2020arXiv

Enhanced principle component method for fringe removal in cold atom images

Many powerful imaging techniques for cold atoms are based on determining the optical density by comparing a beam image having passed through the atom cloud to a reference image taken under similar conditions with no atoms. In practice the beam profile typically contains interference fringes whose phase is not stable between camera exposures. To reduce the error of these fringes in the computed optical density, an algorithm based on principle component analysis (PCA) is often employed. However, PCA is general purpose and not tailored to the specific case of interference fringes. Here we demonstrate an algorithm that takes advantage of the Fourier-space structure of interference fringes to further reduce the residual fringe signatures in the optical density.

preprint2020arXiv

Temperature Dependent Thermal Boundary Conductance of Monolayer MoS$_2$ by Raman Thermometry

The electrical and thermal behavior of nanoscale devices based on two-dimensional (2D) materials is often limited by their contacts and interfaces. Here we report the temperature-dependent thermal boundary conductance (TBC) of monolayer MoS$_2$ with AlN and SiO$_2$, using Raman thermometry with laser-induced heating. The temperature-dependent optical absorption of the 2D material is crucial in such experiments, which we characterize here for the first time above room temperature. We obtain TBC ~ 15 MWm$^-$$^2$K$^-$$^1$ near room temperature, increasing as ~ T$^0$$^.$$^6$$^5$ in the range 300 - 600 K. The similar TBC of MoS$_2$ with the two substrates indicates that MoS$_2$ is the "softer" material with weaker phonon irradiance, and the relatively low TBC signifies that such interfaces present a key bottleneck in energy dissipation from 2D devices. Our approach is needed to correctly perform Raman thermometry of 2D materials, and our findings are key for understanding energy coupling at the nanoscale.

preprint2019arXiv

Spin and charge transport in topological nodal-line semimetals

We study transport properties of topological Weyl nodal-line semimetals(NLSs). Starting from a minimal lattice model with a single nodal loop, and by focusing on a normal-metal-NLS-normal-metal junction, we investigate the dependence of the novel transport behavior on the orientation of the nodal loop. When the loop is parallel to the junction interfaces, the transmitted current is found to be nearly fully spin-polarized. Correspondingly, there exists a spin orientation, along which the incident electrons would be totally reflected. An unusual resonance of half transmission with the participation of surface states also occurs for a pair of incident electrons with opposite spin orientations. All these phenomena have been shown to originate from the existence of a single forward-propagating mode in the NLS of the junction, and argued to survive in more generic multi-band Weyl NLSs.