Researcher profile

Fan Xu

Fan Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

RoboAlign-R1: Distilled Multimodal Reward Alignment for Robot Video World Models

Existing robot video world models are typically trained with low-level objectives such as reconstruction and perceptual similarity, which are poorly aligned with the capabilities that matter most for robot decision making, including instruction following, manipulation success, and physical plausibility. They also suffer from error accumulation in long-horizon autoregressive prediction. We present RoboAlign-R1, a framework that combines reward-aligned post-training with stabilized long-horizon inference for robot video world models. We construct RobotWorldBench, a benchmark of 10,000 annotated video-instruction pairs collected from four robot data sources, and train a multimodal teacher judge, RoboAlign-Judge, to provide fine-grained six-dimensional evaluation of generated videos. We then distill the teacher into a lightweight student reward model for efficient reinforcement-learning-based post-training. To reduce long-horizon rollout drift, we further introduce Sliding Window Re-encoding (SWR), a training-free inference strategy that periodically refreshes the generation context. Under our in-domain evaluation protocol, RoboAlign-R1 improves the aggregate six-dimension score by 10.1% over the strongest baseline, including gains of 7.5% on Manipulation Accuracy and 4.6% on Instruction Following; these ranking improvements are further supported by an external VLM-based cross-check and a blinded human study. Meanwhile, SWR improves long-horizon prediction quality with only about 1% additional latency, yielding a 2.8% gain in SSIM and a 9.8% reduction in LPIPS. Together, these results show that reward-aligned post-training and stabilized long-horizon decoding improve task consistency, physical realism, and long-horizon prediction quality in robot video world models.

preprint2024arXiv

Coordinating Multiple Intelligent Reflecting Surfaces without Channel Information

Conventional beamforming methods for intelligent reflecting surfaces (IRSs) or reconfigurable intelligent surfaces (RISs) typically entail the full channel state information (CSI). However, the computational cost of channel acquisition soars exponentially with the number of IRSs. To bypass this difficulty, we propose a novel strategy called blind beamforming that coordinates multiple IRSs by means of statistics without knowing CSI. Blind beamforming only requires measuring the received signal power at the user terminal for a sequence of randomly generated phase shifts across all IRSs. The main idea is to extract the key statistical quantity for beamforming by exploring only a small portion of the whole solution space of phase shifts. We show that blind beamforming guarantees a signal-to-noise ratio (SNR) boost of Theta(N^{2L}) under certain conditions, where L is the number of IRSs and N is the number of reflecting elements per IRS. The proposed conditions for achieving the optimal SNR boost of Theta(N^{4}) in a double-IRS system are much easier to satisfy than the existing ones in the literature. Most importantly, the proposed conditions can be extended to a fully general L-IRS system. The above result significantly improves upon the state of the art in the area of multi-IRS-assisted communication. Moreover, blind beamforming is justified via field tests and simulations. In particular, as shown in our field tests at 2.6 GHz, our method yields up to 17 dB SNR boost; to the best of our knowledge, this is the first time that the use of multiple IRSs gets verified in the real world.

preprint2022arXiv

Applications of spherical twist functors to Lie algebras associated to root categories of preprojective algebras

Let $Λ_Q$ be the preprojective algebra of a finite acyclic quiver $Q$ of non-Dynkin type and $D^b(\mathrm{rep}^n Λ_Q)$ be the bounded derived category of finite dimensional nilpotent $Λ_Q$-modules. We define spherical twist functors over the root category $\mathcal{R}_{Λ_Q}$ of $D^b(\mathrm{rep}^n Λ_Q)$ and then realize the Weyl group associated to $Q$ as certain subquotient of the automorphism group of the Ringel-Hall Lie algebra $\mathfrak{g}(\mathcal{R}_{Λ_Q})$ of $\mathcal{R}_{Λ_Q}$ induced by spherical twist functors. We also present a conjectural relation between certain Lie subalgebras of $\mathfrak{g}(\mathcal{R}_{Λ_Q})$ and $\mathfrak{g}(\mathcal{R}_Q)$, where $\mathfrak{g}(\mathcal{R}_Q)$ is the Ringe-Hall Lie algebra associated to the root category $\mathcal{R}_Q$ of $Q$.

preprint2022arXiv

Fundamental Limits of Communication Efficiency for Model Aggregation in Distributed Learning: A Rate-Distortion Approach

One of the main focuses in distributed learning is communication efficiency, since model aggregation at each round of training can consist of millions to billions of parameters. Several model compression methods, such as gradient quantization and sparsification, have been proposed to improve the communication efficiency of model aggregation. However, the information-theoretic minimum communication cost for a given distortion of gradient estimators is still unknown. In this paper, we study the fundamental limit of communication cost of model aggregation in distributed learning from a rate-distortion perspective. By formulating the model aggregation as a vector Gaussian CEO problem, we derive the rate region bound and sum-rate-distortion function for the model aggregation problem, which reveals the minimum communication rate at a particular gradient distortion upper bound. We also analyze the communication cost at each iteration and total communication cost based on the sum-rate-distortion function with the gradient statistics of real-world datasets. It is found that the communication gain by exploiting the correlation between worker nodes is significant for SignSGD, and a high distortion of gradient estimator can achieve low total communication cost in gradient compression.

preprint2022arXiv

Generalized quantum cluster algebras: the Laurent phenomenon and upper bounds

Generalized quantum cluster algebras introduced in [1] are quantum deformation of generalized cluster algebras of geometric types. In this paper, we prove that the Laurent phenomenon holds in these generalized quantum cluster algebras. We also show that upper bounds coincide with the corresponding generalized quantum upper cluster algebras under the "coprimality" condition.

preprint2022arXiv

Periodic repeating fast radio bursts: interaction between a magnetized neutron star and its planet in an eccentric orbit

Fast radio bursts (FRBs) are mysterious transient phenomena. The study of repeating FRBs may provide useful information about their nature due to their redetectability. The two most famous repeating sources are FRBs 121102 and 180916, with a period of 157 days and 16.35 days, respectively. Previous studies suggest that the periodicity of FRBs is likely associated with neutron star (NS) binary systems. Here we introduce a new model which proposes that periodic repeating FRBs are due to the interaction of a NS with its planet in a highly elliptical orbit. The periastron of the planet is very close to the NS so that it would be partially disrupted by tidal force every time it passes through the periastron. Fragments generated in the process could interact with the compact star through the Alfvén wing mechanism and produce FRBs. The model can naturally explain the repeatability of FRBs with a period ranging from a few days to several hundred days, but it generally requires that the eccentricity of the planet's orbit should be large enough. Taking FRBs 121102 and 180916 as examples, it is shown that the main features of the observed repeating behaviors can be satisfactorily accounted for.

preprint2022arXiv

PSP: Million-level Protein Sequence Dataset for Protein Structure Prediction

Proteins are essential component of human life and their structures are important for function and mechanism analysis. Recent work has shown the potential of AI-driven methods for protein structure prediction. However, the development of new models is restricted by the lack of dataset and benchmark training procedure. To the best of our knowledge, the existing open source datasets are far less to satisfy the needs of modern protein sequence-structure related research. To solve this problem, we present the first million-level protein structure prediction dataset with high coverage and diversity, named as PSP. This dataset consists of 570k true structure sequences (10TB) and 745k complementary distillation sequences (15TB). We provide in addition the benchmark training procedure for SOTA protein structure prediction model on this dataset. We validate the utility of this dataset for training by participating CAMEO contest in which our model won the first place. We hope our PSP dataset together with the training benchmark can enable a broader community of AI/biology researchers for AI-driven protein related research.

preprint2022arXiv

Quantum cluster algebras associated to weighted projective lines

Let $\mathbb{X}_{\boldsymbol{p},\boldsymbolλ}$ be a weighted projective line. We define the quantum cluster algebra of $\mathbb{X}_{\boldsymbol{p},\boldsymbolλ}$ and realize its specialized version as the subquotient of the Hall algebra of $\mathbb{X}_{\boldsymbol{p},\boldsymbolλ}$ via the quantum cluster character map. Inspired by \cite{Chen2021}, we prove an analogue cluster multiplication formula between quantum cluster characters. As an application, we obtain the polynomial property of the cardinalities of Grassmannian varieties of exceptional coherent sheaves on $\mathbb{X}_{\boldsymbol{p},\boldsymbolλ}$ . In the end, we construct several bar-invariant $\mathbb{Z}[ν^{\pm}]$-bases for the quantum cluster algebra of the projective line $\mathbb{P}^1$ and show how it coincides with the quantum cluster algebra of the Kronecker quiver.

preprint2022arXiv

Tidal Deformability of Strange Quark Planets and Strange Dwarfs

Strange quark matter, which is composed of u, d, and s quarks, could be the true ground of matter. According to this hypothesis, compact stars may actually be strange quark stars, and there may even be stable strange quark dwarfs and strange quark planets. The detection of the binary neutron star merger event GW170817 provides us new clues on the equation of state of compact stars. In this study, the tidal deformability of strange quark planets and strange quark dwarfs are calculated. It is found that the tidal deformability of strange quark objects is smaller than that of normal matter counterparts. For a typical 0.6 M$_\odot$ compact star, the tidal deformability of a strange dwarf is about 1.4 times less than that of a normal white dwarf. The difference is even more significant between strange quark planets and normal matter planets. Additionally, if the strange quark planet is a bare one (i.e., not covered by a normal matter curst), the tidal deformability will be extremely small, which means bare strange quark planets will hardly be distorted by tidal forces. Our study clearly proves the effectiveness of identifying strange quark objects via searching for strange quark planets through gravitational-wave observations.

preprint2020arXiv

Water affects morphogenesis of growing aquatic plant leaves

Lotus leaves floating on water usually experience short-wavelength edge wrinkling that decays toward the center, while the leaves growing above water normally morph into a global bending cone shape with long rippled waves near the edge. Observations suggest that the underlying water (liquid substrate) significantly affects the morphogenesis of leaves. To understand the biophysical mechanism under such phenomena, we develop mathematical models that can effectively account for inhomogeneous differential growth of floating and free-standing leaves, to quantitatively predict formation and evolution of their morphology. We find, both theoretically and experimentally, that the short-wavelength buckled configuration is energetically favorable for growing membranes lying on liquid, while the global buckling shape is more preferable for suspended ones. Other influencing factors such as stem/vein, heterogeneity and dimension are also investigated. Our results provide a fundamental insight into a variety of plant morphogenesis affected by water foundation and suggest that such surface instabilities can be harnessed for morphology control of biomimetic deployable structures using substrate or edge actuation.