Source author record

Xiang Cheng

Xiang Cheng appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

38works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AMAP Agentic Planning Technical Report

We present STAgent, an agentic large language model tailored for spatio-temporal understanding, designed to solve complex tasks such as constrained point-of-interest discovery and itinerary planning. STAgent is a specialized model capable of interacting with ten distinct tools within spatio-temporal scenarios, enabling it to explore, verify, and refine intermediate steps during complex reasoning. Notably, STAgent effectively preserves its general capabilities. We empower STAgent with these capabilities through three key contributions: (1) a stable tool environment that supports over ten domain-specific tools, enabling asynchronous rollout and training; (2) a hierarchical data curation framework that identifies high-quality data like a needle in a haystack, curating high-quality queries by retaining less than 1\% of the raw data, emphasizing both diversity and difficulty; and (3) a cascaded training recipe that starts with a seed SFT stage acting as a guardian to measure query difficulty, followed by a second SFT stage fine-tuned on queries with high certainty, and an ultimate RL stage that leverages data of low certainty. Initialized with Qwen3-30B-A3B to establish a strong SFT foundation and leverage insights into sample difficulty, STAgent yields promising performance on TravelBench while maintaining its general capabilities across a wide range of general benchmarks, thereby demonstrating the effectiveness of our proposed agentic model.

preprint2026arXiv

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

On-policy self-distillation, where a student is pulled toward a copy of itself conditioned on privileged context (e.g., a verified solution or feedback), offers a promising direction for advancing reasoning capability without a stronger external teacher. Yet in math reasoning the gains are inconsistent, even when the same approach succeeds elsewhere. A pointwise mutual information analysis traces the failure to the privileged context itself: it inflates the teacher's confidence on tokens already implied by the solution (structural connectives, verifiable claims) and deflates it on deliberation tokens ("Wait", "Let", "Maybe") that drive multi-step search. We propose Anti-Self-Distillation (AntiSD), which ascends a divergence between student and teacher rather than descending it: this reverses the per-token sign and yields a naturally bounded advantage in one step. An entropy-triggered gate disables the term once the teacher entropy collapses, completing a drop-in replacement for default self-distillation. Across five models from 4B to 30B parameters on math reasoning benchmarks, AntiSD reaches the GRPO baseline's accuracy in 2 to 10x fewer training steps and improves final accuracy by up to 11.5 points. AntiSD opens a path to scalable self-improvement, where a language model bootstraps its own reasoning through its training signal.

preprint2026arXiv

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

On-policy self-distillation has emerged as a promising paradigm for post-training language models, in which the model conditions on environment feedback to serve as its own teacher, providing dense token-level rewards without external teacher models or step-level annotations. Despite its empirical success, what this reward actually measures and what kind of credit it assigns remain unclear. Under a posterior-compatibility interpretation of feedback conditioning, standard in the implicit-reward literature, we show that the self-distillation token reward is a Bayesian filtering increment whose trajectory sum is exactly the pointwise mutual information between the response and the feedback given the input. This pMI can be raised by input-specific reasoning or by input-generic shortcuts, so we further decompose the teacher log-probability along the input axis. Based on this analysis, we propose CREDIT (Contrastive REward from DIsTillation), which isolates the input-specific component with a batch-contrastive baseline. At the sequence level, CREDIT is a teacher-side surrogate for a contrastive pMI objective that also penalizes responses remaining likely under unrelated inputs. Across coding, scientific reasoning, and tool-use benchmarks on two model families, CREDIT delivers the strongest aggregate performance at negligible additional compute.

preprint2026arXiv

GP-DHT: A Dual-Head Transformer with Contras-tive Learning for Predicting Gene Regulatory Rela-tionships across Species from Single-Cell Data

Gene regulatory networks (GRNs) are essential for understanding cell fate decisions and disease mechanisms, yet cross-species GRN inference from single-cell RNA-seq data remains challenging due to noise, sparsity, and cross-species distribution shifts. We propose GP-DHT (GenePair DualHeadTransformer), a cross-species single-cell GRN inference framework that models genes and cells in a heterogeneous graph with multi-level expression relations and learns structured regulatory representations via multi-relational graph attention. A dual-head Transformer further captures local gene pair regulatory dependencies and global cross-cell interaction patterns. To improve robustness under sparse and cross-species settings, GP-DHT introduces gene pair level supervised contrastive learning. Experiments on seven BEELINE benchmark datasets show consistent gains over representative baselines, improving AUROC and AUPRC by approximately 5 to 7 percent on most datasets. GP-DHT also recovers known regulatory modules and helps distinguish conserved from species-specific regulations.

preprint2026arXiv

Inertia-Dilatancy Interplay Governs Shear-Thickening Drop Impact

Combining high-speed photography with direct force measurements, we investigate the impact dynamics of drops of cornstarch-water mixtures -- a premier example of shear-thickening fluids -- across a wide range of impact conditions. Our study identifies three distinct impact regimes. In addition to the liquid-like and solid-like behaviors generally expected for the impact-induced response of shear-thickening fluids, we uncover a counterintuitive regime in which high-concentration cornstarch-water mixtures display a liquid-like response at the onset of impact when shear rates are high and only transition to a solid-like behavior at later times as shear rates reduce. By integrating the classic drop-impact theory with the Reynolds-Darcy mechanism for dilatancy, we develop a unified model that quantitatively describes the impact dynamics of shear-thickening drops across all regimes. Our work reveals the unexpected response of shear-thickening fluids to ultra-fast deformation and advances fundamental understanding of drop impact for complex fluids.

preprint2026arXiv

Large Wireless Foundation Models: Stronger over Bigger

AI-communication integration is widely regarded as a core enabling technology for 6G. Most existing AI-based physical-layer designs rely on task-specific models that are separately tailored to individual modules, resulting in poor generalization. In contrast, communication systems are inherently general-purpose and should support broad applicability and robustness across diverse scenarios. Foundation models offer a promising solution through strong reasoning and generalization, yet wireless-system constraints hinder a direct transfer of large language model (LLM)-style success to the wireless domain. Therefore, we introduce the concept of large wireless foundation models (LWFMs) and present a novel framework for empowering the physical layer with foundation models under wireless constraints. Specifically, we propose two paradigms for realizing LWFMs, including leveraging existing general-purpose foundation models and building novel wireless foundation models. Based on recent progress, we distill two roadmaps for each paradigm and formulate design principles under wireless constraints. We further provide case studies of LWFM-empowered wireless systems to intuitively validate their advantages. Finally, we characterize the notion of "large" in LWFMs through a multidimensional analysis of existing work and outline promising directions for future research.

preprint2026arXiv

One for All: A Non-Linear Transformer can Enable Cross-Domain Generalization for In-Context Reinforcement Learning

A central challenge in reinforcement learning (RL) is to learn models that generalize beyond the tasks on which they are trained, a goal traditionally pursued through multi-task and meta RL. Recently, transformer architectures have emerged as a promising approach, enabling adaptation to new tasks via in-context learning without explicit parameter updates. From a functional perspective, a transformer can be viewed as a functional operator that maps a context to a task-specific function. It is thus fundamental to understand and design this operator to support stronger generalization in RL. In this work, we address this resulting question of generalization from a kernel-based perspective by establishing a connection between non-linear transformers and kernel-based temporal difference learning. By interpreting the transformer as performing regression in a Reproducing Kernel Hilbert Space (RKHS), we show that value functions from different domains can be represented using a shared set of weights, provided they lie within the same RKHS. Experiments on multiple MetaWorld domains support this interpretation, demonstrating convergence of the temporal-difference objective.

preprint2026arXiv

Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot

In-Context Learning (ICL) is an essential emergent ability of Large Language Models (LLMs), and recent studies introduce Chain-of-Thought (CoT) to exemplars of ICL to enhance the reasoning capability, especially in mathematics tasks. However, given the continuous advancement of model capabilities, it remains unclear whether CoT exemplars still benefit recent, stronger models in such tasks. Through systematic experiments, we find that for recent strong models such as the Qwen2.5 series, adding traditional CoT exemplars does not improve reasoning performance compared to Zero-Shot CoT. Instead, their primary function is to align the output format with human expectations. We further investigate the effectiveness of enhanced CoT exemplars, constructed using answers from advanced models such as \texttt{Qwen2.5-Max} and \texttt{DeepSeek-R1}. Experimental results indicate that these enhanced exemplars still fail to improve the model's reasoning performance. Further analysis reveals that models tend to ignore the exemplars and focus primarily on the instructions, leading to no observable gain in reasoning ability. Overall, our findings highlight the limitations of the current ICL+CoT framework in mathematical reasoning, calling for a re-examination of the ICL paradigm and the definition of exemplars.

preprint2026arXiv

Synesthesia of Machines (SoM)-Aided Online FDD Precoding via Heterogeneous Multi-Modal Sensing: A Vertical Federated Learning Approach

This paper investigates a heterogeneous multi-vehicle, multi-modal sensing (H-MVMM) aided online precoding problem. The proposed H-MVMM scheme utilizes a vertical federated learning (VFL) framework to minimize pilot sequence length and optimize the sum rate. This offers a promising solution for reducing latency in frequency division duplexing systems. To achieve this, three preprocessing modules are designed to transform raw sensory data into informative representations relevant to precoding. The approach effectively addresses local data heterogeneity arising from diverse on-board sensor configurations through a well-structured VFL training procedure. Additionally, a label-free online model updating strategy is introduced, enabling the H-MVMM scheme to adapt its weights flexibly. This strategy features a pseudo downlink channel state information label simulator (PCSI-Simulator), which is trained using a semi-supervised learning (SSL) approach alongside an online loss function. Numerical results show that the proposed method can closely approximate the performance of traditional optimization techniques with perfect channel state information, achieving a significant 90.6\% reduction in pilot sequence length.

preprint2026arXiv

Tiny-WiFo: A Lightweight Wireless Foundation Model for Channel Prediction via Multi-Component Adaptive Knowledge Distillation

The massive scale of Wireless Foundation Models (FMs) hinders their real-time deployment on edge devices. This letter moves beyond standard knowledge distillation by introducing a novel Multi-Component Adaptive Knowledge Distillation (MCAKD) framework. Key innovations include a Cross-Attention-Based Knowledge Selection (CA-KS) module that selectively identifies critical features from the teacher model, and an Autonomous Learning-Passive Learning (AL-PL) strategy that balances knowledge transfer with independent learning to achieve high training efficiency at a manageable computational cost. When applied to the WiFo FM, the distilled Tiny-WiFo model, with only 5.5M parameters, achieves a 1.6 ms inference time while retaining over 98% of WiFo's performance and its crucial zero-shot generalization capability, making real-time FM deployment viable.

preprint2026arXiv

WiFo-E: A Scalable Wireless Foundation Model for End-to-End FDD Precoding in Communication Networks

Accurate precoding in massive multiple-input multiple-output (MIMO) frequency-division duplexing (FDD) systems relies on efficient channel state information (CSI) acquisition. End-to-end learning frameworks improve performance by jointly optimizing this process, but they lack scalability and fail to generalize across different system configurations, such as varying numbers of antennas and users. To overcome this limitation, we introduce WiFo-E, a wireless foundation model designed for scalable end-to-end precoding. WiFo-E employs multi-task pretraining on a diverse set of configurations to learn transferable representations of underlying wireless principles. Central to the model is a sparse Mixture-of-Experts (MoE) Transformer architecture, which mitigates task interference and enhances training efficiency by activating specialized parameter subsets adaptively. Extensive simulations demonstrate that WiFo-E outperforms conventional per-configuration training and shows strong generalization to unseen system configurations, providing a flexible and efficient foundation for adaptive massive MIMO precoding.

preprint2026arXiv

WiFo-MUD: Wireless Foundation Model for Heterogeneous Multi-User Demodulator

Multi-user signal demodulation is critical to wireless communications, directly impacting transmission reliability and efficiency. However, existing demodulators underperform in generic multi-user environments: classical demodulators struggle to balance accuracy and complexity, while deep learning-based methods lack adaptability under heterogeneous configurations. Although diffusion models have been introduced for demodulation, their flexibility remains limited for practical use. To address these issues, this work proposes WiFo-MUD, a universal diffusion-based foundation model for multi-user demodulation. The model aligns inter-user signal-to-noise ratio imbalance and performs conditional denoising via a customized backbone. Furthermore, a communication-aware consistency distillation method and a dynamic user-grouping strategy are devised to enhance inference. WiFo-MUD achieves state-of-the-art results on large-scale heterogeneous datasets, demonstrating efficient inference and strong generalization across varying system configurations.

preprint2025arXiv

LLMs for Explainable Business Decision-Making: A Reinforcement Learning Fine-Tuning Approach

Artificial Intelligence (AI) models increasingly drive high-stakes consumer interactions, yet their decision logic often remains opaque. Prevailing explainable AI techniques rely on post hoc numerical feature attributions, which fail to provide coherent narratives behind model decisions. Large language models (LLMs) present an opportunity to generate natural-language explanations, but three design challenges remain unresolved: explanations must be both decision-correct and faithful to the factors that drive the prediction; they should be able to serve multiple audiences without shifting the underlying decision rule; and they should be trained in a label-efficient way that does not depend on large corpora of human-scored explanations. To address these challenges, we introduce LEXMA (LLM-based EXplanations for Multi-Audience decisions), a reinforcement-learning-based fine-tuning framework that produces narrative-driven, audience-appropriate explanations. LEXMA combines reflection-augmented supervised fine-tuning with two stages of Group Relative Policy Optimization (GRPO). Specifically, it fine-tunes two separate parameter sets to improve decision correctness and satisfy stylistic requirements for different audiences, using reward signals that do not rely on human-annotated explanations. We instantiate LEXMA in the context of mortgage approval decisions. Results demonstrate that LEXMA yields significant improvements in predictive performance compared with other LLM baselines. Moreover, human evaluations show that expert-facing explanations generated by our approach are more risk-focused, and consumer-facing explanations are clearer, more actionable, and more polite. Our study contributes a cost-efficient, systematic LLM fine-tuning approach to enhance explanation quality for business decisions, offering strong potential for scalable deployment of transparent AI systems.

preprint2022arXiv

Stress distribution and surface shock wave of drop impact

Drop impact causes severe surface erosion, dictating many important natural, environmental and engineering processes and calling for substantial prevention and preservation efforts. Nevertheless, despite extensive studies on the kinematic features of impacting drops over the last two decades, the dynamic process that leads to the drop-impact erosion is still far from clear. Here, we develop a method of high-speed stress microscopy, which measures the key dynamic properties of drop impact responsible for erosion, i.e., the shear stress and pressure distributions of impacting drops, with unprecedented spatiotemporal resolutions. Our experiments reveal the fast propagation of self-similar noncentral stress maxima underneath impacting drops and quantify the shear force on impacted substrates. Moreover, we examine the deformation of elastic substrates under impact and uncover impact-induced surface shock waves. Our study opens the door for quantitative measurements of the impact stress of liquid drops and sheds light on the origin of low-speed drop-impact erosion.

preprint2022arXiv

The colloidal nature of complex fluids leads to enhanced motility of flagellated bacteria

The natural habitats of microorganisms in the human microbiome and ocean and soil ecosystems are full of colloids and macromolecules, which impart non-Newtonian flow properties drastically affecting the locomotion of swimming microorganisms. Although the low-Reynolds-number hydrodynamics of the swimming of flagellated bacteria in simple Newtonian fluids has been well developed, our understanding of bacterial motility in complex non-Newtonian fluids is still primitive. Even after six decades of research, fundamental questions about the nature and origin of bacterial motility enhancement in polymer solutions are still under debate. Here, we study the motility of flagellated bacteria in colloidal suspensions of varying sizes and volume fractions. We find that bacteria in dilute colloidal suspensions display quantitatively the same motile behaviors as those in dilute polymer solutions, where a universal particle-size-dependent motility enhancement up to 80% is uncovered, accompanied by strong suppression of bacterial wobbling. By virtue of the well-controlled size and the hard-sphere nature of colloids, the finding not only resolves the long-standing controversy over bacterial motility enhancement in complex fluids but also challenges all the existing theories using polymer dynamics to address the swimming of flagellated bacteria in dilute polymer solutions. We further develop a simple physical model incorporating the colloidal nature of complex fluids, which quantitatively explains bacterial wobbling dynamics and mobility enhancement in both colloidal and polymeric fluids. Our study sheds light on the puzzling motile behaviors of bacteria in complex fluids relevant to a wide range of microbiological processes and provides a cornerstone in engineering bacterial swimming in complex environments.

preprint2022arXiv

Tuning the rheology and microstructure of particle-laden fluid interfaces with Janus particles

Hypothesis: Particle-laden fluid interfaces are the central component of many natural and engineering systems. Understanding the mechanical properties and improving the stability of such interfaces are of great practical importance. Janus particles, a special class of heterogeneous colloids, might be utilized as an effective surface-active agent to control the assembly and interfacial rheology of particle-laden fluid interfaces. Experiments: Using a custom-built interfacial stress rheometer, we explore the effect of Janus particle additives on the interfacial rheology and microscopic structure of particle-laden fluid interfaces. Findings: We find that the addition of a small amount of platinum-polystyrene (Pt-PS) Janus particles within a monolayer of PS colloids (1:40 number ratio) can lead to more than an order-of-magnitude increase in surface moduli with enhanced elasticity, which greatly improves the stability of the interface. This drastic change in interfacial rheology is associated with the formation of local particle clusters surrounding each Janus particle. We further explain the origin of local particle clusters by considering the interparticle interactions at the interface. Our experiments reveal the effect of local particle structures on the macroscopic rheological behaviors of particle monolayers and demonstrate a new way to tune the microstructure and mechanical properties of particle-laden fluid interfaces.

preprint2021arXiv

Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples

Fuzzy similarity join is an important database operator widely used in practice. So far the research community has focused exclusively on optimizing fuzzy join \textit{scalability}. However, practitioners today also struggle to optimize fuzzy-join \textit{quality}, because they face a daunting space of parameters (e.g., distance-functions, distance-thresholds, tokenization-options, etc.), and often have to resort to a manual trial-and-error approach to program these parameters in order to optimize fuzzy-join quality. This key challenge of automatically generating high-quality fuzzy-join programs has received surprisingly little attention thus far. In this work, we study the problem of "auto-program" fuzzy-joins. Leveraging a geometric interpretation of distance-functions, we develop an unsupervised \textsc{Auto-FuzzyJoin} framework that can infer suitable fuzzy-join programs on given input tables, without requiring explicit human input such as labeled training data. Using \textsc{Auto-FuzzyJoin}, users only need to provide two input tables $L$ and $R$, and a desired precision target $τ$ (say 0.9). \textsc{Auto-FuzzyJoin} leverages the fact that one of the input is a reference table to automatically program fuzzy-joins that meet the precision target $τ$ in expectation, while maximizing fuzzy-join recall (defined as the number of correctly joined records). Experiments on both existing benchmarks and a new benchmark with 50 fuzzy-join tasks created from Wikipedia data suggest that the proposed \textsc{Auto-FuzzyJoin} significantly outperforms existing unsupervised approaches, and is surprisingly competitive even against supervised approaches (e.g., Magellan and DeepMatcher) when 50\% of ground-truth labels are used as training data.

preprint2021arXiv

Imaging the emergence of bacterial turbulence: phase diagram and transition kinetics

We experimentally study the emergence of collective bacterial swimming, a phenomenon often referred to as bacterial turbulence. A phase diagram of the flow of 3D E. coli suspensions spanned by bacterial concentration, the swimming speed of bacteria and the number fraction of active swimmers is systematically mapped, which shows quantitative agreement with kinetic theories and demonstrates the dominant role of hydrodynamic interactions in bacterial collective swimming. More importantly, we trigger bacterial turbulence by suddenly increasing the swimming speed of light-powered bacteria and image the transition to the turbulence in real time. Our experiments identify two unusual kinetic pathways, i.e., the one-step transition with long incubation periods near the phase boundary and the two-step transition driven by long-wavelength instabilities deep inside the turbulent phase. Our study provides not only a quantitative verification of existing theories, but also new insights into interparticle interactions and transition kinetics of bacterial turbulence.

preprint2020arXiv

Answering Multi-Dimensional Range Queries under Local Differential Privacy

In this paper, we tackle the problem of answering multi-dimensional range queries under local differential privacy. There are three key technical challenges: capturing the correlations among attributes, avoiding the curse of dimensionality, and dealing with the large domains of attributes. None of the existing approaches satisfactorily deals with all three challenges. Overcoming these three challenges, we first propose an approach called Two-Dimensional Grids (TDG). Its main idea is to carefully use binning to partition the two-dimensional (2-D) domains of all attribute pairs into 2-D grids that can answer all 2-D range queries and then estimate the answer of a higher dimensional range query from the answers of the associated 2-D range queries. However, in order to reduce errors due to noises, coarse granularities are needed for each attribute in 2-D grids, losing fine-grained distribution information for individual attributes. To correct this deficiency, we further propose Hybrid-Dimensional Grids (HDG), which also introduces 1-D grids to capture finer-grained information on distribution of each individual attribute and combines information from 1-D and 2-D grids to answer range queries. To make HDG consistently effective, we provide a guideline for properly choosing granularities of grids based on an analysis of how different sources of errors are impacted by these choices. Extensive experiments conducted on real and synthetic datasets show that HDG can give a significant improvement over the existing approaches.

preprint2020arXiv

Sharp convergence rates for Langevin dynamics in the nonconvex setting

We study the problem of sampling from a distribution $p^*(x) \propto \exp\left(-U(x)\right)$, where the function $U$ is $L$-smooth everywhere and $m$-strongly convex outside a ball of radius $R$, but potentially nonconvex inside this ball. We study both overdamped and underdamped Langevin MCMC and establish upper bounds on the number of steps required to obtain a sample from a distribution that is within $ε$ of $p^*$ in $1$-Wasserstein distance. For the first-order method (overdamped Langevin MCMC), the iteration complexity is $\tilde{\mathcal{O}}\left(e^{cLR^2}d/ε^2\right)$, where $d$ is the dimension of the underlying space. For the second-order method (underdamped Langevin MCMC), the iteration complexity is $\tilde{\mathcal{O}}\left(e^{cLR^2}\sqrt{d}/ε\right)$ for an explicit positive constant $c$. Surprisingly, the iteration complexity for both these algorithms is only polynomial in the dimension $d$ and the target accuracy $ε$. It is exponential, however, in the problem parameter $LR^2$, which is a measure of non-log-concavity of the target distribution.

preprint2016arXiv

A multi-axis confocal rheoscope for studying shear flow of structured fluids

We present a new design for a confocal rheoscope that enables uniform uniaxial or biaxial shear. The design consists of two precisely-positioned parallel plates with a gap that can be adjusted down to 2$\pm$0.1 μm, allowing for the exploration of confinement effects. By using our shear cell in conjunction with a biaxial force measurement device and a high-speed confocal microscope, we are able to measure the real-time biaxial stress while simultaneously imaging the material 3D structure. We illustrate the importance of the instrument capabilities by discussing the applications of this instrument in current and future research topics in colloidal suspensions.

preprint2016arXiv

Asymptotic behavior of $\ell_p$-based Laplacian regularization in semi-supervised learning

Given a weighted graph with $N$ vertices, consider a real-valued regression problem in a semi-supervised setting, where one observes $n$ labeled vertices, and the task is to label the remaining ones. We present a theoretical study of $\ell_p$-based Laplacian regularization under a $d$-dimensional geometric random graph model. We provide a variational characterization of the performance of this regularized learner as $N$ grows to infinity while $n$ stays constant, the associated optimality conditions lead to a partial differential equation that must be satisfied by the associated function estimate $\hat{f}$. From this formulation we derive several predictions on the limiting behavior the $d$-dimensional function $\hat{f}$, including (a) a phase transition in its smoothness at the threshold $p = d + 1$, and (b) a tradeoff between smoothness and sensitivity to the underlying unlabeled data distribution $P$. Thus, over the range $p \leq d$, the function estimate $\hat{f}$ is degenerate and "spiky," whereas for $p\geq d+1$, the function estimate $\hat{f}$ is smooth. We show that the effect of the underlying density vanishes monotonically with $p$, such that in the limit $p = \infty$, corresponding to the so-called Absolutely Minimal Lipschitz Extension, the estimate $\hat{f}$ is independent of the distribution $P$. Under the assumption of semi-supervised smoothness, ignoring $P$ can lead to poor statistical performance, in particular, we construct a specific example for $d=1$ to demonstrate that $p=2$ has lower risk than $p=\infty$ due to the former penalty adapting to $P$ and the latter ignoring it. We also provide simulations that verify the accuracy of our predictions for finite sample sizes. Together, these properties show that $p = d+1$ is an optimal choice, yielding a function estimate $\hat{f}$ that is both smooth and non-degenerate, while remaining maximally sensitive to $P$.

preprint2016arXiv

Diffusion of an ellipsoid in bacterial suspensions

Active matter such as swarming bacteria and motile colloids exhibits exotic properties different from conventional equilibrium materials. Among these properties, the enhanced diffusion of tracer particles is generally deemed as a hallmark of active matter. Here, rather than spherical tracers, we investigate the diffusion of isolated ellipsoids in quasi-two-dimensional bacterial bath. Our study reveals a nonlinear enhancement of both translational and rotational diffusions. More importantly, we uncover an anomalous coupling between translation and rotation that is strictly prohibited in the classic Brownian diffusion. Combining experiments with theoretical modeling, we show that such an anomaly arises from generic stretching flows induced by swimming bacteria. Our work illustrates a universal organizing principle of active matter and sheds new light on fundamental transport processes in microbiological systems.

preprint2016arXiv

Dynamics of ellipsoidal tracers in swimming algal suspensions

Enhanced diffusion of passive tracers immersed in active fluids is a universal feature of active fluids and has been extensively studied in recent years. Similar to microrheology for equilibrium complex fluids, the unusual enhanced particle dynamics reveal intrinsic properties of active fluids. Nevertheless, previous studies have shown that the translational dynamics of spherical tracers are qualitatively similar, independent of whether active particles are pushers or pullers---the two fundamental classes of active fluids. Is it possible to distinguish pushers from pullers by simply imaging the dynamics of passive tracers? Here, we investigated the diffusion of isolated ellipsoids in algal C. reinhardtii suspensions---a model for puller-type active fluids. In combination with our previous results on pusher-type E. coli suspensions [Peng et al., Phys. Rev. Lett. 116, 068303 (2016)], we showed that the dynamics of asymmetric tracers show a profound difference in pushers and pullers due to their rotational degree of freedom. Although the laboratory-frame translation and rotation of ellipsoids are enhanced in both pushers and pullers, similar to spherical tracers, the anisotropic diffusion in the body frame of ellipsoids shows opposite trends in the two classes of active fluids. An ellipsoid diffuses fastest along its major axis when immersed in pullers, whereas it diffuses slowest along the major axis in pushers. This striking difference can be qualitatively explained using a simple hydrodynamic model. In addition, our study on algal suspensions reveals that the influence of the near-field advection of algal swimming flows on the translation and rotation of ellipsoids shows different ranges and strengths. Our work provides not only new insights into universal organizing principles of active fluids, but also a convenient tool for detecting the class of active particles.

preprint2016arXiv

Firebird: Predicting Fire Risk and Prioritizing Fire Inspections in Atlanta

The Atlanta Fire Rescue Department (AFRD), like many municipal fire departments, actively works to reduce fire risk by inspecting commercial properties for potential hazards and fire code violations. However, AFRD's fire inspection practices relied on tradition and intuition, with no existing data-driven process for prioritizing fire inspections or identifying new properties requiring inspection. In collaboration with AFRD, we developed the Firebird framework to help municipal fire departments identify and prioritize commercial property fire inspections, using machine learning, geocoding, and information visualization. Firebird computes fire risk scores for over 5,000 buildings in the city, with true positive rates of up to 71% in predicting fires. It has identified 6,096 new potential commercial properties to inspect, based on AFRD's criteria for inspection. Furthermore, through an interactive map, Firebird integrates and visualizes fire incidents, property information and risk scores to help AFRD make informed decisions about fire inspections. Firebird has already begun to make positive impact at both local and national levels. It is improving AFRD's inspection processes and Atlanta residents' safety, and was highlighted by National Fire Protection Association (NFPA) as a best practice for using data to inform fire inspections.

preprint2016arXiv

Structures and dynamics of glass-forming colloidal liquids under spherical confinement

Recent theories predict that when a supercooled liquid approaches the glass transition, particle clusters with a special "amorphous order" nucleate within the liquid, which lead to static correlations dictating the dramatic slowdown of liquid relaxation. The prediction, however, has yet to be verified in 3D experiments. Here, we design a colloidal system, where particles are confined inside spherical cavities with an amorphous layer of particles pinned at the boundary. Using this novel system, we capture the amorphous-order particle clusters and demonstrate the development of a static correlation. Moreover, by investigating the dynamics of spherically confined samples, we reveal a profound influence of the static correlation on the relaxation of colloidal liquids. In analogy to glass-forming liquids with randomly pinned particles, we propose a simple relation for the change of the configurational entropy of confined colloidal liquids, which quantitatively explains our experimental findings and illustrates a divergent static length scale during the colloidal glass transition.

preprint2015arXiv

Interfacial silica nanoparticles stabilize cocontinuous polymer blends

We investigated the formation of cocontinuous structures in polymer blends. These polymeric bijels (bicontinuous interfacially jammed emulsion gels) were composed of polystyrene oligomer, polybutene and fluorescent hydrophobic silica nanoparticles. A micron-sized cocontinuous morphology was stabilized by a monolayer of silica nanoparticles at the interface. Real-time observation of coalescence dynamics in co-continuous polymer blends stabilized by interfacial particles was for the first time achieved via laser scanning confocal microscopy. We demonstrated that suppression of coalescence arises from coverage of interfaces by nanoparticles. Furthermore, by combining confocal microscopy with rheology, we correlated the rheological response of a cocontinuous structure with its morphology change. We found that the rheological behavior can be attributed to competition between interface shrinkage and particle network formation. In addition, we showed that a particle scaffold is maintained even after the remixing of two polymer phases above the spinodal point. Finally, we also discussed differences between the shear response of the particle-stabilized cocontinuous structure and normal colloidal gels: the former one is more fragile than the latter under shear.

preprint2015arXiv

Jamming in Hierarchical Networks

We study the Biroli-Mezard model for lattice glasses on a number of hierarchical networks. These networks combine certain lattice-like features with a recursive structure that makes them suitable for exact renormalization group studies and provide an alternative to the mean-field approach. In our numerical simulations here, we first explore their equilibrium properties with the Wang-Landau algorithm. Then, we investigate their dynamical behavior using a grand-canonical annealing algorithm. We find that the dynamics readily falls out of equilibrium and jams in many of our networks with certain constraints on the neighborhood occupation imposed by the Biroli-Mezard model, even in cases where exact results indicate that no ideal glass transition exists. But while we find that time-scales for the jams diverge, our simulations cannot ascertain such a divergence for a packing fraction distinctly above random close packing. In cases where we allow hopping in our dynamical simulations, the jams on these networks generally disappear.

preprint2015arXiv

Scaling of liquid-drop impact craters in wet granular media

Combining high-speed photography with laser profilometry, we study the dynamics and the morphology of liquid-drop impact cratering in wet granular media---a ubiquitous phenomenon relevant to many important geological, agricultural, and industrial processes. By systematically investigating important variables such as impact energy, the size of impinging drops and the degree of liquid saturation in granular beds, we uncover a novel scaling for the size of impact craters. We show that this scaling can be explained by considering the balance between the inertia of impinging drops and the strength of impacted surface. Such a theoretical understanding confirms that the unique energy partition originally proposed for liquid-drop impact cratering in dry granular media also applies for impact cratering in wet granular media. Moreover, we demonstrate that compressive stresses, instead of shear stresses, control granular impact cratering. Our study enriches the picture of generic granular impact cratering and sheds light on the familiar phenomena of raindrop impacts in granular media.

preprint2014arXiv

Granular impact cratering by liquid drops: Understanding raindrop imprints through an analogy to asteroid strikes

When a granular material is impacted by a sphere, its surface deforms like a liquid yet it preserves a circular crater like a solid. Although the mechanism of granular impact cratering by solid spheres is well explored, our knowledge on granular impact cratering by liquid drops is still very limited. Here, by combining high-speed photography with high-precision laser profilometry, we investigate liquid-drop impact dynamics on granular surface and monitor the morphology of resulting impact craters. Surprisingly, we find that, despite the enormous energy and length difference, granular impact cratering by liquid drops follows the same energy scaling and reproduces the same crater morphology as that of asteroid impact craters. Inspired by this similarity, we integrate the physical insight from planetary sciences, the liquid marble model from fluid mechanics and the concept of jamming transition from granular physics into a simple theoretical framework that quantitatively describes all the main features of liquid-drop imprints in granular media. Our study sheds light on the mechanisms governing raindrop impacts on granular surfaces and reveals a remarkable analogy between familiar phenomena of raining and catastrophic asteroid strikes.

preprint2013arXiv

Bi-axial shear of confined colloidal suspensions: the structure and rheology of the vorticity-aligned string phase

Using a novel bi-axial confocal rheoscope, we investigate the structure and rheology of sheared colloidal suspensions under confinement. Consistent with previous work [X. Cheng \textit{et al., Proc. Natl. Acad. Sci. U. S. A.}, 2011, 109, 63], we observe a vorticity-aligned string phase in moderate concentrated colloidal suspensions under uniaxial shear. Using bi-axial shear protocols, we directly manipulate the orientation and morphology of the string structures. Simultaneously, we measure the suspension rheology along both the flow and vorticity directions with a bi-axial force measurement device. Our results demonstrate that despite the highly anisotropic microstructure, the suspension viscosity remains isotropic and constant over the shear rates explored. These results suggest that hydrodynamic contributions dominate the suspension response. In addition they highlight the capabilities of bi-axial confocal rheoscopes for elucidating the relationship between microstructure and rheology in complex fluids.

preprint2013arXiv

Far-from-equilibrium Sheared Colloidal Liquids: Disentangling Relaxation, Advection, and Shear-induced Diffusion

Using high-speed confocal microscopy, we measure the particle positions in a colloidal suspension under large amplitude oscillatory shear. Using the particle positions we quantify the in situ anisotropy of the pair-correlation function -- a measure of the Brownian stress. From these data, we find two distinct types of responses as the system crosses over from equilibrium to far-from-equilibrium states. The first is a nonlinear amplitude saturation that arises from shear-induced advection, while the second is a linear frequency saturation due to competition between suspension relaxation and shear rate. In spite of their different underlying mechanisms, we show that all the data can be scaled onto a master curve that spans the equilibrium and far-from-equilibrium regimes, linking small amplitude oscillatory to continuous shear. This observation illustrates a colloidal analog of the Cox-Merz rule and its microscopic underpinning. Brownian Dynamics simulations show that interparticle interactions are sufficient for generating both experimentally observed saturations.

preprint2012arXiv

Efficiency Resource Allocation for Device-to-Device Underlay Communication Systems: A Reverse Iterative Combinatorial Auction Based Approach

Peer-to-peer communication has been recently considered as a popular issue for local area services. An innovative resource allocation scheme is proposed to improve the performance of mobile peer-to-peer, i.e., device-to-device (D2D), communications as an underlay in the downlink (DL) cellular networks. To optimize the system sum rate over the resource sharing of both D2D and cellular modes, we introduce a reverse iterative combinatorial auction as the allocation mechanism. In the auction, all the spectrum resources are considered as a set of resource units, which as bidders compete to obtain business while the packages of the D2D pairs are auctioned off as goods in each auction round. We first formulate the valuation of each resource unit, as a basis of the proposed auction. And then a detailed non-monotonic descending price auction algorithm is explained depending on the utility function that accounts for the channel gain from D2D and the costs for the system. Further, we prove that the proposed auction-based scheme is cheat-proof, and converges in a finite number of iteration rounds. We explain non-monotonicity in the price update process and show lower complexity compared to a traditional combinatorial allocation. The simulation results demonstrate that the algorithm efficiently leads to a good performance on the system sum rate.

preprint2012arXiv

Large number of receptors may reduce cellular response time variation

Cells often have tens of thousands of receptors, even though only a few activated receptors can trigger full cellular responses. Reasons for the overabundance of receptors remain unclear. We suggest that, in certain conditions, the large number of receptors results in a competition among receptors to be the first to activate the cell. The competition decreases the variability of the time to cellular activation, and hence results in a more synchronous activation of cells. We argue that, in simple models, this variability reduction does not necessarily interfere with the receptor specificity to ligands achieved by the kinetic proofreading mechanism. Thus cells can be activated accurately in time and specifically to certain signals. We predict the minimum number of receptors needed to reduce the coefficient of variation for the time to activation following binding of a specific ligand. Further, we predict the maximum number of receptors so that the kinetic proofreading mechanism still can improve the specificity of the activation. These predictions fall in line with experimentally reported receptor numbers for multiple systems.

preprint2010arXiv

Experimental study of the jamming transition at zero temperature

We experimentally investigate jamming in a quasi-two-dimensional granular system of automatically swelling particles and show that a maximum in the height of the first peak of the pair correlation function is a structural signature of the jamming transition at zero temperature. The same signature is also found in the second peak of the pair correlation function, but not in the third peak, reflecting the underlying singularity of jamming transition. We also study the development of clusters in this system. A static length scale extracted from the cluster structure reaches the size of the system when the system approaches the jamming point. Finally, we show that in a highly inhomogeneous system, friction causes the system to jam in series of steps. In this case, jamming may be obtained through successive buckling of force chains.

preprint2010arXiv

Packing structure of a two-dimensional granular system through the jamming transition

We have performed a novel experiment on granular packs composed of automatically swelling particles. By analyzing the Voronoi structure of packs going through the jamming transition, we show that the local configuration of a jamming pack is strikingly similar to that of a glass-forming liquid, both in terms of their universal area distribution and the process of defect annealing. Furthermore, we demonstrate that an unambiguous structural signature of the jamming transition can be obtained from the pair correlation functions of a pack. Our study provides insights into the structural properties of general jamming systems.

preprint2005arXiv

The Effect of Air on Granular Size Separation in a Vibrated Granular Bed

Using high-speed video and magnetic resonance imaging (MRI) we study the motion of a large sphere in a vertically vibrated bed of smaller grains. As previously reported we find a non-monotonic density dependence of the rise and sink time of the large sphere. We find that this density dependence is solely due to air drag. We investigate in detail how the motion of the intruder sphere is influenced by size of the background particles, initial vertical position in the bed, ambient pressure and convection. We explain our results in the framework of a simple model and find quantitative agreement in key aspects with numerical simulations to the model equations.

preprint2005arXiv

Three-dimensional shear in granular flow

The evolution of granular shear flow is investigated as a function of height in a split-bottom Couette cell. Using particle tracking, magnetic-resonance imaging, and large-scale simulations we find a transition in the nature of the shear as a characteristic height $H^*$ is exceeded. Below $H^*$ there is a central stationary core; above $H^*$ we observe the onset of additional axial shear associated with torsional failure. Radial and axial shear profiles are qualitatively different: the radial extent is wide and increases with height while the axial width remains narrow and fixed.

Xiang Cheng

What is connected

Connect this record

See the researcher in context

Building this map preview

38 published item(s)

AMAP Agentic Planning Technical Report

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation

GP-DHT: A Dual-Head Transformer with Contras-tive Learning for Predicting Gene Regulatory Rela-tionships across Species from Single-Cell Data

Inertia-Dilatancy Interplay Governs Shear-Thickening Drop Impact

Large Wireless Foundation Models: Stronger over Bigger

One for All: A Non-Linear Transformer can Enable Cross-Domain Generalization for In-Context Reinforcement Learning

Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot

Synesthesia of Machines (SoM)-Aided Online FDD Precoding via Heterogeneous Multi-Modal Sensing: A Vertical Federated Learning Approach

Tiny-WiFo: A Lightweight Wireless Foundation Model for Channel Prediction via Multi-Component Adaptive Knowledge Distillation

WiFo-E: A Scalable Wireless Foundation Model for End-to-End FDD Precoding in Communication Networks

WiFo-MUD: Wireless Foundation Model for Heterogeneous Multi-User Demodulator

LLMs for Explainable Business Decision-Making: A Reinforcement Learning Fine-Tuning Approach

Stress distribution and surface shock wave of drop impact

The colloidal nature of complex fluids leads to enhanced motility of flagellated bacteria

Tuning the rheology and microstructure of particle-laden fluid interfaces with Janus particles

Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples

Imaging the emergence of bacterial turbulence: phase diagram and transition kinetics

Answering Multi-Dimensional Range Queries under Local Differential Privacy

Sharp convergence rates for Langevin dynamics in the nonconvex setting

A multi-axis confocal rheoscope for studying shear flow of structured fluids

Asymptotic behavior of $\ell_p$-based Laplacian regularization in semi-supervised learning

Diffusion of an ellipsoid in bacterial suspensions

Dynamics of ellipsoidal tracers in swimming algal suspensions

Firebird: Predicting Fire Risk and Prioritizing Fire Inspections in Atlanta

Structures and dynamics of glass-forming colloidal liquids under spherical confinement

Interfacial silica nanoparticles stabilize cocontinuous polymer blends

Jamming in Hierarchical Networks

Scaling of liquid-drop impact craters in wet granular media

Granular impact cratering by liquid drops: Understanding raindrop imprints through an analogy to asteroid strikes

Bi-axial shear of confined colloidal suspensions: the structure and rheology of the vorticity-aligned string phase

Far-from-equilibrium Sheared Colloidal Liquids: Disentangling Relaxation, Advection, and Shear-induced Diffusion

Efficiency Resource Allocation for Device-to-Device Underlay Communication Systems: A Reverse Iterative Combinatorial Auction Based Approach

Large number of receptors may reduce cellular response time variation

Experimental study of the jamming transition at zero temperature

Packing structure of a two-dimensional granular system through the jamming transition

The Effect of Air on Granular Size Separation in a Vibrated Granular Bed

Three-dimensional shear in granular flow