Researcher profile

Cheng Xu

Cheng Xu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

VisMMOE: Exploiting Visual-Expert Affinity for Efficient Visual-Language MoE Offloading

Large-scale vision-language mixture-of-experts (VL-MoE) models provide strong multimodal capability, but efficient deployment on memory-constrained platforms remains difficult. Existing MoE offloading systems are largely designed for text-centric workloads and become much less effective for visual-heavy inputs, where large numbers of visual tokens induce broader and less predictable expert accesses. We present VisMMoE, a VL-MoE offloading system built on a single systems insight: pruning redundant visual tokens can improve offloading not only by reducing computation, but also by reshaping expert demand. We refer to this effect as \textit{visual-expert affinity}: token pruning makes expert accesses more concentrated within layers and more stable across layers, producing a smaller and more predictable expert working set. Guided by this insight, VisMMoE combines affinity-aware token compression, lookahead expert prediction, and cache/pipeline orchestration to improve expert locality and prefetch effectiveness under tight memory budgets. We implement VisMMoE on multiple frameworks and evaluate it on representative VL-MoE models and benchmarks. VisMMoE improves end-to-end inference performance by up to 2.68x and 1.61x, respectively, over strong baselines for today's VL-MoE deployments while maintaining competitive accuracy.

preprint2025arXiv

Chiral superconductivity from spin polarized Chern band in twisted MoTe$_2$

Superconductivity has been observed in twisted MoTe2 within the anomalous Hall metal parent state. Key signatures-including a fully spin/valley polarized normal state, anomalous Hall resistivity hysteresis, superconducting phase adjacent to the fractional Chern insulating state and a narrow superconducting dome at zero gating field-collectively indicate chiral superconductivity driven by intravalley pairing of electrons. Within the Kohn-Luttinger mechanism, we compute the superconducting phase diagram via random phase approximation, incorporating Coulomb repulsion in a realistic continuum model. Our results identify a dominant intravalley pairing with a narrow superconducting dome of p+ip type at zero gate field. This chiral phase contrasts sharply with the much weaker time-reversal-symmetric intervalley pairing at finite gating field. Our work highlights the role of band topology in achieving robust topological superconductivity, and supports the chiral and topological nature of the superconductivity observed in twisted MoTe2.

preprint2024arXiv

Multiple Chern bands in twisted MoTe$_2$ and possible non-Abelian states

We investigate the moiré band structures and possible even denominator fractional quantum Hall state in small angle twisted bilayer MoTe$_2$, using combined large-scale local basis density functional theory calculation and continuum model exact diagonalization. Via large-scale first principles calculations at $θ=1.89^{\circ}$, we find a sequence of $C=1$(Chern number in K valley)moiré Chern bands, in analogy to Landau levels. By constructing the continuum model with multiple Chern bands, we undertake band-projected exact diagonalization using unscreened Coulomb repulsion to identify possible non-Abelian states near twist angle $θ=1.89^{\circ}$ at the half filling of second moiré band.

preprint2022arXiv

Interface inter-atomic electron-transition induced photon emission in contact-electrification

Contact electrification (CE) (or triboelectrification) is the phenomena where charges are produced through physical contact between two materials. Here we report the atomic featured photon emission spectra during CE between two solid materials. Photon emission provides the evidence that electron transfer takes place at the interface from an atom in one material to another atom in the other material during CE. This process is the contact electrification induced interface photon emission spectroscopy (CEIIPES). It naturally paves a way to a spectroscopy corresponding to the CE at an interface, which might impact our understanding of the interaction between solids, liquids, and gases. The physics presented here could be expanded to Auger electron excitation, x-ray emission, and electron emission in CE for general cases, which remain to be explored. This could lead to a general field that may be termed as contact electrification induced interface spectroscopy (CEIIS).

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

On Shapley Value in Data Assemblage Under Independent Utility

In many applications, an organization may want to acquire data from many data owners. Data marketplaces allow data owners to produce data assemblage needed by data buyers through coalition. To encourage coalitions to produce data, it is critical to allocate revenue to data owners in a fair manner according to their contributions. Although in literature Shapley fairness and alternatives have been well explored to facilitate revenue allocation in data assemblage, computing exact Shapley value for many data owners and large assembled data sets through coalition remains challenging due to the combinatoric nature of Shapley value. In this paper, we explore the decomposability of utility in data assemblage by formulating the independent utility assumption. We argue that independent utility enjoys many applications. Moreover, we identify interesting properties of independent utility and develop fast computation techniques for exact Shapley value under independent utility. Our experimental results on a series of benchmark data sets show that our new approach not only guarantees the exactness of Shapley value, but also achieves faster computation by orders of magnitudes.

preprint2022arXiv

Toward Minimal Misalignment at Minimal Cost in One-Stage and Anchor-Free Object Detection

Common object detection models consist of classification and regression branches, due to different task drivers, these two branches have different sensibility to the features from the same scale level and the same spatial location. The point-based prediction method, which is based on the assumption that the high classification confidence point has the high regression quality, leads to the misalignment problem. Our analysis shows, the problem is further composed of scale misalignment and spatial misalignment specifically. We aim to resolve the phenomenon at minimal cost: a minor adjustment of the head network and a new label assignment method replacing the rigid one. Our experiments show that, compared to the baseline FCOS, a one-stage and anchor-free object detection model, our model consistently get around 3 AP improvement with different backbones, demonstrating both simplicity and efficiency of our method.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

Photon-counting CT with silicon detectors: feasibility for pediatric imaging

X-ray detectors made of crystalline silicon have several advantages including low dark currents, fast charge collection and high energy resolution. For high-energy x-rays, however, silicon suffers from its low atomic number, which might result in low detection efficiency, as well as low energy and spatial resolution due to Compton scattering. We have used a monte-carlo model to investigate the feasibility of a detector for pediatric CT with 30 to 40 mm of silicon using x-ray spectra ranging from 80 to 140 kVp. A detection efficiency of 0.74 was found at 80 kVp, provided the noise threshold could be set low. Scattered photons were efficiently blocked by a thin metal shielding between the detector units, and Compton scattering in the detector could be well separated from photo absorption at 80 kVp. Hence, the detector is feasible at low acceleration voltages, which is also suitable for pediatric imaging. We conclude that silicon detectors may be an alternative to other designs for this special case.

preprint2020arXiv

Cost-Effective Data Feeds to Blockchains via Workload-Adaptive Data Replication

Feeding external data to a blockchain, a.k.a. data feed, is an essential task to enable blockchain interoperability and support emerging cross-domain applications, notably stablecoins. Given the data-intensive feeds in real life (e.g., high-frequency price updates) and the high cost in using blockchain, namely Gas, it is imperative to reduce the Gas cost of data feeds. Motivated by the constant-changing workloads in finance and other applications, this work focuses on designing a dynamic, workload-aware approach for cost effectiveness in Gas. This design space is understudied in the existing blockchain research which has so far focused on static data placement. This work presents GRuB, a cost-effective data feed that dynamically replicates data between the blockchain and an off-chain cloud storage. GRuB&#39;s data replication is workload-adaptive by monitoring the current workload and making online decisions w.r.t. data replication. A series of online algorithms are proposed that achieve the bounded worst-case cost in blockchain&#39;s Gas. GRuB runs the decision-making components on the untrusted cloud off-chain for lower Gas costs, and employs a security protocol to authenticate the data transferred between the blockchain and cloud. The overall GRuB system can autonomously achieve low Gas costs with changing workloads. We built a GRuB prototype functional with Ethereum and Google LevelDB, and supported real applications in stablecoins. Under real workloads collected from the Ethereum contract-call history and mixed workloads of YCSB, we systematically evaluate GRuB&#39;s cost which shows a saving of Gas by 10% ~ 74%, with comparison to the baselines of static data-placement.