Researcher profile

Yongsheng Zhang

Yongsheng Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Real-time recommender systems execute multi-stage cascades (retrieval, pre-processing, fine-grained ranking) under strict tail-latency SLOs, leaving only tens of milliseconds for ranking. Generative recommendation (GR) models can improve quality by consuming long user-behavior sequences, but in production their online sequence length is tightly capped by the ranking-stage P99 budget. We observe that the majority of GR tokens encode user behaviors that are independent of the item candidates, suggesting an opportunity to pre-infer a user-behavior prefix once and reuse it during ranking rather than recomputing it on the critical path. Realizing this idea at industrial scale is non-trivial: the prefix cache must survive across multiple pipeline stages before the final ranking instance is determined, the user population implies cache footprints far beyond a single device, and indiscriminate pre-inference would overload shared resources under high QPS. We present RelayGR, a production system that enables in-HBM relay-race inference for GR. RelayGR selectively pre-infers long-term user prefixes, keeps their KV caches resident in HBM over the request lifecycle, and ensures the subsequent ranking can consume them without remote fetches. RelayGR combines three techniques: 1) a sequence-aware trigger that admits only at-risk requests under a bounded cache footprint and pre-inference load, 2) an affinity-aware router that co-locates cache production and consumption by routing both the auxiliary pre-infer signal and the ranking request to the same instance, and 3) a memory-aware expander that uses server-local DRAM to capture short-term cross-request reuse while avoiding redundant reloads. We implement RelayGR on Huawei Ascend NPUs and evaluate it with real queries. Under a fixed P99 SLO, RelayGR supports up to 1.5$\times$ longer sequences and improves SLO-compliant throughput by up to 3.6$\times$.

preprint2022arXiv

Chemical trends in the high thermoelectric performance of the pyrite-type dichalcogenides: ZnS2, CdS2 and CdSe2

The thermoelectric properties of the three pyrite-type IIB-VIA2 dichalcogenides (ZnS2, CdS2 and CdSe2) are systematically investigated and compared with those of the prototype ZnSe2 in order to optimize their thermoelectric properties. Using the phonon Boltzmann transport equation, we find that they all have ultralow lattice thermal conductivities. By analyzing their vibrational properties, these are attributed to soft phonon modes derived from the loosely bound rattling-like metal atoms and to strong anharmonicities caused by the vibrations of all atoms perpendicular to the strongly bound nonmetallic dimers. Additionally, by correlating those properties along the series, we elucidate a number of chemical trends. We find that heavier atom masses, larger atomic displacement parameters and longer bond lengths between metal and nonmetal atoms can be beneficial to the looser rattling of the metal atoms and therefore lead to softer phonon modes, and that stronger nonmetallic dimer bonds can boost the anharmonicities, both leading to lower thermal conductivities. Furthermore, we find that all three compounds have complex energy isosurfaces at valence and conduction band edges that simultaneously allow for large density-of-states effective masses and small conductivity effective masses for both p-type and n-type carriers. Consequently, the calculated thermoelectric figures of merit (ZT), can reach large values both for p-type and n-type doping. Our study illustrates the effects of rattling-like metal atoms and localized nonmetallic dimers on the thermal transport properties and the importance of different carrier effective masses to electrical transport properties in these pyrite-type dichalcogenides, which can be used to predict and optimize the thermoelectric properties of other thermoelectric compounds in the future.

preprint2022arXiv

Faraday patterns in spin-orbit coupled Bose-Einstein condensates

We study the Faraday patterns generated by spin-orbit-coupling induced parametric resonance in a spinor Bose-Einstein condensate with repulsive interaction. The collective elementary excitations of the Bose-Einstein condensate, including density waves and spin waves, are coupled as the result of the Raman-induced spin-orbit coupling and a quench of the relative phase of two Raman lasers without the modulation of any of the system's parameters. We observed several higher parametric resonance tongues at integer multiples of the driving frequency and investigated the interplay between Faraday instabilities and modulation instabilities when we quench the spin-orbit-coupled Bose-Einstein condensate from zero-momentum phase to plane-wave phase. If the detuning is equal to zero, the wave number of combination resonance barely changes as the strength of spin-orbit coupling increases. If the detuning is not equal to zero after a quench, a single combination resonance tongue will split into two parts.

preprint2020arXiv

Characterization of rattling in relation to thermal conductivity: ordered half-Heusler semiconductors

The factors that affect the thermal conductivity of semiconductors is a topic of great scientific interest, especially in relation to thermoelectrics. Key developments have been the concept of the phonon-glass-electron-crystal (PGEC) and the related idea of rattling to achieve this. We use first principles phonon and thermal conductivity calculations in order to explore the concept of rattling for stoichiometric ordered half-Heusler compounds. These compounds can be regarded as filled zinc blende materials, and the filling atom could be viewed as a rattler if it is weakly bound. We use two simple metrics, one related to the frequency and the other to bond frustration and anharmonicity. We find that both measures correlate with thermal conductivity. This suggests that both may be useful in screening materials for low thermal conductivity.

preprint2020arXiv

Experimental demonstration of one-sided device-independent self-testing of any pure two-qubit entangled state

We demonstrate one-sided device-independent self-testing of any pure entangled two-qubit state based on a fine-grained steering inequality. The maximum violation of a fine-grained steering inequality can be used to witness certain steerable correlations, which certify all pure two-qubit entangled states. Our experimental results identify which particular pure two-qubit entangled state has been self-tested and which measurement operators are used on the untrusted side. Furthermore, we analytically derive the robustness bound of our protocol, enabling our subsequent experimental verification of robustness through state tomography. Finally, we ensure that the requisite no-signalling constraints are maintained in the experiment.

preprint2020arXiv

On instability of Type (II) Lawson-Osserman Cones

We obtain the instability of Type (II) Lawson-Osserman cones in Euclidean spaces, and thus provide a family of (uncountably many) unstable solutions with singularity to the Dirichlet problem for minimal graphs of high codimension versus smooth unstable ones by Lawson-Osserman through a min-max technique. To our knowledge, these are the first examples of non-smooth unstable minimal graphs and unlikely detectible through the mean curvature flow or min-max theory.