Researcher profile

Yaping Sun

Yaping Sun contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2026arXiv

CR^2: Cost-Aware Risk-Controlled Routing for Wireless Device-Edge LLM Inference

As large language models (LLMs) move from centralized clouds to mobile edge environments, efficient serving must balance latency, energy consumption, and accuracy under constrained device-edge resources. Query-level routing between lightweight on-device models and stronger edge models provides a flexible mechanism to navigate this trade-off. However, existing routers are designed for centralized cloud settings and optimize token-level costs, failing to capture the dynamic latency and energy overheads in wireless edge deployments. In this paper, we formulate mobile edge LLM routing as a deployment-constrained, cost-aware decision problem, and propose CR^2, a two-stage device-edge routing framework. CR^2 decouples a lightweight on-device margin gate from an edge-side utility selector for deferred queries. The margin gate operates on frozen query embeddings and a user-specified cost weight to predict whether local execution is utility-optimal relative to the best edge alternative under the target operating point. We further introduce a conformal risk control (CRC) calibration procedure that maps each operating point to an acceptance threshold, enabling explicit control of the marginal false-acceptance risk under the full-information utility reference. Experiments on the routing task show that CR^2 closely matches a full-information reference router using only device-side signals before deferral. Compared with strong query-level baselines, CR^2 consistently improves the deployable accuracy-cost Pareto frontier and reduces normalized deployment cost by up to 16.9% at matched accuracy.

preprint2021arXiv

Enhancing WiFi Multiple Access Performance with Federated Deep Reinforcement Learning

Carrier sensing multiple access/collision avoidance (CSMA/CA) is the backbone MAC protocol for IEEE 802.11 networks. However, tuning the binary exponential back-off (BEB) mechanism of CSMA/CA in user-dense scenarios so as to maximize aggregate throughput still remains a practically essential and challenging problem. In this paper, we propose a new and enhanced multiple access mechanism based on the application of deep reinforcement learning (DRL) and Federated learning (FL). A new Monte Carlo (MC) reward updating method for DRL training is proposed and the access history of each station is used to derive a DRL-based MAC protocol that improves the network throughput vis-a-vis the traditional distributed coordination function (DCF). Further, federated learning (FL) is applied to achieve fairness among users. The simulation results showcase that the proposed federated reinforcement multiple access (FRMA) performs better than basic DCF by 20% and DCF with request-to-send/clear-to-send (RTS/CTS) by 5% while guaranteeing the fairness in user-dense scenarios.

preprint2020arXiv

Communications-Caching-Computing Tradeoff Analysis for Bidirectional Data Computation in Mobile Edge Networks

With the advent of the modern mobile traffic, e.g., online gaming, augmented reality delivery and etc., a novel bidirectional computation task model where the input data of each task consists of two parts, one generated at the mobile device in real-time and the other originated from the Internet proactively, is emerging as an important use case of 5G. In this paper, for ease of analytical analysis, we consider the homogeneous bidirectional computation task model in a mobile edge network which consists of one mobile edge computing (MEC) server and one mobile device, both enabled with computing and caching capabilities. Each task can be served via three mechanisms, i.e., local computing with local caching, local computing without local caching and computing at the MEC server. To minimize the average bandwidth, we formulate the joint caching and computing optimization problem under the latency, cache size and average power constraints. We derive the closed-form expressions for the optimal policy and the minimum bandwidth. The tradeoff among communications, computing and caching is illustrated both analytically and numerically, which provides insightful guideline for the network designers.