Source author record

Linghe Kong

Linghe Kong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Distributed, Parallel, and Cluster Computing Networking and Internet Architecture

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

FlexSpec: Frozen Drafts Meet Evolving Targets in Edge-Cloud Collaborative LLM Speculative Decoding

Deploying large language models (LLMs) in mobile and edge computing environments is constrained by limited on-device resources, scarce wireless bandwidth, and frequent model evolution. Although edge-cloud collaborative inference with speculative decoding (SD) can reduce end-to-end latency by executing a lightweight draft model at the edge and verifying it with a cloud-side target model, existing frameworks fundamentally rely on tight coupling between the two models. Consequently, repeated model synchronization introduces excessive communication overhead, increasing end-to-end latency, and ultimately limiting the scalability of SD in edge environments. To address these limitations, we propose FlexSpec, a communication-efficient collaborative inference framework tailored for evolving edge-cloud systems. The core design of FlexSpec is a shared-backbone architecture that allows a single and static edge-side draft model to remain compatible with a large family of evolving cloud-side target models. By decoupling edge deployment from cloud-side model updates, FlexSpec eliminates the need for edge-side retraining or repeated model downloads, substantially reducing communication and maintenance costs. Furthermore, to accommodate time-varying wireless conditions and heterogeneous device constraints, we develop a channel-aware adaptive speculation mechanism that dynamically adjusts the speculative draft length based on real-time channel state information and device energy budgets. Extensive experiments demonstrate that FlexSpec achieves superior performance compared to conventional SD approaches in terms of inference efficiency.

preprint2015arXiv

An LS-Decomposition Approach for Robust Data Recovery in Wireless Sensor Networks

Wireless sensor networks are widely adopted in military, civilian and commercial applications, which fuels an exponential explosion of sensory data. However, a major challenge to deploy effective sensing systems is the presence of {\em massive missing entries, measurement noise, and anomaly readings}. Existing works assume that sensory data matrices have low-rank structures. This does not hold in reality due to anomaly readings, causing serious performance degradation. In this paper, we introduce an {\em LS-Decomposition} approach for robust sensory data recovery, which decomposes a corrupted data matrix as the superposition of a low-rank matrix and a sparse anomaly matrix. First, we prove that LS-Decomposition solves a convex program with bounded approximation error. Second, using data sets from the IntelLab, GreenOrbs, and NBDC-CTD projects, we find that sensory data matrices contain anomaly readings. Third, we propose an accelerated proximal gradient algorithm and prove that it approximates the optimal solution with convergence rate $O(1/k^2)$ ($k$ is the number of iterations). Evaluations on real data sets show that our scheme achieves recovery error $\leq 5\%$ for sampling rate $\geq 50\%$ and almost exact recovery for sampling rate $\geq 60\%$, while state-of-the-art methods have error $10\% \sim 15\%$ at sampling rate $90\%$.

preprint2013arXiv

Two Design Issues in Cognitive Sub-Small Cell for Sojourners

In this paper, we propound a solution named Cognitive Sub-Small Cell for Sojourners (CSCS) in allusion to a broadly representative small cell scenario, where users can be categorized into two groups: sojourners and inhabitants. CSCS contributes to save energy, enhance the number of concurrently supportable users and enshield inhabitants. We consider two design issues in CSCS: i) determining the number of transmit antennas on sub-small cell APs; ii) controlling downlink inter-sub-small cell interference. For issue i), we excogitate an algorithm helped by the probability distribution of the number of concurrent sojourners. For issue ii), we propose an interference control scheme named BDBF: Block Diagonalization (BD) Precoding based on uncertain channel state information in conjunction with auxiliary optimal Beamformer (BF). In the simulation, we delve into the issue: how the factors impact the number of transmit antennas on sub-small cell APs. Moreover, we verify a significant conclusion: Using BDBF gains more capacity than using optimal BF alone within a bearably large radius of uncertainty region.