Source author record

Zhuo Sun

Zhuo Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Computer Vision Information Theory math.IT Artificial Intelligence Cryptography and Security eess.SP Networking and Internet Architecture

Catalog footprint

What is connected

6works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Saliency-Aware Regularized Quantization Calibration for Large Language Models

Post-training quantization (PTQ) is an effective approach for deploying large language models (LLMs) under memory and latency constraints. Most existing PTQ methods determine quantization parameters by minimizing a layer-wise reconstruction error on a predetermined calibration dataset, typically optimized via either scale search or Gram-based methods. However, from the perspective of generalization risk, existing PTQ calibration objectives based solely on empirical reconstruction error over limited or unrepresentative calibration data may move the quantized weights away from the original floating-point weights, potentially degrading downstream performance. To address this issue, we propose \emph{Regularized Quantization Calibration} (RQC), a unified framework that augments standard PTQ objectives with a regularizer that explicitly controls weight deviation from the original weights. We further generalize this framework to incorporate a saliency-aware regularizer, resulting in \emph{Saliency-Aware Regularized Quantization Calibration} (SARQC). The proposed regularization encourages quantized weights to remain close to the original weights during calibration, leading to improved generalization at inference time. SARQC integrates seamlessly into existing PTQ pipelines and enhances both scale-search-based and Gram-based methods under a unified formulation. Extensive experiments on dense and Mixture-of-Experts LLMs demonstrate consistent improvements in perplexity and zero-shot accuracy, without introducing additional inference overhead.

preprint2021arXiv

Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

Recently, some researches are devoted to the topic of end-to-end learning a physical layer secure communication system based on autoencoder under Gaussian wiretap channel. However, in those works, the reliability and security of the encoder model were learned through necessary decoding outputs of not only legitimate receiver but also the eavesdropper. In fact, the assumption of known eavesdropper's decoder or its output is not practical. To address this issue, in this paper we propose a dual mutual information neural estimation (MINE) based neural secure communications model. The security constraints of this method is constructed only with the input and output signal samples of the legal and eavesdropper channels and benefit that training the encoder is completely independent of the decoder. Moreover, since the design of secure coding does not rely on the eavesdropper's decoding results, the security performance would not be affected by the eavesdropper's decoding means. Numerical results show that the performance of our model is guaranteed whether the eavesdropper learns the decoder himself or uses the legal decoder.

preprint2020arXiv

A Concise Review of Recent Few-shot Meta-learning Methods

Few-shot meta-learning has been recently reviving with expectations to mimic humanity's fast adaption to new concepts based on prior knowledge. In this short communication, we give a concise review on recent representative methods in few-shot meta-learning, which are categorized into four branches according to their technical characteristics. We conclude this review with some vital current challenges and future prospects in few-shot meta-learning.

preprint2020arXiv

BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Few-shot learning for fine-grained image classification has gained recent attention in computer vision. Among the approaches for few-shot learning, due to the simplicity and effectiveness, metric-based methods are favorably state-of-the-art on many tasks. Most of the metric-based methods assume a single similarity measure and thus obtain a single feature space. However, if samples can simultaneously be well classified via two distinct similarity measures, the samples within a class can distribute more compactly in a smaller feature space, producing more discriminative feature maps. Motivated by this, we propose a so-called \textit{Bi-Similarity Network} (\textit{BSNet}) that consists of a single embedding module and a bi-similarity module of two similarity measures. After the support images and the query images pass through the convolution-based embedding module, the bi-similarity module learns feature maps according to two similarity measures of diverse characteristics. In this way, the model is enabled to learn more discriminative and less similarity-biased features from few shots of fine-grained images, such that the model generalization ability can be significantly improved. Through extensive experiments by slightly modifying established metric/similarity based networks, we show that the proposed approach produces a substantial improvement on several fine-grained image benchmark datasets. Codes are available at: https://github.com/spraise/BSNet

preprint2020arXiv

Coverage Analysis for 3D Terahertz Communication Systems with Blockage and Directional Antennas

The scarcity of spectrum resources in current wireless communication systems has sparked enormous research interest in the terahertz (THz) frequency band. This band is characterized by fundamentally different propagation properties resulting in different interference structures from what we have observed so far at lower frequencies. In this paper, we derive a new expression for the coverage probability of downlink transmission in THz communication systems within a three-dimensional (3D) environment. First, we establish a 3D propagation model which considers the molecular absorption loss, 3D directional antennas at both access points (APs) and user equipments (UEs), interference from nearby APs, and dynamic blockages caused by moving humans. Then, we develop a novel easy-to-use analytical framework based on the dominant interferer analysis to evaluate the coverage probability, the novelty of which lies in the incorporation of the instantaneous interference and the vertical height of THz devices. Our numerical results demonstrate the accuracy of our analysis and reveal that the coverage probability significantly decreases when the transmission distance increases. We also show the increasing blocker density and increasing AP density impose different impacts on the coverage performance when the UE-AP link of interest is in line-of-sight. We further show that the coverage performance improvement brought by increasing the antenna directivity at APs is higher than that brought by increasing the antenna directivity at UEs.

preprint2010arXiv

Buffer Management Algorithm Design and Implementation Based on Network Processors

To solve the parameter sensitive issue of the traditional RED (random early detection) algorithm, an adaptive buffer management algorithm called PAFD (packet adaptive fair dropping) is proposed. This algorithm supports DiffServ (differentiated services) model of QoS (quality of service). In this algorithm, both of fairness and throughput are considered. The smooth buffer occupancy rate function is adopted to adjust the parameters. By implementing buffer management and packet scheduling on Intel IXP2400, the viability of QoS mechanisms on NPs (network processors) is verified. The simulation shows that the PAFD smoothes the flow curve, and achieves better balance between fairness and network throughput. It also demonstrates that this algorithm meets the requirements of fast data packet processing, and the hardware resource utilization of NPs is higher.

Zhuo Sun

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Saliency-Aware Regularized Quantization Calibration for Large Language Models

Dual MINE-based Neural Secure Communications under Gaussian Wiretap Channel

A Concise Review of Recent Few-shot Meta-learning Methods

BSNet: Bi-Similarity Network for Few-shot Fine-grained Image Classification

Coverage Analysis for 3D Terahertz Communication Systems with Blockage and Directional Antennas

Buffer Management Algorithm Design and Implementation Based on Network Processors