Source author record

Xinping Yi

Xinping Yi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Machine Learning eess.SP Artificial Intelligence Computer Vision

Catalog footprint

What is connected

22works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Learning to Unfold Fractional Programming for Multi-Cell MU-MIMO Beamforming with Graph Neural Networks

In the multi-cell multiuser multi-input multi-output (MU-MIMO) systems, fractional programming (FP) has demonstrated considerable effectiveness in optimizing beamforming vectors, yet it suffers from high computational complexity. Recent improvements demonstrate reduced complexity by avoiding large-dimension matrix inversions (i.e., FastFP) and faster convergence by learning to unfold the FastFP algorithm (i.e., DeepFP).

preprint2026arXiv

Towards A Unified PAC-Bayesian Framework for Norm-based Generalization Bounds

Understanding the generalization behavior of deep neural networks remains a fundamental challenge in modern statistical learning theory. Among existing approaches, PAC-Bayesian norm-based bounds have demonstrated particular promise due to their data-dependent nature and their ability to capture algorithmic and geometric properties of learned models. However, most existing results rely on isotropic Gaussian posteriors, heavy use of spectral-norm concentration for weight perturbations, and largely architecture-agnostic analyses, which together limit both the tightness and practical relevance of the resulting bounds. To address these limitations, in this work, we propose a unified framework for PAC-Bayesian norm-based generalization by reformulating the derivation of generalization bounds as a stochastic optimization problem over anisotropic Gaussian posteriors. The key to our approach is a sensitivity matrix that quantifies the network outputs with respect to structured weight perturbations, enabling the explicit incorporation of heterogeneous parameter sensitivities and architectural structures. By imposing different structural assumptions on this sensitivity matrix, we derive a family of generalization bounds that recover several existing PAC-Bayesian results as special cases, while yielding bounds that are comparable to or tighter than state-of-the-art approaches. Such a unified framework provides a principled and flexible way for geometry-/structure-aware and interpretable generalization analysis in deep learning.

preprint2022arXiv

A Little Energy Goes a Long Way: Build an Energy-Efficient, Accurate Spiking Neural Network from Convolutional Neural Network

Spiking neural networks (SNNs) offer an inherent ability to process spatial-temporal data, or in other words, realworld sensory data, but suffer from the difficulty of training high accuracy models. A major thread of research on SNNs is on converting a pre-trained convolutional neural network (CNN) to an SNN of the same structure. State-of-the-art conversion methods are approaching the accuracy limit, i.e., the near-zero accuracy loss of SNN against the original CNN. However, we note that this is made possible only when significantly more energy is consumed to process an input. In this paper, we argue that this trend of "energy for accuracy" is not necessary -- a little energy can go a long way to achieve the near-zero accuracy loss. Specifically, we propose a novel CNN-to-SNN conversion method that is able to use a reasonably short spike train (e.g., 256 timesteps for CIFAR10 images) to achieve the near-zero accuracy loss. The new conversion method, named as explicit current control (ECC), contains three techniques (current normalisation, thresholding for residual elimination, and consistency maintenance for batch-normalisation), in order to explicitly control the currents flowing through the SNN when processing inputs. We implement ECC into a tool nicknamed SpKeras, which can conveniently import Keras CNN models and convert them into SNNs. We conduct an extensive set of experiments with the tool -- working with VGG16 and various datasets such as CIFAR10 and CIFAR100 -- and compare with state-of-the-art conversion methods. Results show that ECC is a promising method that can optimise over energy consumption and accuracy loss simultaneously.

preprint2022arXiv

Enhancing Adversarial Training with Second-Order Statistics of Weights

Adversarial training has been shown to be one of the most effective approaches to improve the robustness of deep neural networks. It is formalized as a min-max optimization over model weights and adversarial perturbations, where the weights can be optimized through gradient descent methods like SGD. In this paper, we show that treating model weights as random variables allows for enhancing adversarial training through \textbf{S}econd-Order \textbf{S}tatistics \textbf{O}ptimization (S$^2$O) with respect to the weights. By relaxing a common (but unrealistic) assumption of previous PAC-Bayesian frameworks that all weights are statistically independent, we derive an improved PAC-Bayesian adversarial generalization bound, which suggests that optimizing second-order statistics of weights can effectively tighten the bound. In addition to this theoretical insight, we conduct an extensive set of experiments, which show that S$^2$O not only improves the robustness and generalization of the trained neural networks when used in isolation, but also integrates easily in state-of-the-art adversarial training techniques like TRADES, AWP, MART, and AVMixup, leading to a measurable improvement of these techniques. The code is available at \url{https://github.com/Alexkael/S2O}.

preprint2022arXiv

Neuronal Correlation: a Central Concept in Neural Network

This paper proposes to study neural networks through neuronal correlation, a statistical measure of correlated neuronal activity on the penultimate layer. We show that neuronal correlation can be efficiently estimated via weight matrix, can be effectively enforced through layer structure, and is a strong indicator of generalisation ability of the network. More importantly, we show that neuronal correlation significantly impacts on the accuracy of entropy estimation in high-dimensional hidden spaces. While previous estimation methods may be subject to significant inaccuracy due to implicit assumption on neuronal independence, we present a novel computational method to have an efficient and authentic computation of entropy, by taking into consideration the neuronal correlation. In doing so, we install neuronal correlation as a central concept of neural network.

preprint2022arXiv

Topological Interference Management with Adversarial Topology Perturbation: An Algorithmic Perspective

In this paper, we consider the topological interference management (TIM) problem in a dynamic setting, where an adversary perturbs network topology to prevent the exploitation of sophisticated coding opportunities (e.g., interference alignment). Focusing on a special class of network topology - chordal networks - we investigate algorithmic aspects of the TIM problem under adversarial topology perturbation. In particular, given the adversarial perturbation with respect to edge insertion/deletion, we propose a dynamic graph coloring algorithm that allows for a constant number of re-coloring updates against each inserted/deleted edge to achieve the information-theoretic optimality. This is a sharp reduction of the general graph re-coloring, whose optimal number of updates scales as the size of the network, thanks to the delicate exploitation of the structural properties of chordal graph classes.

preprint2022arXiv

Weight Expansion: A New Perspective on Dropout and Generalization

While dropout is known to be a successful regularization technique, insights into the mechanisms that lead to this success are still lacking. We introduce the concept of \emph{weight expansion}, an increase in the signed volume of a parallelotope spanned by the column or row vectors of the weight covariance matrix, and show that weight expansion is an effective means of increasing the generalization in a PAC-Bayesian setting. We provide a theoretical argument that dropout leads to weight expansion and extensive empirical support for the correlation between dropout and weight expansion. To support our hypothesis that weight expansion can be regarded as an \emph{indicator} of the enhanced generalization capability endowed by dropout, and not just as a mere by-product, we have studied other methods that achieve weight expansion (resp.\ contraction), and found that they generally lead to an increased (resp.\ decreased) generalization ability. This suggests that dropout is an attractive regularizer, because it is a computationally cheap method for obtaining weight expansion. This insight justifies the role of dropout as a regularizer, while paving the way for identifying regularizers that promise improved generalization through weight expansion.

preprint2020arXiv

A Survey of Safety and Trustworthiness of Deep Neural Networks: Verification, Testing, Adversarial Attack and Defence, and Interpretability

In the past few years, significant progress has been made on deep neural networks (DNNs) in achieving human-level performance on several long-standing tasks. With the broader deployment of DNNs on various applications, the concerns over their safety and trustworthiness have been raised in public, especially after the widely reported fatal incidents involving self-driving cars. Research to address these concerns is particularly active, with a significant number of papers released in the past few years. This survey paper conducts a review of the current research effort into making DNNs safe and trustworthy, by focusing on four aspects: verification, testing, adversarial attack and defence, and interpretability. In total, we survey 202 papers, most of which were published after 2017.

preprint2020arXiv

Asymptotic Singular Value Distribution of Linear Convolutional Layers

In convolutional neural networks, the linear transformation of multi-channel two-dimensional convolutional layers with linear convolution is a block matrix with doubly Toeplitz blocks. Although a "wrapping around" operation can transform linear convolution to a circular one, by which the singular values can be approximated with reduced computational complexity by those of a block matrix with doubly circulant blocks, the accuracy of such an approximation is not guaranteed. In this paper, we propose to inspect such a linear transformation matrix through its asymptotic spectral representation - the spectral density matrix - by which we develop a simple singular value approximation method with improved accuracy over the circular approximation, as well as upper bounds for spectral norm with reduced computational complexity. Compared with the circular approximation, we obtain moderate improvement with a subtle adjustment of the singular value distribution. We also demonstrate that the spectral norm upper bounds are effective spectral regularizers for improving generalization performance in ResNets.

preprint2020arXiv

Dual-Polarized FDD Massive MIMO: A Comprehensive Framework

We propose a comprehensive scheme for realizing a massive multiple-input multiple-output (MIMO) system with dual-polarized antennas in frequency division duplexing (FDD) mode. Employing dual-polarized elements in a massive MIMO array has been common practice recently and can, in principle, double the number of spatial degrees of freedom with a less-than-proportional increase in array size. However, processing a dual-polarized channel is demanding due to the high channel dimension and the lack of Uplink-Downlink (UL-DL) channel reciprocity in FDD mode. In particular, the difficulty arises in channel covariance acquisition for both UL and DL transmissions and in common training of DL channels in a multi-user setup. To overcome these challenges, we develop a unified framework consisting of three steps: (1) a covariance estimation method to efficiently estimate the UL covariance from noisy, orthogonal UL pilots; (2) a UL-DL covariance transformation method that obtains the DL covariance from the estimated UL covariance in the previous step; (3) a multi-user common DL channel training with limited DL pilot dimension method, which enables the BS to estimate effective user DL channels and use them for interference-free DL beamforming and data transmission. We provide extensive empirical results to prove the applicability and merits of our scheme.

preprint2020arXiv

Energy Efficiency Optimization for Downlink Massive MIMO With Statistical CSIT

We investigate energy efficiency (EE) optimization for single-cell massive multiple-input multiple-output (MIMO) downlink transmission with only statistical channel state information (CSI) available at the base station. We first show that beam domain transmission is favorable for energy efficiency in the massive MIMO downlink, by deriving a closed-form solution for the eigenvectors of the optimal transmit covariance matrix. With this conclusion, the EE optimization problem is reduced to a real-valued power allocation problem, which is much easier to tackle than the original large-dimensional complex matrix-valued precoding design problem. We further propose an iterative water-filling-structured beam domain power allocation algorithm with low complexity and guaranteed convergence, exploiting the techniques from sequential optimization, fractional optimization, and random matrix theory. Numerical results demonstrate the near-optimal performance of our proposed statistical CSI aided EE optimization approach.

preprint2020arXiv

Robust Precoding in Massive MIMO: A Deep Learning Approach

In this paper, we consider massive multiple-input-multiple-output (MIMO) communication systems with a uniform planar array (UPA) at the base station (BS) and investigate the downlink precoding with imperfect channel state information (CSI). By exploiting both instantaneous and statistical CSI, we aim to design precoding vectors to maximize the ergodic rate (e.g., sum rate, minimum rate and etc.) subject to a total transmit power constraint. To maximize an upper bound of the ergodic rate, we leverage the corresponding Lagrangian formulation and identify the structural characteristics of the optimal precoder as the solution to a generalized eigenvalue problem. As such, the high-dimensional precoder design problem turns into a low-dimensional power control problem. The Lagrange multipliers play a crucial role in determining both precoder directions and power parameters, yet are challenging to be solved directly. To figure out the Lagrange multipliers, we develop a general framework underpinned by a properly designed neural network that learns directly from CSI. To further relieve the computational burden, we obtain a low-complexity framework by decomposing the original problem into computationally efficient subproblems with instantaneous and statistical CSI handled separately. With the off-line pretrained neural network, the online computational complexity of precoding is substantially reduced compared with the existing iterative algorithm while maintaining nearly the same performance.

preprint2015arXiv

Optimality of Treating Interference as Noise: A Combinatorial Perspective

For single-antenna Gaussian interference channels, we re-formulate the problem of determining the Generalized Degrees of Freedom (GDoF) region achievable by treating interference as Gaussian noise (TIN) derived in [3] from a combinatorial perspective. We show that the TIN power control problem can be cast into an assignment problem, such that the globally optimal power allocation variables can be obtained by well-known polynomial time algorithms. Furthermore, the expression of the TIN-Achievable GDoF region (TINA region) can be substantially simplified with the aid of maximum weighted matchings. We also provide conditions under which the TINA region is a convex polytope that relax those in [3]. For these new conditions, together with a channel connectivity (i.e., interference topology) condition, we show TIN optimality for a new class of interference networks that is not included, nor includes, the class found in [3]. Building on the above insights, we consider the problem of joint link scheduling and power control in wireless networks, which has been widely studied as a basic physical layer mechanism for device-to-device (D2D) communications. Inspired by the relaxed TIN channel strength condition as well as the assignment-based power allocation, we propose a low-complexity GDoF-based distributed link scheduling and power control mechanism (ITLinQ+) that improves upon the ITLinQ scheme proposed in [4] and further improves over the heuristic approach known as FlashLinQ. It is demonstrated by simulation that ITLinQ+ provides significant average network throughput gains over both ITLinQ and FlashLinQ, and yet still maintains the same level of implementation complexity. More notably, the energy efficiency of the newly proposed ITLinQ+ is substantially larger than that of ITLinQ and FlashLinQ, which is desirable for D2D networks formed by battery-powered devices.

preprint2014arXiv

Topological Interference Management with Transmitter Cooperation

Interference networks with no channel state information at the transmitter (CSIT) except for the knowledge of the connectivity graph have been recently studied under the topological interference management (TIM) framework. In this paper, we consider a similar problem with topological knowledge but in a distributed broadcast channel setting, i.e. a network where transmitter cooperation is enabled. We show that the topological information can also be exploited in this case to strictly improve the degrees of freedom (DoF) as long as the network is not fully connected, which is a reasonable assumption in practice. Achievability schemes based on selective graph coloring, interference alignment, and hypergraph covering, are proposed. Together with outer bounds built upon generator sequence, the concept of compound channel settings, and the relation to index coding, we characterize the symmetric DoF for so-called regular networks with constant number of interfering links, and identify the sufficient and/or necessary conditions for the arbitrary network topologies to achieve a certain amount of symmetric DoF.

preprint2013arXiv

On the Degrees of Freedom of the K-User Time Correlated Broadcast Channel with Delayed CSIT

The Degrees of Freedom (DoF) of a K-User MISO Broadcast Channel (BC) is studied when the Transmitter (TX) has access to a delayed channel estimate in addition to an imperfect estimate of the current channel. The current estimate could be for example obtained from prediction applied on past estimates, in the case where feedback delay is within the coherence time. Building on previous recent works on this setting with two users, the estimation error of the current channel is characterized by its scaling as P at the exponent α, where α=1 (resp. α=0) corresponds to an estimate being essentially perfect (resp. useless) in terms of DoF. In this work, we contribute to the characterization of the DoF region in such a setting by deriving an outerbound for the DoF region and by providing an achievable DoF region. The achievable DoF is obtained by developing a new alignment scheme, called the Kα-MAT scheme, which builds upon both the principle of the MAT alignment scheme from Maddah-Ali and Tse and Zero-Forcing to achieve a larger DoF when the delayed CSIT received is correlated with the instantaneous channel state.

preprint2013arXiv

The Degrees of Freedom Region of Temporally-Correlated MIMO Networks with Delayed CSIT

We consider the temporally-correlated Multiple-Input Multiple-Output (MIMO) broadcast channels (BC) and interference channels (IC) where the transmitter(s) has/have (i) delayed channel state information (CSI) obtained from a latency-prone feedback channel as well as (ii) imperfect current CSIT, obtained, e.g., from prediction on the basis of these past channel samples based on the temporal correlation. The degrees of freedom (DoF) regions for the two-user broadcast and interference MIMO networks with general antenna configuration under such conditions are fully characterized, as a function of the prediction quality indicator. Specifically, a simple unified framework is proposed, allowing to attain optimal DoF region for the general antenna configurations and current CSIT qualities. Such a framework builds upon block-Markov encoding with interference quantization, optimally combining the use of both outdated and instantaneous CSIT. A striking feature of our work is that, by varying the power allocation, every point in the DoF region can be achieved with one single scheme. As a result, instead of checking the achievability of every corner point of the outer bound region, as typically done in the literature, we propose a new systematic way to prove the achievability.

preprint2012arXiv

Degrees of Freedom of Time Correlated MISO Broadcast Channel with Delayed CSIT

We consider the time correlated multiple-input single-output (MISO) broadcast channel where the transmitter has imperfect knowledge on the current channel state, in addition to delayed channel state information. By representing the quality of the current channel state information as P^-α for the signal-to-noise ratio P and some constant α \geq 0, we characterize the optimal degree of freedom region for this more general two-user MISO broadcast correlated channel. The essential ingredients of the proposed scheme lie in the quantization and multicasting of the overheard interferences, while broadcasting new private messages. Our proposed scheme smoothly bridges between the scheme recently proposed by Maddah-Ali and Tse with no current state information and a simple zero-forcing beamforming with perfect current state information.

preprint2012arXiv

On the Degrees of Freedom of time correlated MISO broadcast channel with delayed CSIT

We consider the time correlated MISO broadcast channel where the transmitter has partial knowledge on the current channel state, in addition to delayed channel state information (CSI). Rather than exploiting only the current CSI, as the zero-forcing precoding, or only the delayed CSI, as the Maddah-Ali-Tse (MAT) scheme, we propose a seamless strategy that takes advantage of both. The achievable degrees of freedom of the proposed scheme is characterized in terms of the quality of the current channel knowledge.

preprint2012arXiv

Precoding Methods for MISO Broadcast Channel with Delayed CSIT

Recent information theoretic results suggest that precoding on the multi-user downlink MIMO channel with delayed channel state information at the transmitter (CSIT) could lead to data rates much beyond the ones obtained without any CSIT, even in extreme situations when the delayed channel feedback is made totally obsolete by a feedback delay exceeding the channel coherence time. This surprising result is based on the ideas of interference repetition and alignment which allow the receivers to reconstruct information symbols which canceling out the interference completely, making it an optimal scheme in the infinite SNR regime. In this paper, we formulate a similar problem, yet at finite SNR. We propose a first construction for the precoder which matches the previous results at infinite SNR yet reaches a useful trade-off between interference alignment and signal enhancement at finite SNR, allowing for significant performance improvements in practical settings. We present two general precoding methods with arbitrary number of users by means of virtual MMSE and mutual information optimization, achieving good compromise between signal enhancement and interference alignment. Simulation results show substantial improvement due to the compromise between those two aspects.

preprint2012arXiv

The DoF of Network MIMO with Backhaul Delays

We consider the problem of downlink precoding for Network (multi-cell) MIMO networks where Transmitters (TXs) are provided with imperfect Channel State Information (CSI). Specifically, each TX receives a delayed channel estimate with the delay being specific to each channel component. This model is particularly adapted to the scenarios where a user feeds back its CSI to its serving base only as it is envisioned in future LTE networks. We analyze the impact of the delay during the backhaul-based CSI exchange on the rate performance achieved by Network MIMO. We highlight how delay can dramatically degrade system performance if existing precoding methods are to be used. We propose an alternative robust beamforming strategy which achieves the maximal performance, in DoF sense. We verify by simulations that the theoretical DoF improvement translates into a performance increase at finite Signal-to-Noise Ratio (SNR) as well.

preprint2012arXiv

The DoF Region of the Multiple-Antenna Time Correlated Interference Channel with Delayed CSIT

We consider the time-correlated multiple-antenna interference channel where the transmitters have (i) delayed channel state information (CSI) obtained from a latency-prone feedback channel as well as (ii) imperfect current CSIT, obtained e.g. from prediction on the basis of these past channel samples. We derive the degrees of freedom (DoF) region for the two-user multiple-antenna interference channel under such conditions. The proposed DoF achieving scheme exploits a particular combination of the space-time alignment protocol designed for fully outdated CSIT feedback channels (initially developed for the broadcast channel by Maddah-Ali et al, later extended to the interference channel by Vaze et al. and Ghasemi et al.) together with the use of simple zero-forcing (ZF) precoders. The essential ingredient lies in the quantization and feedback of the residual interference left after the application of the initial imperfect ZF precoder. Our focus is on the MISO setting albeit extensions to certain MIMO cases are also considered.

preprint2011arXiv

User Scheduling for Heterogeneous Multiuser MIMO Systems: A Subspace Viewpoint

In downlink multiuser multiple-input multiple-output (MU-MIMO) systems, users are practically heterogeneous in nature. However, most of the existing user scheduling algorithms are designed with an implicit assumption that the users are homogeneous. In this paper, we revisit the problem by exploring the characteristics of heterogeneous users from a subspace point of view. With an objective of minimizing interference non-orthogonality among users, three new angular-based user scheduling criteria that can be applied in various user scheduling algorithms are proposed. While the first criterion is heuristically determined by identifying the incapability of largest principal angle to characterize the subspace correlation and hence the interference non-orthogonality between users, the second and third ones are derived by using, respectively, the sum rate capacity bounds with block diagonalization and the change in capacity by adding a new user into an existing user subset. Aiming at capturing fairness among heterogeneous users while maintaining multiuser diversity gain, two new hybrid user scheduling algorithms are also proposed whose computational complexities are only linearly proportional to the number of users. We show by simulations that the effectiveness of our proposed user scheduling criteria and algorithms with respect to those commonly used in homogeneous environment.

Xinping Yi

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

Learning to Unfold Fractional Programming for Multi-Cell MU-MIMO Beamforming with Graph Neural Networks

Towards A Unified PAC-Bayesian Framework for Norm-based Generalization Bounds

A Little Energy Goes a Long Way: Build an Energy-Efficient, Accurate Spiking Neural Network from Convolutional Neural Network

Enhancing Adversarial Training with Second-Order Statistics of Weights

Neuronal Correlation: a Central Concept in Neural Network

Topological Interference Management with Adversarial Topology Perturbation: An Algorithmic Perspective

Weight Expansion: A New Perspective on Dropout and Generalization

A Survey of Safety and Trustworthiness of Deep Neural Networks: Verification, Testing, Adversarial Attack and Defence, and Interpretability

Asymptotic Singular Value Distribution of Linear Convolutional Layers

Dual-Polarized FDD Massive MIMO: A Comprehensive Framework

Energy Efficiency Optimization for Downlink Massive MIMO With Statistical CSIT

Robust Precoding in Massive MIMO: A Deep Learning Approach

Optimality of Treating Interference as Noise: A Combinatorial Perspective

Topological Interference Management with Transmitter Cooperation

On the Degrees of Freedom of the K-User Time Correlated Broadcast Channel with Delayed CSIT

The Degrees of Freedom Region of Temporally-Correlated MIMO Networks with Delayed CSIT

Degrees of Freedom of Time Correlated MISO Broadcast Channel with Delayed CSIT

On the Degrees of Freedom of time correlated MISO broadcast channel with delayed CSIT

Precoding Methods for MISO Broadcast Channel with Delayed CSIT

The DoF of Network MIMO with Backhaul Delays

The DoF Region of the Multiple-Antenna Time Correlated Interference Channel with Delayed CSIT

User Scheduling for Heterogeneous Multiuser MIMO Systems: A Subspace Viewpoint