Source author record

Shiyun Xu

Shiyun Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning eess.SY math.ST Statistics Theory Systems and Control

Catalog footprint

What is connected

3works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity

Interpretable machine learning has demonstrated impressive performance while preserving explainability. In particular, neural additive models (NAM) offer the interpretability to the black-box deep learning and achieve state-of-the-art accuracy among the large family of generalized additive models. In order to empower NAM with feature selection and improve the generalization, we propose the sparse neural additive models (SNAM) that employ the group sparsity regularization (e.g. Group LASSO), where each feature is learned by a sub-network whose trainable parameters are clustered as a group. We study the theoretical properties for SNAM with novel techniques to tackle the non-parametric truth, thus extending from classical sparse linear models such as the LASSO, which only works on the parametric truth. Specifically, we show that SNAM with subgradient and proximal gradient descents provably converges to zero training loss as $t\to\infty$, and that the estimation error of SNAM vanishes asymptotically as $n\to\infty$. We also prove that SNAM, similar to LASSO, can have exact support recovery, i.e. perfect feature selection, with appropriate regularization. Moreover, we show that the SNAM can generalize well and preserve the `identifiability', recovering each feature's effect. We validate our theories via extensive experiments and further testify to the good accuracy and efficiency of SNAM.

preprint2022arXiv

Transient Stability of Low-Inertia Power Systems with Inverter-Based Generation

This study examines the transient stability of low-inertia power systems with inverter-based generation (IBG) and proposes a sufficient stability criterion. In low-inertia grids, transient interactions are induced between the electromagnetic dynamics of the IBG and the electromechanical dynamics of the synchronous generator (SG) under a fault. For this, a hybrid IBG-SG system is established and a delta-power-frequency model is developed. Based on this model, new mechanisms of transient instability different from those of conventional power systems from the energy perspective are discovered. First, two loss-of-synchronization (LOS) types are identified based on the relative power imbalance owing to the mismatch between the inertia of the IBG and SG under a fault. Second, the relative angle and frequency will jump at the moment of a fault, thus affecting the system energy. Third, the cosine damping coefficient induces a positive energy dissipation, thereby contributing to the system stability. A unified criterion for identifying the two LOS types is proposed using the energy function method. This criterion is proved to be a sufficient stability condition for addressing the effects of the jumps and cosine damping coefficient on the system stability. The new mechanisms and effectiveness of the criterion are verified based on simulation results.

preprint2021arXiv

DebiNet: Debiasing Linear Models with Nonlinear Overparameterized Neural Networks

Recent years have witnessed strong empirical performance of over-parameterized neural networks on various tasks and many advances in the theory, e.g. the universal approximation and provable convergence to global minimum. In this paper, we incorporate over-parameterized neural networks into semi-parametric models to bridge the gap between inference and prediction, especially in the high dimensional linear problem. By doing so, we can exploit a wide class of networks to approximate the nuisance functions and to estimate the parameters of interest consistently. Therefore, we may offer the best of two worlds: the universal approximation ability from neural networks and the interpretability from classic ordinary linear model, leading to both valid inference and accurate prediction. We show the theoretical foundations that make this possible and demonstrate with numerical experiments. Furthermore, we propose a framework, DebiNet, in which we plug-in arbitrary feature selection methods to our semi-parametric neural network. DebiNet can debias the regularized estimators (e.g. Lasso) and perform well, in terms of the post-selection inference and the generalization error.