Source author record

Xiaoyu Ma

Xiaoyu Ma appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Methodology Computer Science and Game Theory cond-mat.mes-hall cond-mat.soft cond-mat.stat-mech Hardware Architecture math.OC quant-ph

Catalog footprint

What is connected

7works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks

To fit sparse linear associations, a LASSO sparsity inducing penalty with a single hyperparameter provably allows to recover the important features (needles) with high probability in certain regimes even if the sample size is smaller than the dimension of the input vector (haystack). More recently learners known as artificial neural networks (ANN) have shown great successes in many machine learning tasks, in particular fitting nonlinear associations. Small learning rate, stochastic gradient descent algorithm and large training set help to cope with the explosion in the number of parameters present in deep neural networks. Yet few ANN learners have been developed and studied to find needles in nonlinear haystacks. Driven by a single hyperparameter, our ANN learner, like for sparse linear associations, exhibits a phase transition in the probability of retrieving the needles, which we do not observe with other ANN learners. To select our penalty parameter, we generalize the universal threshold of Donoho and Johnstone (1994) which is a better rule than the conservative (too many false detections) and expensive cross-validation. In the spirit of simulated annealing, we propose a warm-start sparsity inducing algorithm to solve the high-dimensional, non-convex and non-differentiable optimization problem. We perform precise Monte Carlo simulations to show the effectiveness of our approach.

preprint2022arXiv

Distributed coordination for seeking the optimal Nash equilibrium of aggregative games

This paper aims to design a distributed coordination algorithm for solving a multi-agent decision problem with a hierarchical structure. The primary goal is to search the Nash equilibrium of a noncooperative game such that each player has no incentive to deviate from the equilibrium under its private objective. Meanwhile, the agents can coordinate to optimize the social cost within the set of Nash equilibria of the underlying game. Such an optimal Nash equilibrium problem can be modeled as a distributed optimization problem with variational inequality constraints. We consider the scenario where the objective functions of both the underlying game and social cost optimization problem have a special aggregation structure. Since each player only has access to its local objectives while cannot know all players' decisions, a distributed algorithm is highly desirable. By utilizing the Tikhonov regularization and dynamical averaging tracking technique, we propose a distributed coordination algorithm by introducing an incentive term in addition to the gradient-based Nash equilibrium seeking, so as to intervene players' decisions to improve the system efficiency. We prove its convergence to the optimal Nash equilibrium of a monotone aggregative game with simulation studies.

preprint2022arXiv

Product and sum uncertainty relations based on metric-adjusted skew information

The metric-adjusted skew information establishes a connection between the geometrical formulation of quantum statistics and the measures of quantum information. We study uncertainty relations in product and summation forms of metric-adjusted skew information. We present lower bounds on product and summation uncertainty inequalities based on metric-adjusted skew information via operator representation of observables. Explicit examples are provided to back our claims.

preprint2022arXiv

Sparse additive models in high dimensions with wavelets

In multivariate regression, when covariates are numerous, it is often reasonable to assume that only a small number of them has predictive information. In some medical applications for instance, it is believed that only a few genes out of thousands are responsible for cancers. In that case, the aim is not only to propose a good fit, but also to select the relevant covariates (genes). We propose to perform model selection with additive models in high dimensions (sample size and number of covariates). Our approach is computationally efficient thanks to fast wavelet transforms, it does not rely on cross validation, and it solves a convex optimization problem for a prescribed penalty parameter, called the quantile universal threshold. We also propose a second rule based on Stein unbiased risk estimation geared towards prediction. We use Monte Carlo simulations and real data to compare various methods based on false discovery rate (FDR), true positive rate (TPR) and mean squared error. Our approach is the only one to handle high dimensions, and has the best FDR--TPR trade-off.

preprint2021arXiv

A General Framework of Online Updating Variable Selection for Generalized Linear Models with Streaming Datasets

In the research field of big data, one of important issues is how to recover the sequentially changing sets of true features when the data sets arrive sequentially. The paper presents a general framework for online updating variable selection and parameter estimation in generalized linear models with streaming datasets. This is a type of online updating penalty likelihoods with differentiable or non-differentiable penalty function. The online updating coordinate descent algorithm is proposed to solve the online updating optimization problem. Moreover, a tuning parameter selection is suggested in an online updating way. The selection and estimation consistencies, and the oracle property are established, theoretically. Our methods are further examined and illustrated by various numerical examples from both simulation experiments and a real data analysis.

preprint2021arXiv

Mitigating Edge Machine Learning Inference Bottlenecks: An Empirical Study on Accelerating Google Edge Models

As the need for edge computing grows, many modern consumer devices now contain edge machine learning (ML) accelerators that can compute a wide range of neural network (NN) models while still fitting within tight resource constraints. We analyze a commercial Edge TPU using 24 Google edge NN models (including CNNs, LSTMs, transducers, and RCNNs), and find that the accelerator suffers from three shortcomings, in terms of computational throughput, energy efficiency, and memory access handling. We comprehensively study the characteristics of each NN layer in all of the Google edge models, and find that these shortcomings arise from the one-size-fits-all approach of the accelerator, as there is a high amount of heterogeneity in key layer characteristics both across different models and across different layers in the same model. We propose a new acceleration framework called Mensa. Mensa incorporates multiple heterogeneous ML edge accelerators (including both on-chip and near-data accelerators), each of which caters to the characteristics of a particular subset of models. At runtime, Mensa schedules each layer to run on the best-suited accelerator, accounting for both efficiency and inter-layer dependencies. As we analyze the Google edge NN models, we discover that all of the layers naturally group into a small number of clusters, which allows us to design an efficient implementation of Mensa for these models with only three specialized accelerators. Averaged across all 24 Google edge models, Mensa improves energy efficiency and throughput by 3.0x and 3.1x over the Edge TPU, and by 2.4x and 4.3x over Eyeriss v2, a state-of-the-art accelerator.

preprint2020arXiv

Nanometer scale resolution, multi-channel separation of spherical particles in a rocking ratchet with increasing barrier heights

We present a nanoparticle size-separation device based on a nanofluidic rocking Brownian motor. It features a ratchet-shaped electrostatic particle potential with increasing barrier heights along the particle transport direction. The sharp drop of the particle current with barrier height is exploited to separate a particle suspension into multiple sub-populations. By solving the Fokker--Planck equation, we show that the physics of the separation mechanism is governed by the energy landscape under forward tilt of the ratchet. For a given device geometry and sorting duration, the applied force is thus the only tunable parameter to increase the separation resolution. For the experimental conditions of 3.5 V applied voltage and 20 s sorting, we predict a separation resolution of $\sim 2$ nm, supported by experimental data for separating spherical gold particles of nominal 80 and 100 nm diameters.

Xiaoyu Ma

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

A phase transition for finding needles in nonlinear haystacks with LASSO artificial neural networks

Distributed coordination for seeking the optimal Nash equilibrium of aggregative games

Product and sum uncertainty relations based on metric-adjusted skew information

Sparse additive models in high dimensions with wavelets

A General Framework of Online Updating Variable Selection for Generalized Linear Models with Streaming Datasets

Mitigating Edge Machine Learning Inference Bottlenecks: An Empirical Study on Accelerating Google Edge Models

Nanometer scale resolution, multi-channel separation of spherical particles in a rocking ratchet with increasing barrier heights