Source author record

Zhan Gao

Zhan Gao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

eess.SP Machine Learning econ.EM hep-ph

Catalog footprint

What is connected

5works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Momentum and angular correlations in \texorpdfstring{$Z/γ$}{Z/gamma}-hadron production in relativistic heavy-ion collisions

We carry out a detailed study of medium modifications on momentum and angular correlations between a large transverse momentum hadron and a $Z/γ$ trigger in relativistic heavy-ion collisions within a perturbative QCD parton model improved by the Sudakov resummation technique. The total energy loss of a hard parton propagating inside the medium is employed to modify the fragmentation function, while the medium-induced transverse momentum broadening is included in the resummation approach, and both of them are related to the jet transport parameter and obtained by the high-twist formalism. We obtain good agreements with the existing data on transverse momentum and azimuthal angular correlations for the $Z/γ$-hadron pairs in $pp$ and $AA$ collisions, and predict the correlations for the $γ$-hadron in central $PbPb$ collisions at 5.02 TeV. The numerical analyses for the $Z/γ$-hadron in central $PbPb$ collisions show that the normalized angular distribution is decorrelated due to the medium-induced transverse momentum broadening, however, the angular correlation is enhanced due to the parton energy loss, namely anti-broadening. The observed modification of the angular correlation is a result of the competition between the broadening and the anti-broadening. This work provides a reliable theoretical tool for a comprehensive and precise study of jet quenching in relativistic heavy-ion collisions.

preprint2022arXiv

Wide and Deep Graph Neural Network with Distributed Online Learning

Graph neural networks (GNNs) are naturally distributed architectures for learning representations from network data. This renders them suitable candidates for decentralized tasks. In these scenarios, the underlying graph often changes with time due to link failures or topology variations, creating a mismatch between the graphs on which GNNs were trained and the ones on which they are tested. Online learning can be leveraged to retrain GNNs at testing time to overcome this issue. However, most online algorithms are centralized and usually offer guarantees only on convex problems, which GNNs rarely lead to. This paper develops the Wide and Deep GNN (WD-GNN), a novel architecture that can be updated with distributed online learning mechanisms. The WD-GNN consists of two components: the wide part is a linear graph filter and the deep part is a nonlinear GNN. At training time, the joint wide and deep architecture learns nonlinear representations from data. At testing time, the wide, linear part is retrained, while the deep, nonlinear one remains fixed. This often leads to a convex formulation. We further propose a distributed online learning algorithm that can be implemented in a decentralized setting. We also show the stability of the WD-GNN to changes of the underlying graph and analyze the convergence of the proposed online learning procedure. Experiments on movie recommendation, source localization and robot swarm control corroborate theoretical findings and show the potential of the WD-GNN for distributed online learning.

preprint2021arXiv

On LASSO for Predictive Regression

Explanatory variables in a predictive regression typically exhibit low signal strength and various degrees of persistence. Variable selection in such a context is of great importance. In this paper, we explore the pitfalls and possibilities of the LASSO methods in this predictive regression framework. In the presence of stationary, local unit root, and cointegrated predictors, we show that the adaptive LASSO cannot asymptotically eliminate all cointegrating variables with zero regression coefficients. This new finding motivates a novel post-selection adaptive LASSO, which we call the twin adaptive LASSO (TAlasso), to restore variable selection consistency. Accommodating the system of heterogeneous regressors, TAlasso achieves the well-known oracle property. In contrast, conventional LASSO fails to attain coefficient estimation consistency and variable screening in all components simultaneously. We apply these LASSO methods to evaluate the short- and long-horizon predictability of S\&P 500 excess returns.

preprint2020arXiv

Balancing Rates and Variance via Adaptive Batch-Size for Stochastic Optimization Problems

Stochastic gradient descent is a canonical tool for addressing stochastic optimization problems, and forms the bedrock of modern machine learning and statistics. In this work, we seek to balance the fact that attenuating step-size is required for exact asymptotic convergence with the fact that constant step-size learns faster in finite time up to an error. To do so, rather than fixing the mini-batch and the step-size at the outset, we propose a strategy to allow parameters to evolve adaptively. Specifically, the batch-size is set to be a piecewise-constant increasing sequence where the increase occurs when a suitable error criterion is satisfied. Moreover, the step-size is selected as that which yields the fastest convergence. The overall algorithm, two scale adaptive (TSA) scheme, is developed for both convex and non-convex stochastic optimization problems. It inherits the exact asymptotic convergence of stochastic gradient method. More importantly, the optimal error decreasing rate is achieved theoretically, as well as an overall reduction in computational cost. Experimentally, we observe that TSA attains a favorable tradeoff relative to standard SGD that fixes the mini-batch and the step-size, or simply allowing one to increase or decrease respectively.

preprint2020arXiv

Resource Allocation via Graph Neural Networks in Free Space Optical Fronthaul Networks

This paper investigates the optimal resource allocation in free space optical (FSO) fronthaul networks. The optimal allocation maximizes an average weighted sum-capacity subject to power limitation and data congestion constraints. Both adaptive power assignment and node selection are considered based on the instantaneous channel state information (CSI) of the links. By parameterizing the resource allocation policy, we formulate the problem as an unsupervised statistical learning problem. We consider the graph neural network (GNN) for the policy parameterization to exploit the FSO network structure with small-scale training parameters. The GNN is shown to retain the permutation equivariance that matches with the permutation equivariance of resource allocation policy in networks. The primal-dual learning algorithm is developed to train the GNN in a model-free manner, where the knowledge of system models is not required. Numerical simulations present the strong performance of the GNN relative to a baseline policy with equal power assignment and random node selection.

Zhan Gao

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Momentum and angular correlations in \texorpdfstring{$Z/γ$}{Z/gamma}-hadron production in relativistic heavy-ion collisions

Wide and Deep Graph Neural Network with Distributed Online Learning

On LASSO for Predictive Regression

Balancing Rates and Variance via Adaptive Batch-Size for Stochastic Optimization Problems

Resource Allocation via Graph Neural Networks in Free Space Optical Fronthaul Networks