Source author record

Yu Guang Wang

Yu Guang Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning eess.IV math.NA Artificial Intelligence math.PR Numerical Analysis physics.data-an astro-ph.CO Computer Vision cond-mat.dis-nn Information Theory math.CA math.IT Methodology Networking and Internet Architecture

Catalog footprint

What is connected

17works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Approximate Equivariance SO(3) Needlet Convolution

This paper develops a rotation-invariant needlet convolution for rotation group SO(3) to distill multiscale information of spherical signals. The spherical needlet transform is generalized from $\mathbb{S}^2$ onto the SO(3) group, which decomposes a spherical signal to approximate and detailed spectral coefficients by a set of tight framelet operators. The spherical signal during the decomposition and reconstruction achieves rotation invariance. Based on needlet transforms, we form a Needlet approximate Equivariance Spherical CNN (NES) with multiple SO(3) needlet convolutional layers. The network establishes a powerful tool to extract geometric-invariant features of spherical signals. The model allows sufficient network scalability with multi-resolution representation. A robust signal embedding is learned with wavelet shrinkage activation function, which filters out redundant high-pass representation while maintaining approximate rotation invariance. The NES achieves state-of-the-art performance for quantum chemistry regression and Cosmic Microwave Background (CMB) delensing reconstruction, which shows great potential for solving scientific challenges with high-resolution and multi-scale spherical signal representation.

preprint2022arXiv

Embedding Graphs on Grassmann Manifold

Learning efficient graph representation is the key to favorably addressing downstream tasks on graphs, such as node or graph property prediction. Given the non-Euclidean structural property of graphs, preserving the original graph data's similarity relationship in the embedded space needs specific tools and a similarity metric. This paper develops a new graph representation learning scheme, namely EGG, which embeds approximated second-order graph characteristics into a Grassmann manifold. The proposed strategy leverages graph convolutions to learn hidden representations of the corresponding subspace of the graph, which is then mapped to a Grassmann point of a low dimensional manifold through truncated singular value decomposition (SVD). The established graph embedding approximates denoised correlationship of node attributes, as implemented in the form of a symmetric matrix space for Euclidean calculation. The effectiveness of EGG is demonstrated using both clustering and classification tasks at the node level and graph level. It outperforms baseline models on various benchmarks.

preprint2022arXiv

How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical Images

Gigapixel medical images provide massive data, both morphological textures and spatial information, to be mined. Due to the large data scale in histology, deep learning methods play an increasingly significant role as feature extractors. Existing solutions heavily rely on convolutional neural networks (CNNs) for global pixel-level analysis, leaving the underlying local geometric structure such as the interaction between cells in the tumor microenvironment unexplored. The topological structure in medical images, as proven to be closely related to tumor evolution, can be well characterized by graphs. To obtain a more comprehensive representation for downstream oncology tasks, we propose a fusion framework for enhancing the global image-level representation captured by CNNs with the geometry of cell-level spatial information learned by graph neural networks (GNN). The fusion layer optimizes an integration between collaborative features of global images and cell graphs. Two fusion strategies have been developed: one with MLP which is simple but turns out efficient through fine-tuning, and the other with Transformer gains a champion in fusing multiple networks. We evaluate our fusion strategies on histology datasets curated from large patient cohorts of colorectal and gastric cancers for three biomarker prediction tasks. Both two models outperform plain CNNs or GNNs, reaching a consistent AUC improvement of more than 5% on various network backbones. The experimental results yield the necessity for combining image-level morphological features with cell spatial relations in medical image analysis. Codes are available at https://github.com/yiqings/HEGnnEnhanceCnn.

preprint2022arXiv

Lower and Upper Bounds for Numbers of Linear Regions of Graph Convolutional Networks

The research for characterizing GNN expressiveness attracts much attention as graph neural networks achieve a champion in the last five years. The number of linear regions has been considered a good measure for the expressivity of neural networks with piecewise linear activation. In this paper, we present some estimates for the number of linear regions of the classic graph convolutional networks (GCNs) with one layer and multiple-layer scenarios. In particular, we obtain an optimal upper bound for the maximum number of linear regions for one-layer GCNs, and the upper and lower bounds for multi-layer GCNs. The simulated estimate shows that the true maximum number of linear regions is possibly closer to our estimated lower bound. These results imply that the number of linear regions of multi-layer GCNs is exponentially greater than one-layer GCNs per parameter in general. This suggests that deeper GCNs have more expressivity than shallow GCNs.

preprint2022arXiv

Oversquashing in GNNs through the lens of information contraction and graph expansion

The quality of signal propagation in message-passing graph neural networks (GNNs) strongly influences their expressivity as has been observed in recent works. In particular, for prediction tasks relying on long-range interactions, recursive aggregation of node features can lead to an undesired phenomenon called "oversquashing". We present a framework for analyzing oversquashing based on information contraction. Our analysis is guided by a model of reliable computation due to von Neumann that lends a new insight into oversquashing as signal quenching in noisy computation graphs. Building on this, we propose a graph rewiring algorithm aimed at alleviating oversquashing. Our algorithm employs a random local edge flip primitive motivated by an expander graph construction. We compare the spectral expansion properties of our algorithm with that of an existing curvature-based non-local rewiring strategy. Synthetic experiments show that while our algorithm in general has a slower rate of expansion, it is overall computationally cheaper, preserves the node degrees exactly and never disconnects the graph.

preprint2022arXiv

Weisfeiler and Lehman Go Cellular: CW Networks

Graph Neural Networks (GNNs) are limited in their expressive power, struggle with long-range interactions and lack a principled way to model higher-order structures. These problems can be attributed to the strong coupling between the computational graph and the input graph structure. The recently proposed Message Passing Simplicial Networks naturally decouple these elements by performing message passing on the clique complex of the graph. Nevertheless, these models can be severely constrained by the rigid combinatorial structure of Simplicial Complexes (SCs). In this work, we extend recent theoretical results on SCs to regular Cell Complexes, topological objects that flexibly subsume SCs and graphs. We show that this generalisation provides a powerful set of graph "lifting" transformations, each leading to a unique hierarchical message passing procedure. The resulting methods, which we collectively call CW Networks (CWNs), are strictly more powerful than the WL test and not less powerful than the 3-WL test. In particular, we demonstrate the effectiveness of one such scheme, based on rings, when applied to molecular graph problems. The proposed architecture benefits from provably larger expressivity than commonly used GNNs, principled modelling of higher-order signals and from compressing the distances between nodes. We demonstrate that our model achieves state-of-the-art results on a variety of molecular datasets.

preprint2021arXiv

MathNet: Haar-Like Wavelet Multiresolution-Analysis for Graph Representation and Learning

Graph Neural Networks (GNNs) have recently caught great attention and achieved significant progress in graph-level applications. In this paper, we propose a framework for graph neural networks with multiresolution Haar-like wavelets, or MathNet, with interrelated convolution and pooling strategies. The underlying method takes graphs in different structures as input and assembles consistent graph representations for readout layers, which then accomplishes label prediction. To achieve this, the multiresolution graph representations are first constructed and fed into graph convolutional layers for processing. The hierarchical graph pooling layers are then involved to downsample graph resolution while simultaneously remove redundancy within graph signals. The whole workflow could be formed with a multi-level graph analysis, which not only helps embed the intrinsic topological information of each graph into the GNN, but also supports fast computation of forward and adjoint graph transforms. We show by extensive experiments that the proposed framework obtains notable accuracy gains on graph classification and regression tasks with performance stability. The proposed MathNet outperforms various existing GNN models, especially on big data sets.

preprint2020arXiv

CosmoVAE: Variational Autoencoder for CMB Image Inpainting

Cosmic microwave background radiation (CMB) is critical to the understanding of the early universe and precise estimation of cosmological constants. Due to the contamination of thermal dust noise in the galaxy, the CMB map that is an image on the two-dimensional sphere has missing observations, mainly concentrated on the equatorial region. The noise of the CMB map has a significant impact on the estimation precision for cosmological parameters. Inpainting the CMB map can effectively reduce the uncertainty of parametric estimation. In this paper, we propose a deep learning-based variational autoencoder --- CosmoVAE, to restoring the missing observations of the CMB map. The input and output of CosmoVAE are square images. To generate training, validation, and test data sets, we segment the full-sky CMB map into many small images by Cartesian projection. CosmoVAE assigns physical quantities to the parameters of the VAE network by using the angular power spectrum of the Gaussian random field as latent variables. CosmoVAE adopts a new loss function to improve the learning performance of the model, which consists of $\ell_1$ reconstruction loss, Kullback-Leibler divergence between the posterior distribution of encoder network and the prior distribution of latent variables, perceptual loss, and total-variation regularizer. The proposed model achieves state of the art performance for Planck \texttt{Commander} 2018 CMB map inpainting.

preprint2020arXiv

Deep Learning Based Unsupervised and Semi-supervised Classification for Keratoconus

The transparent cornea is the window of the eye, facilitating the entry of light rays and controlling focusing the movement of the light within the eye. The cornea is critical, contributing to 75% of the refractive power of the eye. Keratoconus is a progressive and multifactorial corneal degenerative disease affecting 1 in 2000 individuals worldwide. Currently, there is no cure for keratoconus other than corneal transplantation for advanced stage keratoconus or corneal cross-linking, which can only halt KC progression. The ability to accurately identify subtle KC or KC progression is of vital clinical significance. To date, there has been little consensus on a useful model to classify KC patients, which therefore inhibits the ability to predict disease progression accurately. In this paper, we utilised machine learning to analyse data from 124 KC patients, including topographical and clinical variables. Both supervised multilayer perceptron and unsupervised variational autoencoder models were used to classify KC patients with reference to the existing Amsler-Krumeich (A-K) classification system. Both methods result in high accuracy, with the unsupervised method showing better performance. The result showed that the unsupervised method with a selection of 29 variables could be a powerful tool to provide an automatic classification tool for clinicians. These outcomes provide a platform for additional analysis for the progression and treatment of keratoconus.

preprint2020arXiv

Distributed Learning via Filtered Hyperinterpolation on Manifolds

Learning mappings of data on manifolds is an important topic in contemporary machine learning, with applications in astrophysics, geophysics, statistical physics, medical diagnosis, biochemistry, 3D object analysis. This paper studies the problem of learning real-valued functions on manifolds through filtered hyperinterpolation of input-output data pairs where the inputs may be sampled deterministically or at random and the outputs may be clean or noisy. Motivated by the problem of handling large data sets, it presents a parallel data processing approach which distributes the data-fitting task among multiple servers and synthesizes the fitted sub-models into a global estimator. We prove quantitative relations between the approximation quality of the learned function over the entire manifold, the type of target function, the number of servers, and the number and type of available samples. We obtain the approximation rates of convergence for distributed and non-distributed approaches. For the non-distributed case, the approximation order is optimal.

preprint2020arXiv

Haar Graph Pooling

Deep Graph Neural Networks (GNNs) are useful models for graph classification and graph-based regression tasks. In these tasks, graph pooling is a critical ingredient by which GNNs adapt to input graphs of varying size and structure. We propose a new graph pooling operation based on compressive Haar transforms -- HaarPooling. HaarPooling implements a cascade of pooling operations; it is computed by following a sequence of clusterings of the input graph. A HaarPooling layer transforms a given input graph to an output graph with a smaller node number and the same feature dimension; the compressive Haar transform filters out fine detail information in the Haar wavelet domain. In this way, all the HaarPooling layers together synthesize the features of any given input graph into a feature vector of uniform size. Such transforms provide a sparse characterization of the data and preserve the structure information of the input graph. GNNs implemented with standard graph convolution layers and HaarPooling layers achieve state of the art performance on diverse graph classification and regression problems.

preprint2020arXiv

Numerical computation of triangular complex spherical designs with small mesh ratio

This paper provides triangular spherical designs for the complex unit sphere $Ω^d$ by exploiting the natural correspondence between the complex unit sphere in $d$ dimensions and the real unit sphere in $2d-1$. The existence of triangular and square complex spherical $t$-designs with the optimal order number of points is established. A variational characterization of triangular complex designs is provided, with particular emphasis on numerical computation of efficient triangular complex designs with good geometric properties as measured by their mesh ratio. We give numerical examples of triangular spherical $t$-designs on complex unit spheres of dimension $d=2$ to $6$.

preprint2020arXiv

Path Integral Based Convolution and Pooling for Graph Neural Networks

Graph neural networks (GNNs) extends the functionality of traditional neural networks to graph-structured data. Similar to CNNs, an optimized design of graph convolution and pooling is key to success. Borrowing ideas from physics, we propose a path integral based graph neural networks (PAN) for classification and regression tasks on graphs. Specifically, we consider a convolution operation that involves every path linking the message sender and receiver with learnable weights depending on the path length, which corresponds to the maximal entropy random walk. It generalizes the graph Laplacian to a new transition matrix we call maximal entropy transition (MET) matrix derived from a path integral formalism. Importantly, the diagonal entries of the MET matrix are directly related to the subgraph centrality, thus providing a natural and adaptive pooling mechanism. PAN provides a versatile framework that can be tailored for different graph data with varying sizes and structures. We can view most existing GNN architectures as special cases of PAN. Experimental results show that PAN achieves state-of-the-art performance on various graph classification/regression tasks, including a new benchmark dataset from statistical mechanics we propose to boost applications of GNN in physical sciences.

preprint2016arXiv

Fully discrete needlet approximation on the sphere

Spherical needlets are highly localized radial polynomials on the sphere $\mathbb{S}^{d}\subset \mathbb{R}^{d+1}$, $d\ge 2$, with centers at the nodes of a suitable cubature rule. The original semidiscrete spherical needlet approximation of Narcowich, Petrushev and Ward is not computable, in that the needlet coefficients depend on inner product integrals. In this work we approximate these integrals by a second quadrature rule with an appropriate degree of precision, to construct a fully discrete needlet approximation. We prove that the resulting approximation is equivalent to filtered hyperinterpolation, that is to a filtered Fourier-Laplace series partial sum with inner products replaced by appropriate cubature sums. It follows that the $\mathbb{L}_{p}$-error of discrete needlet approximation of order $J$ for $1 \le p \le \infty$ and $s > d/p$ has for a function $f$ in the Sobolev space $\mathbb{W}_{p}^{s}(\mathbb{S}^{d})$ the optimal rate of convergence in the sense of optimal recovery, namely $\mathcal{O}\bigl(2^{-J s}\bigr)$. Moreover, this is achieved with a filter function that is of smoothness class $C^{\lfloor \frac{d+3}{2}\rfloor}$, in contrast to the usually assumed $C^{\infty}$. A numerical experiment for a class of functions in known Sobolev smoothness classes gives $\mathbb{L}_2$ errors for the fully discrete needlet approximation that are almost identical to those for the original semidiscrete needlet approximation. Another experiment uses needlets over the whole sphere for the lower levels together with high-level needlets with centers restricted to a local region. The resulting errors are reduced in the local region away from the boundary, indicating that local refinement in special regions is a promising strategy.

preprint2016arXiv

Needlet approximation for isotropic random fields on the sphere

In this paper we establish a multiscale approximation for random fields on the sphere using spherical needlets --- a class of spherical wavelets. We prove that the semidiscrete needlet decomposition converges in mean and pointwise senses for weakly isotropic random fields on $\mathbb{S}^{d}$, $d\ge2$. For numerical implementation, we construct a fully discrete needlet approximation of a smooth $2$-weakly isotropic random field on $\mathbb{S}^{d}$ and prove that the approximation error for fully discrete needlets has the same convergence order as that for semidiscrete needlets. Numerical examples are carried out for fully discrete needlet approximations of Gaussian random fields and compared to a discrete version of the truncated Fourier expansion.

preprint2016arXiv

Random Point Sets on the Sphere --- Hole Radii, Covering, and Separation

Geometric properties of $N$ random points distributed independently and uniformly on the unit sphere $\mathbb{S}^{d}\subset\mathbb{R}^{d+1}$ with respect to surface area measure are obtained and several related conjectures are posed. In particular, we derive asymptotics (as $N \to \infty$) for the expected moments of the radii of spherical caps associated with the facets of the convex hull of $N$ random points on $\mathbb{S}^{d}$. We provide conjectures for the asymptotic distribution of the scaled radii of these spherical caps and the expected value of the largest of these radii (the covering radius). Numerical evidence is included to support these conjectures. Furthermore, utilizing the extreme law for pairwise angles of Cai et al., we derive precise asymptotics for the expected separation of random points on $\mathbb{S}^{d}$.

preprint2016arXiv

Riemann localisation on the sphere

This paper first shows that the Riemann localisation property holds for the Fourier-Laplace series partial sum for sufficiently smooth functions on the two-dimensional sphere, but does not hold for spheres of higher dimension. By Riemann localisation on the sphere $\mathbb{S}^{d}\subset\mathbb{R}^{d+1}$, $d\ge2$, we mean that for a suitable subset $X$ of $\mathbb{L}_{p}(\mathbb{S}^{d})$, $1\le p\le \infty$, the $\mathbb{L}_{p}$-norm of the Fourier local convolution of $f\in X$ converges to zero as the degree goes to infinity. The Fourier local convolution of $f$ at $\boldsymbol{x}\in\mathbb{S}^{d}$ is the Fourier convolution with a modified version of $f$ obtained by replacing values of $f$ by zero on a neighbourhood of $\boldsymbol{x}$. The failure of Riemann localisation for $d>2$ can be overcome by considering a filtered version: we prove that for a sphere of any dimension and sufficiently smooth filter the corresponding local convolution always has the Riemann localisation property. Key tools are asymptotic estimates of the Fourier and filtered kernels.

Yu Guang Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

Approximate Equivariance SO(3) Needlet Convolution

Embedding Graphs on Grassmann Manifold

How GNNs Facilitate CNNs in Mining Geometric Information from Large-Scale Medical Images

Lower and Upper Bounds for Numbers of Linear Regions of Graph Convolutional Networks

Oversquashing in GNNs through the lens of information contraction and graph expansion

Weisfeiler and Lehman Go Cellular: CW Networks

MathNet: Haar-Like Wavelet Multiresolution-Analysis for Graph Representation and Learning

CosmoVAE: Variational Autoencoder for CMB Image Inpainting

Deep Learning Based Unsupervised and Semi-supervised Classification for Keratoconus

Distributed Learning via Filtered Hyperinterpolation on Manifolds

Haar Graph Pooling

Numerical computation of triangular complex spherical designs with small mesh ratio

Path Integral Based Convolution and Pooling for Graph Neural Networks

Fully discrete needlet approximation on the sphere

Needlet approximation for isotropic random fields on the sphere

Random Point Sets on the Sphere --- Hole Radii, Covering, and Separation

Riemann localisation on the sphere