Source author record

Long Jin

Long Jin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.AP math.SP Computer Vision math-ph math.DS math.MP Artificial Intelligence math.CA Neural and Evolutionary Computing nlin.CD quant-ph

Catalog footprint

What is connected

16works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Counting Pollicott--Ruelle resonances for Axiom A flows

In this paper, we count the number of Pollicott--Ruelle resonances for open hyperbolic systems and Axiom A flows. In particular, we prove polynomial upper bounds and sublinear lower bounds on the number of resonances with modulus less than $r$ in strips for open hyperbolic systems and Axiom A flows with a transversality condition.

preprint2022arXiv

Decoupling the Depth and Scope of Graph Neural Networks

State-of-the-art Graph Neural Networks (GNNs) have limited scalability with respect to the graph and model sizes. On large graphs, increasing the model depth often means exponential expansion of the scope (i.e., receptive field). Beyond just a few layers, two fundamental challenges emerge: 1. degraded expressivity due to oversmoothing, and 2. expensive computation due to neighborhood explosion. We propose a design principle to decouple the depth and scope of GNNs -- to generate representation of a target entity (i.e., a node or an edge), we first extract a localized subgraph as the bounded-size scope, and then apply a GNN of arbitrary depth on top of the subgraph. A properly extracted subgraph consists of a small number of critical neighbors, while excluding irrelevant ones. The GNN, no matter how deep it is, smooths the local neighborhood into informative representation rather than oversmoothing the global graph into "white noise". Theoretically, decoupling improves the GNN expressive power from the perspectives of graph signal processing (GCN), function approximation (GraphSAGE) and topological learning (GIN). Empirically, on seven graphs (with up to 110M nodes) and six backbone GNN architectures, our design achieves significant accuracy improvement with orders of magnitude reduction in computation and hardware cost.

preprint2022arXiv

Deep Graph Neural Networks with Shallow Subgraph Samplers

While Graph Neural Networks (GNNs) are powerful models for learning representations on graphs, most state-of-the-art models do not have significant accuracy gain beyond two to three layers. Deep GNNs fundamentally need to address: 1). expressivity challenge due to oversmoothing, and 2). computation challenge due to neighborhood explosion. We propose a simple "deep GNN, shallow sampler" design principle to improve both the GNN accuracy and efficiency -- to generate representation of a target node, we use a deep GNN to pass messages only within a shallow, localized subgraph. A properly sampled subgraph may exclude irrelevant or even noisy nodes, and still preserve the critical neighbor features and graph structures. The deep GNN then smooths the informative local signals to enhance feature learning, rather than oversmoothing the global graph signals into just "white noise". We theoretically justify why the combination of deep GNNs with shallow samplers yields the best learning performance. We then propose various sampling algorithms and neural architecture extensions to achieve good empirical results. On the largest public graph dataset, ogbn-papers100M, we achieve state-of-the-art accuracy with an order of magnitude reduction in hardware cost.

preprint2022arXiv

Experimental secure quantum key distribution in presence of polarization-dependent loss

Quantum key distribution (QKD) is theoretically secure using the principle of quantum mechanics; therefore, QKD is a promising solution for the future of secure communication. Although several experimental demonstrations of QKD have been reported, they have not considered the polarization-dependent loss in state preparation in the key-rate estimation. In this study, we experimentally characterized polarization-dependent loss in realistic state-preparation devices and verified that a considerable PDL exists in fiber- and silicon-based polarization modulators. Hence, the security of such QKD systems is compromised because of the secure key rate overestimation. Furthermore, we report a decoy-state BB84 QKD experiment considering polarization-dependent loss. Finally, we achieved rigorous finite-key security bound over up to 75 km fiber links by applying a recently proposed security proof. This study considers more realistic source flaws than most previous experiments; thus, it is crucial toward a secure QKD with imperfect practical devices.

preprint2022arXiv

Flat trace estimates for Anosov flows

We prove a high energy flat trace estimate for the modified resolvent of the generator of an Anosov flow. This fills a gap in the proof of the local trace formula in [Jin-Zworski '17] and is a by-product of the authors' ongoing project of its generalization to Axiom A flows.

preprint2022arXiv

Labeling Trick: A Theory of Using Graph Neural Networks for Multi-Node Representation Learning

In this paper, we provide a theory of using graph neural networks (GNNs) for multi-node representation learning (where we are interested in learning a representation for a set of more than one node, such as link). We know that GNN is designed to learn single-node representations. When we want to learn a node set representation involving multiple nodes, a common practice in previous works is to directly aggregate the single-node representations obtained by a GNN into a joint node set representation. In this paper, we show a fundamental constraint of such an approach, namely the inability to capture the dependence between nodes in the node set, and argue that directly aggregating individual node representations does not lead to an effective joint representation for multiple nodes. Then, we notice that a few previous successful works for multi-node representation learning, including SEAL, Distance Encoding, and ID-GNN, all used node labeling. These methods first label nodes in the graph according to their relationships with the target node set before applying a GNN. Then, the node representations obtained in the labeled graph are aggregated into a node set representation. By investigating their inner mechanisms, we unify these node labeling techniques into a single and most general form -- labeling trick. We prove that with labeling trick a sufficiently expressive GNN learns the most expressive node set representations, thus in principle solves any joint learning tasks over node sets. Experiments on one important two-node representation learning task, link prediction, verified our theory. Our work explains the superior performance of previous node-labeling-based methods, and establishes a theoretical foundation of using GNNs for multi-node representation learning.

preprint2022arXiv

Zero Stability Well Predicts Performance of Convolutional Neural Networks

The question of what kind of convolutional neural network (CNN) structure performs well is fascinating. In this work, we move toward the answer with one more step by connecting zero stability and model performance. Specifically, we found that if a discrete solver of an ordinary differential equation is zero stable, the CNN corresponding to that solver performs well. We first give the interpretation of zero stability in the context of deep learning and then investigate the performance of existing first- and second-order CNNs under different zero-stable circumstances. Based on the preliminary observation, we provide a higher-order discretization to construct CNNs and then propose a zero-stable network (ZeroSNet). To guarantee zero stability of the ZeroSNet, we first deduce a structure that meets consistency conditions and then give a zero stable region of a training-free parameter. By analyzing the roots of a characteristic equation, we theoretically obtain the optimal coefficients of feature maps. Empirically, we present our results from three aspects: We provide extensive empirical evidence of different depth on different datasets to show that the moduli of the characteristic equation's roots are the keys for the performance of CNNs that require historical features; Our experiments show that ZeroSNet outperforms existing CNNs which is based on high-order discretization; ZeroSNets show better robustness against noises on the input. The source code is available at \url{https://github.com/LongJin-lab/ZeroSNet}.

preprint2021arXiv

Control of eigenfunctions on surfaces of variable curvature

We prove a microlocal lower bound on the mass of high energy eigenfunctions of the Laplacian on compact surfaces of negative curvature, and more generally on surfaces with Anosov geodesic flows. This implies controllability for the Schrödinger equation by any nonempty open set, and shows that every semiclassical measure has full support. We also prove exponential energy decay for solutions to the damped wave equation on such surfaces, for any nontrivial damping coefficient. These results extend previous works [arXiv:1705.05019], [arXiv:1712.02692], which considered the setting of surfaces of constant negative curvature. The proofs use the strategy of [arXiv:1705.05019], [arXiv:1712.02692] and rely on the fractal uncertainty principle of [arXiv:1612.09040]. However, in the variable curvature case the stable/unstable foliations are not smooth, so we can no longer associate to these foliations a pseudodifferential calculus of the type used in [arXiv:1504.06589]. Instead, our argument uses Egorov's Theorem up to local Ehrenfest time and the hyperbolic parametrix of [arXiv:0706.3242], together with the $C^{1+}$ regularity of the stable/unstable foliations.

preprint2020arXiv

Deforming the Loss Surface

In deep learning, it is usually assumed that the shape of the loss surface is fixed. Differently, a novel concept of deformation operator is first proposed in this paper to deform the loss surface, thereby improving the optimization. Deformation function, as a type of deformation operator, can improve the generalization performance. Moreover, various deformation functions are designed, and their contributions to the loss surface are further provided. Then, the original stochastic gradient descent optimizer is theoretically proved to be a flat minima filter that owns the talent to filter out the sharp minima. Furthermore, the flatter minima could be obtained by exploiting the proposed deformation functions, which is verified on CIFAR-100, with visualizations of loss landscapes near the critical points obtained by both the original optimizer and optimizer enhanced by deformation functions. The experimental results show that deformation functions do find flatter regions. Moreover, on ImageNet, CIFAR-10, and CIFAR-100, popular convolutional neural networks enhanced by deformation functions are compared with the corresponding original models, where significant improvements are observed on all of the involved models equipped with deformation functions. For example, the top-1 test accuracy of ResNet-20 on CIFAR-100 increases by 1.46%, with insignificant additional computational overhead.

preprint2020arXiv

Deforming the Loss Surface to Affect the Behaviour of the Optimizer

In deep learning, it is usually assumed that the optimization process is conducted on a shape-fixed loss surface. Differently, we first propose a novel concept of deformation mapping in this paper to affect the behaviour of the optimizer. Vertical deformation mapping (VDM), as a type of deformation mapping, can make the optimizer enter a flat region, which often implies better generalization performance. Moreover, we design various VDMs, and further provide their contributions to the loss surface. After defining the local M region, theoretical analyses show that deforming the loss surface can enhance the gradient descent optimizer's ability to filter out sharp minima. With visualizations of loss landscapes, we evaluate the flatnesses of minima obtained by both the original optimizer and optimizers enhanced by VDMs on CIFAR-100. The experimental results show that VDMs do find flatter regions. Moreover, we compare popular convolutional neural networks enhanced by VDMs with the corresponding original ones on ImageNet, CIFAR-10, and CIFAR-100. The results are surprising: there are significant improvements on all of the involved models equipped with VDMs. For example, the top-1 test accuracy of ResNet-20 on CIFAR-100 increases by 1.46%, with insignificant additional computational overhead.

preprint2020arXiv

Exponential lower resolvent bounds far away from trapped sets

We give examples of semiclassical Schrödinger operators with exponentially large cutoff resolvent norms, even when the supports of the cutoff and potential are very far apart. The examples are radial, which allows us to analyze the resolvent kernel in detail using ordinary differential equation techniques. In particular, we identify a threshold spatial radius where the resolvent behavior changes. We apply these results to wave equations with radial wavespeed, identifying a corresponding threshold radius at which wave decay properties change.

preprint2016arXiv

A local trace formula for Anosov flows (with an appendix by Frédéric Naud)

We prove a local trace formula for Anosov flows. It relates Pollicott--Ruelle resonances to the periods of closed orbits. As an application, we show that the counting function for resonances in a sufficiently wide strip cannot have a sublinear growth. In particular, for any Anosov flow there exist strips with infinitely many resonances.

preprint2016arXiv

Object Detection Free Instance Segmentation With Labeling Transformations

Instance segmentation has attracted recent attention in computer vision and existing methods in this domain mostly have an object detection stage. In this paper, we study the intrinsic challenge of the instance segmentation problem, the presence of a quotient space (swapping the labels of different instances leads to the same result), and propose new methods that are object proposal- and object detection- free. We propose three alternative methods, namely pixel-based affinity mapping, superpixel-based affinity learning, and boundary-based component segmentation, all focusing on performing labeling transformations to cope with the quotient space problem. By adopting fully convolutional neural networks (FCN) like models, our framework attains competitive results on both the PASCAL dataset (object-centric) and the Gland dataset (texture-centric), which the existing methods are not able to do. Our work also has the advantages in its transparency, simplicity, and being all segmentation based.

preprint2014arXiv

Scattering Resonances of Convex Obstacles for general boundary conditions

We study the distribution of resonances for smooth strictly convex obstacles under general boundary conditions. We show that under a pinched curvature condition for the boundary of the obstacle, the resonances are separated into cubic bands and the distribution in each bands satisfies Weyl's law.

preprint2013arXiv

Semiclassical Cauchy Estimates and Applications

In this note, we study solutions to semiclassical Schrodinger equations on a real analytic manifold with a real analytic potential and prove the semiclassical version of Cauchy estimates on derivatives. As an application, we use Donnelly and Fefferman's method to prove the upper and lower bounds for (n-1)-dimensional Hausdorff measure of the nodal sets of the solutions to semiclassical Schrodinger equations.

preprint2012arXiv

Resonance-free Region in scattering by a strictly convex obstacle

We prove the existence of a resonance free region in scattering by a strictly convex obstacle with the Robin boundary condition. More precisely, we show that the scattering resonances lie below a cubic curve which is the same as in the case of the Neumann boundary condition. This generalizes earlier results on cubic poles free regions obtained for the Dirichlet boundary condition.

Long Jin

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Counting Pollicott--Ruelle resonances for Axiom A flows

Decoupling the Depth and Scope of Graph Neural Networks

Deep Graph Neural Networks with Shallow Subgraph Samplers

Experimental secure quantum key distribution in presence of polarization-dependent loss

Flat trace estimates for Anosov flows

Labeling Trick: A Theory of Using Graph Neural Networks for Multi-Node Representation Learning

Zero Stability Well Predicts Performance of Convolutional Neural Networks

Control of eigenfunctions on surfaces of variable curvature

Deforming the Loss Surface

Deforming the Loss Surface to Affect the Behaviour of the Optimizer

Exponential lower resolvent bounds far away from trapped sets

A local trace formula for Anosov flows (with an appendix by Frédéric Naud)

Object Detection Free Instance Segmentation With Labeling Transformations

Scattering Resonances of Convex Obstacles for general boundary conditions

Semiclassical Cauchy Estimates and Applications

Resonance-free Region in scattering by a strictly convex obstacle