Source author record

Li Jing

Li Jing appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

quant-ph Machine Learning Computer Vision Artificial Intelligence Neural and Evolutionary Computing physics.app-ph eess.IV Emerging Technologies physics.data-an physics.optics

Catalog footprint

What is connected

14works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Equivariant Contrastive Learning

In state-of-the-art self-supervised learning (SSL) pre-training produces semantically good representations by encouraging them to be invariant under meaningful transformations prescribed from human knowledge. In fact, the property of invariance is a trivial instance of a broader class called equivariance, which can be intuitively understood as the property that representations transform according to the way the inputs transform. Here, we show that rather than using only invariance, pre-training that encourages non-trivial equivariance to some transformations, while maintaining invariance to other transformations, can be used to improve the semantic quality of representations. Specifically, we extend popular SSL methods to a more general framework which we name Equivariant Self-Supervised Learning (E-SSL). In E-SSL, a simple additional pre-training objective encourages equivariance by predicting the transformations applied to the input. We demonstrate E-SSL's effectiveness empirically on several popular computer vision benchmarks, e.g. improving SimCLR to 72.5% linear probe accuracy on ImageNet. Furthermore, we demonstrate usefulness of E-SSL for applications beyond computer vision; in particular, we show its utility on regression problems in photonics science. Our code, datasets and pre-trained models are available at https://github.com/rdangovs/essl to aid further research in E-SSL.

preprint2022arXiv

Masked Siamese ConvNets

Self-supervised learning has shown superior performances over supervised methods on various vision benchmarks. The siamese network, which encourages embeddings to be invariant to distortions, is one of the most successful self-supervised visual representation learning approaches. Among all the augmentation methods, masking is the most general and straightforward method that has the potential to be applied to all kinds of input and requires the least amount of domain knowledge. However, masked siamese networks require particular inductive bias and practically only work well with Vision Transformers. This work empirically studies the problems behind masked siamese networks with ConvNets. We propose several empirical designs to overcome these problems gradually. Our method performs competitively on low-shot image classification and outperforms previous methods on object detection benchmarks. We discuss several remaining issues and hope this work can provide useful data points for future general-purpose self-supervised learning.

preprint2022arXiv

Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Self-supervised visual representation learning aims to learn useful representations without relying on human annotations. Joint embedding approach bases on maximizing the agreement between embedding vectors from different views of the same image. Various methods have been proposed to solve the collapsing problem where all embedding vectors collapse to a trivial constant solution. Among these methods, contrastive learning prevents collapse via negative sample pairs. It has been shown that non-contrastive methods suffer from a lesser collapse problem of a different nature: dimensional collapse, whereby the embedding vectors end up spanning a lower-dimensional subspace instead of the entire available embedding space. Here, we show that dimensional collapse also happens in contrastive learning. In this paper, we shed light on the dynamics at play in contrastive learning that leads to dimensional collapse. Inspired by our theory, we propose a novel contrastive learning method, called DirectCLR, which directly optimizes the representation space without relying on an explicit trainable projector. Experiments show that DirectCLR outperforms SimCLR with a trainable linear projector on ImageNet.

preprint2020arXiv

Contextualizing Enhances Gradient Based Meta Learning

Meta learning methods have found success when applied to few shot classification problems, in which they quickly adapt to a small number of labeled examples. Prototypical representations, each representing a particular class, have been of particular importance in this setting, as they provide a compact form to convey information learned from the labeled examples. However, these prototypes are just one method of representing this information, and they are narrow in their scope and ability to classify unseen examples. We propose the implementation of contextualizers, which are generalizable prototypes that adapt to given examples and play a larger role in classification for gradient-based models. We demonstrate how to equip meta learning methods with contextualizers and show that their use can significantly boost performance on a range of few shot learning datasets. We also present figures of merit demonstrating the potential benefits of contextualizers, along with analysis of how models make use of them. Our approach is particularly apt for low-data environments where it is difficult to update parameters without overfitting. Our implementation and instructions to reproduce the experiments are available at https://github.com/naveace/proto-context.

preprint2020arXiv

Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery

Symbolic regression is a powerful technique that can discover analytical equations that describe data, which can lead to explainable models and generalizability outside of the training data set. In contrast, neural networks have achieved amazing levels of accuracy on image recognition and natural language processing tasks, but are often seen as black-box models that are difficult to interpret and typically extrapolate poorly. Here we use a neural network-based architecture for symbolic regression called the Equation Learner (EQL) network and integrate it with other deep learning architectures such that the whole system can be trained end-to-end through backpropagation. To demonstrate the power of such systems, we study their performance on several substantially different tasks. First, we show that the neural network can perform symbolic regression and learn the form of several functions. Next, we present an MNIST arithmetic task where a separate part of the neural network extracts the digits. Finally, we demonstrate prediction of dynamical systems where an unknown parameter is extracted through an encoder. We find that the EQL-based architecture can extrapolate quite well outside of the training data set compared to a standard neural network-based architecture, paving the way for deep learning to be applied in scientific exploration and discovery.

preprint2019arXiv

Heuristic Recurrent Algorithms for Photonic Ising Machines

The inability of conventional electronic architectures to efficiently solve large combinatorial problems motivates the development of novel computational hardware. There has been much effort recently toward developing novel, application-specific hardware, across many different fields of engineering, such as integrated circuits, memristors, and photonics. However, unleashing the true potential of such novel architectures requires the development of featured algorithms which optimally exploit their fundamental properties. We here present the Photonic Recurrent Ising Sampler (PRIS), a heuristic method tailored for parallel architectures that allows for fast and efficient sampling from distributions of combinatorially hard Ising problems. Since the PRIS relies essentially on vector-to-fixed matrix multiplications, we suggest the implementation of the PRIS in photonic parallel networks, which realize these operations at an unprecedented speed. The PRIS provides sample solutions to the ground state of arbitrary Ising models, by converging in probability to their associated Gibbs distribution. By running the PRIS at various noise levels, we probe the critical behavior of universality classes and their critical exponents. In addition to the attractive features of photonic networks, the PRIS relies on intrinsic dynamic noise and eigenvalue dropout to find ground states more efficiently. Our work suggests speedups in heuristic methods via photonic implementations of the PRIS. We also hint at a broader class of (meta)heuristic algorithms derived from the PRIS, such as combined simulated annealing on the noise and eigenvalue dropout levels. Our algorithm can also be implemented in a competitive manner on fast parallel electronic hardware, such as FPGAs and ASICs.

preprint2014arXiv

Fitting magnetic field gradient with Heisenberg-scaling accuracy

We propose a quantum fitting scheme to estimate the magnetic field gradient with $N$-atom spins preparing in W state, which attains the Heisenberg-scaling accuracy. Our scheme combines the quantum multi-parameter estimation and the least square linear fitting method to achieve the quantum Cramér-Rao bound (QCRB). We show that the estimated quantity achieves the Heisenberg-scaling accuracy. In single parameter estimation with assumption that the magnetic field is strictly linear, two optimal measurements can achieve the identical Heisenberg-scaling accuracy. Proper interpretation of the super-Heisenberg-scaling accuracy is presented. The scheme of quantum metrology combined with data fitting provides a new method in fast high precision measurements.

preprint2014arXiv

Quantum Cloning Machines and the Applications

No-cloning theorem is fundamental for quantum mechanics and for quantum information science that states an unknown quantum state cannot be cloned perfectly. However, we can try to clone a quantum state approximately with the optimal fidelity, or instead, we can try to clone it perfectly with the largest probability. Thus various quantum cloning machines have been designed for different quantum information protocols. Specifically, quantum cloning machines can be designed to analyze the security of quantum key distribution protocols such as BB84 protocol, six-state protocol, B92 protocol and their generalizations. Some well-known quantum cloning machines include universal quantum cloning machine, phase-covariant cloning machine, the asymmetric quantum cloning machine and the probabilistic quantum cloning machine etc. In the past years, much progress has been made in studying quantum cloning machines and their applications and implementations, both theoretically and experimentally. In this review, we will give a complete description of those important developments about quantum cloning and some related topics. On the other hand, this review is self-consistent, and in particular, we try to present some detailed formulations so that further study can be taken based on those results.

preprint2013arXiv

Quantum key distribution based on a quantum retrodiction protocol

This paper is withdrawn. We study the quantum key distribution (QKD) protocol based on a quantum retrodiction protocol, namely the so-called mean king problem. The security is analyzed by considering the eavesdropping on both the preparation of the entangled pair and the transmission of the quantum state. This QKD protocol can generate efficiently a bit of raw key in every single run. We find that, for qubit system, it is more secure than the QKD Bennett-Brassard 1984 protocol and the six-state protocol. This QKD protocol works also for higher dimensional system.

preprint2012arXiv

General Quantum Key Distribution in Higher Dimension

We study a general quantum key distribution protocol in higher dimension. In this protocol, quantum states in arbitrary $g+1$ ($1\le g\le d$) out of all $d+1$ mutually unbiased bases in a d-dimensional system can be used for the key encoding. This provides a natural generalization of the quantum key distribution in higher dimension and recovers the previously known results for $g=1$ and $d$. In our investigation, we study Eve's attack by two slightly different approaches. One is considering the optimal cloner for Eve, and the other, defined as the optimal attack, is maximizing Eve's information. We derive results for both approaches and show the deviation of the optimal cloner from the optimal attack. With our systematic investigation of the quantum key distribution protocols in higher dimension, one may balance the security gain and the implementation cost by changing the number of bases in the key encoding. As a side product, we also prove the equivalency between the optimal phase covariant quantum cloning machine and the optimal cloner for the $g=d-1$ quantum key distribution.

preprint2012arXiv

Minimal sets determining universal and phase-covariant quantum cloning

We study the minimal input sets which can determine completely the universal and the phase-covariant quantum cloning machines. We find that the universal quantum cloning machine, which can copy arbitrary input qubit equally well, however can be determined completely by only four input states located at the four vertices of a tetrahedron. The phase-covariant quantum cloning machine, which can copy all qubits located on the equator of the Bloch sphere, can be determined by three equatorial qubits with equal angular distance. These results sharpen further the well-known results that BB84 states and six-states used in quantum cryptography can determine completely the phase-covariant and universal quantum cloning machines. This concludes the study of the power of universal and phase-covariant quantum cloning, i.e., from minimal input sets necessarily to full input sets by definition. This can simplify dramatically the testing of whether the quantum clone machines are successful or not, we only need to check that the minimal input sets can be cloned optimally.

preprint2012arXiv

Quantum network teleportation for quantum information distribution and concentration

We investigate the schemes of quantum network teleportation for quantum information distribution and concentration which are essential in quantum cloud computation and quantum internet. In those schemes, the cloud can send simultaneously identical unknown quantum states to clients located in different places by a network like teleportation with a prior shared multipartite entangled state resource. The cloud first perform the quantum operation, each client can recover their quantum state locally by using the classical information announced by the cloud about the measurement result. The number of clients can be beyond the number of identical quantum states intentionally being sent, this quantum network teleportation can make sure that the retrieved quantum state is optimal. Furthermore, we present a scheme to realize its reverse process, which concentrates the states from the clients to reconstruct the original state of the cloud. These schemes facilitate the quantum information distribution and concentration in quantum networks in the framework of quantum cloud computation. Potential applications in time synchronization are discussed.

preprint2011arXiv

No-compressing of quantum phase information

We raise a general question of quantum information theory whether the quantum phase information can be compressed and retrieved. A general qubit contains both amplitude and phase information, while an equatorial qubit contains only a phase information. We study whether it is possible to compress the phase information of n equatorial qubits into m general qubits with m being less than n, and still those information can be retrieved perfectly. We prove that this process is not allowed by quantum mechanics.

preprint2011arXiv

Unified Universal Quantum Cloning Machine and Fidelities

We present a unified universal quantum cloning machine, which combines several different existing universal cloning machines together including the asymmetric case. In this unified framework, the identical pure states are projected equally into each copy initially constituted by input and one half of the maximally entangled states. We show explicitly that the output states of those universal cloning machines are the same. One importance of this unified cloning machine is that the cloning procession is always the symmetric projection which reduces dramatically the difficulties for implementation. Also it is found that this unified cloning machine can be directly modified to the general asymmetric case. Besides the global fidelity and the single-copy fidelity, we also present all possible arbitrary-copy fidelities.

Li Jing

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Equivariant Contrastive Learning

Masked Siamese ConvNets

Understanding Dimensional Collapse in Contrastive Self-supervised Learning

Contextualizing Enhances Gradient Based Meta Learning

Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery

Heuristic Recurrent Algorithms for Photonic Ising Machines

Fitting magnetic field gradient with Heisenberg-scaling accuracy

Quantum Cloning Machines and the Applications

Quantum key distribution based on a quantum retrodiction protocol

General Quantum Key Distribution in Higher Dimension

Minimal sets determining universal and phase-covariant quantum cloning

Quantum network teleportation for quantum information distribution and concentration

No-compressing of quantum phase information

Unified Universal Quantum Cloning Machine and Fidelities