Source author record

Jie Lu

Jie Lu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning hep-ph cond-mat.mes-hall math.OC Artificial Intelligence Computer Vision Distributed, Parallel, and Cluster Computing hep-lat Information Retrieval Systems and Control Methodology Social and Information Networks Computation and Language cond-mat.mtrl-sci hep-ex physics.app-ph physics.optics quant-ph

Catalog footprint

What is connected

47works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Autonomous Drift Learning in Data Streams: A Unified Perspective

In the pursuit of autonomous learning systems, the foundational assumption of stationarity, the premise that data distributions and model behaviors remain constant, is fundamentally untenable. Historically, the research community has addressed non-stationary environments almost exclusively under the scope of concept drift, focusing primarily on temporal shifts in streams. However, as learning systems become increasingly autonomous and complex, merely adapting to temporal non-stationarity is no longer sufficient. Evolving beyond this traditional perspective, we propose a novel, three-dimensional taxonomy that systematizes the field based on the operational state of the system. First, time stream drift distinguishes between stochastic arbitrary patterns and structural rhythmic dynamics. Second, data stream drift disentangles shifts in feature representations, identified as representation drift, from changes in underlying semantics, recognized as semantic drift. Third, model stream drift characterizes the internal endogenous divergence of learning systems through the lenses of sequential plasticity, decentralized heterogeneity, and policy instability. Based on this framework, we systematically review 193 representative studies and identify key open challenges. By bridging the fragmented paradigms of drift adaptation, continual learning, and temporal generalization, this survey outlines a roadmap for building self-evolving intelligent systems capable of learning autonomously through continuous change.

preprint2024arXiv

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a given specific problem. In this paper, we propose a learning-based method to achieve efficient distributed optimization over networked systems. Specifically, a deep reinforcement learning (DRL) framework is developed for adaptive configuration within a parameterized unifying algorithmic form, which incorporates an abundance of decentralized first-order and second-order optimization algorithms. We exploit the local consensus and objective information to represent the regularities of problem instances and trace the solving progress, which constitute the states observed by a DRL agent. The framework is trained using Proximal Policy Optimization (PPO) on a number of practical problem instances of similar structures yet different problem data. Experiments on various smooth and non-smooth classes of objective functions demonstrate that our proposed learning-based method outperforms several state-of-the-art distributed optimization algorithms in terms of convergence speed and solution accuracy.

preprint2024arXiv

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification

Multistream classification poses significant challenges due to the necessity for rapid adaptation in dynamic streaming processes with concept drift. Despite the growing research outcomes in this area, there has been a notable oversight regarding the temporal dynamic relationships between these streams, leading to the issue of negative transfer arising from irrelevant data. In this paper, we propose a novel Online Boosting Adaptive Learning (OBAL) method that effectively addresses this limitation by adaptively learning the dynamic correlation among different streams. Specifically, OBAL operates in a dual-phase mechanism, in the first of which we design an Adaptive COvariate Shift Adaptation (AdaCOSA) algorithm to construct an initialized ensemble model using archived data from various source streams, thus mitigating the covariate shift while learning the dynamic correlations via an adaptive re-weighting strategy. During the online process, we employ a Gaussian Mixture Model-based weighting mechanism, which is seamlessly integrated with the acquired correlations via AdaCOSA to effectively handle asynchronous drift. This approach significantly improves the predictive performance and stability of the target stream. We conduct comprehensive experiments on several synthetic and real-world data streams, encompassing various drifting scenarios and types. The results clearly demonstrate that OBAL achieves remarkable advancements in addressing multistream classification problems by effectively leveraging positive knowledge derived from multiple sources.

preprint2022arXiv

Fast Evolution of Single Qubit Gate in Non-Adiabatic Geometric Quantum Computing

We implemented arbitrary single qubit gates of geometric quantum computing for a three-level system in a single-shot manner. The evolution time of the gate has been minimized by considering the shortest trajectory of the state on the Bloch sphere. The duration of gates grows from zero with the rotation angle $γ$, and the tested T gate time can be reduced to $\sim$40\% of those in the traditional orange-sliced-shaped path non-adiabatic holonomic quantum computing (NHQC) scheme by the parametrization of Rabi frequency. We also demonstrated that certain pulses are robust against static detuning errors and Rabi errors. The time-dependent detuning and Rabi frequency are found to be proportional to each other by a constant which is determined by the geometric phase. In this way, some previous NHQC schemes can be treated as special cases in our generalized model.

preprint2022arXiv

Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data

Modern kernel-based two-sample tests have shown great success in distinguishing complex, high-dimensional distributions with appropriate learned kernels. Previous work has demonstrated that this kernel learning procedure succeeds, assuming a considerable number of observed samples from each distribution. In realistic scenarios with very limited numbers of data samples, however, it can be challenging to identify a kernel powerful enough to distinguish complex distributions. We address this issue by introducing the problem of meta two-sample testing (M2ST), which aims to exploit (abundant) auxiliary data on related tasks to find an algorithm that can quickly identify a powerful test on new target tasks. We propose two specific algorithms for this task: a generic scheme which improves over baselines and a more tailored approach which performs even better. We provide both theoretical justification and empirical evidence that our proposed meta-testing schemes out-perform learning kernel-based tests directly from scarce observations, and identify when such schemes will be successful.

preprint2022arXiv

Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

The theoretical analysis of multi-class classification has proved that the existing multi-class classification methods can train a classifier with high classification accuracy on the test set, when the instances are precise in the training and test sets with same distribution and enough instances can be collected in the training set. However, one limitation with multi-class classification has not been solved: how to improve the classification accuracy of multi-class classification problems when only imprecise observations are available. Hence, in this paper, we propose a novel framework to address a new realistic problem called multi-class classification with imprecise observations (MCIMO), where we need to train a classifier with fuzzy-feature observations. Firstly, we give the theoretical analysis of the MCIMO problem based on fuzzy Rademacher complexity. Then, two practical algorithms based on support vector machine and neural networks are constructed to solve the proposed new problem. Experiments on both synthetic and real-world datasets verify the rationality of our theoretical analysis and the efficacy of the proposed algorithms.

preprint2022arXiv

Stepping beyond your comfort zone: Diffusion-based network analytics for knowledge trajectory recommendation

Interest in tracing the research interests of scientific researchers is rising, and particularly that of predicting a researcher's knowledge trajectories beyond their current foci into potential inter-/cross-/multi-disciplinary interactions. Hence, in this study, we present a method of diffusion-based network analytics for knowledge trajectory recommendation. The method begins by constructing a heterogeneous bibliometric network consisting of a co-topic layer and a co-authorship layer. A novel link prediction approach with a diffusion strategy is then used to reflect real-world academic activity, such as knowledge sharing between co-authors or diffusing between similar research topics. This strategy differentiates the interactions occurring between homogeneous and heterogeneous nodes and weights the strengths of these interactions. Two sets of experiments - one with a local dataset and another with a global dataset - demonstrate that the proposed method is prior to selected baselines. In addition, to further examine the reliability of our method, we conducted a case study on recommending knowledge trajectories of selected information scientists and their research groups. The results demonstrate the empirical insights our method yields for individual researchers, communities, and research institutions in the information science discipline.

preprint2021arXiv

Butterfly: One-step Approach towards Wildly Unsupervised Domain Adaptation

In unsupervised domain adaptation (UDA), classifiers for the target domain (TD) are trained with clean labeled data from the source domain (SD) and unlabeled data from TD. However, in the wild, it is difficult to acquire a large amount of perfectly clean labeled data in SD given limited budget. Hence, we consider a new, more realistic and more challenging problem setting, where classifiers have to be trained with noisy labeled data from SD and unlabeled data from TD -- we name it wildly UDA (WUDA). We show that WUDA ruins all UDA methods if taking no care of label noise in SD, and to this end, we propose a Butterfly framework, a powerful and efficient solution to WUDA. Butterfly maintains four deep networks simultaneously, where two take care of all adaptations (i.e., noisy-to-clean, labeled-to-unlabeled, and SD-to-TD-distributional) and then the other two can focus on classification in TD. As a consequence, Butterfly possesses all the conceptually necessary components for solving WUDA. Experiments demonstrate that, under WUDA, Butterfly significantly outperforms existing baseline methods.

preprint2021arXiv

Chiral domain wall dynamics in magnetic heterostructures with bulk Dzyaloshinskii-Moriya interactions

In this work, dynamics of chiral domain walls in long and narrow magnetic heterostructures based on non-centrosymmetric chiral magnets with bulk Dzyaloshinskii-Moriya interactions (DMI) and perpendicular magnetic anisotropy is systematically investigated. The driving forces can be out-of-plane magnetic fields and in-plane currents, correspondingly both steady and precessional flows are considered. Their dividing points (the Walker critical field and current density) are obtained as functions of bulk DMI strength ($D_{\mathrm{b}}$) and the ratio ($κ$) of total (crystalline plus shape) anisotropy in the hard axis over that in the easy one. When far beyond Walker breakdown, the dependence curve of wall velocity on external in-plane bias field takes parabolic shape around the compensation point where the total in-plane field disappears. The center shift is determined by $D_{\mathrm{b}}$, $κ$, and the wall's topological charge, thus can be used to measure the bulk DMI strength in chiral magnets.

preprint2021arXiv

Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

In unsupervised domain adaptation (UDA), classifiers for the target domain are trained with massive true-label data from the source domain and unlabeled data from the target domain. However, it may be difficult to collect fully-true-label data in a source domain given a limited budget. To mitigate this problem, we consider a novel problem setting where the classifier for the target domain has to be trained with complementary-label data from the source domain and unlabeled data from the target domain named budget-friendly UDA (BFUDA). The key benefit is that it is much less costly to collect complementary-label source data (required by BFUDA) than collecting the true-label source data (required by ordinary UDA). To this end, the complementary label adversarial network (CLARINET) is proposed to solve the BFUDA problem. CLARINET maintains two deep networks simultaneously, where one focuses on classifying complementary-label source data and the other takes care of the source-to-target distributional adaptation. Experiments show that CLARINET significantly outperforms a series of competent baselines.

preprint2021arXiv

Distributed Optimization with Coupling Constraints

In this paper, we develop a novel distributed algorithm for addressing convex optimization with both nonlinear inequality and linear equality constraints, where the objective function can be a general nonsmooth convex function and all the constraints can be fully coupled. Specifically, we first separate the constraints into three groups, and design two primal-dual methods and utilize a virtual-queue-based method to handle each group of the constraints independently. Then, we integrate these three methods in a strategic way, leading to an integrated primal-dual proximal (IPLUX) algorithm, and enable the distributed implementation of IPLUX. We show that IPLUX achieves an $O(1/k)$ rate of convergence in terms of optimality and feasibility, which is stronger than the convergence results of the state-of-the-art distributed algorithms for convex optimization with coupling nonlinear constraints. Finally, IPLUX exhibits competitive practical performance in the simulations.

preprint2021arXiv

Learning Deep Kernels for Non-Parametric Two-Sample Tests

We propose a class of kernel-based two-sample tests, which aim to determine whether two sets of samples are drawn from the same distribution. Our tests are constructed from kernels parameterized by deep neural nets, trained to maximize test power. These tests adapt to variations in distribution smoothness and shape over space, and are especially suited to high dimensions and complex data. By contrast, the simpler kernels used in prior kernel testing work are spatially homogeneous, and adaptive only in lengthscale. We explain how this scheme includes popular classifier-based two-sample tests as a special case, but improves on them in general. We provide the first proof of consistency for the proposed adaptation method, which applies both to kernels on deep features and to simpler radial basis kernels or multiple kernel learning. In experiments, we establish the superior performance of our deep kernels in hypothesis testing on benchmark and real-world data. The code of our deep-kernel-based two sample tests is available at https://github.com/fengliu90/DK-for-TST.

preprint2021arXiv

Multiplexed structured illumination super-resolution imaging with time-domain upconversion nanoparticles

The emerging optical multiplexing within nanoscale shows super-capacity in encoding information by using the time-domain fingerprints from uniform nanoparticles. However, the optical diffraction limit compromises the decoding throughput and accuracy of the nanoparticles during wide-field imaging. This, in turn, challenges the quality of nanoparticles to afford the modulated excitation condition, and further to retain the multiplexed optical fingerprints for super-resolution multiplexing. Here we report a tailor-made time-domain super-resolution method with the lifetime-engineered upconversion nanoparticles for multiplexing. We demonstrate that the nanoparticles are bright, uniform, and stable under structured illumination, which supports a lateral resolution of 186 nm, less than 1/4th of the excitation wavelength. We further develop a deep learning algorithm to coordinate with super-resolution images for more accurate decoding compared to a numeric algorithm. We demonstrate a three-channel sub-diffraction-limit imaging-based optical multiplexing with decoding accuracies above 93% for each channel, and larger than 60% accuracies for potential seven-channel multiplexing. The improved resolution provides high throughput by resolving the particles within the optical limit, which enables higher multiplexing capacity in space. This time-domain super-resolution multiplexing opens a new horizon for handling the growing amount of information content, diseases source, and security risk in modern society

preprint2021arXiv

PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

By leveraging experience from previous tasks, meta-learning algorithms can achieve effective fast adaptation ability when encountering new tasks. However it is unclear how the generalization property applies to new tasks. Probably approximately correct (PAC) Bayes bound theory provides a theoretical framework to analyze the generalization performance for meta-learning. We derive three novel generalisation error bounds for meta-learning based on PAC-Bayes relative entropy bound. Furthermore, using the empirical risk minimization (ERM) method, a PAC-Bayes bound for meta-learning with data-dependent prior is developed. Experiments illustrate that the proposed three PAC-Bayes bounds for meta-learning guarantee a competitive generalization performance guarantee, and the extended PAC-Bayes bound with data-dependent prior can achieve rapid convergence ability.

preprint2020arXiv

Bridging the Theoretical Bound and Deep Algorithms for Open Set Domain Adaptation

In the unsupervised open set domain adaptation (UOSDA), the target domain contains unknown classes that are not observed in the source domain. Researchers in this area aim to train a classifier to accurately: 1) recognize unknown target data (data with unknown classes) and, 2) classify other target data. To achieve this aim, a previous study has proven an upper bound of the target-domain risk, and the open set difference, as an important term in the upper bound, is used to measure the risk on unknown target data. By minimizing the upper bound, a shallow classifier can be trained to achieve the aim. However, if the classifier is very flexible (e.g., deep neural networks (DNNs)), the open set difference will converge to a negative value when minimizing the upper bound, which causes an issue where most target data are recognized as unknown data. To address this issue, we propose a new upper bound of target-domain risk for UOSDA, which includes four terms: source-domain risk, $ε$-open set difference ($Δ_ε$), a distributional discrepancy between domains, and a constant. Compared to the open set difference, $Δ_ε$ is more robust against the issue when it is being minimized, and thus we are able to use very flexible classifiers (i.e., DNNs). Then, we propose a new principle-guided deep UOSDA method that trains DNNs via minimizing the new upper bound. Specifically, source-domain risk and $Δ_ε$ are minimized by gradient descent, and the distributional discrepancy is minimized via a novel open-set conditional adversarial training strategy. Finally, compared to existing shallow and deep UOSDA methods, our method shows the state-of-the-art performance on several benchmark datasets, including digit recognition (MNIST, SVHN, USPS), object recognition (Office-31, Office-Home), and face recognition (PIE).

preprint2020arXiv

Concept Drift Detection via Equal Intensity k-means Space Partitioning

Data stream poses additional challenges to statistical classification tasks because distributions of the training and target samples may differ as time passes. Such distribution change in streaming data is called concept drift. Numerous histogram-based distribution change detection methods have been proposed to detect drift. Most histograms are developed on grid-based or tree-based space partitioning algorithms which makes the space partitions arbitrary, unexplainable, and may cause drift blind-spots. There is a need to improve the drift detection accuracy for histogram-based methods with the unsupervised setting. To address this problem, we propose a cluster-based histogram, called equal intensity k-means space partitioning (EI-kMeans). In addition, a heuristic method to improve the sensitivity of drift detection is introduced. The fundamental idea of improving the sensitivity is to minimize the risk of creating partitions in distribution offset regions. Pearson's chi-square test is used as the statistical hypothesis test so that the test statistics remain independent of the sample distribution. The number of bins and their shapes, which strongly influence the ability to detect drift, are determined dynamically from the sample based on an asymptotic constraint in the chi-square test. Accordingly, three algorithms are developed to implement concept drift detection, including a greedy centroids initialization algorithm, a cluster amplify-shrink algorithm, and a drift detection algorithm. For drift adaptation, we recommend retraining the learner if a drift is detected. The results of experiments on synthetic and real-world datasets demonstrate the advantages of EI-kMeans and show its efficacy in detecting concept drift.

preprint2020arXiv

Concept Drift Detection: Dealing with MissingValues via Fuzzy Distance Estimations

In data streams, the data distribution of arriving observations at different time points may change - a phenomenon called concept drift. While detecting concept drift is a relatively mature area of study, solutions to the uncertainty introduced by observations with missing values have only been studied in isolation. No one has yet explored whether or how these solutions might impact drift detection performance. We, however, believe that data imputation methods may actually increase uncertainty in the data rather than reducing it. We also conjecture that imputation can introduce bias into the process of estimating distribution changes during drift detection, which can make it more difficult to train a learning model. Our idea is to focus on estimating the distance between observations rather than estimating the missing values, and to define membership functions that allocate observations to histogram bins according to the estimation errors. Our solution comprises a novel masked distance learning (MDL) algorithm to reduce the cumulative errors caused by iteratively estimating each missing value in an observation and a fuzzy-weighted frequency (FWF) method for identifying discrepancies in the data distribution. The concept drift detection algorithm proposed in this paper is a singular and unified algorithm that can handle missing values, but not an imputation algorithm combined with a concept drift detection algorithm. Experiments on both synthetic and real-world data sets demonstrate the advantages of this method and show its robustness in detecting drift in data with missing values. These findings reveal that missing values exert a profound impact on concept drift detection, but using fuzzy set theory to model observations can produce more reliable results than imputation.

preprint2020arXiv

Diverse Instances-Weighting Ensemble based on Region Drift Disagreement for Concept Drift Adaptation

Concept drift refers to changes in the distribution of underlying data and is an inherent property of evolving data streams. Ensemble learning, with dynamic classifiers, has proved to be an efficient method of handling concept drift. However, the best way to create and maintain ensemble diversity with evolving streams is still a challenging problem. In contrast to estimating diversity via inputs, outputs, or classifier parameters, we propose a diversity measurement based on whether the ensemble members agree on the probability of a regional distribution change. In our method, estimations over regional distribution changes are used as instance weights. Constructing different region sets through different schemes will lead to different drift estimation results, thereby creating diversity. The classifiers that disagree the most are selected to maximize diversity. Accordingly, an instance-based ensemble learning algorithm, called the diverse instance weighting ensemble (DiwE), is developed to address concept drift for data stream classification problems. Evaluations of various synthetic and real-world data stream benchmarks show the effectiveness and advantages of the proposed algorithm.

preprint2020arXiv

Heterogeneous domain adaptation: An unsupervised approach

Domain adaptation leverages the knowledge in one domain - the source domain - to improve learning efficiency in another domain - the target domain. Existing heterogeneous domain adaptation research is relatively well-progressed, but only in situations where the target domain contains at least a few labeled instances. In contrast, heterogeneous domain adaptation with an unlabeled target domain has not been well-studied. To contribute to the research in this emerging field, this paper presents: (1) an unsupervised knowledge transfer theorem that guarantees the correctness of transferring knowledge; and (2) a principal angle-based metric to measure the distance between two pairs of domains: one pair comprises the original source and target domains and the other pair comprises two homogeneous representations of two domains. The theorem and the metric have been implemented in an innovative transfer model, called a Grassmann-Linear monotonic maps-geodesic flow kernel (GLG), that is specifically designed for heterogeneous unsupervised domain adaptation (HeUDA). The linear monotonic maps meet the conditions of the theorem and are used to construct homogeneous representations of the heterogeneous domains. The metric shows the extent to which the homogeneous representations have preserved the information in the original source and target domains. By minimizing the proposed metric, the GLG model learns the homogeneous representations of heterogeneous domains and transfers knowledge through these learned representations via a geodesic flow kernel. To evaluate the model, five public datasets were reorganized into ten HeUDA tasks across three applications: cancer detection, credit assessment, and text classification. The experiments demonstrate that the proposed model delivers superior performance over the existing baselines.

preprint2020arXiv

How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Unsupervised domain adaptation (UDA) aims to train a target classifier with labeled samples from the source domain and unlabeled samples from the target domain. Classical UDA learning bounds show that target risk is upper bounded by three terms: source risk, distribution discrepancy, and combined risk. Based on the assumption that the combined risk is a small fixed value, methods based on this bound train a target classifier by only minimizing estimators of the source risk and the distribution discrepancy. However, the combined risk may increase when minimizing both estimators, which makes the target risk uncontrollable. Hence the target classifier cannot achieve ideal performance if we fail to control the combined risk. To control the combined risk, the key challenge takes root in the unavailability of the labeled samples in the target domain. To address this key challenge, we propose a method named E-MixNet. E-MixNet employs enhanced mixup, a generic vicinal distribution, on the labeled source samples and pseudo-labeled target samples to calculate a proxy of the combined risk. Experiments show that the proxy can effectively curb the increase of the combined risk when minimizing the source risk and distribution discrepancy. Furthermore, we show that if the proxy of the combined risk is added into loss functions of four representative UDA methods, their performance is also improved.

preprint2020arXiv

Interactive Steering of Hierarchical Clustering

Hierarchical clustering is an important technique to organize big data for exploratory data analysis. However, existing one-size-fits-all hierarchical clustering methods often fail to meet the diverse needs of different users. To address this challenge, we present an interactive steering method to visually supervise constrained hierarchical clustering by utilizing both public knowledge (e.g., Wikipedia) and private knowledge from users. The novelty of our approach includes 1) automatically constructing constraints for hierarchical clustering using knowledge (knowledge-driven) and intrinsic data distribution (data-driven), and 2) enabling the interactive steering of clustering through a visual interface (user-driven). Our method first maps each data item to the most relevant items in a knowledge base. An initial constraint tree is then extracted using the ant colony optimization algorithm. The algorithm balances the tree width and depth and covers the data items with high confidence. Given the constraint tree, the data items are hierarchically clustered using evolutionary Bayesian rose tree. To clearly convey the hierarchical clustering results, an uncertainty-aware tree visualization has been developed to enable users to quickly locate the most uncertain sub-hierarchies and interactively improve them. The quantitative evaluation and case study demonstrate that the proposed approach facilitates the building of customized clustering trees in an efficient and effective manner.

preprint2020arXiv

Learning from a Complementary-label Source Domain: Theory and Algorithms

In unsupervised domain adaptation (UDA), a classifier for the target domain is trained with massive true-label data from the source domain and unlabeled data from the target domain. However, collecting fully-true-label data in the source domain is high-cost and sometimes impossible. Compared to the true labels, a complementary label specifies a class that a pattern does not belong to, hence collecting complementary labels would be less laborious than collecting true labels. Thus, in this paper, we propose a novel setting that the source domain is composed of complementary-label data, and a theoretical bound for it is first proved. We consider two cases of this setting, one is that the source domain only contains complementary-label data (completely complementary unsupervised domain adaptation, CC-UDA), and the other is that the source domain has plenty of complementary-label data and a small amount of true-label data (partly complementary unsupervised domain adaptation, PC-UDA). To this end, a complementary label adversarial network} (CLARINET) is proposed to solve CC-UDA and PC-UDA problems. CLARINET maintains two deep networks simultaneously, where one focuses on classifying complementary-label source data and the other takes care of source-to-target distributional adaptation. Experiments show that CLARINET significantly outperforms a series of competent baselines on handwritten-digits-recognition and objects-recognition tasks.

preprint2020arXiv

Learning under Concept Drift: A Review

Concept drift describes unforeseeable changes in the underlying distribution of streaming data over time. Concept drift research involves the development of methodologies and techniques for drift detection, understanding and adaptation. Data analysis has revealed that machine learning in a concept drift environment will result in poor learning results if the drift is not addressed. To help researchers identify which research topics are significant and how to apply related techniques in data analysis tasks, it is necessary that a high quality, instructive review of current research developments and trends in the concept drift field is conducted. In addition, due to the rapid development of concept drift in recent years, the methodologies of learning under concept drift have become noticeably systematic, unveiling a framework which has not been mentioned in literature. This paper reviews over 130 high quality publications in concept drift related research areas, analyzes up-to-date developments in methodologies and techniques, and establishes a framework of learning under concept drift including three main components: concept drift detection, concept drift understanding, and concept drift adaptation. This paper lists and discusses 10 popular synthetic datasets and 14 publicly available benchmark datasets used for evaluating the performance of learning algorithms aiming at handling concept drift. Also, concept drift related research directions are covered and discussed. By providing state-of-the-art knowledge, this survey will directly support researchers in their understanding of research developments in the field of learning under concept drift.

preprint2020arXiv

Survey of 360$^{\circ}$ domain walls in magnetic heterostructures: topology, chirality and current-driven dynamics

Chirality and current-driven dynamics of topologically nontrivial 360$^{\circ}$ domain walls (360DWs) in magnetic heterostructures (MHs) are systematically investigated. For MHs with normal substrates, the static 360DWs are Néel-type with no chirality. While for those with heavy-metal substrates, the interfacial Dzyaloshinskii-Moriya interaction (iDMI) therein makes 360DWs prefer specific chirality. Under in-plane driving charge currents, as the direct result of "full-circle" topology a certain 360DW does not undergo the "Walker breakdown"-type process like a well-studied 180$^{\circ}$ domain wall as the current density increases. Alternatively, it keeps a fixed propagating mode (either steady-flow or precessional-flow, depending on the effective damping constant of the MH) until it collapses or changes to other types of solition when the current density becomes too high. Similarly, the field-like spin-orbit torque (SOT) has no effects on the dynamics of 360DWs, while the anti-damping SOT has. For both modes, modifications to the mobility of 360DWs by iDMI and anti-damping SOT are provided.

preprint2020arXiv

Symmetry breaking induced magnon-magnon coupling in synthetic antiferromagnets

We propose a general theory of microwave absorption spectroscopy for symmetry-breaking synthetic antiferromagnets (SAFs). Generally, inhomogeneity or different thickness of the two ferromagnetic sublayers of a SAF results in the intrinsic symmetry breaking, while out-of-plane components of dc magnetic fields lead to the extrinsic one. The broken symmetry of SAFs excludes the original symmetry-protected crossing between pure in-phase and out-of-phase resonance modes with opposite parity. Alternatively, new frequency branches become hybridization of original bare modes in terms of symmetry-breaking-induced magnon-magnon coupling, which results in an indirect gap in ferromagnetic resonance frequencies. Also, the dependence of gap width on the degree of symmetry breaking for several typical cases are presented and compared with existing experiments. Our theory provides a simple but physical understanding on the rich structure of ferromagnetic resonance spectra for asymmetric SAFs.

preprint2019arXiv

Determination of the Dzyaloshinskii-Moriya interactions

Using in-plane field dependence of the precessional flow of chiral domain walls (DWs) to simultaneously determine bulk and interfacial Dzyaloshinskii-Moriya interactions (DMIs) is proposed. It is found that effective fields of bulk and interfacial DMIs have respectively transverse and longitudinal components that affect differently the motion of chiral DWs in magnetic narrow heterostructure strips. The in-plane field dependence of DW velocity has a dome-shape or a canyon-shape, depending on whether the driving force is an in-plane current or an out-of-plane magnetic field. The responses of their center shifts to the reversal of topological wall charge and current/field direction uniquely determine the nature and strength of DMI therein. Operable procedures are proposed and applied to explain existing experimental data.

preprint2016arXiv

Statics and field-driven dynamics of transverse domain walls in biaxial nanowires under uniform transverse magnetic fields

In this work, we report analytical results on transverse domain wall (TDW) statics and field-driven dynamics in quasi one-dimensional biaxial nanowires under arbitrary uniform transverse magnetic fields (TMFs) based on the Landau-Lifshitz-Gilbert equation. Without axial driving fields, the static TDW should be symmetric about its center meanwhile twisted in its azimuthal angle distribution. By decoupling of polar and azimuthal degrees of freedom, an approximate solution is provided which reproduces these features to a great extent. When an axial driving field is applied, the dynamical behavior of a TDW is viewed as the response of its static profile to external excitations. By means of the asymptotic expansion method, the TDW velocity in traveling-wave mode is obtained, which provides the extent and boundary of the "velocity-enhancement" effect of TMFs to TDWs in biaxial nanowires. Finally numerical simulations are performed and strongly support our analytics.

preprint2015arXiv

Convergence analysis of approximate primal solutions in dual first-order methods

Dual first-order methods are powerful techniques for large-scale convex optimization. Although an extensive research effort has been devoted to studying their convergence properties, explicit convergence rates for the primal iterates have only been established under global Lipschitz continuity of the dual gradient. This is a rather restrictive assumption that does not hold for several important classes of problems. In this paper, we demonstrate that primal convergence rate guarantees can also be obtained when the dual gradient is only locally Lipschitz. The class of problems that we analyze admits general convex constraints including nonlinear inequality, linear equality, and set constraints. As an approximate primal solution, we take the minimizer of the Lagrangian, computed when evaluating the dual gradient. We derive error bounds for this approximate primal solution in terms of the errors of the dual variables, and establish convergence rates of the dual variables when the dual problem is solved using a projected gradient or fast gradient method. By combining these results, we show that the suboptimality and infeasibility of the approximate primal solution at iteration $k$ are no worse than $O(1/\sqrt{k})$ when the dual problem is solved using a projected gradient method, and $O(1/k)$ when a fast dual gradient method is used.

preprint2015arXiv

Dependent Indian Buffet Process-based Sparse Nonparametric Nonnegative Matrix Factorization

Nonnegative Matrix Factorization (NMF) aims to factorize a matrix into two optimized nonnegative matrices appropriate for the intended applications. The method has been widely used for unsupervised learning tasks, including recommender systems (rating matrix of users by items) and document clustering (weighting matrix of papers by keywords). However, traditional NMF methods typically assume the number of latent factors (i.e., dimensionality of the loading matrices) to be fixed. This assumption makes them inflexible for many applications. In this paper, we propose a nonparametric NMF framework to mitigate this issue by using dependent Indian Buffet Processes (dIBP). In a nutshell, we apply a correlation function for the generation of two stick weights associated with each pair of columns of loading matrices, while still maintaining their respective marginal distribution specified by IBP. As a consequence, the generation of two loading matrices will be column-wise (indirectly) correlated. Under this same framework, two classes of correlation function are proposed (1) using Bivariate beta distribution and (2) using Copula function. Both methods allow us to adopt our work for various applications by flexibly choosing an appropriate parameter settings. Compared with the other state-of-the art approaches in this area, such as using Gaussian Process (GP)-based dIBP, our work is seen to be much more flexible in terms of allowing the two corresponding binary matrix columns to have greater variations in their non-zero entries. Our experiments on the real-world and synthetic datasets show that three proposed models perform well on the document clustering task comparing standard NMF without predefining the dimension for the factor matrices, and the Bivariate beta distribution-based and Copula-based models have better flexibility than the GP-based model.

preprint2015arXiv

Flavour-changing top decays in the aligned two-Higgs-doublet model

We perform a complete one-loop computation of the two-body flavour-changing top decays $t \rightarrow c h$ and $t \rightarrow c V$ ($V= γ, Z$), within the aligned two-Higgs-doublet model. We evaluate the impact of the model parameters on the associated branching ratios, taking into account constraints from flavour data and measurements of the Higgs properties. Assuming that the $125$~GeV Higgs corresponds to the lightest CP-even scalar of the CP-conserving aligned two-Higgs-doublet model, we find that the rates for such flavour-changing top decays lie below the expected sensitivity of the future high-luminosity phase of the LHC. Measurements of the Higgs signal strength in the di-photon channel are found to play an important role in limiting the size of the $t \rightarrow c h$ decay rate when the charged scalar of the model is light.

preprint2015arXiv

Infinite Author Topic Model based on Mixed Gamma-Negative Binomial Process

Incorporating the side information of text corpus, i.e., authors, time stamps, and emotional tags, into the traditional text mining models has gained significant interests in the area of information retrieval, statistical natural language processing, and machine learning. One branch of these works is the so-called Author Topic Model (ATM), which incorporates the authors's interests as side information into the classical topic model. However, the existing ATM needs to predefine the number of topics, which is difficult and inappropriate in many real-world settings. In this paper, we propose an Infinite Author Topic (IAT) model to resolve this issue. Instead of assigning a discrete probability on fixed number of topics, we use a stochastic process to determine the number of topics from the data itself. To be specific, we extend a gamma-negative binomial process to three levels in order to capture the author-document-keyword hierarchical structure. Furthermore, each document is assigned a mixed gamma process that accounts for the multi-author's contribution towards this document. An efficient Gibbs sampling inference algorithm with each conditional distribution being closed-form is developed for the IAT model. Experiments on several real-world datasets show the capabilities of our IAT model to learn the hidden topics, authors' interests on these topics and the number of topics simultaneously.

preprint2015arXiv

Nonnegative Multi-level Network Factorization for Latent Factor Analysis

Nonnegative Matrix Factorization (NMF) aims to factorize a matrix into two optimized nonnegative matrices and has been widely used for unsupervised learning tasks such as product recommendation based on a rating matrix. However, although networks between nodes with the same nature exist, standard NMF overlooks them, e.g., the social network between users. This problem leads to comparatively low recommendation accuracy because these networks are also reflections of the nature of the nodes, such as the preferences of users in a social network. Also, social networks, as complex networks, have many different structures. Each structure is a composition of links between nodes and reflects the nature of nodes, so retaining the different network structures will lead to differences in recommendation performance. To investigate the impact of these network structures on the factorization, this paper proposes four multi-level network factorization algorithms based on the standard NMF, which integrates the vertical network (e.g., rating matrix) with the structures of horizontal network (e.g., user social network). These algorithms are carefully designed with corresponding convergence proofs to retain four desired network structures. Experiments on synthetic data show that the proposed algorithms are able to preserve the desired network structures as designed. Experiments on real-world data show that considering the horizontal networks improves the accuracy of document clustering and recommendation with standard NMF, and various structures show their differences in performance on these two tasks. These results can be directly used in document clustering and recommendation systems.

preprint2015arXiv

Nonparametric Relational Topic Models through Dependent Gamma Processes

Traditional Relational Topic Models provide a way to discover the hidden topics from a document network. Many theoretical and practical tasks, such as dimensional reduction, document clustering, link prediction, benefit from this revealed knowledge. However, existing relational topic models are based on an assumption that the number of hidden topics is known in advance, and this is impractical in many real-world applications. Therefore, in order to relax this assumption, we propose a nonparametric relational topic model in this paper. Instead of using fixed-dimensional probability distributions in its generative model, we use stochastic processes. Specifically, a gamma process is assigned to each document, which represents the topic interest of this document. Although this method provides an elegant solution, it brings additional challenges when mathematically modeling the inherent network structure of typical document network, i.e., two spatially closer documents tend to have more similar topics. Furthermore, we require that the topics are shared by all the documents. In order to resolve these challenges, we use a subsampling strategy to assign each document a different gamma process from the global gamma process, and the subsampling probabilities of documents are assigned with a Markov Random Field constraint that inherits the document network structure. Through the designed posterior inference algorithm, we can discover the hidden topics and its number simultaneously. Experimental results on both synthetic and real-world network datasets demonstrate the capabilities of learning the hidden topics and, more importantly, the number of topics.

preprint2014arXiv

$B_{s,d}^0 \to \ell^+\ell^-$ Decays in the Aligned Two-Higgs-Doublet Model

The rare decays $B_{s,d}^0 \to \ell^+\ell^-$ are analyzed within the general framework of the aligned two-Higgs doublet model. We present a complete one-loop calculation of the relevant short-distance Wilson coefficients, giving a detailed technical summary of our results and comparing them with previous calculations performed in particular limits or approximations. We investigate the impact of various model parameters on the branching ratios and study the phenomenological constraints imposed by present data.

preprint2014arXiv

$B_{s,d}^0 \to \ell^+\ell^-$ Decays in Two-Higgs Doublet Models

We study the rare leptonic decays $B_{s,d}^0 \to \ell^+\ell^-$ within the general framework of the aligned two-Higgs doublet model. A complete one-loop calculation of the relevant short-distance Wilson coefficients is presented, with a detailed technical summary of the results. The phenomenological constraints imposed by present data on the model parameters are also investigated.

preprint2014arXiv

Constraining General 2HDM by the Evolution of Yukawa Couplings

We analysis the constraints of the general two Higgs doublet models via evolving the Yukawa coupling constants to high energy under renormalization group. We consider the appearance of a Landau pole or large off-diagonal Yukawa couplings which cause tree level flavour changing neutral currents. Our study shows the latter condition can be used to answer that how much $Z_2$ symmetry breaking can be allowed in a given 2HDM model.

preprint2014arXiv

Effective Theories for QCD-like at TeV Scale

We study the Effective Field Theory of three QCD-like theories, which can be classified by having quarks in a complex, real or pseudo-real representations of the gauge group. The Lagrangians are written in a very similar way so that the calculations can be done using techniques from Chiral Perturbation Theory (ChPT). We calculated the vacuum-expectation-value, the mass and the decay constant of pseudo-Goldstone Bosons up to next-to-next-to leading order (NNLO) \cite{Bijnens:2009qm}. The various channels of general $n$ flavour meson-meson scattering of the three theories are systematically studied and calculated up to NNLO \cite{Bijnens:2011fm}. We also calculated the vector, axial-vector, scalar, pseudo-scalar two-point functions and pseudo-scalar decay constant up NNLO order \cite{Bijnens:2011xt}. The analytic expressions of the S parameter for the three different QCD-like theories are obtained at TeV scale. Our results are useful for chiral extrapolation in lattice calculation on theory of strong dynamical and finite baryon density.

preprint2012arXiv

Constraining General Two Higgs Doublet Models by the Evolution of Yukawa Couplings

We study how general two Higgs doublet models can be constrained by considering their properties under renormalization group evolution of the Yukawa couplings. We take into account both the appearance of a Landau pole as well as off-diagonal Yukawa couplings leading to flavour changing neutral currents in violation with experimental constraints at the electroweak scale. We find that the latter condition can be used to limit the amount of Z2 symmetry breaking allowed in a given model.

preprint2012arXiv

Quantum percolation in quantum spin Hall antidot systems

We study the influences of antidot-induced bound states on transport properties of two- dimensional quantum spin Hall insulators. The bound statesare found able to induce quantum percolation in the originally insulating bulk. At some critical antidot densities, the quantum spin Hall phase can be completely destroyed due to the maximum quantum percolation. For systems with periodic boundaries, the maximum quantum percolationbetween the bound states creates intermediate extended states in the bulk which is originally gapped and insulating. The antidot in- duced bound states plays the same role as the magnetic field inthe quantum Hall effect, both makes electrons go into cyclotron motions. We also draw an analogy between the quantum percolation phenomena in this system and that in the network models of quantum Hall effect.

preprint2011arXiv

Gossip Algorithms for Convex Consensus Optimization over Networks

In many applications, nodes in a network desire not only a consensus, but an optimal one. To date, a family of subgradient algorithms have been proposed to solve this problem under general convexity assumptions. This paper shows that, for the scalar case and by assuming a bit more, novel non-gradient-based algorithms with appealing features can be constructed. Specifically, we develop Pairwise Equalizing (PE) and Pairwise Bisectioning (PB), two gossip algorithms that solve unconstrained, separable, convex consensus optimization problems over undirected networks with time-varying topologies, where each local function is strictly convex, continuously differentiable, and has a minimizer. We show that PE and PB are easy to implement, bypass limitations of the subgradient algorithms, and produce switched, nonlinear, networked dynamical systems that admit a common Lyapunov function and asymptotically converge. Moreover, PE generalizes the well-known Pairwise Averaging and Randomized Gossip Algorithm, while PB relaxes a requirement of PE, allowing nodes to never share their local functions.

preprint2011arXiv

Meson-meson Scattering in QCD-like Theories

We discuss meson-meson scattering at next-to-next-to-leading order in the chiral expansion for QCD-like theories with general $n$ degenerate flavours for the cases with a complex, real and pseudo-real representation. I.e. with global symmetry and breaking pattern $SU(n)_L\times SU(n)_R\to SU(n)_V$, $SU(2n)\to SO(2n)$ and $SU(2n)\to Sp(2n)$. We obtain fully analytical expressions for all these cases. We discuss the general structure of the amplitude and the structure of the possible intermediate channels for all three cases. We derive the expressions for the lowest partial wave scattering length in each channel and present some representative numerical results. We also show various relations between the different cases in the limit of large $n$.

preprint2011arXiv

Surface and Edge States in Topological Semi-metals

We study the topologically non-trivial semi-metals by means of the 6-band Kane model. Existence of surface states is explicitly demonstrated by calculating the LDOS on the material surface. In the strain free condition, surface states are divided into two parts in the energy spectrum, one part is in the direct gap, the other part including the crossing point of surface state Dirac cone is submerged in the valence band. We also show how uni-axial strain induces an insulating band gap and raises the crossing point from the valence band into the band gap, making the system a true topological insulator. We predict existence of helical edge states and spin Hall effect in the thin film topological semi-metals, which could be tested with future experiment. Disorder is found to significantly enhance the spin Hall effect in the valence band of the thin films.

preprint2011arXiv

Two-Point Functions and S-Parameter in QCD-like Theories

We calculated the vector, axial-vector, scalar and pseudo-scalar two-point functions up to two-loop level in the low-energy effective field theory for three different QCD-like theories. In addition we also calculated the pseudo-scalar decay constant $G_M$. The QCD-like theories we used are those with fermions in a complex, real or pseudo-real representation with in general n flavours. These case correspond to global symmetry breaking pattern of $SU(n)_L\times SU(n)_R\to SU(n)_V$, $SU(2n)\to SO(2n)$ or $SU(2n)\to Sp(2n)$. We also estimated the S parameter for those different theories.

preprint2011arXiv

Zero-Gradient-Sum Algorithms for Distributed Convex Optimization: The Continuous-Time Case

This paper presents a set of continuous-time distributed algorithms that solve unconstrained, separable, convex optimization problems over undirected networks with fixed topologies. The algorithms are developed using a Lyapunov function candidate that exploits convexity, and are called Zero-Gradient-Sum (ZGS) algorithms as they yield nonlinear networked dynamical systems that evolve invariantly on a zero-gradient-sum manifold and converge asymptotically to the unknown optimizer. We also describe a systematic way to construct ZGS algorithms, show that a subset of them actually converge exponentially, and obtain lower and upper bounds on their convergence rates in terms of the network topologies, problem characteristics, and algorithm parameters, including the algebraic connectivity, Laplacian spectral radius, and function curvatures. The findings of this paper may be regarded as a natural generalization of several well-known algorithms and results for distributed consensus, to distributed convex optimization.

preprint2010arXiv

Controlled Hopwise Averaging: Bandwidth/Energy-Efficient Asynchronous Distributed Averaging for Wireless Networks

This paper addresses the problem of averaging numbers across a wireless network from an important, but largely neglected, viewpoint: bandwidth/energy efficiency. We show that existing distributed averaging schemes have several drawbacks and are inefficient, producing networked dynamical systems that evolve with wasteful communications. Motivated by this, we develop Controlled Hopwise Averaging (CHA), a distributed asynchronous algorithm that attempts to "make the most" out of each iteration by fully exploiting the broadcast nature of wireless medium and enabling control of when to initiate an iteration. We show that CHA admits a common quadratic Lyapunov function for analysis, derive bounds on its exponential convergence rate, and show that they outperform the convergence rate of Pairwise Averaging for some common graphs. We also introduce a new way to apply Lyapunov stability theory, using the Lyapunov function to perform greedy, decentralized, feedback iteration control. Finally, through extensive simulation on random geometric graphs, we show that CHA is substantially more efficient than several existing schemes, requiring far fewer transmissions to complete an averaging task.

preprint2010arXiv

Z2 invariant protected bound states in topological insulators

We present an exact solution of a modifed Dirac equation for topological insulator in the presence of a hole or vacancy to demonstrate that vacancies may induce bound states in the band gap of topological insulators. They arise due to the Z_2 classification of time-reversal invariant insulators, thus are also topologically-protected like the edge states in the quantum spin Hall effect and the surface states in three-dimensional topological insulators. Coexistence of the in-gap bound states and the edge or surface states in topological insulators suggests that imperfections may affect transport properties of topological insulators via additional bound states near the system boundary.

preprint2009arXiv

Technicolor and other QCD-like theories at next-to-next-to-leading order

We calculate the vacuum-expectation-value, the meson mass and the meson decay constant to next-to-next-to-leading-order in the chiral expansion for QCD-like theories with general $N_F$ degenerate flavours for the cases with a complex representation, a real and a pseudoreal representation, i.e. with global symmetry and breaking patters $SU(N_F)_L\times SU(N_F)_R\to SU(N_F)_V$, $SU(2N_F)\to SO(2N)$ and $SU(2N_F)\to Sp(2N_F)$. These calculations should be useful for lattice calculations for dynamical electroweak symmetry breaking and related cases.

Jie Lu

What is connected

Connect this record

See the researcher in context

Building this map preview

47 published item(s)

Autonomous Drift Learning in Data Streams: A Unified Perspective

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

Online Boosting Adaptive Learning under Concept Drift for Multistream Classification

Fast Evolution of Single Qubit Gate in Non-Adiabatic Geometric Quantum Computing

Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data

Multi-class Classification with Fuzzy-feature Observations: Theory and Algorithms

Stepping beyond your comfort zone: Diffusion-based network analytics for knowledge trajectory recommendation

Butterfly: One-step Approach towards Wildly Unsupervised Domain Adaptation

Chiral domain wall dynamics in magnetic heterostructures with bulk Dzyaloshinskii-Moriya interactions

Clarinet: A One-step Approach Towards Budget-friendly Unsupervised Domain Adaptation

Distributed Optimization with Coupling Constraints

Learning Deep Kernels for Non-Parametric Two-Sample Tests

Multiplexed structured illumination super-resolution imaging with time-domain upconversion nanoparticles

PAC-Bayes Bounds for Meta-learning with Data-Dependent Prior

Bridging the Theoretical Bound and Deep Algorithms for Open Set Domain Adaptation

Concept Drift Detection via Equal Intensity k-means Space Partitioning

Concept Drift Detection: Dealing with MissingValues via Fuzzy Distance Estimations

Diverse Instances-Weighting Ensemble based on Region Drift Disagreement for Concept Drift Adaptation

Heterogeneous domain adaptation: An unsupervised approach

How does the Combined Risk Affect the Performance of Unsupervised Domain Adaptation Approaches?

Interactive Steering of Hierarchical Clustering

Learning from a Complementary-label Source Domain: Theory and Algorithms

Learning under Concept Drift: A Review

Survey of 360$^{\circ}$ domain walls in magnetic heterostructures: topology, chirality and current-driven dynamics

Symmetry breaking induced magnon-magnon coupling in synthetic antiferromagnets

Determination of the Dzyaloshinskii-Moriya interactions

Statics and field-driven dynamics of transverse domain walls in biaxial nanowires under uniform transverse magnetic fields

Convergence analysis of approximate primal solutions in dual first-order methods

Dependent Indian Buffet Process-based Sparse Nonparametric Nonnegative Matrix Factorization

Flavour-changing top decays in the aligned two-Higgs-doublet model

Infinite Author Topic Model based on Mixed Gamma-Negative Binomial Process

Nonnegative Multi-level Network Factorization for Latent Factor Analysis

Nonparametric Relational Topic Models through Dependent Gamma Processes

$B_{s,d}^0 \to \ell^+\ell^-$ Decays in the Aligned Two-Higgs-Doublet Model

$B_{s,d}^0 \to \ell^+\ell^-$ Decays in Two-Higgs Doublet Models

Constraining General 2HDM by the Evolution of Yukawa Couplings

Effective Theories for QCD-like at TeV Scale

Constraining General Two Higgs Doublet Models by the Evolution of Yukawa Couplings

Quantum percolation in quantum spin Hall antidot systems

Gossip Algorithms for Convex Consensus Optimization over Networks

Meson-meson Scattering in QCD-like Theories

Surface and Edge States in Topological Semi-metals

Two-Point Functions and S-Parameter in QCD-like Theories

Zero-Gradient-Sum Algorithms for Distributed Convex Optimization: The Continuous-Time Case

Controlled Hopwise Averaging: Bandwidth/Energy-Efficient Asynchronous Distributed Averaging for Wireless Networks

Z2 invariant protected bound states in topological insulators

Technicolor and other QCD-like theories at next-to-next-to-leading order