Source author record

Yi Zhou

Yi Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

131works

51topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SemEval-2026 Task 7: Everyday Knowledge Across Diverse Languages and Cultures

We present our shared task on evaluating the adaptability of LLMs and NLP systems across multiple languages and cultures. The task data consist of an extended version of our manually constructed BLEnD benchmark (Myung et al. 2024), covering more than 30 language-culture pairs, predominantly representing low-resource languages spoken across multiple continents. As the task is designed strictly for evaluation, participants were not permitted to use the data for training, fine-tuning, few-shot learning, or any other form of model modification. Our task includes two tracks: (a) Short-Answer Questions (SAQ) and (b) Multiple-Choice Questions (MCQ). Participants were required to predict labels and were allowed to submit any NLP system and adopt diverse modelling strategies, provided that the benchmark was used solely for evaluation. The task attracted more than 140 registered participants, and we received final submissions from 62 teams, along with 19 system description papers. We report the results and present an analysis of the best-performing systems and the most commonly adopted approaches. Furthermore, we discuss shared insights into open questions and challenges related to evaluation, misalignment, and methodological perspectives on model behaviour in low-resource languages and for under-represented cultures.

preprint2024arXiv

A Pure Integral-Type PLL with a Damping Branch to Enhance the Stability of Grid-Tied Inverter under Weak Grids

In a phase-locked loop (PLL) synchronized inverter, due to the strong nonlinear coupling between the PLL's parame-ters and the operation power angle, the equivalent damping coefficient will quickly deteriorate while the power angle is close to 90° under an ultra-weak grid, which causes the synchronous instability. To address this issue, in this letter, a pure integral-type phase-locked loop (IPLL) with a damping branch is proposed to replace the traditional PI-type PLL. The equivalent damping coefficient of an IPLL-synchronized inverter is decoupled with the steady-state power angle. As a result, the IPLL-synchronized inverter can stably operate under an ultra-weak grid when the equilibrium point exists. Finally, time-domain simulation results verify the effectiveness and correctness of the proposed IPLL.

preprint2024arXiv

On Unbalanced Optimal Transport: Gradient Methods, Sparsity and Approximation Error

We study the Unbalanced Optimal Transport (UOT) between two measures of possibly different masses with at most $n$ components, where the marginal constraints of standard Optimal Transport (OT) are relaxed via Kullback-Leibler divergence with regularization factor $τ$. Although only Sinkhorn-based UOT solvers have been analyzed in the literature with the iteration complexity of ${O}\big(\tfrac{τ\log(n)}{\varepsilon} \log\big(\tfrac{\log(n)}{\varepsilon}\big)\big)$ and per-iteration cost of $O(n^2)$ for achieving the desired error $\varepsilon$, their positively dense output transportation plans strongly hinder the practicality. On the other hand, while being vastly used as heuristics for computing UOT in modern deep learning applications and having shown success in sparse OT problem, gradient methods applied to UOT have not been formally studied. In this paper, we propose a novel algorithm based on Gradient Extrapolation Method (GEM-UOT) to find an $\varepsilon$-approximate solution to the UOT problem in $O\big( κ\log\big(\frac{τn}{\varepsilon}\big) \big)$ iterations with $\widetilde{O}(n^2)$ per-iteration cost, where $κ$ is the condition number depending on only the two input measures. Our proof technique is based on a novel dual formulation of the squared $\ell_2$-norm UOT objective, which fills the lack of sparse UOT literature and also leads to a new characterization of approximation error between UOT and OT. To this end, we further present a novel approach of OT retrieval from UOT, which is based on GEM-UOT with fine tuned $τ$ and a post-process projection step. Extensive experiments on synthetic and real datasets validate our theories and demonstrate the favorable performance of our methods in practice.

preprint2023arXiv

Extended Load Flexibility of Utility-Scale P2H Plants: Optimal Production Scheduling Considering Dynamic Thermal and HTO Impurity Effects

In the conversion toward a clear and sustainable energy system, the flexibility of power-to-hydrogen (P2H) production enables the admittance of volatile renewable energies on a utility scale and provides the connected electrical power system with ancillary services. To extend the load flexibility and thus improve the profitability of green hydrogen production, this paper presents an optimal production scheduling approach for utility-scale P2H plants composed of multiple alkaline electrolyzers. Unlike existing works, this work discards the conservative constant steady-state constraints and first leverages the dynamic thermal and hydrogen-to-oxygen (HTO) impurity crossover processes of electrolyzers. Doing this optimizes their effects on the loading range and energy conversion efficiency, therefore improving the load flexibility of P2H production. The proposed multiphysics-aware scheduling model is formulated as mixed-integer linear programming (MILP). It coordinates the electrolyzers' operation state transitions and load allocation subject to comprehensive thermodynamic and mass transfer constraints. A decomposition-based solution method, SDM-GS-ALM, is followingly adopted to address the scalability issue for scheduling large-scale P2H plants composed of tens of electrolyzers. With an experiment-verified dynamic electrolyzer model, case studies up to 22 electrolyzers show that the proposed method remarkably improves the hydrogen output and profit of P2H production powered by either solar or wind energy compared to the existing scheduling approach.

preprint2022arXiv

A Fast and Convergent Proximal Algorithm for Regularized Nonconvex and Nonsmooth Bi-level Optimization

Many important machine learning applications involve regularized nonconvex bi-level optimization. However, the existing gradient-based bi-level optimization algorithms cannot handle nonconvex or nonsmooth regularizers, and they suffer from a high computation complexity in nonconvex bi-level optimization. In this work, we study a proximal gradient-type algorithm that adopts the approximate implicit differentiation (AID) scheme for nonconvex bi-level optimization with possibly nonconvex and nonsmooth regularizers. In particular, the algorithm applies the Nesterov's momentum to accelerate the computation of the implicit gradient involved in AID. We provide a comprehensive analysis of the global convergence properties of this algorithm through identifying its intrinsic potential function. In particular, we formally establish the convergence of the model parameters to a critical point of the bi-level problem, and obtain an improved computation complexity $\mathcal{O}(κ^{3.5}ε^{-2})$ over the state-of-the-art result. Moreover, we analyze the asymptotic convergence rates of this algorithm under a class of local nonconvex geometries characterized by a Łojasiewicz-type gradient inequality. Experiment on hyper-parameter optimization demonstrates the effectiveness of our algorithm.

preprint2022arXiv

A likelihood based sensitivity analysis for publication bias on summary ROC in meta-analysis of diagnostic test accuracy

In meta-analysis of diagnostic test accuracy, summary receiver operating characteristic (SROC) is a recommended method to summarize the discriminant capacity of a diagnostic test in the presence of study-specific cutoff values and the area under the SROC (SAUC) gives the aggregate measure of test accuracy. SROC or SAUC can be estimated by bivariate modelling of pairs of sensitivity and specificity over the primary diagnostic studies. However, publication bias is a major threat to the validity of estimates in meta-analysis. To address this issue, we propose to adopt sensitivity analysis to make an objective inference for the impact of publication bias on SROC or SAUC. We extend Copas likelihood based sensitivity analysis to the bivariate normal model used for meta-analysis of diagnostic test accuracy to evaluate how much SROC or SAUC would change with different selection probabilities under several selective publication mechanisms dependent on sensitivity and/or specificity. The selection probability is modelled by a selection function on $t$-type statistic for the linear combination of logit-transformed sensitivity and specificity, allowing the selective publication of each study to be influenced by the cutoff-dependent $p$-value for sensitivity, specificity, or diagnostic odds ratio. By embedding the selection function into the bivariate normal model, the conditional likelihood is proposed and the bias-corrected SROC or SAUC can be estimated by maximizing the likelihood. We illustrate the proposed sensitivity analysis by reanalyzing a meta-analysis of test accuracy for intravascular device related infection. Simulation studies are conducted to investigate the performance of proposed methods.

preprint2022arXiv

Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex Minimax Machine Learning

Alternating gradient-descent-ascent (AltGDA) is an optimization algorithm that has been widely used for model training in various machine learning applications, which aims to solve a nonconvex minimax optimization problem. However, the existing studies show that it suffers from a high computation complexity in nonconvex minimax optimization. In this paper, we develop a single-loop and fast AltGDA-type algorithm that leverages proximal gradient updates and momentum acceleration to solve regularized nonconvex minimax optimization problems. By leveraging the momentum acceleration technique, we prove that the algorithm converges to a critical point in nonconvex minimax optimization and achieves a computation complexity in the order of $\mathcal{O}(κ^{\frac{11}{6}}ε^{-2})$, where $ε$ is the desired level of accuracy and $κ$ is the problem's condition number. {Such a computation complexity improves the state-of-the-art complexities of single-loop GDA and AltGDA algorithms (see the summary of comparison in \Cref{table1})}. We demonstrate the effectiveness of our algorithm via an experiment on adversarial deep learning.

preprint2022arXiv

Coordinated Frequency Control through Safe Reinforcement Learning

With widespread deployment of renewables, the electric power grids are experiencing increasing dynamics and uncertainties, with its secure operation being threatened. Existing frequency control schemes based on day-ahead offline analysis and minute-level online sensitivity calculations are difficult to adapt to rapidly changing system states. In particular, they are unable to facilitate coordinated control of system frequency and power flows. A refined approach and tools are urgently needed to assist system operators to make timely decisions. This paper proposes a novel model-free coordinated frequency control framework based on safe reinforcement learning, with multiple control objectives considered. The load frequency control problem is modeled as a constrained Markov decision process, which can be solved by an AI agent continuously interacting with the grid to achieve sub-second decision making. Extensive numerical experiments conducted at East China Power Grid demonstrate the effectiveness and promise of the proposed method.

preprint2022arXiv

Data Sampling Affects the Complexity of Online SGD over Dependent Data

Conventional machine learning applications typically assume that data samples are independently and identically distributed (i.i.d.). However, practical scenarios often involve a data-generating process that produces highly dependent data samples, which are known to heavily bias the stochastic optimization process and slow down the convergence of learning. In this paper, we conduct a fundamental study on how different stochastic data sampling schemes affect the sample complexity of online stochastic gradient descent (SGD) over highly dependent data. Specifically, with a $ϕ$-mixing model of data dependence, we show that online SGD with proper periodic data-subsampling achieves an improved sample complexity over the standard online SGD in the full spectrum of the data dependence level. Interestingly, even subsampling a subset of data samples can accelerate the convergence of online SGD over highly dependent data. Moreover, we show that online SGD with mini-batch sampling can further substantially improve the sample complexity over online SGD with periodic data-subsampling over highly dependent data. Numerical experiments validate our theoretical results.

preprint2022arXiv

DDDM: a Brain-Inspired Framework for Robust Classification

Despite their outstanding performance in a broad spectrum of real-world tasks, deep artificial neural networks are sensitive to input noises, particularly adversarial perturbations. On the contrary, human and animal brains are much less vulnerable. In contrast to the one-shot inference performed by most deep neural networks, the brain often solves decision-making with an evidence accumulation mechanism that may trade time for accuracy when facing noisy inputs. The mechanism is well described by the Drift-Diffusion Model (DDM). In the DDM, decision-making is modeled as a process in which noisy evidence is accumulated toward a threshold. Drawing inspiration from the DDM, we propose the Dropout-based Drift-Diffusion Model (DDDM) that combines test-phase dropout and the DDM for improving the robustness for arbitrary neural networks. The dropouts create temporally uncorrelated noises in the network that counter perturbations, while the evidence accumulation mechanism guarantees a reasonable decision accuracy. Neural networks enhanced with the DDDM tested in image, speech, and text classification tasks all significantly outperform their native counterparts, demonstrating the DDDM as a task-agnostic defense against adversarial attacks.

preprint2022arXiv

Delving into the Estimation Shift of Batch Normalization in a Network

Batch normalization (BN) is a milestone technique in deep learning. It normalizes the activation using mini-batch statistics during training but the estimated population statistics during inference. This paper focuses on investigating the estimation of population statistics. We define the estimation shift magnitude of BN to quantitatively measure the difference between its estimated population statistics and expected ones. Our primary observation is that the estimation shift can be accumulated due to the stack of BN in a network, which has detriment effects for the test performance. We further find a batch-free normalization (BFN) can block such an accumulation of estimation shift. These observations motivate our design of XBNBlock that replace one BN with BFN in the bottleneck block of residual-style networks. Experiments on the ImageNet and COCO benchmarks show that XBNBlock consistently improves the performance of different architectures, including ResNet and ResNeXt, by a significant margin and seems to be more robust to distribution shift.

preprint2022arXiv

Desingularization and p-Curvature of Recurrence Operators

Linear recurrence operators in characteristic $p$ are classified by their $p$-curvature. For a recurrence operator $L$, denote by $χ(L)$ the characteristic polynomial of its $p$-curvature. We can obtain information about the factorization of $L$ by factoring $χ(L)$. The main theorem of this paper gives an unexpected relation between $χ(L)$ and the true singularities of $L$. An application is to speed up a fast algorithm for computing $χ(L)$ by desingularizing $L$ first. Another contribution of this paper is faster desingularization.

preprint2022arXiv

DeTrust-FL: Privacy-Preserving Federated Learning in Decentralized Trust Setting

Federated learning has emerged as a privacy-preserving machine learning approach where multiple parties can train a single model without sharing their raw training data. Federated learning typically requires the utilization of multi-party computation techniques to provide strong privacy guarantees by ensuring that an untrusted or curious aggregator cannot obtain isolated replies from parties involved in the training process, thereby preventing potential inference attacks. Until recently, it was thought that some of these secure aggregation techniques were sufficient to fully protect against inference attacks coming from a curious aggregator. However, recent research has demonstrated that a curious aggregator can successfully launch a disaggregation attack to learn information about model updates of a target party. This paper presents DeTrust-FL, an efficient privacy-preserving federated learning framework for addressing the lack of transparency that enables isolation attacks, such as disaggregation attacks, during secure aggregation by assuring that parties' model updates are included in the aggregated model in a private and secure manner. DeTrust-FL proposes a decentralized trust consensus mechanism and incorporates a recently proposed decentralized functional encryption (FE) scheme in which all parties agree on a participation matrix before collaboratively generating decryption key fragments, thereby gaining control and trust over the secure aggregation process in a decentralized setting. Our experimental evaluation demonstrates that DeTrust-FL outperforms state-of-the-art FE-based secure multi-party aggregation solutions in terms of training time and reduces the volume of data transferred. In contrast to existing approaches, this is achieved without creating any trust dependency on external trusted entities.

preprint2022arXiv

Extended Load Flexibility of Industrial P2H Plants: A Process Constraint-Aware Scheduling Approach

The operational flexibility of industrial power-to-hydrogen (P2H) plants enables admittance of volatile renewable power and provides auxiliary regulatory services for the power grid. Aiming to extend the flexibility of the P2H plant further, this work presents a scheduling method by considering detailed process constraints of the alkaline electrolyzers. Unlike existing works that assume constant load range, the presented scheduling framework fully exploits the dynamic processes of the electrolyzer, including temperature and hydrogen-to-oxygen (HTO) crossover, to improve operational flexibility. Varying energy conversion efficiency under different load levels and temperature is also considered. The scheduling model is solved by proper mathematical transformation as a mixed-integer linear programming (MILP), which determines the on-off-standby states and power levels of different electrolyzers in the P2H plant for daily operation. With experiment-verified constraints, a case study show that compared to the existing scheduling approach, the improved flexibility leads to a 1.627% profit increase when the P2H plant is directly coupled to the photovoltaic power.

preprint2022arXiv

Extracting Densest Sub-hypergraph with Convex Edge-weight Functions

The densest subgraph problem (DSG) aiming at finding an induced subgraph such that the average edge-weights of the subgraph is maximized, is a well-studied problem. However, when the input graph is a hypergraph, the existing notion of DSG fails to capture the fact that a hyperedge partially belonging to an induced sub-hypergraph is also a part of the sub-hypergraph. To resolve the issue, we suggest a function $f_e:\mathbb{Z}_{\ge0}\rightarrow \mathbb{R}_{\ge 0}$ to represent the partial edge-weight of a hyperedge $e$ in the input hypergraph $\mathcal{H}=(V,\mathcal{E},f)$ and formulate a generalized densest sub-hypergraph problem (GDSH) as $\max_{S\subseteq V}\frac{\sum_{e\in \mathcal{E}}{f_e(|e\cap S|)}}{|S|}$. We demonstrate that, when all the edge-weight functions are non-decreasing convex, GDSH can be solved in polynomial-time by the linear program-based algorithm, the network flow-based algorithm and the greedy $\frac{1}{r}$-approximation algorithm where $r$ is the rank of the input hypergraph. Finally, we investigate the computational tractability of GDSH where some edge-weight functions are non-convex.

preprint2022arXiv

Generalized persistence of entropy weak solutions for system of hyperbolic conservation laws

Let $u(t,x)$ be the solution to the Cauchy problem of a scalar conservation law in one space dimension. It is well known that even for smooth initial data the solution can become discontinuous in finite time and global entropy weak solution can best lie in the space of bounded total variations. It is impossible that the solutions belong to ,for example ,$H^1$ because by Sobolev embedding theorem $H^1$ functions are H$\mathrm{\ddot{o}}$lder continuous. However, we note that from any point $(t,x)$ we can draw a generalized characteristic downward which meets the initial axis at $y=α(t,x)$. if we regard $u$ as a function of $(t,y)$, it indeed belongs to $H^1$ as a function of $y$ if the initial data belongs to $H^1$. We may call this generalized persistence (of high regularity) of the entropy weak solutions. The main purpose of this paper is to prove some kinds of generalized persistence (of high regularity) for the scalar and $2\times 2$ Temple system of hyperbolic conservation laws in one space dimension .

preprint2022arXiv

Inhomogeneous superconducting states in two weakly linked superconducting ultra thin films

A sufficiently large parallel magnetic field will generate staggered supercurrent loops and superfluid density wave in two weakly linked superconducting (SC) ultrathin films, resulting in an inhomogeneous Fulde-Ferrell-Larkin-Ovchinnikov (FFLO) state. The SC order parameter of such an FFLO state is characterized by Bloch wave functions, called the "Bloch SC state". The staggered supercurrent loops form an array of Josephson vortex-antivortex pairs, instead of the usual Josephson vortex lattice. Enclosing a unit cell of the array, the London's fluxoid is quantized as $Φ^{\prime}=Φ_0=hc/2e$, while the net orbital magnetization caused by the staggered supercurrent is zero. Meanwhile, a small parallel magnetic field gives rise to an Fulde-Ferrell (FF) state that has uniform superfluid density. The phase transition between the Bloch SC state and the FF state belongs to the universality class of two-dimensional commensurate-incommensurate transitions. An analytical solution in terms of Jacobian elliptic functions is found to be an excellent approximation to the Bloch SC order parameter.

preprint2022arXiv

Learning Visibility for Robust Dense Human Body Estimation

Estimating 3D human pose and shape from 2D images is a crucial yet challenging task. While prior methods with model-based representations can perform reasonably well on whole-body images, they often fail when parts of the body are occluded or outside the frame. Moreover, these results usually do not faithfully capture the human silhouettes due to their limited representation power of deformable models (e.g., representing only the naked body). An alternative approach is to estimate dense vertices of a predefined template body in the image space. Such representations are effective in localizing vertices within an image but cannot handle out-of-frame body parts. In this work, we learn dense human body estimation that is robust to partial observations. We explicitly model the visibility of human joints and vertices in the x, y, and z axes separately. The visibility in x and y axes help distinguishing out-of-frame cases, and the visibility in depth axis corresponds to occlusions (either self-occlusions or occlusions by other objects). We obtain pseudo ground-truths of visibility labels from dense UV correspondences and train a neural network to predict visibility along with 3D coordinates. We show that visibility can serve as 1) an additional signal to resolve depth ordering ambiguities of self-occluded vertices and 2) a regularization term when fitting a human body model to the predictions. Extensive experiments on multiple 3D human datasets demonstrate that visibility modeling significantly improves the accuracy of human body estimation, especially for partial-body cases. Our project page with code is at: https://github.com/chhankyao/visdb.

preprint2022arXiv

Matrix product states for Hartree-Fock-Bogoliubov wave functions

We provide an efficient and accurate method for converting Hartree-Fock-Bogoliubov wave functions into matrix product states (MPSs). These wave functions, also known as Bogoliubov vacua, exhibit a peculiar entanglement structure that the eigenvectors of the reduced density matrix are also Bogoliubov vacua. We exploit this important feature to obtain their optimal MPS approximation and derive an explicit formula for corresponding MPS matrices. The performance of our method is benchmarked with the Kitaev chain and the Majorana-Hubbard model on the honeycomb lattice. The approach facilitates the applications of Hartree-Fock-Bogoliubov wave functions and is ideally suited for combining with the density-matrix renormalization group method.

preprint2022arXiv

Plasma Image Classification Using Cosine Similarity Constrained CNN

Plasma jets are widely investigated both in the laboratory and in nature. Astrophysical objects such as black holes, active galactic nuclei, and young stellar objects commonly emit plasma jets in various forms. With the availability of data from plasma jet experiments resembling astrophysical plasma jets, classification of such data would potentially aid in investigating not only the underlying physics of the experiments but the study of astrophysical jets. In this work we use deep learning to process all of the laboratory plasma images from the Caltech Spheromak Experiment spanning two decades. We found that cosine similarity can aid in feature selection, classify images through comparison of feature vector direction, and be used as a loss function for the training of AlexNet for plasma image classification. We also develop a simple vector direction comparison algorithm for binary and multi-class classification. Using our algorithm we demonstrate 93% accurate binary classification to distinguish unstable columns from stable columns and 92% accurate five-way classification of a small, labeled data set which includes three classes corresponding to varying levels of kink instability.

preprint2022arXiv

Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis

Actor-critic (AC) algorithms have been widely adopted in decentralized multi-agent systems to learn the optimal joint control policy. However, existing decentralized AC algorithms either do not preserve the privacy of agents or are not sample and communication-efficient. In this work, we develop two decentralized AC and natural AC (NAC) algorithms that are private, and sample and communication-efficient. In both algorithms, agents share noisy information to preserve privacy and adopt mini-batch updates to improve sample and communication efficiency. Particularly for decentralized NAC, we develop a decentralized Markovian SGD algorithm with an adaptive mini-batch size to efficiently compute the natural policy gradient. Under Markovian sampling and linear function approximation, we prove the proposed decentralized AC and NAC algorithms achieve the state-of-the-art sample complexities $\mathcal{O}\big(ε^{-2}\ln(ε^{-1})\big)$ and $\mathcal{O}\big(ε^{-3}\ln(ε^{-1})\big)$, respectively, and the same small communication complexity $\mathcal{O}\big(ε^{-1}\ln(ε^{-1})\big)$. Numerical experiments demonstrate that the proposed algorithms achieve lower sample and communication complexities than the existing decentralized AC algorithm.

preprint2022arXiv

Sense Embeddings are also Biased--Evaluating Social Biases in Static and Contextualised Sense Embeddings

Sense embedding learning methods learn different embeddings for the different senses of an ambiguous word. One sense of an ambiguous word might be socially biased while its other senses remain unbiased. In comparison to the numerous prior work evaluating the social biases in pretrained word embeddings, the biases in sense embeddings have been relatively understudied. We create a benchmark dataset for evaluating the social biases in sense embeddings and propose novel sense-specific bias evaluation measures. We conduct an extensive evaluation of multiple static and contextualised sense embeddings for various types of social biases using the proposed measures. Our experimental results show that even in cases where no biases are found at word-level, there still exist worrying levels of social biases at sense-level, which are often ignored by the word-level bias evaluation measures.

preprint2022arXiv

Single-shot Hyper-parameter Optimization for Federated Learning: A General Algorithm & Analysis

We address the relatively unexplored problem of hyper-parameter optimization (HPO) for federated learning (FL-HPO). We introduce Federated Loss SuRface Aggregation (FLoRA), a general FL-HPO solution framework that can address use cases of tabular data and any Machine Learning (ML) model including gradient boosting training algorithms and therefore further expands the scope of FL-HPO. FLoRA enables single-shot FL-HPO: identifying a single set of good hyper-parameters that are subsequently used in a single FL training. Thus, it enables FL-HPO solutions with minimal additional communication overhead compared to FL training without HPO. We theoretically characterize the optimality gap of FL-HPO, which explicitly accounts for the heterogeneous non-IID nature of the parties' local data distributions, a dominant characteristic of FL systems. Our empirical evaluation of FLoRA for multiple ML algorithms on seven OpenML datasets demonstrates significant model accuracy improvements over the considered baseline, and robustness to increasing number of parties involved in FL-HPO training.

preprint2022arXiv

Specificity-preserving RGB-D Saliency Detection

Salient object detection (SOD) on RGB and depth images has attracted more and more research interests, due to its effectiveness and the fact that depth cues can now be conveniently captured. Existing RGB-D SOD models usually adopt different fusion strategies to learn a shared representation from the two modalities (\ie, RGB and depth), while few methods explicitly consider how to preserve modality-specific characteristics. In this study, we propose a novel framework, termed SPNet} (Specificity-preserving network), which benefits SOD performance by exploring both the shared information and modality-specific properties (\eg, specificity). Specifically, we propose to adopt two modality-specific networks and a shared learning network to generate individual and shared saliency prediction maps, respectively. To effectively fuse cross-modal features in the shared learning network, we propose a cross-enhanced integration module (CIM) and then propagate the fused feature to the next layer for integrating cross-level information. Moreover, to capture rich complementary multi-modal information for boosting the SOD performance, we propose a multi-modal feature aggregation (MFA) module to integrate the modality-specific features from each individual decoder into the shared decoder. By using a skip connection, the hierarchical features between the encoder and decoder layers can be fully combined. Extensive experiments demonstrate that our~\ours~outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection benchmarks. The project is publicly available at: https://github.com/taozh2017/SPNet.

preprint2022arXiv

Two-dimensional superconductivity at the surfaces of KTaO3 gated with ionic liquid

The recent observation of superconductivity at the interfaces between KTaO3 and EuO (or LaAlO3) offers a new example of emergent phenomena at oxide interfaces. This superconductivity exhibits an unusual strong dependence on the crystalline orientation of KTaO3 and its superconducting transition temperature Tc is nearly one order of magnitude higher than that of the seminal LaAlO3/SrTiO3 interface. To understand its mechanism, it is crucial to address if the formation of oxide interfaces is indispensable for the presence of superconductivity. Here, by exploiting ionic liquid (IL) gating, we obtain superconductivity at KTaO3 (111) and (110) surfaces with Tc up to 2.0 K and 1.0 K, respectively. This oxide-interface-free superconductivity gives a clear experimental evidence that the essential physics of KTaO3 interface superconductivity lies in the KTaO3 surfaces doped with electrons. Moreover, the ability to control superconductivity at surfaces with IL provides a simple way to study the intrinsic superconductivity in KTaO3.

preprint2022arXiv

UNISON: Unpaired Cross-lingual Image Captioning

Image captioning has emerged as an interesting research field in recent years due to its broad application scenarios. The traditional paradigm of image captioning relies on paired image-caption datasets to train the model in a supervised manner. However, creating such paired datasets for every target language is prohibitively expensive, which hinders the extensibility of captioning technology and deprives a large part of the world population of its benefit. In this work, we present a novel unpaired cross-lingual method to generate image captions without relying on any caption corpus in the source or the target language. Specifically, our method consists of two phases: (i) a cross-lingual auto-encoding process, which utilizing a sentence parallel (bitext) corpus to learn the mapping from the source to the target language in the scene graph encoding space and decode sentences in the target language, and (ii) a cross-modal unsupervised feature mapping, which seeks to map the encoded scene graph features from image modality to language modality. We verify the effectiveness of our proposed method on the Chinese image caption generation task. The comparisons against several existing methods demonstrate the effectiveness of our approach.

preprint2022arXiv

Unveiling a critical stripy state in the triangular-lattice SU(4) spin-orbital model

The simplest spin-orbital model can host a nematic spin-orbital liquid state on the triangular lattice. We provide clear evidence that the ground state of the SU(4) Kugel-Khomskii model on the triangular lattice can be well described by a "single" Gutzwiller projected wave function with an emergent parton Fermi surface, despite it exhibits strong finite-size effect in quasi-one-dimensional cylinders. The finite-size effect can be resolved by the fact that the parton Fermi surface consists of open orbits in the reciprocal space. Thereby, a stripy liquid state is expected in the two-dimensional limit, which preserves the SU(4) symmetry while breaks the translational symmetry by doubling the unit cell along one of the lattice vector directions. It is indicative that these stripes are critical and the central charge is $c=3$, in agreement with the SU(4)$_1$ Wess-Zumino-Witten conformal field theory. All these results are consistent with the Lieb-Schultz-Mattis-Oshikawa-Hastings theorem.

preprint2021arXiv

Curse or Redemption? How Data Heterogeneity Affects the Robustness of Federated Learning

Data heterogeneity has been identified as one of the key features in federated learning but often overlooked in the lens of robustness to adversarial attacks. This paper focuses on characterizing and understanding its impact on backdooring attacks in federated learning through comprehensive experiments using synthetic and the LEAF benchmarks. The initial impression driven by our experimental results suggests that data heterogeneity is the dominant factor in the effectiveness of attacks and it may be a redemption for defending against backdooring as it makes the attack less efficient, more challenging to design effective attack strategies, and the attack result also becomes less predictable. However, with further investigations, we found data heterogeneity is more of a curse than a redemption as the attack effectiveness can be significantly boosted by simply adjusting the client-side backdooring timing. More importantly,data heterogeneity may result in overfitting at the local training of benign clients, which can be utilized by attackers to disguise themselves and fool skewed-feature based defenses. In addition, effective attack strategies can be made by adjusting attack data distribution. Finally, we discuss the potential directions of defending the curses brought by data heterogeneity. The results and lessons learned from our extensive experiments and analysis offer new insights for designing robust federated learning methods and systems

preprint2021arXiv

Emergence of high-temperature superconductivity at the interface of two Mott insulators

Interfacial superconductivity has manifested itself in various types of heterostructures: band insulator-band insulator, band insulator-Mott insulator, and Mott insulator-metal. We report the observation of high-temperature superconductivity (HTS) in a complementary and long expected type of heterostructures, which consists of two Mott insulators, La2CuO4 (LCO) and PrBa2Cu3O7 (PBCO). By carefully controlling oxidization condition and selectively doping CuO2 planes with Fe atoms, which suppress superconductivity, we found that the superconductivity arises at the LCO side and is confined within no more than two unit cells (about 2.6 nm) near the interface. A phenomenon of overcome the Fe barrier will show up if excess oxygen is present during growth. Some possible mechanisms for the interfacial HTS have been discussed, and we attribute it to the redistribution of oxygen.

preprint2021arXiv

Event-based Motion Segmentation with Spatio-Temporal Graph Cuts

Identifying independently moving objects is an essential task for dynamic scene understanding. However, traditional cameras used in dynamic scenes may suffer from motion blur or exposure artifacts due to their sampling principle. By contrast, event-based cameras are novel bio-inspired sensors that offer advantages to overcome such limitations. They report pixelwise intensity changes asynchronously, which enables them to acquire visual information at exactly the same rate as the scene dynamics. We develop a method to identify independently moving objects acquired with an event-based camera, i.e., to solve the event-based motion segmentation problem. We cast the problem as an energy minimization one involving the fitting of multiple motion models. We jointly solve two subproblems, namely event cluster assignment (labeling) and motion model fitting, in an iterative manner by exploiting the structure of the input event data in the form of a spatio-temporal graph. Experiments on available datasets demonstrate the versatility of the method in scenes with different motion patterns and number of moving objects. The evaluation shows state-of-the-art results without having to predetermine the number of expected moving objects. We release the software and dataset under an open source licence to foster research in the emerging topic of event-based motion segmentation.

preprint2021arXiv

Global existence for semilinear wave equations with scaling invariant damping in 3-D

Global existence for small data Cauchy problem of semilinear wave equations with scaling invariant damping in 3-D is established in this work, assuming that the data are radial and the constant in front of the damping belongs to $[1.5, 2)$. The proof is based on a weighted $L^2-L^2$ estimate for inhomogeneous wave equation, which is established by interpolating between energy estimate and Morawetz type estimate.

preprint2021arXiv

Global Existence of Ideal Invicid Compressible and Heat Conductive Fluids with Radial Symmetry

In this paper, we study the global existence of classical solutions to the three dimensional ideal invicid compressible and heat conductive fluids with radial symmetrical data in $H^s(\mathbb{R}^3)$. Our proof is based on the symmetric hyperbolic structure of the system.

preprint2021arXiv

Graph topology invariant gradient and sampling complexity for decentralized and stochastic optimization

One fundamental problem in decentralized multi-agent optimization is the trade-off between gradient/sampling complexity and communication complexity. We propose new algorithms whose gradient and sampling complexities are graph topology invariant while their communication complexities remain optimal. For convex smooth deterministic problems, we propose a primal dual sliding (PDS) algorithm that computes an $ε$-solution with $O((\tilde{L}/ε)^{1/2})$ gradient and $O((\tilde{L}/ε)^{1/2}+\|\mathcal{A}\|/ε)$ communication complexities, where $\tilde{L}$ is the smoothness parameter of the objective and $\mathcal{A}$ is related to either the graph Laplacian or the transpose of the oriented incidence matrix of the communication network. The results can be improved to $O((\tilde{L}/μ)^{1/2}\log(1/ε))$ and $O((\tilde{L}/μ)^{1/2}\log(1/ε) + \|\mathcal{A}\|/ε^{1/2})$ respectively with $μ$-strong convexity. We also propose a stochastic variant, the primal dual sliding (SPDS) algorithm for problems with stochastic gradients. The SPDS algorithm utilizes the mini-batch technique and enables the agents to perform sampling and communication simultaneously. It computes a stochastic $ε$-solution with $O((\tilde{L}/ε)^{1/2} + (σ/ε)^2)$ sampling complexity, which can be improved to $O((\tilde{L}/μ)^{1/2}\log(1/ε) + σ^2/ε)$ with strong convexity. Here $σ^2$ is the variance. The communication complexities of SPDS remain the same as that of the deterministic case. All the aforementioned gradient and sampling complexities match the lower complexity bounds for centralized convex smooth optimization and are independent of the network structure. To the best of our knowledge, these gradient and sampling complexities have not been obtained before for decentralized optimization over a constraint feasible set.

preprint2021arXiv

Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for Thoracic Disease Identification

Chest X-rays are an important and accessible clinical imaging tool for the detection of many thoracic diseases. Over the past decade, deep learning, with a focus on the convolutional neural network (CNN), has become the most powerful computer-aided diagnosis technology for improving disease identification performance. However, training an effective and robust deep CNN usually requires a large amount of data with high annotation quality. For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming. Thus, existing public chest X-ray datasets usually adopt language pattern based methods to automatically mine labels from reports. However, this results in label uncertainty and inconsistency. In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods from two perspectives to improve a single model's disease identification performance, rather than focusing on an ensemble of models. MODL integrates multiple models to obtain a soft label distribution for optimizing the single target model, which can reduce the effects of original label uncertainty. Moreover, KNNS aims to enhance the robustness of the target model to provide consistent predictions on images with similar medical findings. Extensive experiments on the public NIH Chest X-ray and CheXpert datasets show that our model achieves consistent improvements over the state-of-the-art methods.

preprint2021arXiv

Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

The gradient descent-ascent (GDA) algorithm has been widely applied to solve minimax optimization problems. In order to achieve convergent policy parameters for minimax optimization, it is important that GDA generates convergent variable sequences rather than convergent sequences of function values or gradient norms. However, the variable convergence of GDA has been proved only under convexity geometries, and there lacks understanding for general nonconvex minimax optimization. This paper fills such a gap by studying the convergence of a more general proximal-GDA for regularized nonconvex-strongly-concave minimax optimization. Specifically, we show that proximal-GDA admits a novel Lyapunov function, which monotonically decreases in the minimax optimization process and drives the variable sequence to a critical point. By leveraging this Lyapunov function and the KŁ geometry that parameterizes the local geometries of general nonconvex functions, we formally establish the variable convergence of proximal-GDA to a critical point $x^*$, i.e., $x_t\to x^*, y_t\to y^*(x^*)$. Furthermore, over the full spectrum of the KŁ-parameterized geometry, we show that proximal-GDA achieves different types of convergence rates ranging from sublinear convergence up to finite-step convergence, depending on the geometry associated with the KŁ parameter. This is the first theoretical result on the variable convergence for nonconvex minimax optimization.

preprint2021arXiv

Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data

This paper presents a novel framework to build a voice conversion (VC) system by learning from a text-to-speech (TTS) synthesis system, that is called TTS-VC transfer learning. We first develop a multi-speaker speech synthesis system with sequence-to-sequence encoder-decoder architecture, where the encoder extracts robust linguistic representations of text, and the decoder, conditioned on target speaker embedding, takes the context vectors and the attention recurrent network cell output to generate target acoustic features. We take advantage of the fact that TTS system maps input text to speaker independent context vectors, and reuse such a mapping to supervise the training of latent representations of an encoder-decoder voice conversion system. In the voice conversion system, the encoder takes speech instead of text as input, while the decoder is functionally similar to TTS decoder. As we condition the decoder on speaker embedding, the system can be trained on non-parallel data for any-to-any voice conversion. During voice conversion training, we present both text and speech to speech synthesis and voice conversion networks respectively. At run-time, the voice conversion network uses its own encoder-decoder architecture. Experiments show that the proposed approach outperforms two competitive voice conversion baselines consistently, namely phonetic posteriorgram and variational autoencoder methods, in terms of speech quality, naturalness, and speaker similarity.

preprint2020arXiv

Accelerating Power Methods for Higher-order Markov Chains

Higher-order Markov chains play a very important role in many fields, ranging from multilinear PageRank to financial modeling. In this paper, we propose three accelerated higher-order power methods for computing the limiting probability distribution of higher-order Markov chains, namely higher-order power method with momentum and higher-order quadratic extrapolation method. The convergence results are established, and numerical experiments are reported to show the efficiency of the proposed algorithms. In particular, the non-parametric quadratic extrapolation method is very competitive, and outperforms state-of-the-art competitions.

preprint2020arXiv

An Investigation into the Stochasticity of Batch Whitening

Batch Normalization (BN) is extensively employed in various network architectures by performing standardization within mini-batches. A full understanding of the process has been a central target in the deep learning communities. Unlike existing works, which usually only analyze the standardization operation, this paper investigates the more general Batch Whitening (BW). Our work originates from the observation that while various whitening transformations equivalently improve the conditioning, they show significantly different behaviors in discriminative scenarios and training Generative Adversarial Networks (GANs). We attribute this phenomenon to the stochasticity that BW introduces. We quantitatively investigate the stochasticity of different whitening transformations and show that it correlates well with the optimization behaviors during training. We also investigate how stochasticity relates to the estimation of population statistics during inference. Based on our analysis, we provide a framework for designing and comparing BW algorithms in different scenarios. Our proposed BW algorithm improves the residual networks by a significant margin on ImageNet classification. Besides, we show that the stochasticity of BW can improve the GAN's performance with, however, the sacrifice of the training stability.

preprint2020arXiv

Chinese Named Entity Recognition Augmented with Lexicon Memory

Inspired by a concept of content-addressable retrieval from cognitive science, we propose a novel fragment-based model augmented with a lexicon-based memory for Chinese NER, in which both the character-level and word-level features are combined to generate better feature representations for possible name candidates. It is observed that locating the boundary information of entity names is useful in order to classify them into pre-defined categories. Position-dependent features, including prefix and suffix are introduced for NER in the form of distributed representation. The lexicon-based memory is used to help generate such position-dependent features and deal with the problem of out-of-vocabulary words. Experimental results showed that the proposed model, called LEMON, achieved state-of-the-art on four datasets.

preprint2020arXiv

Classical and quantum order in hyperkagome antiferromagnets

Motivated by recent experiments and density functional theory calculations on choloalite PbCuTe$_2$O$_6$, which possesses a Cu-based three-dimensional hyperkagome lattice, we propose and study a $J_1$-$J_2$-$J_3$ antiferromagnetic Heisenberg model on a hyperkagome lattice. In the classical limit, possible ground states are analyzed by two triangle rules, i.e., the "hyperkagome triangle rule" and the "isolated triangle rule," and classical Monte Carlo simulations are exploited to identify possible classical magnetic ordering and explore the phase diagram. In the quantum regime, Schwinger boson theory is applied to study possible quantum spin liquid states and long-range magnetically ordered states on an equal footing. These quantum states with bosonic partons are classified and analyzed by using projective symmetry groups (PSGs). It is found that there are only four types of algebraic PSGs allowed by the space group $P4_{1}32$ on a hyperkagome lattice. Moreover, there are only two types of PSGs that are compatible with the $J_1$-$J_2$-$J_3$ Heisenberg model. These two types of $Z_2$ bosonic states are distinguished by the gauge-invariant flux on the elementary ten-site loops on the hyperkagome network, called zero-flux state and $π$-flux state respectively. Both the zero-flux state and the $π$-flux state are able to give rise to quantum spin liquid states as well as magnetically ordered states, and the zero-flux states and the $π$-flux states can be distinguished by the lower and upper edges of the spectral function $S(\bm{q},ω)$, which can be measured by inelastic neutron scattering experiments.

preprint2020arXiv

Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble

Despite neural networks have achieved prominent performance on many natural language processing (NLP) tasks, they are vulnerable to adversarial examples. In this paper, we propose Dirichlet Neighborhood Ensemble (DNE), a randomized smoothing method for training a robust model to defense substitution-based attacks. During training, DNE forms virtual sentences by sampling embedding vectors for each word in an input sentence from a convex hull spanned by the word and its synonyms, and it augments them with the training data. In such a way, the model is robust to adversarial attacks while maintaining the performance on the original clean data. DNE is agnostic to the network architectures and scales to large models for NLP applications. We demonstrate through extensive experimentation that our method consistently outperforms recently proposed defense methods by a significant margin across different network architectures and multiple data sets.

preprint2020arXiv

Efficient tensor network representation for Gutzwiller projected states of paired fermions

Recent work by Wu {\em et al.} [arXiv:1910.11011] proposed a numerical method, so-called matrix product operator-matrix product state (MPO-MPS) method, by which several types of quantum many-body wave functions, in particular, the projected Fermi sea state, can be efficiently represented as a tensor network. In this paper, we generalize the MPO-MPS method to study Gutzwiller projected paired states of fermions, where the maximally localized Wannier orbitals for Bogoliubov quasiparticles/quasiholes have been adapted to improve the computational performance. The study of $SO(3)$-symmetric spin-1 chains reveals that this new method has better performance than variational Monte Carlo for gapped states and similar performance for gapless states. Moreover, we demonstrate that dynamic correlation functions can be easily evaluated by this method cooperating with other MPS-based accurate approaches, such as the Chebyshev MPS method.

preprint2020arXiv

Exploring the Hierarchy in Relation Labels for Scene Graph Generation

By assigning each relationship a single label, current approaches formulate the relationship detection as a classification problem. Under this formulation, predicate categories are treated as completely different classes. However, different from the object labels where different classes have explicit boundaries, predicates usually have overlaps in their semantic meanings. For example, sit\_on and stand\_on have common meanings in vertical relationships but different details of how these two objects are vertically placed. In order to leverage the inherent structures of the predicate categories, we propose to first build the language hierarchy and then utilize the Hierarchy Guided Feature Learning (HGFL) strategy to learn better region features of both the coarse-grained level and the fine-grained level. Besides, we also propose the Hierarchy Guided Module (HGM) to utilize the coarse-grained level to guide the learning of fine-grained level features. Experiments show that the proposed simple yet effective method can improve several state-of-the-art baselines by a large margin (up to $33\%$ relative gain) in terms of Recall@50 on the task of Scene Graph Generation in different datasets.

preprint2020arXiv

Generative Tweening: Long-term Inbetweening of 3D Human Motions

The ability to generate complex and realistic human body animations at scale, while following specific artistic constraints, has been a fundamental goal for the game and animation industry for decades. Popular techniques include key-framing, physics-based simulation, and database methods via motion graphs. Recently, motion generators based on deep learning have been introduced. Although these learning models can automatically generate highly intricate stylized motions of arbitrary length, they still lack user control. To this end, we introduce the problem of long-term inbetweening, which involves automatically synthesizing complex motions over a long time interval given very sparse keyframes by users. We identify a number of challenges related to this problem, including maintaining biomechanical and keyframe constraints, preserving natural motions, and designing the entire motion sequence holistically while considering all constraints. We introduce a biomechanically constrained generative adversarial network that performs long-term inbetweening of human motions, conditioned on keyframe constraints. This network uses a novel two-stage approach where it first predicts local motion in the form of joint angles, and then predicts global motion, i.e. the global path that the character follows. Since there are typically a number of possible motions that could satisfy the given user constraints, we also enable our network to generate a variety of outputs with a scheme that we call Motion DNA. This approach allows the user to manipulate and influence the output content by feeding seed motions (DNA) to the network. Trained with 79 classes of captured motion data, our network performs robustly on a variety of highly complex motion styles.

preprint2020arXiv

GFTE: Graph-based Financial Table Extraction

Tabular data is a crucial form of information expression, which can organize data in a standard structure for easy information retrieval and comparison. However, in financial industry and many other fields tables are often disclosed in unstructured digital files, e.g. Portable Document Format (PDF) and images, which are difficult to be extracted directly. In this paper, to facilitate deep learning based table extraction from unstructured digital files, we publish a standard Chinese dataset named FinTab, which contains more than 1,600 financial tables of diverse kinds and their corresponding structure representation in JSON. In addition, we propose a novel graph-based convolutional neural network model named GFTE as a baseline for future comparison. GFTE integrates image feature, position feature and textual feature together for precise edge prediction and reaches overall good results.

preprint2020arXiv

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Variance-reduced algorithms, although achieve great theoretical performance, can run slowly in practice due to the periodic gradient estimation with a large batch of data. Batch-size adaptation thus arises as a promising approach to accelerate such algorithms. However, existing schemes either apply prescribed batch-size adaption rule or exploit the information along optimization path via additional backtracking and condition verification steps. In this paper, we propose a novel scheme, which eliminates backtracking line search but still exploits the information along optimization path by adapting the batch size via history stochastic gradients. We further theoretically show that such a scheme substantially reduces the overall complexity for popular variance-reduced algorithms SVRG and SARAH/SPIDER for both conventional nonconvex optimization and reinforcement learning problems. To this end, we develop a new convergence analysis framework to handle the dependence of the batch size on history stochastic gradients. Extensive experiments validate the effectiveness of the proposed batch-size adaptation scheme.

preprint2020arXiv

IBM Federated Learning: an Enterprise Framework White Paper V0.1

Federated Learning (FL) is an approach to conduct machine learning without centralizing training data in a single place, for reasons of privacy, confidentiality or data volume. However, solving federated machine learning problems raises issues above and beyond those of centralized machine learning. These issues include setting up communication infrastructure between parties, coordinating the learning process, integrating party results, understanding the characteristics of the training data sets of different participating parties, handling data heterogeneity, and operating with the absence of a verification data set. IBM Federated Learning provides infrastructure and coordination for federated learning. Data scientists can design and run federated learning jobs based on existing, centralized machine learning models and can provide high-level instructions on how to run the federation. The framework applies to both Deep Neural Networks as well as ``traditional'' approaches for the most common machine learning libraries. {\proj} enables data scientists to expand their scope from centralized to federated machine learning, minimizing the learning curve at the outset while also providing the flexibility to deploy to different compute environments and design custom fusion algorithms.

preprint2020arXiv

Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images

Coronavirus Disease 2019 (COVID-19) spread globally in early 2020, causing the world to face an existential health crisis. Automated detection of lung infections from computed tomography (CT) images offers a great potential to augment the traditional healthcare strategy for tackling COVID-19. However, segmenting infected regions from CT slices faces several challenges, including high variation in infection characteristics, and low intensity contrast between infections and normal tissues. Further, collecting a large amount of data is impractical within a short time period, inhibiting the training of a deep model. To address these challenges, a novel COVID-19 Lung Infection Segmentation Deep Network (Inf-Net) is proposed to automatically identify infected regions from chest CT slices. In our Inf-Net, a parallel partial decoder is used to aggregate the high-level features and generate a global map. Then, the implicit reverse attention and explicit edge-attention are utilized to model the boundaries and enhance the representations. Moreover, to alleviate the shortage of labeled data, we present a semi-supervised segmentation framework based on a randomly selected propagation strategy, which only requires a few labeled images and leverages primarily unlabeled data. Our semi-supervised framework can improve the learning ability and achieve a higher performance. Extensive experiments on our COVID-SemiSeg and real CT volumes demonstrate that the proposed Inf-Net outperforms most cutting-edge segmentation models and advances the state-of-the-art performance.

preprint2020arXiv

Learning to Generate Diverse Dance Motions with Transformer

With the ongoing pandemic, virtual concerts and live events using digitized performances of musicians are getting traction on massive multiplayer online worlds. However, well choreographed dance movements are extremely complex to animate and would involve an expensive and tedious production process. In addition to the use of complex motion capture systems, it typically requires a collaborative effort between animators, dancers, and choreographers. We introduce a complete system for dance motion synthesis, which can generate complex and highly diverse dance sequences given an input music sequence. As motion capture data is limited for the range of dance motions and styles, we introduce a massive dance motion data set that is created from YouTube videos. We also present a novel two-stream motion transformer generative model, which can generate motion sequences with high flexibility. We also introduce new evaluation metrics for the quality of synthesized dance motions, and demonstrate that our system can outperform state-of-the-art methods. Our system provides high-quality animations suitable for large crowds for virtual concerts and can also be used as reference for professional animation pipelines. Most importantly, we show that vast online videos can be effective in training dance motion models.

preprint2020arXiv

Measurement of the neutron beam profile of the Back-n white neutron facility at CSNS with a Micromegas detector

The Back-n white neutron beam line, which uses back-streaming white neutrons from the spallation target of the China Spallation Neutron Source, is used for nuclear data measurements. A Micromegas-based neutron detector with two variants was specially developed to measure the beam spot distribution for this beam line. In this article, the design, fabrication, and characterization of the detector are described. The results of the detector performance tests are presented, which include the relative electron transparency, the gain and the gain uniformity, and the neutron beam profile reconstruction capability. The result of the first measurement of the Back-n neutron beam spot distribution is also presented.

preprint2020arXiv

Momentum with Variance Reduction for Nonconvex Composition Optimization

Composition optimization is widely-applied in nonconvex machine learning. Various advanced stochastic algorithms that adopt momentum and variance reduction techniques have been developed for composition optimization. However, these algorithms do not fully exploit both techniques to accelerate the convergence and are lack of convergence guarantee in nonconvex optimization. This paper complements the existing literature by developing various momentum schemes with SPIDER-based variance reduction for non-convex composition optimization. In particular, our momentum design requires less number of proximal mapping evaluations per-iteration than that required by the existing Katyusha momentum. Furthermore, our algorithm achieves near-optimal sample complexity results in both non-convex finite-sum and online composition optimization and achieves a linear convergence rate under the gradient dominant condition. Numerical experiments demonstrate that our algorithm converges significantly faster than existing algorithms in nonconvex composition optimization.

preprint2020arXiv

Nanoscale structure detection and monitoring of tumour growth with optical coherence tomography

Approximately 90% of cancers have their origins in epithelial tissues and this leads to epithelial thickening, but the ultrastructural changes and underlying architecture is less well known. Depth resolved label free visualization of nanoscale tissue morphology is required to reveal the extent and distribution of ultrastructural changes in underlying tissue, but is difficult to achieve with existing imaging modalities. We developed a nanosensitive optical coherence tomography (nsOCT) approach to provide such imaging based on dominant axial structure with a few nanometre detection accuracy. nsOCT maps the distribution of axial structural sizes an order of magnitude smaller than the axial resolution of the system. We validated nsOCT methodology by detecting synthetic axial structure via numerical simulations. Subsequently, we validated the nsOCT technique experimentally by detecting known structures from a commercially fabricated sample. nsOCT reveals scaling with different depth of dominant submicron structural changes associated with carcinoma which may inform the origins of the disease, its progression and improve diagnosis.

preprint2020arXiv

Nanosensitive optical coherence tomography to assess wound healing within the cornea

Optical Coherence Tomography (OCT) is a non-invasive depth resolved optical imaging modality, that enables high resolution, cross-sectional imaging in biological tissues and materials at clinically relevant depths. Though OCT offers high resolution imaging, the best ultra-high-resolution OCT systems are limited to imaging structural changes with a resolution of one micron on a single B-scan within very limited depth. Nanosensitive OCT (nsOCT) is a recently developed technique that is capable of providing enhanced sensitivity of OCT to structural changes. Improving the sensitivity of OCT to detect structural changes at the nanoscale level, to a depth typical for conventional OCT, could potentially improve the diagnostic capability of OCT in medical applications. In this paper, we demonstrate the capability of nsOCT to detect structural changes deep in the rat cornea following superficial corneal injury.

preprint2020arXiv

Non-invasive detection of nanoscale structural changes in cornea associated with cross-linking treatment

Corneal cross-linking (CXL) using UVA irradiation with a riboflavin photosensitizer has grown from an interesting concept to a practical clinical treatment for corneal ectatic diseases globally, such as keratoconus. To characterize the corneal structural changes, existing methods such as X-ray microscopy, transmission electron microscopy (TEM), histology and optical coherence tomography have been used. However, these methods have various drawbacks such as invasive detection, the impossibility for in vivo measurement, or limited resolution and sensitivity to structural alterations. Here, we report the application of over-sampling nano-sensitive optical coherence tomography (nsOCT) method for probing the corneal structural alterations. The results indicate that the spatial period increases slightly after 30 minutes riboflavin instillation but decreases significantly after 30 min UVA irradiation following the Dresden protocol. The proposed non-invasive method can be implemented using existing OCT system, without any additional components, for detecting nanoscale changes with the potential to assist diagnostic assessment during CXL treatment, and possibly to be a real-time monitoring tool in clinics.

preprint2020arXiv

On the Continuity of Rotation Representations in Neural Networks

In neural networks, it is often desirable to work with various representations of the same space. For example, 3D rotations can be represented with quaternions or Euler angles. In this paper, we advance a definition of a continuous representation, which can be helpful for training deep neural networks. We relate this to topological concepts such as homeomorphism and embedding. We then investigate what are continuous and discontinuous representations for 2D, 3D, and n-dimensional rotations. We demonstrate that for 3D rotations, all representations are discontinuous in the real Euclidean spaces of four or fewer dimensions. Thus, widely used representations such as quaternions and Euler angles are discontinuous and difficult for neural networks to learn. We show that the 3D rotations have continuous representations in 5D and 6D, which are more suitable for learning. We also present continuous representations for the general case of the n-dimensional rotation group SO(n). While our main focus is on rotations, we also show that our constructions apply to other groups such as the orthogonal group and similarity transforms. We finally present empirical results, which show that our continuous rotation representations outperform discontinuous ones for several practical problems in graphics and vision, including a simple autoencoder sanity test, a rotation estimator for 3D point clouds, and an inverse kinematics solver for 3D human poses.

preprint2020arXiv

Proximal Gradient Algorithm with Momentum and Flexible Parameter Restart for Nonconvex Optimization

Various types of parameter restart schemes have been proposed for accelerated gradient algorithms to facilitate their practical convergence in convex optimization. However, the convergence properties of accelerated gradient algorithms under parameter restart remain obscure in nonconvex optimization. In this paper, we propose a novel accelerated proximal gradient algorithm with parameter restart (named APG-restart) for solving nonconvex and nonsmooth problems. Our APG-restart is designed to 1) allow for adopting flexible parameter restart schemes that cover many existing ones; 2) have a global sub-linear convergence rate in nonconvex and nonsmooth optimization; and 3) have guaranteed convergence to a critical point and have various types of asymptotic convergence rates depending on the parameterization of local geometry in nonconvex and nonsmooth optimization. Numerical experiments demonstrate the effectiveness of our proposed algorithm.

preprint2020arXiv

Reanalysis of Variance Reduced Temporal Difference Learning

Temporal difference (TD) learning is a popular algorithm for policy evaluation in reinforcement learning, but the vanilla TD can substantially suffer from the inherent optimization variance. A variance reduced TD (VRTD) algorithm was proposed by Korda and La (2015), which applies the variance reduction technique directly to the online TD learning with Markovian samples. In this work, we first point out the technical errors in the analysis of VRTD in Korda and La (2015), and then provide a mathematically solid analysis of the non-asymptotic convergence of VRTD and its variance reduction performance. We show that VRTD is guaranteed to converge to a neighborhood of the fixed-point solution of TD at a linear convergence rate. Furthermore, the variance error (for both i.i.d.\ and Markovian sampling) and the bias error (for Markovian sampling) of VRTD are significantly reduced by the batch size of variance reduction in comparison to those of vanilla TD. As a result, the overall computational complexity of VRTD to attain a given accurate solution outperforms that of TD under Markov sampling and outperforms that of TD under i.i.d.\ sampling for a sufficiently small conditional number.

preprint2020arXiv

Small-floating Target Detection in Sea Clutter via Visual Feature Classifying in the Time-Doppler Spectra

It is challenging to detect small-floating object in the sea clutter for a surface radar. In this paper, we have observed that the backscatters from the target brake the continuity of the underlying motion of the sea surface in the time-Doppler spectra (TDS) images. Following this visual clue, we exploit the local binary pattern (LBP) to measure the variations of texture in the TDS images. It is shown that the radar returns containing target and those only having clutter are separable in the feature space of LBP. An unsupervised one-class support vector machine (SVM) is then utilized to detect the deviation of the LBP histogram of the clutter. The outiler of the detector is classified as the target. In the real-life IPIX radar data sets, our visual feature based detector shows favorable detection rate compared to other three existing approaches.

preprint2020arXiv

Spatio-temporal Attention Model for Tactile Texture Recognition

Recently, tactile sensing has attracted great interest in robotics, especially for facilitating exploration of unstructured environments and effective manipulation. A detailed understanding of the surface textures via tactile sensing is essential for many of these tasks. Previous works on texture recognition using camera based tactile sensors have been limited to treating all regions in one tactile image or all samples in one tactile sequence equally, which includes much irrelevant or redundant information. In this paper, we propose a novel Spatio-Temporal Attention Model (STAM) for tactile texture recognition, which is the very first of its kind to our best knowledge. The proposed STAM pays attention to both spatial focus of each single tactile texture and the temporal correlation of a tactile sequence. In the experiments to discriminate 100 different fabric textures, the spatially and temporally selective attention has resulted in a significant improvement of the recognition accuracy, by up to 18.8%, compared to the non-attention based models. Specifically, after introducing noisy data that is collected before the contact happens, our proposed STAM can learn the salient features efficiently and the accuracy can increase by 15.23% on average compared with the CNN based baseline approach. The improved tactile texture perception can be applied to facilitate robot tasks like grasping and manipulation.

preprint2020arXiv

SpiderBoost and Momentum: Faster Stochastic Variance Reduction Algorithms

SARAH and SPIDER are two recently developed stochastic variance-reduced algorithms, and SPIDER has been shown to achieve a near-optimal first-order oracle complexity in smooth nonconvex optimization. However, SPIDER uses an accuracy-dependent stepsize that slows down the convergence in practice, and cannot handle objective functions that involve nonsmooth regularizers. In this paper, we propose SpiderBoost as an improved scheme, which allows to use a much larger constant-level stepsize while maintaining the same near-optimal oracle complexity, and can be extended with proximal mapping to handle composite optimization (which is nonsmooth and nonconvex) with provable convergence guarantee. In particular, we show that proximal SpiderBoost achieves an oracle complexity of $\mathcal{O}(\min\{n^{1/2}ε^{-2},ε^{-3}\})$ in composite nonconvex optimization, improving the state-of-the-art result by a factor of $\mathcal{O}(\min\{n^{1/6},ε^{-1/3}\})$. We further develop a novel momentum scheme to accelerate SpiderBoost for composite optimization, which achieves the near-optimal oracle complexity in theory and substantial improvement in experiments.

preprint2020arXiv

The Complexity of the Partition Coloring Problem

Given a simple undirected graph $G=(V,E)$ and a partition of the vertex set $V$ into $p$ parts, the \textsc{Partition Coloring Problem} asks if we can select one vertex from each part of the partition such that the chromatic number of the subgraph induced on the $p$ selected vertices is bounded by $k$. PCP is a generalized problem of the classical \textsc{Vertex Coloring Problem} and has applications in many areas, such as scheduling and encoding etc. In this paper, we show the complexity status of the \textsc{Partition Coloring Problem} with three parameters: the number of colors, the number of parts of the partition, and the maximum size of each part of the partition. Furthermore, we give a new exact algorithm for this problem.

preprint2020arXiv

TiFL: A Tier-based Federated Learning System

Federated Learning (FL) enables learning a shared model across many clients without violating the privacy requirements. One of the key attributes in FL is the heterogeneity that exists in both resource and data due to the differences in computation and communication capacity, as well as the quantity and content of data among different clients. We conduct a case study to show that heterogeneity in resource and data has a significant impact on training time and model accuracy in conventional FL systems. To this end, we propose TiFL, a Tier-based Federated Learning System, which divides clients into tiers based on their training performance and selects clients from the same tier in each training round to mitigate the straggler problem caused by heterogeneity in resource and data quantity. To further tame the heterogeneity caused by non-IID (Independent and Identical Distribution) data and resources, TiFL employs an adaptive tier selection approach to update the tiering on-the-fly based on the observed training performance and accuracy overtime. We prototype TiFL in a FL testbed following Google's FL architecture and evaluate it using popular benchmarks and the state-of-the-art FL benchmark LEAF. Experimental evaluation shows that TiFL outperforms the conventional FL in various heterogeneous conditions. With the proposed adaptive tier selection policy, we demonstrate that TiFL achieves much faster training performance while keeping the same (and in some cases - better) test accuracy across the board.

preprint2020arXiv

Timing Performance of a Micro-Channel-Plate Photomultiplier Tube

The spatial dependence of the timing performance of the R3809U-50 Micro-Channel-Plate PMT (MCP-PMT) by Hamamatsu was studied in high energy muon beams. Particle position information is provided by a GEM tracker telescope, while timing is measured relative to a second MCP-PMT, identical in construction. In the inner part of the circular active area (radius r$<$5.5\,mm) the time resolution of the two MCP-PMTs combined is better than 10~ps. The signal amplitude decreases in the outer region due to less light reaching the photocathode, resulting in a worse time resolution. The observed radial dependence is in quantitative agreement with a dedicated simulation. With this characterization, the suitability of MCP-PMTs as $\text{t}_\text{0}$ reference detectors has been validated.

preprint2020arXiv

Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle

Although SGD with random reshuffle has been widely-used in machine learning applications, there is a limited understanding of how model characteristics affect the convergence of the algorithm. In this work, we introduce model incoherence to characterize the diversity of model characteristics and study its impact on convergence of SGD with random reshuffle under weak strong convexity. Specifically, minimizer incoherence measures the discrepancy between the global minimizers of a sample loss and those of the total loss and affects the convergence error of SGD with random reshuffle. In particular, we show that the variable sequence generated by SGD with random reshuffle converges to a certain global minimizer of the total loss under full minimizer coherence. The other curvature incoherence measures the quality of condition numbers of the sample losses and determines the convergence rate of SGD. With model incoherence, our results show that SGD has a faster convergence rate and smaller convergence error under random reshuffle than those under random sampling, and hence provide justifications to the superior practical performance of SGD with random reshuffle.

preprint2019arXiv

Evidence for nematic superconductivity of topological surface states in PbTaSe2

Spontaneous symmetry breaking has been a paradigm to describe the phase transitions in condensed matter physics. In addition to the continuous electromagnetic gauge symmetry, an unconventional superconductor can break discrete symmetries simultaneously, such as time reversal and lattice rotational symmetry. In this work we report a characteristic in-plane 2-fold behaviour of the resistive upper critical field and point-contact spectra on the superconducting semimetal PbTaSe2 with topological nodal-rings, despite its hexagonal lattice symmetry (or D_3h in bulk while C_3v on surface, to be precise). However, we do not observe any lattice rotational symmetry breaking signal from field-angle-dependent specific heat. It is worth noting that such surface-only electronic nematicity is in sharp contrast to the observation in the topological superconductor candidate, CuxBi2Se3, where the nematicity occurs in various bulk measurements. In combination with theory, superconducting nematicity is likely to emerge from the topological surface states of PbTaSe2, rather than the proximity effect. The issue of time reversal symmetry breaking is also addressed. Thus, our results on PbTaSe2 shed new light on possible routes to realize nematic superconductivity with nontrivial topology.

preprint2019arXiv

Extensive beam test study of prototype MRPCs for the T0 detector at the CSR external-target experiment

The CSR External-target Experiment (CEE) will be the first large-scale nuclear physics experiment device at the Cooling Storage Ring (CSR) of the Heavy-Ion Research Facility in Lanzhou (HIRFL) in China. A new T0 detector has been proposed to measure the multiplicity, angular distribution and timing information of charged particles produced in heavy-ion collisions at the target region. Multi-gap resistive plate chamber (MRPC) technology was chosen as part of the construction of the T0 detector, which provides precision event collision times (T0) and collision geometry information. The prototype was tested with hadron and heavy-ion beams to study its performance. By comparing the experimental results with a Monte Carlo simulation, the time resolution of the MRPCs are found to be $\sim$ 50 ps or better. The timing performance of the T0 detector, including both detector and readout electronics, we found to fulfil the requirements of the CEE.

preprint2019arXiv

Formation of finite-time singularities for nonlinear elastodynamics with small initial disturbances

This article concerns the formation of finite-time singularities in solutions to quasilinear hyperbolic systems with small initial data. By constructing a special test function, we first present a simpler proof of the main result in Sideris' "Formation of singularities in three-dimensional compressible fluids": the global classical solution is non-existent for compressible Euler equation even for some small initial data. Then we apply this approach to nonlinear elastodynamics and magnetohydrodynamics, showing that the classical solutions to these equations can still blow up in finite time even if the initial data is small enough.

preprint2019arXiv

High Speed Mid-Infrared Interband Cascade Photodetector Based on InAs/GaSb Type-II Superlattice

High speed mid-wave infrared (MWIR) photodetectors have applications in the areas such as free space optical communication and frequency comb spectroscopy. However, most of the research on the MWIR photodetectors is focused on how to increase the quantum efficiency and reduce the dark current, in order to improve the detectivity (D*), and the 3dB bandwidth performance of the corresponding MWIR photodetectors is still not fully studied. In this work, we report and characterize a MWIR interband cascade photodetector based on InAs/GaSb type-II superlattice with a 50% cutoff wavelength at ~5.3 um at 300 K. The 3 dB cutoff frequency is 2.4 GHz at 300 K, for a 40 μm circular diameter device under -5 V applied bias. Limitations on the detector high speed performance are also discussed.High speed mid-wave infrared (MWIR) photodetectors have applications in the areas such as free space optical communication and frequency comb spectroscopy. However, most of the research on the MWIR photodetectors is focused on how to increase the quantum efficiency and reduce the dark current, in order to improve the detectivity (D*), and the 3dB bandwidth performance of the corresponding MWIR photodetectors is still not fully studied. In this work, we report and characterize a MWIR interband cascade photodetector based on InAs/GaSb type-II superlattice with a 50% cutoff wavelength at ~5.3 um at 300 K. The 3 dB cutoff frequency is 2.4 GHz at 300 K, for a 40 um circular diameter device under -5 V applied bias. Limitations on the detector high speed performance are also discussed.

preprint2019arXiv

On some conjectures by Lu and Wenzel

In order to give a unified generalization of the BW inequality and the DDVV inequality, Lu and Wenzel proposed three Conjectures 1, 2, 3 and an open Question 1 in 2016. In this paper we discuss further these conjectures and put forward several new conjectures which will be shown equivalent to Conjecture 2. In particular, we prove Conjecture 2 and hence all conjectures in some special cases. For Conjecture 3, we obtain a bigger upper bound $2+\sqrt{10}/2$, and we also give a weaker answer for the more general Question 1. In addition, we obtain some new simple proofs of the complex BW inequality and the condition for equality.

preprint2019arXiv

Superconductivity, pair density wave, and Neel order in cuprates

We investigate in underdoped cuprates possible coexistence of the superconducting (SC) order at zero momentum and pair density wave (PDW) at momentum ${\bf Q}=(π, π)$ in the presence of a Neel order. By symmetry, the $d$-wave uniform singlet pairing $dS_0$ can coexist with the $d$-wave triplet PDW $dT_{\bf Q}$, and the $p$-wave singlet PDW $pS_{\bf Q}$ can coexist with the $p$-wave uniform triplet $pT_0$. At half filling, we find the novel $pS_{\bf Q}+pT_0$ state is energetically more favorable than the $dS_0+dT_{\bf Q}$ state. At finite doping, however, the $dS_0+dT_{\bf Q}$ state is more favorable. In both types of states, the variational triplet parameters, $dT_{\bf Q}$ and $pT_0$, are of secondary significance. Our results point to a fully symmetric $\mathrm{Z_2}$ quantum spin liquid with spinon Fermi surface in proximity to the Neel order at zero doping, and to intertwined $d$-wave triplet PDW fluctuations and spin moment fluctuations along with the dominant $d$-wave singlet SC at finite doping. The results are obtained by variational quantum Monte Carlo simulations.

preprint2019arXiv

Supervised Encoding for Discrete Representation Learning

Classical supervised classification tasks search for a nonlinear mapping that maps each encoded feature directly to a probability mass over the labels. Such a learning framework typically lacks the intuition that encoded features from the same class tend to be similar and thus has little interpretability for the learned features. In this paper, we propose a novel supervised learning model named Supervised-Encoding Quantizer (SEQ). The SEQ applies a quantizer to cluster and classify the encoded features. We found that the quantizer provides an interpretable graph where each cluster in the graph represents a class of data samples that have a particular style. We also trained a decoder that can decode convex combinations of the encoded features from similar and different clusters and provide guidance on style transfer between sub-classes.

preprint2016arXiv

A Set Theoretic Approach for Knowledge Representation: the Representation Part

In this paper, we propose a set theoretic approach for knowledge representation. While the syntax of an application domain is captured by set theoretic constructs including individuals, concepts and operators, knowledge is formalized by equality assertions. We first present a primitive form that uses minimal assumed knowledge and constructs. Then, assuming naive set theory, we extend it by definitions, which are special kinds of knowledge. Interestingly, we show that the primitive form is expressive enough to define logic operators, not only propositional connectives but also quantifiers.

preprint2016arXiv

DAP3D-Net: Where, What and How Actions Occur in Videos?

Action parsing in videos with complex scenes is an interesting but challenging task in computer vision. In this paper, we propose a generic 3D convolutional neural network in a multi-task learning manner for effective Deep Action Parsing (DAP3D-Net) in videos. Particularly, in the training phase, action localization, classification and attributes learning can be jointly optimized on our appearancemotion data via DAP3D-Net. For an upcoming test video, we can describe each individual action in the video simultaneously as: Where the action occurs, What the action is and How the action is performed. To well demonstrate the effectiveness of the proposed DAP3D-Net, we also contribute a new Numerous-category Aligned Synthetic Action dataset, i.e., NASA, which consists of 200; 000 action clips of more than 300 categories and with 33 pre-defined action attributes in two hierarchical levels (i.e., low-level attributes of basic body part movements and high-level attributes related to action motion). We learn DAP3D-Net using the NASA dataset and then evaluate it on our collected Human Action Understanding (HAU) dataset. Experimental results show that our approach can accurately localize, categorize and describe multiple actions in realistic videos.

preprint2016arXiv

DAVE: A Unified Framework for Fast Vehicle Detection and Annotation

Vehicle detection and annotation for streaming video data with complex scenes is an interesting but challenging task for urban traffic surveillance. In this paper, we present a fast framework of Detection and Annotation for Vehicles (DAVE), which effectively combines vehicle detection and attributes annotation. DAVE consists of two convolutional neural networks (CNNs): a fast vehicle proposal network (FVPN) for vehicle-like objects extraction and an attributes learning network (ALN) aiming to verify each proposal and infer each vehicle's pose, color and type simultaneously. These two nets are jointly optimized so that abundant latent knowledge learned from the ALN can be exploited to guide FVPN training. Once the system is trained, it can achieve efficient vehicle detection and annotation for real-world traffic surveillance data. We evaluate DAVE on a new self-collected UTS dataset and the public PASCAL VOC2007 car and LISA 2010 datasets, with consistent improvements over existing algorithms.

preprint2016arXiv

Desiging Artificial Lieb Lattice on Metal Surface

Recently, several experiments have illustrated that metal surface electrons can be manipulated to form a two dimensional (2D) lattice by depositing a designer molecule lattice on metal surface. This offers a promising new technique to construct artificial 2D electron lattices. Here we theoretically propose a molecule lattice pattern to realize an artificial Lieb lattice on metal surface, which shows a flat electronic band due to the lattice geometry. We show that the localization of electrons in the flat band may be understood from the viewpoint of electron interference, which may be probed by measuring the local density of states with the scanning tunnelling microscopy. Our proposal may be readily implemented in experiment and may offer an ideal solid state platform to investigate the novel flat band physics of the Lieb lattice.

preprint2016arXiv

Electric control of inverted gap and hybridization gap in type II InAs/GaSb quantum wells

The quantum spin Hall effect has been predicted theoretically and observed experimentally in InAs/GaSb quantum wells as a result of inverted band structures, for which electron bands in InAs layers are below heavy hole bands in GaSb layers in energy. The hybridization between electron bands and heavy hole bands leads to a hybridization gap away from k=0. A recent puzzling observation in experiments is that when the system is tuned to more inverted regime by a gate voltage (a larger inverted gap at k=0), the hybridization gap decreases. Motivated by this experiment, we explore the dependence of hybridization gap as a function of external electric fields based on the eight-band Kane model. We identify two regimes when varying electric fields: (1) both inverted and hybridization gaps increase and (2) inverted gap increases while hybridization gap decreases. Based on the effective model, we find that light-hole bands in GaSb layers play an important role in determining hybridization gap. In addition, a large external electric field can induce a strong Rashba splitting and also influence hybridization gap.

preprint2016arXiv

Helicity protected ultrahigh mobility Weyl fermions in NbP

Non-centrosymmetric transition metal monopnictides, including TaAs, TaP, NbAs, and NbP, are emergent topological Weyl semimetals (WSMs) hosting exotic relativistic Weyl fermions. In this letter, we elucidate the physical origin of the unprecedented charge carrier mobility of NbP, which can reach $1\times10^{7}$ cm $^{2}$V$^{-1}$s$^{-1}$ at 1.5 K. Angle- and temperature-dependent quantum oscillations, supported by density function theory calculations, reveal that NbP has the coexistence of p- and n-type WSM pockets in the $k_{z}$=1.16$π$/c plane (W1-WSM) and in the $k_{z}$=0 plane near the high symmetry points $Σ$ (W2-WSM), respectively. Uniquely, each W2-WSM pocket forms a large dumbbell-shaped Fermi surface (FS) enclosing two neighboring Weyl nodes with the opposite chirality. The magneto-transport in NbP is dominated by these highly anisotropic W2-WSM pockets, in which Weyl fermions are well protected from defect backscattering by real spin conservation associated to the chiral nodes. However, with a minimal doping of $\sim$1\% Cr, the mobility of NbP is degraded by more than two order of magnitude, due to the invalid of helicity protection to magnetic impurities. Helicity protected Weyl fermion transport is also manifested in chiral anomaly induced negative magnetoresistance, controlled by the W1-WSM states. In the quantum regime below 10 K, the intervalley scattering time by impurities becomes a large constant, producing the sharp and nearly identical conductivity enhancement at low magnetic field.

preprint2016arXiv

Instability of three-band Tomonaga-Luttinger liquid: renormalization group analysis and possible application to K2Cr3As3

Motivated by recently discovered quasi-one-dimensional superconductor K$_{2}$Cr$_{3}$As$_{3}$ with $D_{3h}$ lattice symmetry, we study one-dimensional three-orbital Hubbard model with generic electron repulsive interaction described by intra-orbital repulsion $U$, inter-orbital repulsion, and Hund's coupling $J$. As extracted from density functional theory calculation, two of the three atomic orbitals are degenerate ($E^{\prime}$ states) and the third one is non-degenerate ($A^{\prime}_1$), and the system is presumed to be at an incommensurate filling. With the help of bosonization, we have usual three-band Tomonaga-Luttinger liquid for the normal state. Possible charge density wave (CDW), spin density wave (SDW) and superconducting (SC) instabilities are analyzed by renormalization group. The ground state depends on the ratio $J/U$ and is sensitive to the degeneracy of $E^{\prime}$ bands. At $0<J<U/3$, spin-singlet SC state is favored, while spin-triplet superconductivity will be favored in the region $U/3<J<U/2$. The SDW state has the lowest energy only in the unphysical parameter region $J>U/2$. When the two-fold degeneracy of $E^{\prime}$ bands is lifted, SDW instability has the tendency to dominate over the spin-singlet SC state at $0<J<U/3$, while the order parameter of the spin-triplet SC state will be modulated by a phase factor $2Δk_F x$ at $U/3<J<U/2$. Possible experimental consequences and applications to K$_{2}$Cr$_{3}$As$_{3}$ are discussed.

preprint2016arXiv

Observation of Majorana fermions with spin selective Andreev reflection in the vortex of topological superconductor

Majorana fermion (MF) whose antiparticle is itself has been predicted in condensed matter systems. Signatures of the MFs have been reported as zero energy modes in various systems. More definitive evidences are highly desired to verify the existence of the MF. Very recently, theory has predicted MFs to induce spin selective Andreev reflection (SSAR), a novel magnetic property which can be used to detect the MFs. Here we report the first observation of the SSAR from MFs inside vortices in Bi2Te3/NbSe2 hetero-structure, in which topological superconductivity was previously established. By using spin-polarized scanning tunneling microscopy/spectroscopy (STM/STS), we show that the zero-bias peak of the tunneling differential conductance at the vortex center is substantially higher when the tip polarization and the external magnetic field are parallel than anti-parallel to each other. Such strong spin dependence of the tunneling is absent away from the vortex center, or in a conventional superconductor. The observed spin dependent tunneling effect is a direct evidence for the SSAR from MFs, fully consistent with theoretical analyses. Our work provides definitive evidences of MFs and will stimulate the MFs research on their novel physical properties, hence a step towards their statistics and application in quantum computing.

preprint2016arXiv

Reshaped Wirtinger Flow and Incremental Algorithm for Solving Quadratic System of Equations

We study the phase retrieval problem, which solves quadratic system of equations, i.e., recovers a vector $\boldsymbol{x}\in \mathbb{R}^n$ from its magnitude measurements $y_i=|\langle \boldsymbol{a}_i, \boldsymbol{x}\rangle|, i=1,..., m$. We develop a gradient-like algorithm (referred to as RWF representing reshaped Wirtinger flow) by minimizing a nonconvex nonsmooth loss function. In comparison with existing nonconvex Wirtinger flow (WF) algorithm \cite{candes2015phase}, although the loss function becomes nonsmooth, it involves only the second power of variable and hence reduces the complexity. We show that for random Gaussian measurements, RWF enjoys geometric convergence to a global optimal point as long as the number $m$ of measurements is on the order of $n$, the dimension of the unknown $\boldsymbol{x}$. This improves the sample complexity of WF, and achieves the same sample complexity as truncated Wirtinger flow (TWF) \cite{chen2015solving}, but without truncation in gradient loop. Furthermore, RWF costs less computationally than WF, and runs faster numerically than both WF and TWF. We further develop the incremental (stochastic) reshaped Wirtinger flow (IRWF) and show that IRWF converges linearly to the true signal. We further establish performance guarantee of an existing Kaczmarz method for the phase retrieval problem based on its connection to IRWF. We also empirically demonstrate that IRWF outperforms existing ITWF algorithm (stochastic version of TWF) as well as other batch algorithms.

preprint2016arXiv

The effect of in-plane magnetic field and applied strain in quantum spin Hall systems: application to InAs/GaSb quantum wells

Motivated by the recent discovery of quantized spin Hall effect in InAs/GaSb quantum wells\cite{du2013}$^,$\cite{xu2014}, we theoretically study the effects of in-plane magnetic field and strain effect to the quantization of charge conductance by using Landauer-Butikker formalism. Our theory predicts a robustness of the conductance quantization against the magnetic field up to a very high field of 20 tesla. We use a disordered hopping term to model the strain and show that the strain may help the quantization of the conductance. Relevance to the experiments will be discussed.

preprint2016arXiv

Theory for Spin Selective Andreev Reflection in Vortex Core of Topological Superconductor: Majorana Zero Modes on Spherical Surface and Application to Spin Polarized Scanning Tunneling Microscope Probe

Majorana zero modes (MZMs) have been predicted to exist in the topological insulator (TI)/superconductor (SC) heterostructure. Recent spin polarized scanning tunneling microscope (STM) experiment$^{1}$ has observed spin-polarization dependence of the zero bias differential tunneling conductance at the center of vortex core, which may be attributed to the spin selective Andreev reflection, a novel property of the MZMs theoretically predicted in 1-dimensional nanowire$^{2}$. Here we consider a helical electron system described by a Rashba spin orbit coupling Hamiltonian on a spherical surface with a s-wave superconducting pairing due to proximity effect. We examine in-gap excitations of a pair of vortices with one at the north pole and the other at the south pole. While the MZM is not a spin eigenstate, the spin wavefunction of the MZM at the center of the vortex core, r = 0, is parallel to the magnetic field, and the local Andreev reflection of the MZM is spin selective, namely occurs only when the STM tip has the spin polarization parallel to the magnetic field, similar to the case in 1-dimensional nanowire2. The total local differential tunneling conductance consists of the normal term proportional to the local density of states and an additional term arising from the Andreev reflection. We also discuss the finite size effect, for which the MZM at the north pole is hybridized with the MZM at the south pole. We apply our theory to examine the recently reported spin-polarized STM experiments and show good agreement with the experiments.

preprint2015arXiv

A Cosmic Ray Test Platform Based on the High Time Resolution MRPC Technology

In order to test the performance of detector/prototype in environment of laboratory, we design and build a larger area ($90\times52$ $cm^2$) test platform of cosmic ray based on well-designed Multi-gap Resistive Plate Chamber (MRPC) with an excellent time resolution and a high detection efficiency for the minimum ionizing particles (MIPs). The time resolution of the MRPC module used is tested to be ~80 ps, and the position resolution along the strip is ~5 mm, while the position resolution perpendicular to the strip is ~12.7 mm. The platform constructed by four MRPC modules can be functional for tracking the cosmic rays with a spatial resolution ~6.3 mm, and provide a reference time ~40 ps.

preprint2015arXiv

A magnetic Impurity in a Weyl semimetal

We utilize the variational method to study the Kondo screening of a spin-$1/2$ magnetic impurity in a three-dimensional (3D) Weyl semimetal with two Weyl nodes along the $k_z$-axis. The model reduces to a 3D Dirac semimetal when the separation of the two Weyl nodes vanishes. When the chemical potential lies at the nodal point, $μ=0$, the impurity spin is screened only if the coupling between the impurity and the conduction electron exceeds a critical value. For finite but small $μ$, the impurity spin is weakly bound due to the low density of state, which is proportional to $μ^2$, contrary to that in a 2D Dirac metal such as graphene and 2D helical metal where the density of states is proportional to $|μ|$. The spin-spin correlation function $J_{uv}(\mathbf{r})$ between the spin $v$-component of the magnetic impurity at the origin and the spin $u$-component of a conduction electron at spatial point $\mathbf{r}$, is found to be strongly anisotropic due to the spin-orbit coupling, and it decays in the power-law. The main difference of the Kondo screening in 3D Weyl semimetals and in Dirac semimetals is in the spin $x$- ($y$-) component of the correlation function in the spatial direction of the $z$-axis.

preprint2015arXiv

A two-stage video coding framework with both self-adaptive redundant dictionary and adaptively orthonormalized DCT basis

In this work, we propose a two-stage video coding framework, as an extension of our previous one-stage framework in [1]. The two-stage frameworks consists two different dictionaries. Specifically, the first stage directly finds the sparse representation of a block with a self-adaptive dictionary consisting of all possible inter-prediction candidates by solving an L0-norm minimization problem using an improved orthogonal matching pursuit with embedded orthonormalization (eOMP) algorithm, and the second stage codes the residual using DCT dictionary adaptively orthonormalized to the subspace spanned by the first stage atoms. The transition of the first stage and the second stage is determined based on both stages' quantization stepsizes and a threshold. We further propose a complete context adaptive entropy coder to efficiently code the locations and the coefficients of chosen first stage atoms. Simulation results show that the proposed coder significantly improves the RD performance over our previous one-stage coder. More importantly, the two-stage coder, using a fixed block size and inter-prediction only, outperforms the H.264 coder (x264) and is competitive with the HEVC reference coder (HM) over a large rate range.

preprint2015arXiv

An optimal randomized incremental gradient method

In this paper, we consider a class of finite-sum convex optimization problems whose objective function is given by the summation of $m$ ($\ge 1$) smooth components together with some other relatively simple terms. We first introduce a deterministic primal-dual gradient (PDG) method that can achieve the optimal black-box iteration complexity for solving these composite optimization problems using a primal-dual termination criterion. Our major contribution is to develop a randomized primal-dual gradient (RPDG) method, which needs to compute the gradient of only one randomly selected smooth component at each iteration, but can possibly achieve better complexity than PDG in terms of the total number of gradient evaluations. More specifically, we show that the total number of gradient evaluations performed by RPDG can be ${\cal O} (\sqrt{m})$ times smaller, both in expectation and with high probability, than those performed by deterministic optimal first-order methods under favorable situations. We also show that the complexity of the RPDG method is not improvable by developing a new lower complexity bound for a general class of randomized methods for solving large-scale finite-sum convex optimization problems. Moreover, through the development of PDG and RPDG, we introduce a novel game-theoretic interpretation for these optimal methods for convex optimization.

preprint2015arXiv

Distributed Machine Learning via Sufficient Factor Broadcasting

Matrix-parametrized models, including multiclass logistic regression and sparse coding, are used in machine learning (ML) applications ranging from computer vision to computational biology. When these models are applied to large-scale ML problems starting at millions of samples and tens of thousands of classes, their parameter matrix can grow at an unexpected rate, resulting in high parameter synchronization costs that greatly slow down distributed learning. To address this issue, we propose a Sufficient Factor Broadcasting (SFB) computation model for efficient distributed learning of a large family of matrix-parameterized models, which share the following property: the parameter update computed on each data sample is a rank-1 matrix, i.e., the outer product of two "sufficient factors" (SFs). By broadcasting the SFs among worker machines and reconstructing the update matrices locally at each worker, SFB improves communication efficiency --- communication costs are linear in the parameter matrix's dimensions, rather than quadratic --- without affecting computational correctness. We present a theoretical convergence analysis of SFB, and empirically corroborate its efficiency on four different matrix-parametrized ML models.

preprint2015arXiv

Distributed Machine Learning via Sufficient Factor Broadcasting

preprint2015arXiv

Global existence of radial solutions for general semilinear hyperbolic systems in 3D

We study the well-posedness of radial solutions for general nonlinear hyperbolic systems in three dimensions. We give a proof of the global existence of radial solutions for general semilinear hyperbolic systems in 3D under null condition, with small scaling invariant $\dot{W}^{2,1}(\mathbb{R}^3)$ data. We obtain a bilinear estimate that is effective to the hyperbolic systems which do not have any time decay. It allows us to achieve the boundedness of the weighted BV norm of the radial solution.

preprint2015arXiv

Lifespan of Classical Solutions to Quasilinear Wave Equations Outside of a Star-Shaped Obstacle in Four Space Dimensions

We study the initial-boundary value problem of quasilinear wave equations outside of a star-shaped obstacle in four space dimensions, in which the nonlinear term under consideration may explicitly depend on the unknown function itself. By some new $L^{\infty}_{t}L^{2}_{x}$ and weighted $L^{2}_{t,x}$ estimates for the unknown function itself, together with energy estimates and KSS estimates, for the quasilinear obstacle problem we obtain a lower bound of the lifespan $T_{\varepsilon}\geq \exp{(\frac{c}{\varepsilon^2})}$, which coincides with the sharp lower bound of lifespan estimate for the corresponding Cauchy problem.

preprint2015arXiv

Lower bounds on blowing-up solutions of the 3D Navier--Stokes equations in $\dot H^{3/2}$, $\dot H^{5/2}$, and $\dot B^{5/2}_{2,1}$

If $u$ is a smooth solution of the Navier--Stokes equations on ${\mathbb R}^3$ with first blowup time $T$, we prove lower bounds for $u$ in the Sobolev spaces $\dot H^{3/2}$, $\dot H^{5/2}$, and the Besov space $\dot B^{5/2}_{2,1}$, with optimal rates of blowup: we prove the strong lower bounds $\|u(t)\|_{\dot H^{3/2}}\ge c(T-t)^{-1/2}$ and $\|u(t)\|_{\dot B^{5/2}_{2,1}}\ge c(T-t)^{-1}$, but in $\dot H^{5/2}$ we only obtain the weaker result $\limsup_{t\to T^-}(T-t)\|u(t)\|_{\dot H^{5/2}}\ge c$. The proofs involve new inequalities for the nonlinear term in Sobolev and Besov spaces, both of which are obtained using a dyadic decomposition of $u$.

preprint2015arXiv

Structure of Helicity and Global Solutions of Incompressible Navier-Stokes Equation

In this paper we derive a new energy identity for the three-dimensional incompressible Navier-Stokes equations by a special structure of helicity. The new energy functional is critical with respect to the natural scalings of the Navier-Stokes equations. Moreover, it is conditionally coercive. As an application we construct a family of finite energy smooth solutions to the Navier-Stokes equations whose critical norms can be arbitrarily large.

preprint2014arXiv

A Large area GEM Detector Using an improved Self-stretch Technique

A GEM detector with an effective area of 30*30 cm2 has been constructed using an improved self-stretch technique, which enables an easy and fast GEM assembling. The design and assembling of the detector is described. Results from tests of the detector with 8 keV X-rays on effective gain and energy resolution are presented.

preprint2014arXiv

A Logical Study of Partial Entailment

We introduce a novel logical notion--partial entailment--to propositional logic. In contrast with classical entailment, that a formula P partially entails another formula Q with respect to a background formula set Γintuitively means that under the circumstance of Γ, if P is true then some "part" of Q will also be true. We distinguish three different kinds of partial entailments and formalize them by using an extended notion of prime implicant. We study their semantic properties, which show that, surprisingly, partial entailments fail for many simple inference rules. Then, we study the related computational properties, which indicate that partial entailments are relatively difficult to be computed. Finally, we consider a potential application of partial entailments in reasoning about rational agents.

preprint2014arXiv

Electrically controllable magnetic order in the bilayer Hubbard model on honeycomb lattice --- a determinant quantum Monte Carlo study

Layered antiferromagnetic spin density wave (LAF) state is one of the plausible ground states of charge neutral Bernal stacked bilayer graphene. In this paper, we use determinant quantum Monte Carlo method to study the effect of the electric field on the magnetic order in bilayer Hubbard model on a honeycomb lattice. Our results qualitatively support the LAF ground state found in the mean field theory. The obtained magnetic moments, however, are much smaller than what are estimated in the mean field theory. As electric field increases, the magnetic order parameter rapidly decreases.

preprint2014arXiv

Gutzwiller Approach for Elementary Excitations in $S=1$ Antiferromagnetic Chains

In a previous paper [Phys. Rev. B 85,195144 (2012)], variational Monte Carlo method (based on Gutzwiller projected states) was generalized to $S=1$ systems. This method provided very good trial ground states for the gapped phases of $S=1$ bilinear-biquadratic (BLBQ) Heisenberg chain. In the present paper, we extend the approach to study the low-lying elementary excitations in $S=1$ chains. We calculate the one-magnon and two-magnon excitation spectra of the BLBQ Heisenberg chain and the results agree very well with recent data in literature. In our approach, the difference of the excitation spectrum between the Haldane phase and the dimer phase (such as the even/odd size effect) can be understood from their different topology of corresponding mean field theory. We especially study the Takhtajan-Babujian critical point. Despite the fact that the `elementary excitations' are spin-1 magnons which are different from the spin-1/2 spinons in Bethe solution, we show that the excitation spectrum, critical exponent ($η=0.74$) and central charge ($c=1.45$) calculated from our theory agree well with Bethe ansatz solution and conformal field theory predictions.

preprint2014arXiv

Localized States and Quantum Spin Hall Effect in Si-Doped InAs/GaSb Quantum Wells

We study localized in-gap states and quantum spin Hall effect in Si-doped InAs/GaSb quantum wells. We propose a model describing donor and/or acceptor impurities to describe Si dopants. This model shows in-gap bound states and wide conductance plateau with the quantized value $2e^2/h$ in light dopant concentration, consistent with recent experiments by Du et al. We predict a conductance dip structure due to backward scattering in the region where the localization length $ξ$ is comparable with the sample width $L_y$ but much smaller than the sample length $L_x$.

preprint2014arXiv

Reconstructing human organ cross-sectional imaging along any axis

Cross-sectional imaging of human organ serves as a critical tool to provide diagnostic results of many diseases. Based on a unique body coordinate system, we present a method that we use to reconstruct any cross-sectional imaging of organ regardless of its original section going along which scanning or cutting axis. In clinical medicine, this method enables a patient to undergo only one scanning, and then the doctor can observe the structure of lesion sections along any axis, and it can help find changes of lesions at the same section from different scanning results and thus quantify diagnosis by cross-sectional imaging. Significant progress has thus been made towards quantitative diagnosis cross-sectional imaging.

preprint2014arXiv

Superconductivity in a molecular graphene

We propose that constructing a molecule super-lattice on a superconducting ultrathin film is a promising way to manipulate superconductivity in experiment. We theoretically study superconductivity in a molecule graphene system, which is built by fabricating a hexagonal molecule super-lattice on 2-dimensional electron gas. The super-lattice potential dramatically changes the electron density of states, which oscillates as function of the energy. We show that such a molecular graphene may increase superconducting gap by a few times, which may open a new route to realize high temperature superconductivity.

preprint2013arXiv

Majority Rule for Belief Evolution in Social Networks

In this paper, we study how an agent's belief is affected by her neighbors in a social network. We first introduce a general framework, where every agent has an initial belief on a statement, and updates her belief according to her and her neighbors' current beliefs under some belief evolution functions, which, arguably, should satisfy some basic properties. Then, we focus on the majority rule belief evolution function, that is, an agent will (dis)believe the statement iff more than half of her neighbors (dis)believe it. We consider some fundamental issues about majority rule belief evolution, for instance, whether the belief evolution process will eventually converge. The answer is no in general. However, for random asynchronous belief evolution, this is indeed the case.

preprint2013arXiv

Near-field focusing of dielectric microspheres: Super-resolution and field-invariant parameter scaling

Optical near-fields of small dielectric particles are of particular importance and interests for nanoscale optical engineering such as field localization, fabrication, characterization, sensing and imaging. This paper represents a systematic investigation on the focusing characteristics (focal length, field enhancement, spot size) for a given refractive-index microsphere (n=1.6) with a varying size parameter across the range of pi<q0<20*pi. Conditions for super-resolution foci were analysised in details. Particularly strong super-resolution foci with spot size falling at least 50% below the diffraction limit were identified and possible new applications were suggested. To understand how the super-resolution conditions could be scaled to other refractive-index particles or background medium, principles of field-invariant parameters scaling (size, wavelength, and refractive index) were revealed and demonstrated with example cases. It offers the new freedom to choose particles and background medium to gain super-resolution at any frequency across the whole electromagnetic spectrum.

preprint2013arXiv

Spin Liquid States at the vicinity of metal-insulator transition

We study in this paper quantum spin liquid states (QSLs) at the vicinity of metal-insulator transition. Assuming that the low energy excitations in the QSLs are labeled by "spinon" occupation numbers with the same Fermi surface structure as in the corresponding metal (Fermi-liquid) side, we propose a phenomenological Landau-like low energy theory for the QSLs and show that the usual U(1) QSLs is a representative member of this class of spin liquids. Based on our effective low energy theory, an alternative picture to the Brinkman-Rice picture of Mott metal-insulator transition is proposed. The charge, spin and thermal responses of QSLs are discussed under such a phenomenology.

preprint2012arXiv

Almost Global Existence for 2-D Incompressible Isotropic Elastodynamics

We consider the Cauchy problem for 2-D incompressible isotropic elastodynamics. Standard energy methods yield local solutions on a time interval $[0,{T}/ε]$, for initial data of the form $εU_0$, where $T$ depends only on some Sobolev norm of $U_0$. We show that for such data there exists a unique solution on a time interval $[0, \exp{T}/ε]$, provided that $ε$ is sufficiently small. This is achieved by careful consideration of the structure of the nonlinearity. The incompressible elasticity equation is inherently linearly degenerate in the isotropic case; in other words, the equation satisfies a null condition. This is essential for time decay estimates. The pressure, which arises as a Lagrange multiplier to enforce the incompressibility constraint, is estimated in a novel way as a nonlocal nonlinear term with null structure. The proof employs the generalized energy method of Klainerman, enhanced by weighted $L^2$ estimates and the ghost weight introduced by Alinhac.

preprint2012arXiv

Blow up for some semilinear wave equations in multi-space dimensions

In this paper, we discuss a new nonlinear phenomenon. We find that in $n\geq 2$ space dimensions, there exists two indexes $p$ and $q$ such that the cauchy problems for the nonlinear wave equations {equation} \label{0.1} \Box u(t,x) = |u(t,x)|^{q}, \ \ x\in R^{n}, {equation} and {equation} \label{0.2} \Box u(t,x) = |u_{t}(t,x)|^{p}, \ \ x\in R^{n} {equation} both have global existence for small initial data, while for the combined nonlinearity, the solutions to the Cauchy problem for the nonlinear wave equation {equation} \label{0.3} \Box u(t,x) = | u_{t}(t,x)|^{p} + |u(t,x)|^{q}, \ \ x\in R^{n}, {equation} with small initial data will blow up in finite time. In the two dimensional case, we also find that if $ q=4$, the Cauchy problem for the equation \eqref{0.1} has global existence, and the Cauchy problem for the equation {equation} \label{0.4} \Box u(t,x) = u (t,x)u_{t}(t,x)^{2}, \ \ x\in R^{2} {equation} has almost global existence, that is, the life span is at least $ \exp (c\varepsilon^{-2}) $ for initial data of size $ \varepsilon$. However, in the combined nonlinearity case, the Cauchy problem for the equation {equation} \label{0.5} \Box u(t,x) = u(t,x) u_{t}(t,x)^{2} + u(t,x)^{4}, \ \ x\in R^{2} {equation} has a life span which is of the order of $ \varepsilon^{-18} $ for the initial data of size $ \varepsilon$, this is considerably shorter in magnitude than that of the first two equations. This solves an open optimality problem for general theory of fully nonlinear wave equations (see \cite{Katayama}).

preprint2012arXiv

Bond distortion effects and electric orders in spiral multiferroic magnets

We study in this paper bond distortion effect on electric polarization in spiral multiferroic magnets based on cluster and chain models. The bond distortion break inversion symmetry and modify the $d$-$p$ hybridization. Consequently, it will affect electric polarization which can be divided into spin-current part and lattice-mediated part. The spin-current polarization can be written in terms of $\vec{e}_{i,j}\times(\vec{e}_{i}\times\vec{e}_{j}) $ and the lattice-mediated polarization exists only when the M-O-M bond is distorted. The electric polarization for three-atom M-O-M and four-atom M-O$_{2}$-M clusters is calculated. We also study possible electric ordering in three kinds of chains made of different clusters. We apply our theory to multiferroics cuprates and find that the results are in agreement with experimental observations.

preprint2012arXiv

Edge superconducting state in attractive U Kane-Mele-Hubbard model

We theoretically investigate the phase transition from topological insulator (TI) to superconductor in the attractive U Kane-Mele-Hubbard model with self-consistent mean field method. We demonstrate the existence of edge superconducting state (ESS), in which the bulk is still an insulator and the superconductivity only appears near the edges. The ESS results from the special energy dispersion of TI, and is a general property of the superconductivity in TI. The phase transition in this model essentially consists of two steps. When the attractive U becomes nonzero, ESS appears immediately. After the attractive U exceeds a critical value $U_c$, the whole system becomes a superconductor. The effective model of the ESS has also been discussed and we believe that the conception of ESS can be realized in atomic optical lattice system.

preprint2012arXiv

Formulation of finite-time singularity for free-surface Euler equations

We give an extremely short proof that the free-surface incompressible, irrotational Euler equations with regular initial condition can form a finite time singularity in 2D or 3D. Thus, we provide a simple view of the problem studied by Castro, Cordoba, Fefferman, Gancedo, Lopez-Fernadez, Gomez-Serrano and Coutand, Shkoller.

preprint2012arXiv

Global Solutions of Evolutionary Faddeev Model With Small Initial Data

We consider the Cauchy problem for evolutionary Faddeev model corresponding to maps from the Minkowski space $\mathbb{R}^{1 + n}$ to the unit sphere $\mathbb{S}^2$, which obey a system of non-linear wave equations. The nonlinearity enjoys the null structure and contains semi-linear terms, quasi-linear terms and unknowns themselves. We prove that the Cauchy problem is globally well-posed for sufficiently small initial data in Sobolev space.

preprint2012arXiv

Gutzwiller Projected wavefunctions in the fermonic theory of S=1 spin chains

We study in this paper a series of Gutzwiller Projected wavefunctions for S=1 spin chains obtained from a fermionic mean-field theory for general S>1/2 spin systems [Phys. Rev. B 81, 224417] applied to the bilinear-biquadratic (J-K) model. The free-fermion mean field states before the projection are 1D paring states. By comparing the energies and correlation functions of the projected pairing states with those obtained from known results, we show that the optimized Gutzwiller projected wavefunctions are very good trial ground state wavefunctions for the antiferromagnetic bilinear-biquadratic model in the regime K<J, (J>0). We find that different topological phases of the free-fermion paring states correspond to different spin phases: the weak pairing (topologically non-trivial) state gives rise to the Haldane phase, whereas the strong pairing (topologically trivial) state gives rise to the dimer phase. In particular the mapping between the Haldane phase and Gutwziller wavefunction is exact at the AKLT point K=1/3. The transition point between the two phases determined by the optimized Gutzwiller Projected wavefunction is in good agreement with the known result. The effect of Z2 gauge fluctuations above the mean field theory is analyzed.

preprint2012arXiv

Stacking order, interaction and weak surface magnetism in layered graphene sheets

Recent transport experiments have demonstrated that the rhombohedral stacking trilayer graphene is an insulator with an intrinsic gap of 6meV and the Bernal stacking trilayer one is a metal. We propose a Hubbard model with a moderate $U$ for layered graphene sheets, and show that the model well explains the experiments of the stacking dependent energy gap. The on-site Coulomb repulsion drives the metallic phase of the non-interacting system to a weak surface antiferromagnetic insulator for the rhombohedral stacking layers, but does not alter the metallic phase for the Bernal stacking layers.

preprint2011arXiv

Electrical spin injection and transport in Germanium

We report the first experimental demonstration of electrical spin injection, transport and detection in bulk germanium (Ge). The non-local magnetoresistance in n-type Ge is observable up to 225K. Our results indicate that the spin relaxation rate in the n-type Ge is closely related to the momentum scattering rate, which is consistent with the predicted Elliot-Yafet spin relaxation mechanism for Ge. The bias dependence of the nonlocal magnetoresistance and the spin lifetime in n-type Ge is also investigated.

preprint2011arXiv

Life-Span of Solutions to Critical Semilinear Wave Equations

The final open part of the famous Strauss conjecture on semilinear wave equations of the form \Box u=|u|^{p}, i.e., blow-up theorem for the critical case in high dimensions was solved by Yordanov and Zhang, or Zhou independently. But the estimate for the lifespan, the maximal existence time, of solutions was not clarified in both papers. Recently, Takamura and Wakasa have obtained the sharp upper bound of the lifespan of the solution to the critical semilinear wave equations, and their method is based on the method in Yordanov and Zhang. In this paper, we give a much simple proof of the result of Takamura and Wakasa by using the method in Y. Zhou for space dimensions n\geq 2. Simultaneously, this estimate of the life span also proves the last open optimality problem of the general theory for fully nonlinear wave equations with small initial data in the case n=4 and quadratic nonlinearity(One can see Li and Chen for references on the whole history).

preprint2011arXiv

Theory for superconductivity in (Tl,K)Fe$_x$Se$_2$ as a doped Mott insulator

Possible superconductivity in recently discovered (Tl,K)Fe$_x$Se$_2$ compounds is studied from the viewpoint of doped Mott insulator. The Mott insulating phase is examined to be preferred in the parent compound at $x=1.5$ due to the presence of Fe vacancies. Partial filling of vacancies at the Fe-sites introduces electron carriers and leads to electron doped superconductivity. By using a two-orbital Hubbard model in the strong coupling limit, we find that the s-wave pairing is more favorable at small Hund's coupling, and d$_{x^2-y^2}$ wave pairing is more favorable at large Hund's coupling.

preprint2011arXiv

Unified Spin Order Theory via Gauge Landau-Lifshitz Equation

The continuum limit of the tilted SU(2) spin model is shown to give rise to the gauge Landau-Lifshitz equation which provides a unified description for various spin orders. For a definite gauge, we find a double periodic solution, where the conical spiral, in-plane spiral, helical, and ferromagnetic spin orders become special cases, respectively. For another gauge, we obtain the skyrmion-crystal solution. By simulating the influence of magnetic field and temperature for our covariant model, we find a spontaneous formation of skyrmion-fragment lattice and obtain a wider range of skyrmion-crystal phase in comparison to the conventional Dzyaloshinsky-Moriya model.

preprint2010arXiv

Blow up of Solutions to Semilinear Wave Equations with variable coefficients and boundary

This paper is devoted to studying the following two initial-boundary value problems for semilinear wave equations with variable coefficients on exterior domain with subcritical exponent in $n$ space dimensions: u_{tt}-partial_{i}(a_{ij}(x)\partial_{j}u)=|u|^{p}, (x,t)\in Ω^{c}\times(0,+\infty), n\geq 3 and u_{tt}-\partial_{i}(a_{ij}(x)\partial_{j}u)=|u_{t}|^{p}, (x,t)\in Ω^{c}\times (0,+\infty), n\geq 1, where $a_{ij}(x)=δ_{ij}, when |x|\geq R. The exponents $p$ satisfies $ 1<p<p_{1}(n)$ in (0.1), and $p \leq p_{2}(n)$ in (0.2), where $p_{1}(n)$ is the larger root of the quadratic equation (n-1)p^{2}-(n+1)p-2=0, and p_{2}(n)=\frac{2}{n-1}+1, respectively. It is well-known that the numbers p_{1}(n) and p_{2}(n) are the critical exponents. We will establish two blowup results for the above two initial-boundary value problems, it is proved that there can be no global solutions no matter how small the initial data are, and also we give the lifespan estimate of solutions for above problems.

preprint2010arXiv

Effect of Spatial Charge Inhomogeneity on 1/f Noise Behavior in Graphene

Scattering mechanisms in graphene are critical to understanding the limits of signal-to-noise-ratios of unsuspended graphene devices. Here we present the four-probe low frequency noise (1/f) characteristics in back-gated single layer graphene (SLG) and bilayer graphene (BLG) samples. Contrary to the expected noise increase with the resistance, the noise for SLG decreases near the Dirac point, possibly due to the effects of the spatial charge inhomogeneity. For BLG, a similar noise reduction near the Dirac point is observed, but with a different gate dependence of its noise behavior. Some possible reasons for the different noise behavior between SLG and BLG are discussed.

preprint2010arXiv

Fermionic theory for quantum antiferromagnets with spin S > 1/2

The fermion representation for S = 1/2 spins is generalized to spins with arbitrary magnitudes. The symmetry properties of the representation is analyzed where we find that the particle-hole symmetry in the spinon Hilbert space of S =1/2 fermion representation is absent for S > 1/2. As a result, different path integral representations and mean field theories can be formulated for spin models. In particular, we construct a Lagrangian with restored particle-hole symmetry, and apply the corresponding mean field theory to one dimensional (1D) S = 1 and S = 3/2 antiferromagnetic Heisenberg models, with results that agree with Haldane's conjecture. For a S = 1 open chain, we show that Majorana fermion edge states exist in our mean field theory. The generalization to spins with arbitrary magnitude S is discussed. Our approach can be applied to higher dimensional spin systems. As an example, we study the geometrically frustrated S = 1 AFM on triangular lattice. Two spin liquids with different pairing symmetries are discussed: the gapped px + ipy-wave spin liquid and the gapless f-wave spin liquid. We compare our mean field result with the experiment on NiGa2S4, which remains disordered at low temperature and was proposed to be in a spin liquid state. Our fermionic mean field theory provide a framework to study S > 1/2 spin liquids with fermionic spinon excitations.

preprint2010arXiv

Giant mesoscopic spin Hall effect on surface of topological insulator

We study mesoscopic spin Hall effect on the surface of topological insulator with a step-function potential. The giant spin polarization induced by a transverse electric current is derived analytically by using McMillan method in the ballistic transport limit, which oscillates across the potential boundary with no confinement from the potential barrier due to the Klein paradox, and should be observable in spin resolved scanning tunneling microscope.

preprint2010arXiv

Global existence of critical nonlinear wave equation with time dependent variable coefficients

In this paper, we establish global existence of smooth solutions for the Cauchy problem of the critical nonlinear wave equation with time dependent variable coefficients in three space dimensions {equation}\partial_{tt}ϕ-\partial_{x_i}\big(g^{ij}(t,x)\partial_{x_j}ϕ\big)+ϕ^5=0, mathbb{R}_t \times \mathbb{R}_x^3,{equation} where $\big(g_{ij}(t,x)\big)$ is a regular function valued in the spacetime of $3\times3$ positive definite matrix and $\big(g^{ij}(t,x)\big)$ its inverse matrix. Here and in the sequence, a repeated sum on an index in lower and upper position is never indicated. In the constant coefficients case, the result of global existence is due to Grillakis \cite{Grillakis1}; and in the time-independent variable coefficients case, the result of global existence and regularity is due to Ibrahim and Majdoub \cite{Ibrahim}. The key point of our proofs is to show that the energy cannot concentrate at any point. For that purpose, following Christodoulou and Klainerman \cite{Chris}, we use a null frame associated to an optical function to construct a geometric multiplier similar to the well-known Morawetz multiplier. Then we use comparison theorem originated from Riemannian Geometry to estimate the error terms. Finally, using Strichartz inequality due to \cite{Smith} as Ibrahim and Majdoub \cite{Ibrahim}, we obtain global existence.

preprint2010arXiv

Global Existence of the Critical Semilinear Wave Equations with Variable Coefficients Outside Obstacles

In this paper, we consider exterior problem of the critical semilinear wave equation in three space dimensions with variable coefficients and prove global existence of smooth solutions. Similar to the constant coefficients case, we show that the energy cannot concentrate at any point $(t,x)\in(0,\infty)\timesΩ$. For that purpose, following Ibrahim and Majdoub \cite{Ibrahim}, we use a geometric multiplier close to the well-known Morawetz multiplier used in the constant coefficients case. Then we use comparison theorem from Riemannian Geometry to estimate the error terms. Finally, using Strichartz inequality as in Smith and Sogge \cite{Sogge}, we get the global existence.

preprint2010arXiv

Possibility of S=1 spin liquids with fermionic spinons on triangular lattices

In this paper we generalize the fermionic representation for $S=1/2$ spins to arbitrary spins. Within a mean field theory we obtain several spin liquid states for spin $S=1$ antiferromagnets on triangular lattices, including gapless f-wave spin liquid and topologically nontrivial $p_x+ip_y$ spin liquid. After considering different competing orders, we construct a phase diagram for the $J_1$-$J_3$-$K$ model. The application to recently discovered material $\mathrm{NiGa_2S_4}$ is discussed.

preprint2010arXiv

Self doping effect and successive magnetic transitions in superconducting Sr$_2$VFeAsO$_3$

We have studied a quinary Fe-based superconductor Sr$_2$VFeAsO$_3$ by the measurements of x-ray diffraction, x-ray absorption, Mössbauer spectrum, resistivity, magnetization and specific heat. This apparently undoped oxyarsenide is shown to be self doped via electron transfer from the V$^{3+}$ ions. We observed successive magnetic transitions within the VO$_2$ layers: an antiferromagnetic transition at 150 K followed by a weak ferromagnetic transition at 55 K. The spin orderings within the VO$_2$ planes are discussed based on mixed valence of V$^{3+}$ and V$^{4+}$.

preprint2010arXiv

Spinon Phonon Interaction and Ultrasonic Attenuation in Quantum Spin Liquids

Several experimental candidates for quantum spin liquids have been discovered in the past few years which appear to support gapless fermionic $S = {1\over 2}$ excitations called spinons. The spinons may form a Fermi sea coupled to a $U(1)$ gauge field, and may undergo a pairing instability. We show that despite being charge neutral, the spinons couple to phonons in exactly the same way that electrons do in the long wavelength limit. Therefore we can use sound attenuation to measure the spinon mass and lifetime. Furthermore, transverse ultrasonic attenuation is a direct probe of the onset of pairing because the Meissner effect of the gauge field causes a "rapid fall" of the attenuation at $T_c$ in addition to the reduction due to the opening of the energy gap. This phenomenon, well known in clean superconductors, may reveal the existence of the U(1) gauge field.

preprint2009arXiv

Fermi-edge problem in the presence of AC electric field

We study in this paper a non-equilibrium Fermi-edge problem where the system under investigation is a single electron reservoir putting under an AC electric field. We show that the electron Green's function and other correlation functions in the problem can be solved and expressed exactly in terms of a well-defined integral. The qualitative behaviors of the solution is studied and compared with the situation where the impurity is coupled to more than one reservoirs at different chemical potentials.

preprint2009arXiv

On Abstract Strichartz Estimates and the Strauss Conjecture for Nontrapping Obstacles

The purpose of this paper is to show how local energy decay estimates for certain linear wave equations involving compact perturbations of the standard Laplacian lead to optimal global existence theorems for the corresponding small amplitude nonlinear wave equations with power nonlinearities. To achieve this goal, at least for spatial dimensions $n=3$ and 4, we shall show how the aforementioned linear decay estimates can be combined with "abstract Strichartz" estimates for the free wave equation to prove corresponding estimates for the perturbed wave equation when $n\ge3$. As we shall see, we are only partially successful in the latter endeavor when the dimension is equal to two, and therefore, at present, our applications to nonlinear wave equations in this case are limited.

preprint2009arXiv

Topological glass states

In connection with recent discussion of topological order and topological phase transitions in quantum systems, we reexamine circumstances that lead to the appearance of a topological glass in certain classical lattice spin models. Local bonding enforces constraints on low energy states which organize themselves into topologically distinct classes that break ergodicity but not any apparent symmetry as in the usual Landau theory of phase transitions. Various properties of such a topological glass are demonstrated using two classical Ising-like models.

preprint2007arXiv

Concerning the Strauss conjecture and almost global existence for nonlinear Dirichlet-wave equations in 4-dimensions

We show the obstacle version of the Strauss conjecture holds when the spatial dimension is equal to 4. We also show that an almost global existence theorem of Hörmander for (4+1)-dimensional Minkowski space holds in the obstacle setting. We use weighed space-time variants of the energy inequality and a variant of the classical Hardy inequality.

preprint2003arXiv

Quantum coherence of double-well BEC: a SU(2)-coherent-state path-integral approach

Macroscopic quantum coherence of Bose gas in a double-well potential is studied based on SU(2)-coherent-state path-integral. The ground state and fluctuations around it can be obtained by this method. In this picture, one can obtain macroscopic quantum superposition states for attractive Bose gas. The coherent gap of degenerate ground states is obtained with the instanton technique. The phenomenon of macroscopic quantum self-trapping is also discussed.

preprint2001arXiv

Spin-phase interference, coherent superposition, and quantum tunneling at excited levels in nano-antiferromagnets

The spin-phase interference effects are studied analytically in resonant quantum tunneling of the Néel vector between degenerate excited levels in nanometer-scale single-domain antiferromagnets in the absence of an external magnetic field. We consider a model for mesoscopic antiferromagnets with uncompensated excess spins for the more general structure of magnetic anisotropy, such as biaxial, trigonal, tetragonal and hexagonal crystal symmetry. This study provides a nontrivial generalization of the Kramers degeneracy for double-well system to coherently spin tunneling at ground states as well as low-lying excited states in AFM system with $m$-fold rotational symmetry around the $\hat{z}$ axis. The energy level spectrum and the thermodynamic properties of magnetic tunneling states are found to depend significantly on the parity of the excess spins at sufficiently low temperatures. Possible relevance to experiments is also discussed.

preprint2000arXiv

Field-dependent quantum nucleation of antiferromagnetic bubbles

The phenomenon of quantum nucleation is studied in a nanometer-scale antiferromagnet with biaxial symmetry in the presence of a magnetic field at an arbitrary angle. Within the instanton approach, we calculate the dependence of the rate of quantum nucleation and the crossover temperature on the orientation and strength of the field for bulk solids and two-dimensional films of antiferromagnets, respectively. Our results show that the rate of quantum nucleation and the crossover temperature from thermal-to-quantum transitions depend on the orientation and strength of the field distinctly, which can be tested with the use of existing experimental techniques.

preprint2000arXiv

Phase interference of spin tunneling in an arbitrarily directed magnetic field

We present an exact analytic study on the topological phase interference effect in resonant quantum tunneling of the magnetization between degenerate excited levels for biaxial ferromagnets in an arbitrarily directed magnetic field. We show that the topological phase interference effect depends on the orientation of the field distinctly. The transition from classical to quantum behavior is also discussed.

Yi Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

131 published item(s)

SemEval-2026 Task 7: Everyday Knowledge Across Diverse Languages and Cultures

A Pure Integral-Type PLL with a Damping Branch to Enhance the Stability of Grid-Tied Inverter under Weak Grids

On Unbalanced Optimal Transport: Gradient Methods, Sparsity and Approximation Error

Extended Load Flexibility of Utility-Scale P2H Plants: Optimal Production Scheduling Considering Dynamic Thermal and HTO Impurity Effects

A Fast and Convergent Proximal Algorithm for Regularized Nonconvex and Nonsmooth Bi-level Optimization

A likelihood based sensitivity analysis for publication bias on summary ROC in meta-analysis of diagnostic test accuracy

Accelerated Proximal Alternating Gradient-Descent-Ascent for Nonconvex Minimax Machine Learning

Coordinated Frequency Control through Safe Reinforcement Learning

Data Sampling Affects the Complexity of Online SGD over Dependent Data

DDDM: a Brain-Inspired Framework for Robust Classification

Delving into the Estimation Shift of Batch Normalization in a Network

Desingularization and p-Curvature of Recurrence Operators

DeTrust-FL: Privacy-Preserving Federated Learning in Decentralized Trust Setting

Extended Load Flexibility of Industrial P2H Plants: A Process Constraint-Aware Scheduling Approach

Extracting Densest Sub-hypergraph with Convex Edge-weight Functions

Generalized persistence of entropy weak solutions for system of hyperbolic conservation laws

Inhomogeneous superconducting states in two weakly linked superconducting ultra thin films

Learning Visibility for Robust Dense Human Body Estimation

Matrix product states for Hartree-Fock-Bogoliubov wave functions

Plasma Image Classification Using Cosine Similarity Constrained CNN

Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis

Sense Embeddings are also Biased--Evaluating Social Biases in Static and Contextualised Sense Embeddings

Single-shot Hyper-parameter Optimization for Federated Learning: A General Algorithm & Analysis

Specificity-preserving RGB-D Saliency Detection

Two-dimensional superconductivity at the surfaces of KTaO3 gated with ionic liquid

UNISON: Unpaired Cross-lingual Image Captioning

Unveiling a critical stripy state in the triangular-lattice SU(4) spin-orbital model

Curse or Redemption? How Data Heterogeneity Affects the Robustness of Federated Learning

Emergence of high-temperature superconductivity at the interface of two Mott insulators

Event-based Motion Segmentation with Spatio-Temporal Graph Cuts

Global existence for semilinear wave equations with scaling invariant damping in 3-D

Global Existence of Ideal Invicid Compressible and Heat Conductive Fluids with Radial Symmetry

Graph topology invariant gradient and sampling complexity for decentralized and stochastic optimization

Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for Thoracic Disease Identification

Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data

Accelerating Power Methods for Higher-order Markov Chains

An Investigation into the Stochasticity of Batch Whitening

Chinese Named Entity Recognition Augmented with Lexicon Memory

Classical and quantum order in hyperkagome antiferromagnets

Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble

Efficient tensor network representation for Gutzwiller projected states of paired fermions

Exploring the Hierarchy in Relation Labels for Scene Graph Generation

Generative Tweening: Long-term Inbetweening of 3D Human Motions

GFTE: Graph-based Financial Table Extraction

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

IBM Federated Learning: an Enterprise Framework White Paper V0.1

Inf-Net: Automatic COVID-19 Lung Infection Segmentation from CT Images

Learning to Generate Diverse Dance Motions with Transformer

Measurement of the neutron beam profile of the Back-n white neutron facility at CSNS with a Micromegas detector

Momentum with Variance Reduction for Nonconvex Composition Optimization

Nanoscale structure detection and monitoring of tumour growth with optical coherence tomography

Nanosensitive optical coherence tomography to assess wound healing within the cornea

Non-invasive detection of nanoscale structural changes in cornea associated with cross-linking treatment

On the Continuity of Rotation Representations in Neural Networks

Proximal Gradient Algorithm with Momentum and Flexible Parameter Restart for Nonconvex Optimization

Reanalysis of Variance Reduced Temporal Difference Learning

Small-floating Target Detection in Sea Clutter via Visual Feature Classifying in the Time-Doppler Spectra

Spatio-temporal Attention Model for Tactile Texture Recognition

SpiderBoost and Momentum: Faster Stochastic Variance Reduction Algorithms

The Complexity of the Partition Coloring Problem

TiFL: A Tier-based Federated Learning System

Timing Performance of a Micro-Channel-Plate Photomultiplier Tube

Understanding the Impact of Model Incoherence on Convergence of Incremental SGD with Random Reshuffle

Evidence for nematic superconductivity of topological surface states in PbTaSe2

Extensive beam test study of prototype MRPCs for the T0 detector at the CSR external-target experiment

Formation of finite-time singularities for nonlinear elastodynamics with small initial disturbances

High Speed Mid-Infrared Interband Cascade Photodetector Based on InAs/GaSb Type-II Superlattice

On some conjectures by Lu and Wenzel

Superconductivity, pair density wave, and Neel order in cuprates

Supervised Encoding for Discrete Representation Learning

A Set Theoretic Approach for Knowledge Representation: the Representation Part

DAP3D-Net: Where, What and How Actions Occur in Videos?

DAVE: A Unified Framework for Fast Vehicle Detection and Annotation