Source author record

Zhe Wang

Zhe Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

152works

55topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A Breast Vision Pathology Foundation Model for Real-world Clinical Utility

Pathology foundation models have shown strong retrospective performance, but whether such systems can support clinically relevant use remains unclear. This challenge is particularly important in breast cancer, where pathological assessment serves as the gold standard for diagnosis and guides treatment planning, surgical decision-making and risk stratification across pre-, intra- and post-operative stages. Here we present \textbf{BRAVE}, a breast-adaptive pathology foundation model developed and evaluated using a total resource of 101,638 breast whole-slide images from 32 sources across Asia, Europe and North America. We assessed BRAVE across 34 tasks in 82 cohorts spanning pre-operative biopsy, intra-operative frozen section and post-operative resection, using an evidence chain comprising retrospective benchmarking, clinically challenging scenarios, workflow-oriented clinical impact simulations, prospective observational validation with the thresholds locked in the retrospective cohorts and crossover pathologist-AI interaction studies. Across these settings, BRAVE supported practical roles in the clinical workflow, including safe exclusion of low-risk cases from routine review, AI-assisted second-review rescue of initially missed positives and prioritization of cases for further assessment. In prospective validation across three centres, BRAVE excluded 76.9% of negative biopsy cases (NPV 0.953) and 70.1% of negative frozen-section cases (NPV 0.973), and triaged 78.8% of post-operative subtyping cases as high-confidence clear-cut cases (NPV 1.000). In reader studies, AI assistance improved balanced accuracy from 88.5% to 95.1% (OR 3.14, P<0.001), with better efficiency, confidence and inter-rater agreement. BRAVE-derived scores also independently predicted disease-free survival (adjusted HR 4.79, P<0.001) and overall survival (adjusted HR 8.14, P<0.001).

preprint2026arXiv

Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment

While Graph Foundation Models (GFMs) have achieved remarkable success in homogeneous graphs, extending them to multi-domain heterogeneous graphs (MDHGs) remains a formidable challenge due to cross-type feature shifts and intra-domain relation gaps. Existing global feature alignment methods (PCA or SVD) enforce a shared feature space blindly, which distorts type-specific semantics and disrupts original topologies, inevitably leading to "Type Collapse" and "Relation Confusion". To address these fundamental limitations, we propose Decoupled relation Subspace Alignment (DRSA), a novel, plug-and-play relation-driven alignment framework. DRSA fundamentally shifts the paradigm by decoupling feature semantics from relation structures. Specifically, it introduces a dual-relation subspace projection mechanism to coordinate cross-type interactions within a shared low-rank relation subspace explicitly. Furthermore, a feature-structure decoupled representation is designed to decompose aligned features into a semantic projection component and a structural residual term, adaptively absorbing intra-domain variations. Optimized via a stable alternating minimization strategy based on Block Coordinate Descent, DRSA constructs a well-calibrated, structure-aware latent space. Extensive experiments on multiple real-world benchmark datasets demonstrate that DRSA can be seamlessly integrated as a universal preprocessing module, significantly and consistently enhancing the cross-domain and few-shot knowledge transfer capabilities of state-of-the-art GFMs. The code is available at: https://github.com/zhengziyu77/DSRA.

preprint2024arXiv

A Tutorial on Extremely Large-Scale MIMO for 6G: Fundamentals, Signal Processing, and Applications

Extremely large-scale multiple-input-multiple-output (XL-MIMO), which offers vast spatial degrees of freedom, has emerged as a potentially pivotal enabling technology for the sixth generation (6G) of wireless mobile networks. With its growing significance, both opportunities and challenges are concurrently manifesting. This paper presents a comprehensive survey of research on XL-MIMO wireless systems. In particular, we introduce four XL-MIMO hardware architectures: uniform linear array (ULA)-based XL-MIMO, uniform planar array (UPA)-based XL-MIMO utilizing either patch antennas or point antennas, and continuous aperture (CAP)-based XL-MIMO. We comprehensively analyze and discuss their characteristics and interrelationships. Following this, we introduce several electromagnetic characteristics and general distance boundaries in XL-MIMO. Given the distinct electromagnetic properties of near-field communications, we present a range of channel models to demonstrate the benefits of XL-MIMO. We further discuss and summarize signal processing schemes for XL-MIMO. It is worth noting that the low-complexity signal processing schemes and deep learning empowered signal processing schemes are reviewed and highlighted to promote the practical implementation of XL-MIMO. Furthermore, we explore the interplay between XL-MIMO and other emergent 6G technologies. Finally, we outline several compelling research directions for future XL-MIMO wireless communication systems.

preprint2023arXiv

Uplink Precoding Design for Cell-Free Massive MIMO with Iteratively Weighted MMSE

In this paper, we investigate a cell-free massive multiple-input multiple-output system with both access points and user equipments equipped with multiple antennas over the Weichselberger Rayleigh fading channel. We study the uplink spectral efficiency (SE) for the fully centralized processing scheme and large-scale fading decoding (LSFD) scheme. To further improve the SE performance, we design the uplink precoding schemes based on the weighted sum SE maximization. Since the weighted sum SE maximization problem is not jointly over all optimization variables, two efficient uplink precoding schemes based on Iteratively Weighted sum-Minimum Mean Square Error (I-WMMSE) algorithms, which rely on the iterative minimization of weighted MSE, are proposed for two processing schemes investigated. Furthermore, with maximum ratio combining applied in the LSFD scheme, we derive novel closed-form achievable SE expressions and optimal precoding schemes. Numerical results validate the proposed results and show that the I-WMMSE precoding schemes can achieve excellent sum SE performance with a large number of UE antennas.

preprint2023arXiv

Wasserstein convergence rates in the invariance principle for deterministic dynamical systems

In this paper, we consider the convergence rate with respect to Wasserstein distance in the invariance principle for deterministic nonuniformly hyperbolic systems, where both discrete time systems and flows are included. Our results apply to uniformly hyperbolic systems and large classes of nonuniformly hyperbolic systems including intermittent maps, Viana maps, finite horizon planar periodic Lorentz gases and others. Furthermore, as a nontrivial application to homogenization problem, we investigate the $\mathcal{W}_2$-convergence rate of a fast-slow discrete deterministic system to a stochastic differential equation.

preprint2022arXiv

A Competitive Method for Dog Nose-print Re-identification

Vision-based pattern identification (such as face, fingerprint, iris etc.) has been successfully applied in human biometrics for a long history. However, dog nose-print authentication is a challenging problem since the lack of a large amount of labeled data. For that, this paper presents our proposed methods for dog nose-print authentication (Re-ID) task in CVPR 2022 pet biometric challenge. First, considering the problem that each class only with few samples in the training set, we propose an automatic offline data augmentation strategy. Then, for the difference in sample styles between the training and test datasets, we employ joint cross-entropy, triplet and pair-wise circle losses function for network optimization. Finally, with multiple models ensembled adopted, our methods achieve 86.67\% AUC on the test set. Codes are available at https://github.com/muzishen/Pet-ReID-IMAG.

preprint2022arXiv

Accelerating Real-Time Coupled Cluster Methods with Single-Precision Arithmetic and Adaptive Numerical Integration

We explore the framework of a real-time coupled cluster method with a focus on improving its computational efficiency. Propagation of the wave function via the time-dependent Schrödinger equation places high demands on computing resources, particularly for high level theories such as coupled cluster with polynomial scaling. Similar to earlier investigations of coupled cluster properties, we demonstrate that the use of single-precision arithmetic reduces both the storage and multiplicative costs of the real-time simulation by approximately a factor of two with no significant impact on the resulting UV/vis absorption spectrum computed via the Fourier transform of the time-dependent dipole moment. Additional speedups of up to a factor of 14 in test simulations of water clusters are obtained via a straightforward GPU-based implementation as compared to conventional CPU calculations. We also find that further performance optimization is accessible through sagacious selection of numerical integration algorithms, and the adaptive methods, such as the Cash-Karp integrator provide an effective balance between computing costs and numerical stability. Finally, we demonstrate that a simple mixed-step integrator based on the conventional fourth-order Runge-Kutta approach is capable of stable propagations even for strong external fields, provided the time step is appropriately adapted to the duration of the laser pulse with only minimal computational overhead.

preprint2022arXiv

AMinerGNN: Heterogeneous Graph Neural Network for Paper Click-through Rate Prediction with Fusion Query

Paper recommendation with user-generated keyword is to suggest papers that simultaneously meet user's interests and are relevant to the input keyword. This is a recommendation task with two queries, a.k.a. user ID and keyword. However, existing methods focus on recommendation according to one query, a.k.a. user ID, and are not applicable to solving this problem. In this paper, we propose a novel click-through rate (CTR) prediction model with heterogeneous graph neural network, called AMinerGNN, to recommend papers with two queries. Specifically, AMinerGNN constructs a heterogeneous graph to project user, paper, and keyword into the same embedding space by graph representation learning. To process two queries, a novel query attentive fusion layer is designed to recognize their importances dynamically and then fuse them as one query to build a unified and end-to-end recommender system. Experimental results on our proposed dataset and online A/B tests prove the superiority of AMinerGNN.

preprint2022arXiv

Band Gap Opening in Bilayer Graphene-CrCl$_3$/CrBr$_3$/CrI$_3$ van der Waals Interfaces

We report experimental investigations of transport through bilayer graphene (BLG)/chromium trihalide (CrX$_3$; X=Cl, Br, I) van der Waals interfaces. In all cases, a large charge transfer from BLG to CrX$_3$ takes place (reaching densities in excess of $10^{13}$ cm$^{-2}$), and generates an electric field perpendicular to the interface that opens a band gap in BLG. We determine the gap from the activation energy of the conductivity and find excellent agreement with the latest theory accounting for the contribution of the $σ$ bands to the BLG dielectric susceptibility. We further show that for BLG/CrCl$_3$ and BLG/CrBr$_3$ the band gap can be extracted from the gate voltage dependence of the low-temperature conductivity, and use this finding to refine the gap dependence on the magnetic field. Our results allow a quantitative comparison of the electronic properties of BLG with theoretical predictions and indicate that electrons occupying the CrX$_3$ conduction band are correlated.

preprint2022arXiv

Beyond Data Samples: Aligning Differential Networks Estimation with Scientific Knowledge

Learning the differential statistical dependency network between two contexts is essential for many real-life applications, mostly in the high dimensional low sample regime. In this paper, we propose a novel differential network estimator that allows integrating various sources of knowledge beyond data samples. The proposed estimator is scalable to a large number of variables and achieves a sharp asymptotic convergence rate. Empirical experiments on extensive simulated data and four real-world applications (one on neuroimaging and three from functional genomics) show that our approach achieves improved differential network estimation and provides better supports to downstream tasks like classification. Our results highlight significant benefits of integrating group, spatial and anatomic knowledge during differential genetic network identification and brain connectome change discovery.

preprint2022arXiv

Ellipticity control of terahertz high-harmonic generation in a Dirac semimetal

We report on terahertz high-harmonic generation in a Dirac semimetal as a function of the driving-pulse ellipticity and on a theoretical study of the field-driven intraband kinetics of massless Dirac fermions.Very efficient control of third-harmonic yield and polarization state is achieved in electron-doped Cd$_3$As$_2$ thin films at room temperature. The observed tunability is understood as resulting from terahertz-field driven intraband kinetics of the Dirac fermions. Our study paves the way for exploiting nonlinear optical properties of Dirac matter for applications in signal processing and optical communications.

preprint2022arXiv

FFConv: Fast Factorized Convolutional Neural Network Inference on Encrypted Data

Homomorphic Encryption (HE), allowing computations on encrypted data (ciphertext) without decrypting it first, enables secure but prohibitively slow Convolutional Neural Network (CNN) inference for privacy-preserving applications in clouds. To reduce the inference latency, one approach is to pack multiple messages into a single ciphertext in order to reduce the number of ciphertexts and support massive parallelism of Homomorphic Multiply-Accumulate (HMA) operations between ciphertexts. Despite the faster HECNN inference, the mainstream packing schemes Dense Packing (DensePack) and Convolution Packing (ConvPack) introduce expensive rotation overhead, which prolongs the inference latency of HECNN for deeper and wider CNN architectures. In this paper, we propose a low-rank factorization method named FFConv dedicated to efficient ciphertext packing for reducing both the rotation overhead and HMA operations. FFConv approximates a d x d convolution layer with low-rank factorized convolutions, in which a d x d low-rank convolution with fewer channels is followed by a 1 x 1 convolution to restore the channels. The d x d low-rank convolution with DensePack leads to significantly reduced rotation operations, while the rotation overhead of 1 x 1 convolution with ConvPack is close to zero. To our knowledge, FFConv is the first work that is capable of reducing the rotation overhead incurred by DensePack and ConvPack simultaneously, without introducing additional special blocks into the HECNN inference pipeline. Compared to prior art LoLa and Falcon, our method reduces the inference latency by up to 88% and 21%, respectively, with comparable accuracy on MNIST and CIFAR-10.

preprint2022arXiv

FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

The binary code similarity detection (BCSD) method measures the similarity of two binary executable codes. Recently, the learning-based BCSD methods have achieved great success, outperforming traditional BCSD in detection accuracy and efficiency. However, the existing studies are rather sparse on the adversarial vulnerability of the learning-based BCSD methods, which cause hazards in security-related applications. To evaluate the adversarial robustness, this paper designs an efficient and black-box adversarial code generation algorithm, namely, FuncFooler. FuncFooler constrains the adversarial codes 1) to keep unchanged the program's control flow graph (CFG), and 2) to preserve the same semantic meaning. Specifically, FuncFooler consecutively 1) determines vulnerable candidates in the malicious code, 2) chooses and inserts the adversarial instructions from the benign code, and 3) corrects the semantic side effect of the adversarial code to meet the constraints. Empirically, our FuncFooler can successfully attack the three learning-based BCSD models, including SAFE, Asm2Vec, and jTrans, which calls into question whether the learning-based BCSD is desirable.

preprint2022arXiv

Identifying and Exploiting Sparse Branch Correlations for Optimizing Branch Prediction

Branch prediction is arguably one of the most important speculative mechanisms within a high-performance processor architecture. A common approach to improve branch prediction accuracy is to employ lengthy history records of previously seen branch directions to capture distant correlations between branches. The larger the history, the richer the information that the predictor can exploit for discovering predictive patterns. However, without appropriate filtering, such an approach may also heavily disorganize the predictor's internal mechanisms, leading to diminishing returns. This paper studies a fundamental control-flow property: the sparsity in the correlation between branches and recent history. First, we show that sparse branch correlations exist in standard applications and, more importantly, such correlations can be computed efficiently using sparse modeling methods. Second, we introduce a sparsity-aware branch prediction mechanism that can compactly encode and store sparse models to unlock essential performance opportunities. We evaluated our approach for various design parameters demonstrating MPKI improvements of up to 42% (2.3% on average) with 2KB of additional storage overhead. Our circuit-level evaluation of the design showed that it can operate within accepted branch prediction latencies, and under reasonable power and area limitations.

preprint2022arXiv

Iteratively Weighted MMSE Uplink Precoding for Cell-Free Massive MIMO

In this paper, we investigate a cell-free massive MIMO system with both access points and user equipments equipped with multiple antennas over the Weichselberger Rayleigh fading channel. We study the uplink spectral efficiency (SE) based on a two-layer decoding structure with maximum ratio (MR) or local minimum mean-square error (MMSE) combining applied in the first layer and optimal large-scale fading decoding method implemented in the second layer, respectively. To maximize the weighted sum SE, an uplink precoding structure based on an Iteratively Weighted sum-MMSE (I-WMMSE) algorithm using only channel statistics is proposed. Furthermore, with MR combining applied in the first layer, we derive novel achievable SE expressions and optimal precoding structures in closed-form. Numerical results validate our proposed results and show that the I-WMMSE precoding can achieve excellent sum SE performance.

preprint2022arXiv

Joint Learning of Deep Texture and High-Frequency Features for Computer-Generated Image Detection

Distinguishing between computer-generated (CG) and natural photographic (PG) images is of great importance to verify the authenticity and originality of digital images. However, the recent cutting-edge generation methods enable high qualities of synthesis in CG images, which makes this challenging task even trickier. To address this issue, a joint learning strategy with deep texture and high-frequency features for CG image detection is proposed. We first formulate and deeply analyze the different acquisition processes of CG and PG images. Based on the finding that multiple different modules in image acquisition will lead to different sensitivity inconsistencies to the convolutional neural network (CNN)-based rendering in images, we propose a deep texture rendering module for texture difference enhancement and discriminative texture representation. Specifically, the semantic segmentation map is generated to guide the affine transformation operation, which is used to recover the texture in different regions of the input image. Then, the combination of the original image and the high-frequency components of the original and rendered images are fed into a multi-branch neural network equipped with attention mechanisms, which refines intermediate features and facilitates trace exploration in spatial and channel dimensions respectively. Extensive experiments on two public datasets and a newly constructed dataset with more realistic and diverse images show that the proposed approach outperforms existing methods in the field by a clear margin. Besides, results also demonstrate the detection robustness and generalization ability of the proposed approach to postprocessing operations and generative adversarial network (GAN) generated images.

preprint2022arXiv

Learning Versatile Neural Architectures by Propagating Network Codes

This work explores how to design a single neural network capable of adapting to multiple heterogeneous vision tasks, such as image segmentation, 3D detection, and video recognition. This goal is challenging because both network architecture search (NAS) spaces and methods in different tasks are inconsistent. We solve this challenge from both sides. We first introduce a unified design space for multiple tasks and build a multitask NAS benchmark (NAS-Bench-MR) on many widely used datasets, including ImageNet, Cityscapes, KITTI, and HMDB51. We further propose Network Coding Propagation (NCP), which back-propagates gradients of neural predictors to directly update architecture codes along the desired gradient directions to solve various tasks. In this way, optimal architecture configurations can be found by NCP in our large search space in seconds. Unlike prior arts of NAS that typically focus on a single task, NCP has several unique benefits. (1) NCP transforms architecture optimization from data-driven to architecture-driven, enabling joint search an architecture among multitasks with different data distributions. (2) NCP learns from network codes but not original data, enabling it to update the architecture efficiently across datasets. (3) In addition to our NAS-Bench-MR, NCP performs well on other NAS benchmarks, such as NAS-Bench-201. (4) Thorough studies of NCP on inter-, cross-, and intra-tasks highlight the importance of cross-task neural architecture design, i.e., multitask neural architectures and architecture transferring between different tasks. Code is available at https://github.com/dingmyu/NCP.

preprint2022arXiv

Magneto-optical study of metamagnetic transitions in the antiferromagnetic phase of $α$-RuCl$_3$

$α$-RuCl$_3$ is a promising candidate material to realize the so far elusive quantum spin liquid ground state. However, at low temperatures, the coexistence of different exchange interactions couple the effective pseudospins into an antiferromagnetically zigzag (ZZ) ordered state. The low-field evolution of spin structure is still a matter of debate and the magnetic anisotropy within the honeycomb planes is an open and challenging question. Here, we investigate the evolution of the ZZ order parameter by second-order magneto-optical effects, the magnetic linear dichroism and magnetic linear birefringence. Our results clarify the presence and nature of metamagnetic transitions in the ZZ phase of $α$-RuCl$_3$. Our experimental observations show the presence of initial magnetic domain repopulation followed by a spin-flop transition for small in-plane applied magnetic fields ($\approx$ 1.6 T) along specific crystallographic directions. In addition, using a magneto-optical approach, we detected the recently reported emergence of a field-induced intermediate phase before suppressing the ZZ order. Our results disclose the details of various angle-dependent in-plane metamagnetic transitions quantifying the bond-anisotropic interactions present in $α$-RuCl$_3$

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up to a human expert level. Stratego is one of the few iconic board games that Artificial Intelligence (AI) has not yet mastered. This popular game has an enormous game tree on the order of $10^{535}$ nodes, i.e., $10^{175}$ times larger than that of Go. It has the additional complexity of requiring decision-making under imperfect information, similar to Texas hold'em poker, which has a significantly smaller game tree (on the order of $10^{164}$ nodes). Decisions in Stratego are made over a large number of discrete actions with no obvious link between action and outcome. Episodes are long, with often hundreds of moves before a player wins, and situations in Stratego can not easily be broken down into manageably-sized sub-problems as in poker. For these reasons, Stratego has been a grand challenge for the field of AI for decades, and existing AI methods barely reach an amateur level of play. DeepNash uses a game-theoretic, model-free deep reinforcement learning method, without search, that learns to master Stratego via self-play. The Regularised Nash Dynamics (R-NaD) algorithm, a key component of DeepNash, converges to an approximate Nash equilibrium, instead of 'cycling' around it, by directly modifying the underlying multi-agent learning dynamics. DeepNash beats existing state-of-the-art AI methods in Stratego and achieved a yearly (2022) and all-time top-3 rank on the Gravon games platform, competing with human expert players.

preprint2022arXiv

Maximising the Influence of Temporary Participants in Opinion Formation

DeGroot-style opinion formation presumes a continuous interaction among agents of a social network. Hence, it cannot handle agents external to the social network that interact only temporarily with the permanent ones. Many real-world organisations and individuals fall into such a category. For instance, a company tries to persuade as many as possible to buy its products and, due to various constraints, can only exert its influence for a limited amount of time. We propose a variant of the DeGroot model that allows an external agent to interact with the permanent ones for a preset period of time. We obtain several insights on maximising an external agent's influence in opinion formation by analysing and simulating the variant.

preprint2022arXiv

Monolithically integrated active passive waveguide array fabricated on thin film lithium niobate using a single continuous photolithography process

We demonstrate a robust low-loss optical interface by tiling passive (i.e., without doping of active ions) thin film lithium niobate (TFLN) and active (i.e., doped with rare earth ions) TFLN substrates for monolithic integration of passive/active lithium niobate photonics. The tiled substrates composed of both active and passive areas allow to pattern the mask of the integrated active passive photonic device at once using a single continuous photolithography process. The interface loss of tiled substrate is measured as low as 0.26 dB. Thanks to the stability provided by this approach, a four-channel waveguide amplifier is realized in a straightforward manner, which shows a net gain of ~5 dB at 1550-nm wavelength and that of ~8 dB at 1530-nm wavelength for each channel. The robust low-loss optical interface for passive/active photonic integration will facilitate large-scale high performance photonic devices which require on-chip light sources and amplifiers.

preprint2022arXiv

Multifunctional Two-dimensional van der Waals Janus Magnet Cr-based Dichalcogenide Halides

Two-dimensional van der Waals Janus materials and their heterostructures offer fertile platforms for designing fascinating functionalities. Here, by means of systematic first-principles studies on van der Waals Janus monolayer Cr-based dichalcogenide halides CrYX (Y=S, Se, Te; X=Cl, Br, I), we find that CrSX (X=Cl, Br, I) are the very desirable high TC ferromagnetic semiconductors with an out-of-plane magnetization. Excitingly, by the benefit of the large magnetic moments on ligand S2- anions, the sought-after large-gap quantum anomalous Hall effect and sizable valley splitting can be achieved through the magnetic proximity effect in van der Waals heterostructures CrSBr/Bi2Se3/CrSBr and MoTe2/CrSBr, respectively. Additionally, we show that large Dzyaloshinskii-Moriya interactions give rise to skyrmion states in CrTeX (X=Cl, Br, I) under external magnetic fields. Our work reveals that two-dimensional Janus magnet Cr-based dichalcogenide halides have appealing multifunctionalities in the applications of topological electronic and valleytronic devices.

preprint2022arXiv

Observation of three superconducting transitions in the pressurized CDW-bearing compound TaTe2

Transition metal dichalcogenides host a wide variety of lattice and electronic structures, as well as corresponding exotic physical properties, especially under certain tuning conditions. Here, we are the first to report the observation of pressure-induced three superconducting transitions in TaTe2, a charge density wave (CDW) - bearing layered transition-metal dichalcogenide that is metallic but not superconducting at ambient pressure. We find that its CDW state can be easily suppressed upon increasing pressure up to ~ 1 GPa. A superconducting state then emerges from the suppressed CDW state and persists to the pressure about 7 GPa. Unexpectedly, another superconducting state appears at ~ 11 GPa within the same monoclinic (M) structure of its ambient-pressure one. Upon further compression to 21 GPa, a third superconducting state with higher Tc appears from a high-pressure (HP) phase. Our experimental results suggest that the pressure-induced three superconducting transitions in TaTe2 are respectively driven by the suppression of the CDW state, the change of the angle in the M phase and the transition of M-to-HP phase. These results demonstrate not only the versatile nature of this correlated electron system, but also the first experimental example that shows the pressure-induced evolution from a CDW state to three superconducting states driven by different mechanisms.

preprint2022arXiv

On-chip integrated Yb3+-doped waveguide amplifiers on thin film lithium niobate

We report the fabrication and optical characterization of Yb3+-doped waveguide amplifiers (YDWA) on the thin film lithium niobate fabricated by photolithography assisted chemo-mechanical etching. The fabricated Yb3+-doped lithium niobate waveguides demonstrates low propagation loss of 0.13 dB/cm at 1030 nm and 0.1 dB/cm at 1060 nm. The internal net gain of 5 dB at 1030 nm and 8 dB at 1060 nm are measured on a 4.0 cm long waveguide pumped by 976nm laser diodes, indicating the gain per unit length of 1.25 dB/cm at 1030 nm and 2 dB/cm at 1060 nm, respectively. The integrated Yb3+-doped lithium niobate waveguide amplifiers will benefit the development of a powerful gain platform and are expected to contribute to the high-density integration of thin film lithium niobate based photonic chip.

preprint2022arXiv

Reduction of the 2D Toda Hierarchy and Linear Hodge Integrals

We construct a certain reduction of the 2D Toda hierarchy and obtain a tau-symmetric Hamiltonian integrable hierarchy. This reduced integrable hierarchy controls the linear Hodge integrals in the way that one part of its flows yields the intermediate long wave hierarchy, and the remaining flows coincide with a certain limit of the flows of the fractional Volterra hierarchy which controls the special cubic Hodge integrals.

preprint2022arXiv

Subtype-Former: a deep learning approach for cancer subtype discovery with multi-omics data

Motivation: Cancer is heterogeneous, affecting the precise approach to personalized treatment. Accurate subtyping can lead to better survival rates for cancer patients. High-throughput technologies provide multiple omics data for cancer subtyping. However, precise cancer subtyping remains challenging due to the large amount and high dimensionality of omics data. Results: This study proposed Subtype-Former, a deep learning method based on MLP and Transformer Block, to extract the low-dimensional representation of the multi-omics data. K-means and Consensus Clustering are also used to achieve accurate subtyping results. We compared Subtype-Former with the other state-of-the-art subtyping methods across the TCGA 10 cancer types. We found that Subtype-Former can perform better on the benchmark datasets of more than 5000 tumors based on the survival analysis. In addition, Subtype-Former also achieved outstanding results in pan-cancer subtyping, which can help analyze the commonalities and differences across various cancer types at the molecular level. Finally, we applied Subtype-Former to the TCGA 10 types of cancers. We identified 50 essential biomarkers, which can be used to study targeted cancer drugs and promote the development of cancer treatments in the era of precision medicine.

preprint2022arXiv

TBI-GAN: An Adversarial Learning Approach for Data Synthesis on Traumatic Brain Segmentation

Brain network analysis for traumatic brain injury (TBI) patients is critical for its consciousness level assessment and prognosis evaluation, which requires the segmentation of certain consciousness-related brain regions. However, it is difficult to construct a TBI segmentation model as manually annotated MR scans of TBI patients are hard to collect. Data augmentation techniques can be applied to alleviate the issue of data scarcity. However, conventional data augmentation strategies such as spatial and intensity transformation are unable to mimic the deformation and lesions in traumatic brains, which limits the performance of the subsequent segmentation task. To address these issues, we propose a novel medical image inpainting model named TBI-GAN to synthesize TBI MR scans with paired brain label maps. The main strength of our TBI-GAN method is that it can generate TBI images and corresponding label maps simultaneously, which has not been achieved in the previous inpainting methods for medical images. We first generate the inpainted image under the guidance of edge information following a coarse-to-fine manner, and then the synthesized intensity image is used as the prior for label inpainting. Furthermore, we introduce a registration-based template augmentation pipeline to increase the diversity of the synthesized image pairs and enhance the capacity of data augmentation. Experimental results show that the proposed TBI-GAN method can produce sufficient synthesized TBI images with high quality and valid label maps, which can greatly improve the 2D and 3D traumatic brain segmentation performance compared with the alternatives.

preprint2022arXiv

The Potential to Probe Solar Neutrino Physics with LiCl Water Solution

Lithium chloride water solution is a good option for solar neutrino detection. The $ν_e$ charged-current (CC) interaction cross-section on $\rm{{}^{7}Li}$ is evaluated with new B(GT) experimental measurements. The total CC interaction cross-section weighted by the solar $^8$B electron neutrino spectrum is $3.759\times10^{-42} \rm{cm}^2$, which is about 60 times that of the neutrino-electron elastic scattering process. The final state effective kinetic energy after the CC interaction on $\rm{{}^{7}Li}$ directly reflects the neutrino energy, which stands in sharp contrast to the plateau structure of recoil electrons of the elastic scattering. With the high solubility of LiCl of 74.5 g/100 g water at 10$^\circ$C and the high natural abundance of 92.41%, the molarity of $\rm{{}^{7}Li}$ in water can reach 11 mol/L for safe operation at room temperature. The CC event rate of $ν_e$ on $\rm{{}^{7}Li}$ in the LiCl water solution is comparable to that of neutrino-electron elastic scattering. In addition, the $ν_e$ CC interaction with the contained $\rm{{}^{37}Cl}$ also contributes a few percent of the total CC event rate. The contained $\rm{{}^{35}Cl}$ and $\rm{{}^{6}Li}$ also make a delay-coincidence detection for electron antineutrinos possible. The recrystallization method is found to be applicable for LiCl sample purification. The measured attenuation length of $11\pm1$ m at 430 nm shows that the LiCl solution is practicable for a 10-m diameter detector for solar neutrino detection. Clear advantages are found in studying the upturn effect of solar neutrino oscillation, light sterile neutrinos, and Earth matter effect. The sensitivities in discovering solar neutrino upturn and light sterile neutrinos are shown.

preprint2022arXiv

Towards the ultimate PMT waveform analysis for neutrino and dark matter experiments

Photomultiplier tube (PMT) voltage waveforms are the raw data of many neutrino and dark matter experiments. Waveform analysis is the cornerstone of data processing. We evaluate the performance of all the waveform analysis algorithms known to us and find fast stochastic matching pursuit the best in accuracy. Significant time (up to 2 times) and energy (up to 1.07 times) resolution boosts are attainable with fast stochastic matching pursuit, approaching theoretical limits. Other methods also outperform the traditional threshold crossing approach in time resolution.

preprint2022arXiv

Uplink Performance of Cell-Free Massive MIMO with Multi-Antenna Users Over Jointly-Correlated Rayleigh Fading Channels

In this paper, we investigate a cell-free massive MIMO system with both access points (APs) and user equipments (UEs) equipped with multiple antennas over jointly-correlated Rayleigh fading channels. We study four uplink implementations, from fully centralized processing to fully distributed processing, and derive their achievable spectral efficiency (SE) expressions with minimum mean-squared error successive interference cancellation (MMSE-SIC) detectors and arbitrary combining schemes. Furthermore, the global and local MMSE combining schemes are derived based on full and local channel state information (CSI) obtained under pilot contamination, which can maximize the achievable SE for the fully centralized and distributed implementation, respectively. We study a two-layer decoding implementation with an arbitrary combining scheme in the first layer and optimal large-scale fading decoding (LSFD) in the second layer. Besides, we compute novel closed-form SE expressions for the two-layer decoding implementation with maximum ratio (MR) combining. In the numerical results, we compare the SE performance for different implementation levels, combining schemes, and channel models. It is important to note that increasing the number of antennas per UE may degrade the SE performance.

preprint2022arXiv

WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation

Existing few-shot image generation approaches typically employ fusion-based strategies, either on the image or the feature level, to produce new images. However, previous approaches struggle to synthesize high-frequency signals with fine details, deteriorating the synthesis quality. To address this, we propose WaveGAN, a frequency-aware model for few-shot image generation. Concretely, we disentangle encoded features into multiple frequency components and perform low-frequency skip connections to preserve outline and structural information. Then we alleviate the generator's struggles of synthesizing fine details by employing high-frequency skip connections, thus providing informative frequency information to the generator. Moreover, we utilize a frequency L1-loss on the generated and real images to further impede frequency information loss. Extensive experiments demonstrate the effectiveness and advancement of our method on three datasets. Noticeably, we achieve new state-of-the-art with FID 42.17, LPIPS 0.3868, FID 30.35, LPIPS 0.5076, and FID 4.96, LPIPS 0.3822 respectively on Flower, Animal Faces, and VGGFace. GitHub: https://github.com/kobeshegu/ECCV2022_WaveGAN

preprint2021arXiv

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference

To the best of our knowledge, the existing deep-learning-based Video Super-Resolution (VSR) methods exclusively make use of videos produced by the Image Signal Processor (ISP) of the camera system as inputs. Such methods are 1) inherently suboptimal due to information loss incurred by non-invertible operations in ISP, and 2) inconsistent with the real imaging pipeline where VSR in fact serves as a pre-processing unit of ISP. To address this issue, we propose a new VSR method that can directly exploit camera sensor data, accompanied by a carefully built Raw Video Dataset (RawVD) for training, validation, and testing. This method consists of a Successive Deep Inference (SDI) module and a reconstruction module, among others. The SDI module is designed according to the architectural principle suggested by a canonical decomposition result for Hidden Markov Model (HMM) inference; it estimates the target high-resolution frame by repeatedly performing pairwise feature fusion using deformable convolutions. The reconstruction module, built with elaborately designed Attention-based Residual Dense Blocks (ARDBs), serves the purpose of 1) refining the fused feature and 2) learning the color information needed to generate a spatial-specific transformation for accurate color correction. Extensive experiments demonstrate that owing to the informativeness of the camera raw data, the effectiveness of the network architecture, and the separation of super-resolution and color correction processes, the proposed method achieves superior VSR results compared to the state-of-the-art and can be adapted to any specific camera-ISP. Code and dataset are available at https://github.com/proteus1991/RawVSR.

preprint2021arXiv

Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms

The actor-critic (AC) algorithm is a popular method to find an optimal policy in reinforcement learning. In the infinite horizon scenario, the finite-sample convergence rate for the AC and natural actor-critic (NAC) algorithms has been established recently, but under independent and identically distributed (i.i.d.) sampling and single-sample update at each iteration. In contrast, this paper characterizes the convergence rate and sample complexity of AC and NAC under Markovian sampling, with mini-batch data for each iteration, and with actor having general policy class approximation. We show that the overall sample complexity for a mini-batch AC to attain an $ε$-accurate stationary point improves the best known sample complexity of AC by an order of $\mathcal{O}(ε^{-1}\log(1/ε))$, and the overall sample complexity for a mini-batch NAC to attain an $ε$-accurate globally optimal point improves the existing sample complexity of NAC by an order of $\mathcal{O}(ε^{-1}/\log(1/ε))$. Moreover, the sample complexity of AC and NAC characterized in this work outperforms that of policy gradient (PG) and natural policy gradient (NPG) by a factor of $\mathcal{O}((1-γ)^{-3})$ and $\mathcal{O}((1-γ)^{-4}ε^{-1}/\log(1/ε))$, respectively. This is the first theoretical study establishing that AC and NAC attain orderwise performance improvement over PG and NPG under infinite horizon due to the incorporation of critic.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

Network Pruning via Resource Reallocation

Channel pruning is broadly recognized as an effective approach to obtain a small compact model through eliminating unimportant channels from a large cumbersome network. Contemporary methods typically perform iterative pruning procedure from the original over-parameterized model, which is both tedious and expensive especially when the pruning is aggressive. In this paper, we propose a simple yet effective channel pruning technique, termed network Pruning via rEsource rEalLocation (PEEL), to quickly produce a desired slim model with negligible cost. Specifically, PEEL first constructs a predefined backbone and then conducts resource reallocation on it to shift parameters from less informative layers to more important layers in one round, thus amplifying the positive effect of these informative layers. To demonstrate the effectiveness of PEEL , we perform extensive experiments on ImageNet with ResNet-18, ResNet-50, MobileNetV2, MobileNetV3-small and EfficientNet-B0. Experimental results show that structures uncovered by PEEL exhibit competitive performance with state-of-the-art pruning algorithms under various pruning settings. Our code is available at https://github.com/cardwing/Codes-for-PEEL.

preprint2021arXiv

On-chip integrated waveguide amplifiers on Erbium-doped thin film lithium niobate on insulator

We demonstrate on-chip light amplification with integrated optical waveguide fabricated on erbium-doped thin film lithium niobate on insulator (TFLNOI) using the photolithography assisted chemo-mechanical etching (PLACE) technique. A maximum internal net gain of 18 dB in the small-signal-gain regime is measured at the peak emission wavelength of 1530 nm for a waveguide length of 3.6 cm, indicating a differential gain per unit length of 5 dB/cm. This work paves the way to the monolithic integration of diverse active and passive photonic components on the TFLNOI platform.

preprint2021arXiv

Relate and Predict: Structure-Aware Prediction with Jointly Optimized Neural DAG

Understanding relationships between feature variables is one important way humans use to make decisions. However, state-of-the-art deep learning studies either focus on task-agnostic statistical dependency learning or do not model explicit feature dependencies during prediction. We propose a deep neural network framework, dGAP, to learn neural dependency Graph and optimize structure-Aware target Prediction simultaneously. dGAP trains towards a structure self-supervision loss and a target prediction loss jointly. Our method leads to an interpretable model that can disentangle sparse feature relationships, informing the user how relevant dependencies impact the target task. We empirically evaluate dGAP on multiple simulated and real datasets. dGAP is not only more accurate, but can also recover correct dependency structure.

preprint2021arXiv

Symmetric Rigidity for Circle Endomorphisms with Bounded Geometry

Let $f$ and $g$ be two circle endomorphisms of degree $d\geq 2$ such that each has bounded geometry, preserves the Lebesgue measure, and fixes $1$. Let $h$ fixing $1$ be the topological conjugacy from $f$ to $g$. That is, $h\circ f=g\circ h$. We prove that $h$ is a symmetric circle homeomorphism if and only if $h=Id$. Many other rigidity results in circle dynamics follow from this very general symmetric rigidity result.

preprint2021arXiv

The ANTARES Astronomical Time-Domain Event Broker

We describe the Arizona-NOIRLab Temporal Analysis and Response to Events System (ANTARES), a software instrument designed to process large-scale streams of astronomical time-domain alerts. With the advent of large-format CCDs on wide-field imaging telescopes, time-domain surveys now routinely discover tens of thousands of new events each night, more than can be evaluated by astronomers alone. The ANTARES event broker will process alerts, annotating them with catalog associations and filtering them to distinguish customizable subsets of events. We describe the data model of the system, the overall architecture, annotation, implementation of filters, system outputs, provenance tracking, system performance, and the user interface.

preprint2021arXiv

The Role of the Hercules Autonomous Vehicle During the COVID-19 Pandemic: An Autonomous Logistic Vehicle for Contactless Goods Transportation

Since early 2020, the coronavirus disease 2019 (COVID-19) has spread rapidly across the world. As at the date of writing this article, the disease has been globally reported in 223 countries and regions, infected over 108 million people and caused over 2.4 million deaths (https://covid19.who.int/, accessed on Feb. 17, 2021). Avoiding person-to-person transmission is an effective approach to control and prevent the pandemic. However, many daily activities, such as transporting goods in our daily life, inevitably involve person-to-person contact. Using an autonomous logistic vehicle to achieve contact-less goods transportation could alleviate this issue. For example, it can reduce the risk of virus transmission between the driver and customers. Moreover, many countries have imposed tough lockdown measures to reduce the virus transmission (e.g., retail, catering) during the pandemic, which causes inconveniences for human daily life. Autonomous vehicle can deliver the goods bought by humans, so that humans can get the goods without going out. These demands motivate us to develop an autonomous vehicle, named as Hercules, for contact-less goods transportation during the COVID-19 pandemic. The vehicle is evaluated through real-world delivering tasks under various traffic conditions.

preprint2021arXiv

Variational Bihamiltonian Cohomologies and Integrable Hierarchies II: Virasoro symmetries

We prove that for any tau-symmetric bihamiltonian deformation of the tau-cover of the Principal Hierarchy associated with a semisimple Frobenius manifold, the deformed tau-cover admits an infinite set of Virasoro symmetries.

preprint2020arXiv

A Generalized Training Approach for Multiagent Learning

This paper investigates a population-based training regime based on game-theoretic principles called Policy-Spaced Response Oracles (PSRO). PSRO is general in the sense that it (1) encompasses well-known algorithms such as fictitious play and double oracle as special cases, and (2) in principle applies to general-sum, many-player games. Despite this, prior studies of PSRO have been focused on two-player zero-sum games, a regime wherein Nash equilibria are tractably computable. In moving from two-player zero-sum games to more general settings, computation of Nash equilibria quickly becomes infeasible. Here, we extend the theoretical underpinnings of PSRO by considering an alternative solution concept, $α$-Rank, which is unique (thus faces no equilibrium selection issues, unlike Nash) and applies readily to general-sum, many-player settings. We establish convergence guarantees in several games classes, and identify links between Nash equilibria and $α$-Rank. We demonstrate the competitive performance of $α$-Rank-based PSRO against an exact Nash solver-based PSRO in 2-player Kuhn and Leduc Poker. We then go beyond the reach of prior PSRO applications by considering 3- to 5-player poker games, yielding instances where $α$-Rank achieves faster convergence than approximate Nash solvers, thus establishing it as a favorable general games solver. We also carry out an initial empirical validation in MuJoCo soccer, illustrating the feasibility of the proposed approach in another complex domain.

preprint2020arXiv

Achieving 50 femtosecond resolution in MeV ultrafast electron diffraction with a double bend achromat compressor

We propose and demonstrate a novel scheme to produce ultrashort and ultrastable MeV electron beam. In this scheme, the electron beam produced in a photocathode radio-frequency (rf) gun first expands under its own Coulomb force with which a positive energy chirp is imprinted in the beam longitudinal phase space. The beam is then sent through a double bend achromat with positive longitudinal dispersion where electrons at the bunch tail with lower energies follow shorter paths and thus catch up with the bunch head, leading to longitudinal bunch compression. We show that with optimized parameter sets, the whole beam path from the electron source to the compression point can be made isochronous such that the time of flight for the electron beam is immune to the fluctuations of rf amplitude. With a laser-driven THz deflector, the bunch length and arrival time jitter for a 20 fC beam after bunch compression are measured to be about 29 fs (FWHM) and 22 fs (FWHM), respectively. Such an ultrashort and ultrastable electron beam allows us to achieve 50 femtosecond (FWHM) resolution in MeV ultrafast electron diffraction where lattice oscillation at 2.6 THz corresponding to Bismuth A1g mode is clearly observed without correcting both the short-term timing jitter and long-term timing drift. Furthermore, oscillating weak diffuse scattering signal related to phonon coupling and decay is also clearly resolved thanks to the improved temporal resolution and increased electron flux. We expect that this technique will have a strong impact in emerging ultrashort electron beam based facilities and applications.

preprint2020arXiv

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization

Due to its simplicity and outstanding ability to generalize, stochastic gradient descent (SGD) is still the most widely used optimization method despite its slow convergence. Meanwhile, adaptive methods have attracted rising attention of optimization and machine learning communities, both for the leverage of life-long information and for the profound and fundamental mathematical theory. Taking the best of both worlds is the most exciting and challenging question in the field of optimization for machine learning. Along this line, we revisited existing adaptive gradient methods from a novel perspective, refreshing understanding of second moments. Our new perspective empowers us to attach the properties of second moments to the first moment iteration, and to propose a novel first moment optimizer, \emph{Angle-Calibrated Moment method} (\method). Our theoretical results show that \method is able to achieve the same convergence rate as mainstream adaptive methods. Furthermore, extensive experiments on CV and NLP tasks demonstrate that \method has a comparable convergence to SOTA Adam-type optimizers, and gains a better generalization performance in most cases.

preprint2020arXiv

Adaptive Gradient Methods Can Be Provably Faster than SGD after Finite Epochs

Adaptive gradient methods have attracted much attention of machine learning communities due to the high efficiency. However their acceleration effect in practice, especially in neural network training, is hard to analyze, theoretically. The huge gap between theoretical convergence results and practical performances prevents further understanding of existing optimizers and the development of more advanced optimization methods. In this paper, we provide adaptive gradient methods a novel analysis with an additional mild assumption, and revise AdaGrad to \radagrad for matching a better provable convergence rate. To find an $ε$-approximate first-order stationary point in non-convex objectives, we prove random shuffling \radagrad achieves a $\tilde{O}(T^{-1/2})$ convergence rate, which is significantly improved by factors $\tilde{O}(T^{-1/4})$ and $\tilde{O}(T^{-1/6})$ compared with existing adaptive gradient methods and random shuffling SGD, respectively. To the best of our knowledge, it is the first time to demonstrate that adaptive gradient methods can deterministically be faster than SGD after finite epochs. Furthermore, we conduct comprehensive experiments to validate the additional mild assumption and the acceleration effect benefited from second moments and random shuffling.

preprint2020arXiv

COLD: Towards the Next Generation of Pre-Ranking System

Multi-stage cascade architecture exists widely in many industrial systems such as recommender systems and online advertising, which often consists of sequential modules including matching, pre-ranking, ranking, etc. For a long time, it is believed pre-ranking is just a simplified version of the ranking module, considering the larger size of the candidate set to be ranked. Thus, efforts are made mostly on simplifying ranking model to handle the explosion of computing power for online inference. In this paper, we rethink the challenge of the pre-ranking system from an algorithm-system co-design view. Instead of saving computing power with restriction of model architecture which causes loss of model performance, here we design a new pre-ranking system by joint optimization of both the pre-ranking model and the computing power it costs. We name it COLD (Computing power cost-aware Online and Lightweight Deep pre-ranking system). COLD beats SOTA in three folds: (i) an arbitrary deep model with cross features can be applied in COLD under a constraint of controllable computing power cost. (ii) computing power cost is explicitly reduced by applying optimization tricks for inference acceleration. This further brings space for COLD to apply more complex deep models to reach better performance. (iii) COLD model works in an online learning and severing manner, bringing it excellent ability to handle the challenge of the data distribution shift. Meanwhile, the fully online pre-ranking system of COLD provides us with a flexible infrastructure that supports efficient new model developing and online A/B testing.Since 2019, COLD has been deployed in almost all products involving the pre-ranking module in the display advertising system in Alibaba, bringing significant improvements.

preprint2020arXiv

Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation

State-of-the-art methods for large-scale driving-scene LiDAR semantic segmentation often project and process the point clouds in the 2D space. The projection methods includes spherical projection, bird-eye view projection, etc. Although this process makes the point cloud suitable for the 2D CNN-based networks, it inevitably alters and abandons the 3D topology and geometric relations. A straightforward solution to tackle the issue of 3D-to-2D projection is to keep the 3D representation and process the points in the 3D space. In this work, we first perform an in-depth analysis for different representations and backbones in 2D and 3D spaces, and reveal the effectiveness of 3D representations and networks on LiDAR segmentation. Then, we develop a 3D cylinder partition and a 3D cylinder convolution based framework, termed as Cylinder3D, which exploits the 3D topology relations and structures of driving-scene point clouds. Moreover, a dimension-decomposition based context modeling module is introduced to explore the high-rank context information in point clouds in a progressive manner. We evaluate the proposed model on a large-scale driving-scene dataset, i.e. SematicKITTI. Our method achieves state-of-the-art performance and outperforms existing methods by 6% in terms of mIoU.

preprint2020arXiv

Discrete Darboux system with self-consistent sources and its symmetric reduction

The discrete non-commutative Darboux system of equations with self-consistent sources is constructed, utilizing both the vectorial fundamental (binary Darboux) transformation and the method of additional independent variables. Then the symmetric reduction of discrete Darboux equations with sources is presented. In order to provide a simpler version of the resulting equations we introduce the $τ/σ$ form of the (commutative) discrete Darboux system. Our equations give, in continuous limit, the version with self-consistent sources of the classical symmetric Darboux system.

preprint2020arXiv

Dynamics of entanglement in the one-dimensional anisotropic XXZ model

The dynamics of entanglement in the one-dimensional spin-1/2 anisotropic XXZ model is studied using the quantum renormalization-group method. We obtain the analytical expression of the concurrence, for two different quenching methods, it is found that initial state plays a key role in the evolution of system entanglement, i.e., the system returns completely to the initial state every other period. Our computations and analysis indicate that the first derivative of the characteristic time at which the concurrence reaches its maximum or minimum with respect to the anisotropic parameter occurs nonanalytic behaviors at the quantum critical point. Interestingly, the minimum value of the first derivative of the characteristic time versus the size of the system exhibits the scaling behavior which is the same as the scaling behavior of the system ground-state entanglement in equilibrium. In particular, the scaling behavior near the critical point is independent of the initial state.

preprint2020arXiv

Exploring Trade-offs in Dynamic Task Triggering for Loosely Coupled Scientific Workflows

In order to achieve near-time insights, scientific workflows tend to be organized in a flexible and dynamic way. Data-driven triggering of tasks has been explored as a way to support workflows that evolve based on the data. However, the overhead introduced by such dynamic triggering of tasks is an under-studied topic. This paper discusses different facets of dynamic task triggers. Particularly, we explore different ways of constructing a data-driven dynamic workflow and then evaluate the overheads introduced by such design decisions. We evaluate workflows with varying data size, percentage of interesting data, temporal data distribution, and number of tasks triggered. Finally, we provide advice based upon analysis of the evaluation results for users looking to construct data-driven scientific workflows.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO's features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO's potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network

3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications. In this paper, we extend our preliminary work PointRCNN to a novel and strong point-cloud-based 3D object detection framework, the part-aware and aggregation neural network (Part-$A^2$ net). The whole framework consists of the part-aware stage and the part-aggregation stage. Firstly, the part-aware stage for the first time fully utilizes free-of-charge part supervisions derived from 3D ground-truth boxes to simultaneously predict high quality 3D proposals and accurate intra-object part locations. The predicted intra-object part locations within the same proposal are grouped by our new-designed RoI-aware point cloud pooling module, which results in an effective representation to encode the geometry-specific features of each 3D proposal. Then the part-aggregation stage learns to re-score the box and refine the box location by exploring the spatial relationship of the pooled intra-object part locations. Extensive experiments are conducted to demonstrate the performance improvements from each component of our proposed framework. Our Part-$A^2$ net outperforms all existing 3D detection methods and achieves new state-of-the-art on KITTI 3D object detection dataset by utilizing only the LiDAR point cloud data. Code is available at https://github.com/sshaoshuai/PointCloudDet3D.

preprint2020arXiv

Hierarchical Transformer Network for Utterance-level Emotion Recognition

While there have been significant advances in de-tecting emotions in text, in the field of utter-ance-level emotion recognition (ULER), there are still many problems to be solved. In this paper, we address some challenges in ULER in dialog sys-tems. (1) The same utterance can deliver different emotions when it is in different contexts or from different speakers. (2) Long-range contextual in-formation is hard to effectively capture. (3) Unlike the traditional text classification problem, this task is supported by a limited number of datasets, among which most contain inadequate conversa-tions or speech. To address these problems, we propose a hierarchical transformer framework (apart from the description of other studies, the "transformer" in this paper usually refers to the encoder part of the transformer) with a lower-level transformer to model the word-level input and an upper-level transformer to capture the context of utterance-level embeddings. We use a pretrained language model bidirectional encoder representa-tions from transformers (BERT) as the lower-level transformer, which is equivalent to introducing external data into the model and solve the problem of data shortage to some extent. In addition, we add speaker embeddings to the model for the first time, which enables our model to capture the in-teraction between speakers. Experiments on three dialog emotion datasets, Friends, EmotionPush, and EmoryNLP, demonstrate that our proposed hierarchical transformer network models achieve 1.98%, 2.83%, and 3.94% improvement, respec-tively, over the state-of-the-art methods on each dataset in terms of macro-F1.

preprint2020arXiv

High-index-contrast single-mode optical waveguides fabricated on lithium niobate by photolithography assisted chemo-mechanical etching (PLACE)

We report fabrication of low loss single mode waveguides on lithium niobate on insulator (LNOI) cladded by a layer of SiO2. Our technique, termed photolithography assisted chemo-mechanical etching (PLACE), relies on patterning of a chromium film into the mask shape by femtosecond laser micromachining and subsequent chemo-mechanical etching of the lithium niobate thin film. The high-index-contrast single mode waveguide is measured to have a propagation loss of 0.13 dB/cm. Furthermore, waveguide tapers are fabricated for boosting the coupling efficiency.

preprint2020arXiv

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Variance-reduced algorithms, although achieve great theoretical performance, can run slowly in practice due to the periodic gradient estimation with a large batch of data. Batch-size adaptation thus arises as a promising approach to accelerate such algorithms. However, existing schemes either apply prescribed batch-size adaption rule or exploit the information along optimization path via additional backtracking and condition verification steps. In this paper, we propose a novel scheme, which eliminates backtracking line search but still exploits the information along optimization path by adapting the batch size via history stochastic gradients. We further theoretically show that such a scheme substantially reduces the overall complexity for popular variance-reduced algorithms SVRG and SARAH/SPIDER for both conventional nonconvex optimization and reinforcement learning problems. To this end, we develop a new convergence analysis framework to handle the dependence of the batch size on history stochastic gradients. Extensive experiments validate the effectiveness of the proposed batch-size adaptation scheme.

preprint2020arXiv

Hunting potassium geoneutrinos with liquid scintillator Cherenkov neutrino detectors

The research of geoneutrino is a new interdisciplinary subject of particle experiments and geo-science. Potassium-40 ($^\text{40}$K) decays contribute roughly 1/3 of the radiogenic heat of the Earth, but it is still missing from the experimental observation. Solar neutrino experiments with liquid scintillators have observed uranium and thorium geoneutrinos and are the most promising in the low-background neutrino detection. In this article, we present the new concept of using liquid-scintillator Cherenkov detectors to detect the neutrino-electron elastic scattering process of $^\text{40}$K geoneutrinos. Liquid-scintillator Cherenkov detectors using a slow liquid scintillator can achieve this goal with both energy and direction measurements for charged particles. Given the directionality, we can significantly suppress the dominant intrinsic background originating from solar neutrinos in conventional liquid-scintillator detectors. We simulated the solar- and geo-neutrino scatterings in the slow liquid scintillator detector, and implemented energy and directional reconstructions for the recoiling electrons. We found that $^\text{40}$K geoneutrinos can be detected with three standard deviation accuracy in a kiloton-scale detector.

preprint2020arXiv

Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms

As an important type of reinforcement learning algorithms, actor-critic (AC) and natural actor-critic (NAC) algorithms are often executed in two ways for finding optimal policies. In the first nested-loop design, actor's one update of policy is followed by an entire loop of critic's updates of the value function, and the finite-sample analysis of such AC and NAC algorithms have been recently well established. The second two time-scale design, in which actor and critic update simultaneously but with different learning rates, has much fewer tuning parameters than the nested-loop design and is hence substantially easier to implement. Although two time-scale AC and NAC have been shown to converge in the literature, the finite-sample convergence rate has not been established. In this paper, we provide the first such non-asymptotic convergence rate for two time-scale AC and NAC under Markovian sampling and with actor having general policy class approximation. We show that two time-scale AC requires the overall sample complexity at the order of $\mathcal{O}(ε^{-2.5}\log^3(ε^{-1}))$ to attain an $ε$-accurate stationary point, and two time-scale NAC requires the overall sample complexity at the order of $\mathcal{O}(ε^{-4}\log^2(ε^{-1}))$ to attain an $ε$-accurate global optimal point. We develop novel techniques for bounding the bias error of the actor due to dynamically changing Markovian sampling and for analyzing the convergence rate of the linear critic with dynamically changing base functions and transition kernel.

preprint2020arXiv

Nonequilibrium quasistationary spin disordered state in the Kitaev-Heisenberg magnet $α$-RuCl$_3$

Excitation by light pulses enables the manipulation of phases of quantum condensed matter. Here, we photoexcite high-energy holon-doublon pairs as a way to alter the magnetic free energy landscape of the Kitaev-Heisenberg magnet $α$-RuCl$_3$, with the aim to dynamically stabilize a proximate spin liquid phase. The holon-doublon pair recombination through multimagnon emission is tracked through the time-evolution of the magnetic linear dichroism originating from the competing zigzag spin ordered ground state. A small holon-doublon density suffices to reach a spin disordered state. The phase transition is described within a dynamic Ginzburg-Landau framework, corroborating the quasistationary nature of the transient spin disordered phase. Our work provides insight into the coupling between the electronic and magnetic degrees of freedom in $α$-RuCl$_3$ and suggests a new route to reach a proximate spin liquid phase in Kitaev-Heisenberg magnets.

preprint2020arXiv

Observation of E8 Particles in an Ising Chain Antiferromagnet

Near the transverse-field induced quantum critical point of the Ising chain, an exotic dynamic spectrum consisting of exactly eight particles was predicted, which is uniquely described by an emergent quantum integrable field theory with the symmetry of the $E_8$ Lie algebra, but rarely explored experimentally. Here we use high-resolution terahertz spectroscopy to resolve quantum spin dynamics of the quasi-one-dimensional Ising antiferromagnet BaCo$_2$V$_2$O$_8$ in an applied transverse field. By comparing to an analytical calculation of the dynamical spin correlations, we identify $E_8$ particles as well as their two-particle excitations.

preprint2020arXiv

On the Allowable or Forbidden Nature of Vapor-Deposited Glasses

Vapor deposition can yield glasses that are more stable than those obtained by the traditional melt-quenching route. However, it remains unclear whether vapor-deposited glasses are "allowable" or "forbidden," that is, if they are equivalent to glasses formed by cooling extremely slowly a liquid or if they differ in nature from melt-quenched glasses. Here, based on reactive molecular dynamics simulation (MD) of silica glasses, we demonstrate that the allowable or forbidden nature of vapor-deposited glasses depends on the temperature of the substrate and, in turn, is found to be encoded in their medium-range order structure.

preprint2020arXiv

Phase-resolved Higgs response in superconducting cuprates

In high energy physics, the Higgs field couples to gauge bosons and fermions and gives mass to their elementary excitations. Experimentally, such couplings can be inferred from the decay product of the Higgs boson, i.e. the scalar (amplitude) excitation of the Higgs field. In superconductors, Cooper pairs bear a close analogy to the Higgs field. Interaction between the Cooper pairs and other degrees of freedom provides dissipation channel for the amplitude mode, which may reveal important information about the microscopic pairing mechanism. To this end, we investigate the Higgs (amplitude) mode of several cuprate thin films using phase-resolved terahertz third harmonic generation (THG). In addition to the heavily damped Higgs mode itself, we observe a universal jump in the phase of the driven Higgs oscillation as well as a non-vanishing THG above Tc. These findings indicate coupling of the Higgs mode to other collective modes and potentially a nonzero pairing amplitude above Tc.

preprint2020arXiv

Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation

Monocular estimation of 3d human pose has attracted increased attention with the availability of large ground-truth motion capture datasets. However, the diversity of training data available is limited and it is not clear to what extent methods generalize outside the specific datasets they are trained on. In this work we carry out a systematic study of the diversity and biases present in specific datasets and its effect on cross-dataset generalization across a compendium of 5 pose datasets. We specifically focus on systematic differences in the distribution of camera viewpoints relative to a body-centered coordinate frame. Based on this observation, we propose an auxiliary task of predicting the camera viewpoint in addition to pose. We find that models trained to jointly predict viewpoint and pose systematically show significantly improved cross-dataset generalization.

preprint2020arXiv

Proximal Gradient Algorithm with Momentum and Flexible Parameter Restart for Nonconvex Optimization

Various types of parameter restart schemes have been proposed for accelerated gradient algorithms to facilitate their practical convergence in convex optimization. However, the convergence properties of accelerated gradient algorithms under parameter restart remain obscure in nonconvex optimization. In this paper, we propose a novel accelerated proximal gradient algorithm with parameter restart (named APG-restart) for solving nonconvex and nonsmooth problems. Our APG-restart is designed to 1) allow for adopting flexible parameter restart schemes that cover many existing ones; 2) have a global sub-linear convergence rate in nonconvex and nonsmooth optimization; and 3) have guaranteed convergence to a critical point and have various types of asymptotic convergence rates depending on the parameterization of local geometry in nonconvex and nonsmooth optimization. Numerical experiments demonstrate the effectiveness of our proposed algorithm.

preprint2020arXiv

Reanalysis of Variance Reduced Temporal Difference Learning

Temporal difference (TD) learning is a popular algorithm for policy evaluation in reinforcement learning, but the vanilla TD can substantially suffer from the inherent optimization variance. A variance reduced TD (VRTD) algorithm was proposed by Korda and La (2015), which applies the variance reduction technique directly to the online TD learning with Markovian samples. In this work, we first point out the technical errors in the analysis of VRTD in Korda and La (2015), and then provide a mathematically solid analysis of the non-asymptotic convergence of VRTD and its variance reduction performance. We show that VRTD is guaranteed to converge to a neighborhood of the fixed-point solution of TD at a linear convergence rate. Furthermore, the variance error (for both i.i.d.\ and Markovian sampling) and the bias error (for Markovian sampling) of VRTD are significantly reduced by the batch size of variance reduction in comparison to those of vanilla TD. As a result, the overall computational complexity of VRTD to attain a given accurate solution outperforms that of TD under Markov sampling and outperforms that of TD under i.i.d.\ sampling for a sufficiently small conditional number.

preprint2020arXiv

Resisting Crowd Occlusion and Hard Negatives for Pedestrian Detection in the Wild

Pedestrian detection has been heavily studied in the last decade due to its wide application. Despite incremental progress, crowd occlusion and hard negatives are still challenging current state-of-the-art pedestrian detectors. In this paper, we offer two approaches based on the general region-based detection framework to tackle these challenges. Specifically, to address the occlusion, we design a novel coulomb loss as a regulator on bounding box regression, in which proposals are attracted by their target instance and repelled by the adjacent non-target instances. For hard negatives, we propose an efficient semantic-driven strategy for selecting anchor locations, which can sample informative negative examples at training phase for classification refinement. It is worth noting that these methods can also be applied to general object detection domain, and trainable in an end-to-end manner. We achieves consistently high performance on the Caltech-USA and CityPersons benchmarks.

preprint2020arXiv

Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction

Rich user behavior data has been proven to be of great value for click-through rate prediction tasks, especially in industrial applications such as recommender systems and online advertising. Both industry and academy have paid much attention to this topic and propose different approaches to modeling with long sequential user behavior data. Among them, memory network based model MIMN proposed by Alibaba, achieves SOTA with the co-design of both learning algorithm and serving system. MIMN is the first industrial solution that can model sequential user behavior data with length scaling up to 1000. However, MIMN fails to precisely capture user interests given a specific candidate item when the length of user behavior sequence increases further, say, by 10 times or more. This challenge exists widely in previously proposed approaches. In this paper, we tackle this problem by designing a new modeling paradigm, which we name as Search-based Interest Model (SIM). SIM extracts user interests with two cascaded search units: (i) General Search Unit acts as a general search from the raw and arbitrary long sequential behavior data, with query information from candidate item, and gets a Sub user Behavior Sequence which is relevant to candidate item; (ii) Exact Search Unit models the precise relationship between candidate item and SBS. This cascaded search paradigm enables SIM with a better ability to model lifelong sequential behavior data in both scalability and accuracy. Apart from the learning algorithm, we also introduce our hands-on experience on how to implement SIM in large scale industrial systems. Since 2019, SIM has been deployed in the display advertising system in Alibaba, bringing 7.1\% CTR and 4.4\% RPM lift, which is significant to the business. Serving the main traffic in our real system now, SIM models user behavior data with maximum length reaching up to 54000, pushing SOTA to 54x.

preprint2020arXiv

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

3D vehicle detection based on point cloud is a challenging task in real-world applications such as autonomous driving. Despite significant progress has been made, we observe two aspects to be further improved. First, the semantic context information in LiDAR is seldom explored in previous works, which may help identify ambiguous vehicles. Second, the distribution of point cloud on vehicles varies continuously with increasing depths, which may not be well modeled by a single model. In this work, we propose a unified model SegVoxelNet to address the above two problems. A semantic context encoder is proposed to leverage the free-of-charge semantic segmentation masks in the bird's eye view. Suspicious regions could be highlighted while noisy regions are suppressed by this module. To better deal with vehicles at different depths, a novel depth-aware head is designed to explicitly model the distribution differences and each part of the depth-aware head is made to focus on its own target detection range. Extensive experiments on the KITTI dataset show that the proposed method outperforms the state-of-the-art alternatives in both accuracy and efficiency with point cloud as input only.

preprint2020arXiv

SpiderBoost and Momentum: Faster Stochastic Variance Reduction Algorithms

SARAH and SPIDER are two recently developed stochastic variance-reduced algorithms, and SPIDER has been shown to achieve a near-optimal first-order oracle complexity in smooth nonconvex optimization. However, SPIDER uses an accuracy-dependent stepsize that slows down the convergence in practice, and cannot handle objective functions that involve nonsmooth regularizers. In this paper, we propose SpiderBoost as an improved scheme, which allows to use a much larger constant-level stepsize while maintaining the same near-optimal oracle complexity, and can be extended with proximal mapping to handle composite optimization (which is nonsmooth and nonconvex) with provable convergence guarantee. In particular, we show that proximal SpiderBoost achieves an oracle complexity of $\mathcal{O}(\min\{n^{1/2}ε^{-2},ε^{-3}\})$ in composite nonconvex optimization, improving the state-of-the-art result by a factor of $\mathcal{O}(\min\{n^{1/6},ε^{-1/3}\})$. We further develop a novel momentum scheme to accelerate SpiderBoost for composite optimization, which achieves the near-optimal oracle complexity in theory and substantial improvement in experiments.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy

Multi-focus image fusion (MFF) is a popular technique to generate an all-in-focus image, where all objects in the scene are sharp. However, existing methods pay little attention to defocus spread effects of the real-world multi-focus images. Consequently, most of the methods perform badly in the areas near focus map boundaries. According to the idea that each local region in the fused image should be similar to the sharpest one among source images, this paper presents an optimization-based approach to reduce defocus spread effects. Firstly, a new MFF assessmentmetric is presented by combining the principle of structure similarity and detected focus maps. Then, MFF problem is cast into maximizing this metric. The optimization is solved by gradient ascent. Experiments conducted on the real-world dataset verify superiority of the proposed model. The codes are available at https://github.com/xsxjtu/MFF-SSIM.

preprint2020arXiv

ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language

Person search by natural language aims at retrieving a specific person in a large-scale image pool that matches the given textual descriptions. While most of the current methods treat the task as a holistic visual and textual feature matching one, we approach it from an attribute-aligning perspective that allows grounding specific attribute phrases to the corresponding visual regions. We achieve success as well as the performance boosting by a robust feature learning that the referred identity can be accurately bundled by multiple attribute visual cues. To be concrete, our Visual-Textual Attribute Alignment model (dubbed as ViTAA) learns to disentangle the feature space of a person into subspaces corresponding to attributes using a light auxiliary attribute segmentation computing branch. It then aligns these visual features with the textual attributes parsed from the sentences by using a novel contrastive learning loss. Upon that, we validate our ViTAA framework through extensive experiments on tasks of person search by natural language and by attribute-phrase queries, on which our system achieves state-of-the-art performances. Code will be publicly available upon publication.

preprint2020arXiv

Weak Supervision and Referring Attention for Temporal-Textual Association Learning

A system capturing the association between video frames and textual queries offer great potential for better video analysis. However, training such a system in a fully supervised way inevitably demands a meticulously curated video dataset with temporal-textual annotations. Therefore we provide a Weak-Supervised alternative with our proposed Referring Attention mechanism to learn temporal-textual association (dubbed WSRA). The weak supervision is simply a textual expression (e.g., short phrases or sentences) at video level, indicating this video contains relevant frames. The referring attention is our designed mechanism acting as a scoring function for grounding the given queries over frames temporally. It consists of multiple novel losses and sampling strategies for better training. The principle in our designed mechanism is to fully exploit 1) the weak supervision by considering informative and discriminative cues from intra-video segments anchored with the textual query, 2) multiple queries compared to the single video, and 3) cross-video visual similarities. We validate our WSRA through extensive experiments for temporally grounding by languages, demonstrating that it outperforms the state-of-the-art weakly-supervised methods notably.

preprint2019arXiv

A compact and efficient three-dimensional microfluidic mixer

Microfluidic mixing is a fundamental functionality in most lab on a chip (LOC) systems,whereas realization of efficient mixing is challenging in microfluidic channels due to the small Reynolds numbers. Here, we design and fabricate a compact three-dimensional (3D) micromixer to enable efficient mixing at various flow rates. The performance of the fabricated micromixer was examined using blue and red inks. The extreme flexibility in fabricating microfluidic structures of arbitrary 3D geometries using femtosecond laser micromachining allows us to tackle the major disadvantageous effects for optimizing the mixing efficiency.

preprint2019arXiv

Coordinate-wise descent methods for leading eigenvalue problem

Leading eigenvalue problems for large scale matrices arise in many applications. Coordinate-wise descent methods are considered in this work for such problems based on a reformulation of the leading eigenvalue problem as a non-convex optimization problem. The convergence of several coordinate-wise methods is analyzed and compared. Numerical examples of applications to quantum many-body problems demonstrate the efficiency and provide benchmarks of the proposed coordinate-wise descent methods.

preprint2019arXiv

Determining the phase diagram of atomically thin layered antiferromagnet CrCl$_3$

Changes in the spin configuration of atomically-thin, magnetic van-der-Waals multilayers can cause drastic modifications in their opto-electronic properties. Conversely, the opto-electronic response of these systems provides information about the magnetic state, very difficult to obtain otherwise. Here we show that in CrCl$_3$ multilayers, the dependence of the tunnelling conductance on applied magnetic field ($H$), temperature ($T$), and number of layers ($N$) tracks the evolution of the magnetic state, enabling the magnetic phase diagram of these systems to be determined experimentally. Besides a high-field spin-flip transition occurring for all thicknesses, the in-plane magnetoconductance exhibits an even-odd effect due to a low-field spin-flop transition. If the layer number $N$ is even, the transition occurs at $μ_0 H \sim 0$ T due to the very small in-plane magnetic anisotropy, whereas for odd $N$ the net magnetization of the uncompensated layer causes the transition to occur at finite $H$. Through a quantitative analysis of the phenomena, we determine the interlayer exchange coupling as well as the staggered magnetization, and show that in CrCl$_3$ shape anisotropy dominates. Our results reveal the rich behaviour of atomically-thin layered antiferromagnets with weak magnetic anisotropy.

preprint2019arXiv

Dispersions of Many-Body Bethe strings

Complex bound states of magnetic excitations, known as Bethe string, were predicted almost a century ago to exist in one-dimensional quantum magnets 1. The dispersions of the string states have so far remained the subject of intensive theoretical studies 2-7. By performing neutron scattering experiments on the one-dimensional Heisenberg-Ising antiferromagnet SrCo2V2O8 in high longitudinal magnetic fields, we reveal in detail the dispersion relations of the string states over the full Brillouin zone, as well as their magnetic field dependences. Furthermore the characteristic energy, the scattering intensity and linewidth of the observed string states exhibit excellent agreement with our precise Bethe Ansatz calculations. Our results establish the important role of string states in the quantum spin dynamics of one-dimensional systems, and will invoke studies of their dynamical properties in more general many-body systems.

preprint2019arXiv

High-Field Quantum Disordered State in $α$-RuCl3: Spin Flips, Bound States, and a Multi-Particle Continuum

Layered $α$-RuCl3 has been discussed as a proximate Kitaev spin liquid compound. Raman and THz spectroscopy of magnetic excitations confirm that the low-temperature antiferromagnetic ordered phase features a broad Raman continuum, together with two magnon-like excitations at 2.7 and 3.6 meV, respectively. The continuum strength is maximized as long-range order is suppressed by an external magnetic field. The state above the field-induced quantum phase transition around 7.5 T is characterized by a gapped multi-particle continuum out of which a two-particle bound state emerges, together with a well-defined single-particle excitation at lower energy. Exact diagonalization calculations demonstrate that Kitaev and off-diagonal exchange terms in the Fleury-Loudon operator are crucial for the occurrence of these features in the Raman spectra. Our study firmly establishes the partially-polarized quantum disordered character of the high-field phase.

preprint2019arXiv

Model-free posterior inference on the area under the receiver operating characteristic curve

The area under the receiver operating characteristic curve (AUC) serves as a summary of a binary classifier's performance. Methods for estimating the AUC have been developed under a binormality assumption which restricts the distribution of the score produced by the classifier. However, this assumption introduces an infinite-dimensional nuisance parameter and can be inappropriate, especially in the context of machine learning. This motivates us to adopt a model-free Gibbs posterior distribution for the AUC. We present the asymptotic Gibbs posterior concentration rate, and a strategy for tuning the learning rate so that the corresponding credible intervals achieve the nominal frequentist coverage probability. Simulation experiments and a real data analysis demonstrate the Gibbs posterior's strong performance compared to existing methods based on a rank likelihood.

preprint2019arXiv

Non-perturbative high-harmonic generation in the three-dimensional Dirac semimetal Cd$_3$As$_2$

Harmonic generation is a general characteristic of driven nonlinear systems, and serves as an efficient tool for investigating the fundamental principles that govern the ultrafast nonlinear dynamics. In atomic gases, high-harmonic radiation is produced via a three-step process of ionization, acceleration, and recollision by strong-field infrared laser. This mechanism has been intensively investigated in the extreme ultraviolet and soft X-ray regions, forming the basis of attosecond research. In solid-state materials, which are characterized by crystalline symmetry and strong interactions, yielding of harmonics has just recently been reported. The observed high-harmonic generation was interpreted with fundamentally different mechanisms, such as interband tunneling combined with dynamical Bloch oscillations, intraband thermodynamics and nonlinear dynamics, and many-body electronic interactions. Here, in a distinctly different context of three-dimensional Dirac semimetal, we report on experimental observation of high-harmonic generation up to the seventh order driven by strong-field terahertz pulses. The observed non-perturbative high-harmonic generation is interpreted as a generic feature of terahertz-field driven nonlinear intraband kinetics of Dirac fermions. We anticipate that our results will trigger great interest in detection, manipulation, and coherent control of the nonlinear response in the vast family of three-dimensional Dirac and Weyl materials.

preprint2019arXiv

Spin-flop transition in atomically thin MnPS$_3$ crystals

The magnetic state of atomically thin semiconducting layered antiferromagnets such as CrI$_3$ and CrCl$_3$ can be probed by forming tunnel barriers and measuring their resistance as a function of magnetic field ($H$) and temperature ($T$). This is possible because the tunneling magnetoresistance originates from a spin-filtering effect sensitive to the relative orientation of the magnetization in different layers, i.e., to the magnetic state of the multilayers. For systems in which antiferromagnetism occurs within an individual layer, however, no spin-filtering occurs: it is unclear whether this strategy can work. To address this issue, we investigate tunnel transport through atomically thin crystals of MnPS$_3$, a van der Waals semiconductor that in the bulk exhibits easy-axis antiferromagnetic order within the layers. For thick multilayers below $T\simeq 78$ K, a $T$-dependent magnetoresistance sets-in at $\sim 5$ T, and is found to track the boundary between the antiferromagnetic and the spin-flop phases known from bulk magnetization measurements. The magnetoresistance persists down to individual MnPS$_3$ monolayers with nearly unchanged characteristic temperature and magnetic field scales, albeit with a different dependence on $H$. We discuss the implications of these finding for the magnetic state of atomically thin MnPS$_3$ crystals, conclude that antiferromagnetic correlations persist down to the level of individual monolayers, and that tunneling magnetoresistance does allow magnetism in 2D insulating materials to be detected even in the absence of spin-filtering.

preprint2019arXiv

Two-dimensional Ferromagnetic van der Waals CrX3 (X=Cl, Br, I) Monolayers with Enhanced Anisotropy and Curie Temperature

Among the recently widely studied van der Waals layered magnets CrX3 (X=Cl, Br, I), CrCl3 monolayer (ML) is particularly puzzling as it is solely shown by experiments to have an in-plane magnetic easy axis and, furthermore, all of previous first-principles calculation results contradict this. Through systematical first-principles calculations,we unveil that its in-plane shape anisotropy that dominates over its weak perpendicular magnetocrystalline anisotropy is responsible for the in-plane magnetic easy axis of CrCl3 ML. To tune the in-plane ferromagnetism of CrCl3 ML into the desirable perpendicular one, we propose substituting Cr with isovalent tungsten (W). We find that CrWCl6 has a strong perpendicular magnetic anisotropy and a high Curie temperature up to 76 K. Our work not only gives insight into understanding the two-dimensional ferromagnetism of van der Waals MLs but also sheds new light on engineering their performances for nanodevices.

preprint2016arXiv

ANTARES: Progress towards building a `Broker' of time-domain alerts

The Arizona-NOAO Temporal Analysis and Response to Events System (ANTARES) is a joint effort of NOAO and the Department of Computer Science at the University of Arizona to build prototype software to process alerts from time-domain surveys, especially LSST, to identify those alerts that must be followed up immediately. Value is added by annotating incoming alerts with existing information from previous surveys and compilations across the electromagnetic spectrum and from the history of past alerts. Comparison against a knowledge repository of properties and features of known or predicted kinds of variable phenomena is used for categorization. The architecture and algorithms being employed are described.

preprint2016arXiv

Correlation between non-centrosymmetry and superconductivity in quasi-one-dimensional compounds A2Cr3As3 (A=K, Rb)

Non-centrosymmetric superconductors, whose crystal structure is absent of inversion symmetry, have recently received special attentions due to the expectation of unconventional pairings and exotic physics associated with such pairings. The newly discovered superconductors A2Cr3As3 (A=K, Rb), featured by the quasi-one dimensional structure with conducting CrAs chains, belongs to such kind of superconductor. In this study, we are the first to report the finding that the superconductivity of A2Cr3As3 (A=K, Rb) has a positive correlation with the extent of non-centrosymmetry. Our in-situ high pressure ac susceptibility and synchrotron x-ray diffraction measurements reveal that the larger bond angle of As-Cr-As in the CrAs chains can be taken as a key factor controlling superconductivity. While the smaller bond angle and the distance between the CrAs chains also affect the superconductivity due to their structural connections with the angle. We find that the larger value of the difference between the larger and samller angles, which is associated with the extent of the non-centrosymmetry of the lattice structure, is in favor of superconductivity. These results are expected to shed a new light on the underlying mechanism of the superconductivity in these Q1D superconductors and also to provide new perspective in understanding other non-centrosymmetric superconductors.

preprint2016arXiv

Crafting GBD-Net for Object Detection

The visual cues from multiple support regions of different sizes and resolutions are complementary in classifying a candidate box in object detection. Effective integration of local and contextual visual cues from these regions has become a fundamental problem in object detection. In this paper, we propose a gated bi-directional CNN (GBD-Net) to pass messages among features from different support regions during both feature learning and feature extraction. Such message passing can be implemented through convolution between neighboring support regions in two directions and can be conducted in various layers. Therefore, local and contextual visual patterns can validate the existence of each other by learning their nonlinear relationships and their close interactions are modeled in a more complex way. It is also shown that message passing is not always helpful but dependent on individual samples. Gated functions are therefore needed to control message transmission, whose on-or-offs are controlled by extra visual evidence from the input sample. The effectiveness of GBD-Net is shown through experiments on three object detection datasets, ImageNet, Pascal VOC2007 and Microsoft COCO. This paper also shows the details of our approach in wining the ImageNet object detection challenge of 2016, with source code provided on \url{https://github.com/craftGBD/craftGBD}.

preprint2016arXiv

CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016

This paper presents the method that underlies our submission to the untrimmed video classification task of ActivityNet Challenge 2016. We follow the basic pipeline of temporal segment networks and further raise the performance via a number of other techniques. Specifically, we use the latest deep model architecture, e.g., ResNet and Inception V3, and introduce new aggregation schemes (top-k and attention-weighted pooling). Additionally, we incorporate the audio as a complementary channel, extracting relevant information via a CNN applied to the spectrograms. With these techniques, we derive an ensemble of deep models, which, together, attains a high classification accuracy (mAP $93.23\%$) on the testing set and secured the first place in the challenge.

preprint2016arXiv

Detection of the liquid-liquid transition in the deeply cooled water confined in MCM-41 with elastic neutron scattering technique

In this paper we present a review on our recent experimental investigations into the phase behavior of the deeply cooled water confined in a nanoporous silica material, MCM-41, with elastic neutron scattering technique. Under such strong confinement, the homogeneous nucleation process of water is avoided, which allows the confined water to keep its liquid state at temperatures and pressures that are inaccessible to the bulk water. By measuring the average density of the confined heavy water, we observe a likely first-order low-density liquid (LDL) to high-density liquid (HDL) transition in the deeply cooled region of the confined heavy water. The phase separation starts from 1.12 +- 0.17 kbar and 215 +- 1K and extends to higher pressures and lower temperatures in the phase diagram. This starting point could be the liquid-liquid critical point of the confined water. The locus of the Widom line is also estimated. The observation of the liquid-liquid transition in the confined water has potential to explain the mysterious behaviors of water at low temperatures. In addition, it may also have impacts on other disciplines, because the confined water system represents many biological and geological systems in which water resides in nanoscopic pores or in the vicinity of hydrophilic or hydrophobic surfaces.

preprint2016arXiv

Electromagnons, magnons, and phonons in Eu$_{1-x}$Ho$_x$MnO$_3$

Here we present a detailed study of the THz and FIR response of the mixed perovskite manganite system Eu$_{1-x}$Ho$_x$MnO$_3$ for holmium concentrations x = 0.1 and 0.3. We compare the magnetic excitations of the four different magnetically ordered phases (A-type antiferromagnetic, sinusoidally modulated collinear, helical phases with spin planes perpendicular to the crystallographic a and c axes). The transition between the two latter phases goes hand in hand with a switching of the ferroelectric polarization from P||a to P||c. Special emphasis is paid to the temperature dependence of the excitations at this transition. We find a significant change of intensity indicating that the exchange striction mechanism cannot be the only mechanism to induce dipolar weight to spin-wave excitations. We also focus on excitations within the incommensurate collinear antiferromagnetic phase and find a so far unobserved excitation close to 40 cm$^{-1}$. A detailed analysis of optical weight gives a further unexpected result: In the multiferroic phase with P||c all the spectral weight of the electromagnons comes from the lowest phonon mode. However, for the phase with the polarization P||a additional spectral weight must be transferred from higher frequencies.

preprint2016arXiv

From confined spinons to emergent fermions: Observation of elementary magnetic excitations in a transverse-field Ising chain

We report on spectroscopy study of elementary magnetic excitations in an Ising-like antiferromagnetic chain compound SrCo$_2$V$_2$O$_8$ as a function of temperature and applied transverse magnetic field up to 25 T. An optical as well as an acoustic branch of confined spinons, the elementary excitations at zero field, are identified in the antiferromagnetic phase below the Néel temperature of 5 K and described by a one-dimensional Schrödinger equation. The confinement can be suppressed by an applied transverse field and a quantum disordered phase is induced at 7 T. In this disordered paramagnetic phase, we observe three emergent fermionic excitations with different transverse-field dependencies. The nature of these modes is clarified by studying spin dynamic structure factor of a 1D transverse-field Heisenberg-Ising (XXZ) model using the method of infinite time evolving block decimation. Our work reveals emergent quantum phenomena and provides a concrete system for testifying theoretical predications of one-dimension quantum spin models.

preprint2016arXiv

Orbital Angular Momentum-based Space Division Multiplexing for High-capacity Underwater Optical Communications

To increase system capacity of underwater optical communications, we employ the spatial domain to simultaneously transmit multiple orthogonal spatial beams, each carrying an independent data channel. In this paper, we multiplex and transmit four green orbital angular momentum (OAM) beams through a single aperture. Moreover, we investigate the degrading effects of scattering/turbidity, water current, and thermal gradient-induced turbulence, and we find that thermal gradients cause the most distortions and turbidity causes the most loss. We show systems results using two different data generation techniques, one at 1064 nm for 10-Gbit/s/beam and one at 520 nm for 1-Gbit/s/beam, we use both techniques since present data-modulation technologies are faster for infrared (IR) than for green. For the higher-rate link, data is modulated in the IR, and OAM imprinting is performed in the green using a specially-designed metasurface phase mask. For the lower rates, a green laser diode is directly modulated. Finally, we show that inter-channel crosstalk induced by thermal gradients can be mitigated using multi-channel equalisation processing.

preprint2016arXiv

Origin and magnitude of 'designer' spin-orbit interaction in graphene on semiconducting transition metal dichalcogenides

We use a combination of experimental techniques to demonstrate a general occurrence of spin-orbit interaction (SOI) in graphene on transition metal dichalcogenide (TMD) substrates. Our measurements indicate that SOI is ultra-strong and extremely robust, despite it being merely interfacially-induced, with neither graphene nor the TMD substrates changing their structure. This is found to be the case irrespective of the TMD material used, of the transport regime, of the carrier type in the graphene band, and of the thickness of the graphene multilayer. Specifically, we perform weak antilocalization measurements as the simplest and most general diagnostic of SOI, and show that the spin relaxation time is very short in all cases regardless of the elastic scattering time. Such a short spin-relaxation time strongly suggests that the SOI originates from a modification of graphene band structure. We confirmed this expectation by measuring a gate-dependent beating, and a corresponding frequency splitting, in the low-field Shubnikov-de Haas magneto-resistance oscillations in high quality bilayer graphene on WSe$_2$. These measurements provide an unambiguous diagnostic of a SOI-induced splitting in the electronic band structure, and their analysis allows us to determine the SOI coupling constants for the Rashba term and the so-called spin-valley coupling term, i.e., the terms that were recently predicted theoretically for interface-induced SOI in graphene. The magnitude of the SOI splitting is found to be on the order of 10 meV, more than 100 times greater than the SOI intrinsic to graphene. Both the band character of the interfacially induced SOI, as well as its robustness and large magnitude make graphene-on-TMD a promising system to realize and explore a variety of spin-dependent transport phenomena, such as, in particular, spin-Hall and valley-Hall topological insulating states.

preprint2016arXiv

Origin and Suppression of $1/f$ Magnetic Flux Noise

Magnetic flux noise is a dominant source of dephasing and energy relaxation in superconducting qubits. The noise power spectral density varies with frequency as $1/f^α$ with $α\sim 1$ and spans 13 orders of magnitude. Recent work indicates that the noise is from unpaired magnetic defects on the surfaces of the superconducting devices. Here, we demonstrate that adsorbed molecular O$_2$ is the dominant contributor to magnetism in superconducting thin films. We show that this magnetism can be suppressed by appropriate surface treatment or improvement in the sample vacuum environment. We observe a suppression of static spin susceptibility by more than an order of magnitude and a suppression of $1/f$ magnetic flux noise power spectral density by more than a factor of 5. These advances open the door to realization of superconducting qubits with improved quantum coherence.

preprint2016arXiv

Real-time Action Recognition with Enhanced Motion Vector CNNs

The deep two-stream architecture exhibited excellent performance on video based action recognition. The most computationally expensive step in this approach comes from the calculation of optical flow which prevents it to be real-time. This paper accelerates this architecture by replacing optical flow with motion vector which can be obtained directly from compressed videos without extra calculation. However, motion vector lacks fine structures, and contains noisy and inaccurate motion patterns, leading to the evident degradation of recognition performance. Our key insight for relieving this problem is that optical flow and motion vector are inherent correlated. Transferring the knowledge learned with optical flow CNN to motion vector CNN can significantly boost the performance of the latter. Specifically, we introduce three strategies for this, initialization transfer, supervision transfer and their combination. Experimental results show that our method achieves comparable recognition performance to the state-of-the-art, while our method can process 390.7 frames per second, which is 27 times faster than the original two-stream method.

preprint2016arXiv

Reversible tuning of superconductivity in pressurized qausi-one-dimensional A2Cr3As3 (A=K and Rb)

In-situ hydrostatic and uniaxial high pressure studies were performed on recently discovered CrAs-based qausi-one-dimensional superconductors A2Cr3As3 (A=K and Rb). The established Pressure-Temperature phase diagram in this study clearly demonstrates that either hydrostatic pressure or uniaxial pressure globally suppresses the superconducting transition temperature (Tc), and the latter is more effective than the former. Interestingly, in the same hydrostatic pressure environment, the suppressing rate of Tc in Rb2Cr3As3 is nearly twice as that of K2Cr3As3. Significantly, the reduced Tc in these superconductors can fully recover to its ambient-pressure value after the applied pressure is entirely released. Our results suggest that the bonding distance and angle between Cr-Cr in the Cr3As3 chains are the key factor in determining Tc and that the optimal lattice for superconductivity is hosted in the pristine K2Cr3As3.

preprint2016arXiv

Search for the rare decay $K^+\toμ^+ν\barνν$

Evidence of the $K^+\toμ^+ν\barνν$ decay was searched for using E949 experimental data with an exposure of $1.70\times 10^{12}$ stopped kaons. The data sample is dominated by the backgrond process $K^+\toμ^+ν_μγ$. An upper limit on the decay rate $Γ(K^+\toμ^+ν\barνν)< 2.4\times 10^{-6}Γ(K^+\to all)$ at 90% confidence level was set assuming the Standard Model muon spectrum. The data are presented in such a way as to allow calculation of rates for any assumed $μ^+$ spectrum.

preprint2016arXiv

Spatial Phase and Amplitude Structuring of Beams Using a Combination of Multiple Orthogonal Spatial Functions with Complex Coefficients

Analogous to time signals that can be composed of multiple frequency functions, we use uniquely structured orthogonal spatial modes to create different beam shapes. We tailor the spatial structure by judiciously choosing a weighted combination of multiple modal states within an orthogonal basis set, and we can tunably create beam phase and intensity "shapes" that are not otherwise readily achievable. As an example shape, we use a series of orbital-angular-momentum (OAM) functions with adjustable complex weights to create a reconfigurable spatial region of higher localized power as compared to traditional beam combining. We simulate a structured beam created by coherently combining several orthogonal OAM beams with different complex weights, and we achieve a >10X localized power density enhancement with 19 beams. Additionally, we can create unique shapes by passing a single beam through a specially designed phase and intensity mask that contains the combination of multiple OAM functions each with complex weights. Using this approach, we experimentally demonstrate a ~2.5X localized power density increase when utilizing 9 functions.

preprint2016arXiv

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

Deep convolutional networks have achieved great success for visual recognition in still images. However, for action recognition in videos, the advantage over traditional methods is not so evident. This paper aims to discover the principles to design effective ConvNet architectures for action recognition in videos and learn these models given limited training samples. Our first contribution is temporal segment network (TSN), a novel framework for video-based action recognition. which is based on the idea of long-range temporal structure modeling. It combines a sparse temporal sampling strategy and video-level supervision to enable efficient and effective learning using the whole action video. The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network. Our approach obtains the state-the-of-art performance on the datasets of HMDB51 ( $ 69.4\% $) and UCF101 ($ 94.2\% $). We also visualize the learned ConvNet models, which qualitatively demonstrates the effectiveness of temporal segment network and the proposed good practices.

preprint2016arXiv

Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images

Event recognition in still images is an intriguing problem and has potential for real applications. This paper addresses the problem of event recognition by proposing a convolutional neural network that exploits knowledge of objects and scenes for event classification (OS2E-CNN). Intuitively, it stands to reason that there exists a correlation among the concepts of objects, scenes, and events. We empirically demonstrate that the recognition of objects and scenes substantially contributes to the recognition of events. Meanwhile, we propose an iterative selection method to identify a subset of object and scene classes, which help to more efficiently and effectively transfer their deep representations to event recognition. Specifically, we develop three types of transferring techniques: (1) initialization-based transferring, (2) knowledge-based transferring, and (3) data-based transferring. These newly designed transferring techniques exploit multi-task learning frameworks to incorporate extra knowledge from other networks and additional datasets into the training procedure of event CNNs. These multi-task learning frameworks turn out to be effective in reducing the effect of over-fitting and improving the generalization ability of the learned CNNs. With OS2E-CNN, we design a multi-ratio and multi-scale cropping strategy, and propose an end-to-end event recognition pipeline. We perform experiments on three event recognition benchmarks: the ChaLearn Cultural Event Recognition dataset, the Web Image Dataset for Event Recognition (WIDER), and the UIUC Sports Event dataset. The experimental results show that our proposed algorithm successfully adapts object and scene representations towards the event dataset and that it achieves the current state-of-the-art performance on these challenging datasets.

preprint2015arXiv

A Distance-based Paraconsistent Semantics for DL-Lite

DL-Lite is an important family of description logics. Recently, there is an increasing interest in handling inconsistency in DL-Lite as the constraint imposed by a TBox can be easily violated by assertions in ABox in DL-Lite. In this paper, we present a distance-based paraconsistent semantics based on the notion of feature in DL-Lite, which provides a novel way to rationally draw meaningful conclusions even from an inconsistent knowledge base. Finally, we investigate several important logical properties of this entailment relation based on the new semantics and show its promising advantages in non-monotonic reasoning for DL-Lite.

preprint2015arXiv

Adaptively Directional Wireless Power Transfer for Large-scale Sensor Networks

Wireless power transfer (WPT) prolongs the lifetime of wireless sensor network by providing sustainable power supply to the distributed sensor nodes (SNs) via electromagnetic waves. To improve the energy transfer efficiency in a large WPT system, this paper proposes an adaptively directional WPT (AD-WPT) scheme, where the power beacons (PBs) adapt the energy beamforming strategy to SNs' locations by concentrating the transmit power on the nearby SNs within the efficient charging radius. With the aid of stochastic geometry, we derive the closed-form expressions of the distribution metrics of the aggregate received power at a typical SN and further approximate the complementary cumulative distribution function using Gamma distribution with second-order moment matching. To design the charging radius for the optimal AD-WPT operation, we exploit the tradeoff between the power intensity of the energy beams and the number of SNs to be charged. Depending on different SN task requirements, the optimal AD-WPT can maximize the average received power or the active probability of the SNs, respectively. It is shown that both the maximized average received power and the maximized sensor active probability increase with the increased deployment density and transmit power of the PBs, and decrease with the increased density of the SNs and the energy beamwidth. Finally, we show that the optimal AD-WPT can significantly improve the energy transfer efficiency compared with the traditional omnidirectional WPT.

preprint2015arXiv

Baselining Network-Wide Traffic by Time-Frequency Constrained Stable Principal Component Pursuit

The Internet traffic analysis is important to network management,and extracting the baseline traffic patterns is especially helpful for some significant network applications.In this paper, we study on the baseline problem of the traffic matrix satisfying a refined traffic matrix decomposition model,since this model extends the assumption of the baseline traffic component to characterize its smoothness, and is more realistic than the existing traffic matrix models. We develop a novel baseline scheme, named Stable Principal Component Pursuit with Time-Frequency Constraints (SPCP-TFC), which extends the Stable Principal Component Pursuit (SPCP) by applying new time-frequency constraints. Then we design an efficient numerical algorithm for SPCP-TFC. At last, we evaluate this baseline scheme through simulations, and show it has superior performance than the existing baseline schemes RBL and PCA.

preprint2015arXiv

Better Exploiting OS-CNNs for Better Event Recognition in Images

Event recognition from still images is one of the most important problems for image understanding. However, compared with object recognition and scene recognition, event recognition has received much less research attention in computer vision community. This paper addresses the problem of cultural event recognition in still images and focuses on applying deep learning methods on this problem. In particular, we utilize the successful architecture of Object-Scene Convolutional Neural Networks (OS-CNNs) to perform event recognition. OS-CNNs are composed of object nets and scene nets, which transfer the learned representations from the pre-trained models on large-scale object and scene recognition datasets, respectively. We propose four types of scenarios to explore OS-CNNs for event recognition by treating them as either "end-to-end event predictors" or "generic feature extractors". Our experimental results demonstrate that the global and local representations of OS-CNNs are complementary to each other. Finally, based on our investigation of OS-CNNs, we come up with a solution for the cultural event recognition track at the ICCV ChaLearn Looking at People (LAP) challenge 2015. Our team secures the third place at this challenge and our result is very close to the best performance.

preprint2015arXiv

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

In this paper, we propose deformable deep convolutional neural networks for generic object detection. This new deep learning object detection framework has innovations in multiple aspects. In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty. A new pre-training strategy is proposed to learn feature representations more suitable for the object detection task and with good generalization capability. By changing the net structures, training strategies, adding and removing some key components in the detection pipeline, a set of models with large diversity are obtained, which significantly improves the effectiveness of model averaging. The proposed approach improves the mean averaged precision obtained by RCNN \cite{girshick2014rich}, which was the state-of-the-art, from 31\% to 50.3\% on the ILSVRC2014 detection test set. It also outperforms the winner of ILSVRC2014, GoogLeNet, by 6.1\%. Detailed component-wise analysis is also provided through extensive experimental evaluation, which provide a global view for people to understand the deep learning object detection pipeline.

preprint2015arXiv

Design, characterization, and sensitivity of the supernova trigger system at Daya Bay

Providing an early warning of galactic supernova explosions from neutrino signals is important in studying supernova dynamics and neutrino physics. A dedicated supernova trigger system has been designed and installed in the data acquisition system at Daya Bay and integrated into the worldwide Supernova Early Warning System (SNEWS). Daya Bay's unique feature of eight identically-designed detectors deployed in three separate experimental halls makes the trigger system naturally robust against cosmogenic backgrounds, enabling a prompt analysis of online triggers and a tight control of the false-alert rate. The trigger system is estimated to be fully sensitive to 1987A-type supernova bursts throughout most of the Milky Way. The significant gain in sensitivity of the eight-detector configuration over a mass-equivalent single detector is also estimated. The experience of this online trigger system is applicable to future projects with spatially distributed detectors.

preprint2015arXiv

Internet Traffic Matrix Structural Analysis Based on Multi-Resolution RPCA

The Internet traffic matrix plays a significant roll in network operation and management, therefore, the structural analysis of traffic matrix, which decomposes different traffic components of this high-dimensional traffic dataset, is quite valuable to some network applications. In this study, based on the Robust Principal Component Analysis (RPCA) theory, a novel traffic matrix structural analysis approach named Multi-Resolution RPCA is created, which utilizes the wavelet multi-resolution analysis. Firstly, we build the Multi-Resolution Traffic Matrix Decomposition Model (MR-TMDM), which characterizes the smoothness of the deterministic traffic by its wavelet coefficients. Secondly, based on this model, we improve the Stable Principal Component Pursuit (SPCP), propose a new traffic matrix decomposition method named SPCP-MRC with Multi-Resolution Constraints, and design its numerical algorithm. Specifically, we give and prove the closed-form solution to a sub-problem in the algorithm. Lastly, we evaluate different traffic decomposition methods by multiple groups of simulated traffic matrices containing different kinds of anomalies and distinct noise levels. It is demonstrated that SPCP-MRC, compared with other methods, achieves more accurate and more reasonable traffic decompositions.

preprint2015arXiv

Neutrino Physics with JUNO

The Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton multi-purpose underground liquid scintillator detector, was proposed with the determination of the neutrino mass hierarchy as a primary physics goal. It is also capable of observing neutrinos from terrestrial and extra-terrestrial sources, including supernova burst neutrinos, diffuse supernova neutrino background, geoneutrinos, atmospheric neutrinos, solar neutrinos, as well as exotic searches such as nucleon decays, dark matter, sterile neutrinos, etc. We present the physics motivations and the anticipated performance of the JUNO detector for various proposed measurements. By detecting reactor antineutrinos from two power plants at 53-km distance, JUNO will determine the neutrino mass hierarchy at a 3-4 sigma significance with six years of running. The measurement of antineutrino spectrum will also lead to the precise determination of three out of the six oscillation parameters to an accuracy of better than 1\%. Neutrino burst from a typical core-collapse supernova at 10 kpc would lead to ~5000 inverse-beta-decay events and ~2000 all-flavor neutrino-proton elastic scattering events in JUNO. Detection of DSNB would provide valuable information on the cosmic star-formation rate and the average core-collapsed neutrino energy spectrum. Geo-neutrinos can be detected in JUNO with a rate of ~400 events per year, significantly improving the statistics of existing geoneutrino samples. The JUNO detector is sensitive to several exotic searches, e.g. proton decay via the $p\to K^++\barν$ decay channel. The JUNO detector will provide a unique facility to address many outstanding crucial questions in particle and astrophysics. It holds the great potential for further advancing our quest to understanding the fundamental properties of neutrinos, one of the building blocks of our Universe.

preprint2015arXiv

Object-Scene Convolutional Neural Networks for Event Recognition in Images

Event recognition from still images is of great importance for image understanding. However, compared with event recognition in videos, there are much fewer research works on event recognition in images. This paper addresses the issue of event recognition from images and proposes an effective method with deep neural networks. Specifically, we design a new architecture, called Object-Scene Convolutional Neural Network (OS-CNN). This architecture is decomposed into object net and scene net, which extract useful information for event understanding from the perspective of objects and scene context, respectively. Meanwhile, we investigate different network architectures for OS-CNN design, and adapt the deep (AlexNet) and very-deep (GoogLeNet) networks to the task of event recognition. Furthermore, we find that the deep and very-deep networks are complementary to each other. Finally, based on the proposed OS-CNN and comparative study of different network architectures, we come up with a solution of five-stream CNN for the track of cultural event recognition at the ChaLearn Looking at People (LAP) challenge 2015. Our method obtains the performance of 85.5% and ranks the $1^{st}$ place in this challenge.

preprint2015arXiv

Polar Dynamics at the Jahn-Teller Transition in Ferroelectric GaV4S8

We present a dielectric spectroscopy study of the polar dynamics linked to the orbitally driven ferroelectric transition in the skyrmion host GaV4S8. By combining THz and MHz-GHz spectroscopy techniques, we succeed in detecting the relaxational dynamics arising from coupled orbital and polar fluctuations in this material and traced its temperature dependence in the paraelectric as well as in the ferroelectric phase. The relaxation time significantly increases when approaching the critical temperature from both sides of the transition. It is natural to assume that these polar fluctuations map the orbital dynamics at the Jahn-Teller transition. Due to the first-order character of the orbital-ordering transition, the relaxation time shows an enormous jump of about five orders of magnitude at the polar and structural phase transition.

preprint2015arXiv

Search for heavy neutrinos in $K^+\toμ^+ν_H$ decays

Evidence of a heavy neutrino, $ν_H$, in the $K^+\toμ^+ν_H$ decays was sought using the E949 experimental data with an exposure of $1.70\times 10^{12}$ stopped kaons. With the major background from the radiative $K^+\toμ^+ν_μγ$ decay understood and suppressed, upper limits (90% C.L.) on the neutrino mixing matrix element between muon and heavy neutrino, $|U_{μH}|^2$, were set at the level of $10^{-7}$ to $10^{-9}$ for the heavy neutrino mass region 175 to 300 MeV/$c^2$.

preprint2015arXiv

Spin-orbiton and quantum criticality in FeSc2S4

In FeSc2S4 spin-orbital exchange competes with strong spin-orbit coupling, suppressing long-range spin and orbital order and, hence, this material represents one of the rare examples of a spin-orbital liquid ground state. Moreover, it is close to a quantum-critical point separating the ordered and disordered regimes. Using THz and FIR spectroscopy we study low-lying excitations in FeSc2S4 and provide clear evidence for a spin-orbiton, an excitation of strongly entangled spins and orbitals. It becomes particularly well pronounced upon cooling, when advancing deep into the quantum-critical regime. Moreover, indications of an underlying structureless excitation continuum are found, a possible signature of quantum criticality.

preprint2015arXiv

Spinon Confinement in the One-Dimensional Ising-Like Antiferromagnet SrCo2V2O8

For quasi-one dimensional quantum spin systems theory predicts the occurrence of a confinement of spinon excitation due to interchain couplings. Here we investigate the system SrCo2V2O8, a realization of the weakly-coupled Ising-like XXZ antiferromagnetic chains, by terahertz spectroscopy with and without applied magnetic field. At low temperatures a series of excitations is observed, which split in a Zeeman-like fashion in an applied magnetic field. These magnetic excitations are identified as the theoretically predicted spinon-pair excitations. Using a one dimensional Schrödinger equation with a linear confinement potential imposed by weak interchain couplings, the hierarchy of the confined spinons can be fully described.

preprint2015arXiv

Strong interface-induced spin-orbit coupling in graphene on WS2

Interfacial interactions allow the electronic properties of graphene to be modified, as recently demonstrated by the appearance of satellite Dirac cones in the band structure of graphene on hexagonal boron nitride (hBN) substrates. Ongoing research strives to explore interfacial interactions in a broader class of materials in order to engineer targeted electronic properties. Here we show that at an interface with a tungsten disulfide (WS2) substrate, the strength of the spin-orbit interaction (SOI) in graphene is very strongly enhanced. The induced SOI leads to a pronounced low-temperature weak anti-localization (WAL) effect, from which we determine the spin-relaxation time. We find that spin-relaxation time in graphene is two-to-three orders of magnitude smaller on WS2 than on SiO2 or hBN, and that it is comparable to the intervalley scattering time. To interpret our findings we have performed first-principle electronic structure calculations, which both confirm that carriers in graphene-on-WS2 experience a strong SOI and allow us to extract a spin-dependent low-energy effective Hamiltonian. Our analysis further shows that the use of WS2 substrates opens a possible new route to access topological states of matter in graphene-based systems.

preprint2015arXiv

Towards Good Practices for Very Deep Two-Stream ConvNets

Deep convolutional networks have achieved great success for object recognition in still images. However, for action recognition in videos, the improvement of deep convolutional networks is not so evident. We argue that there are two reasons that could probably explain this result. First the current network architectures (e.g. Two-stream ConvNets) are relatively shallow compared with those very deep models in image domain (e.g. VGGNet, GoogLeNet), and therefore their modeling capacity is constrained by their depth. Second, probably more importantly, the training dataset of action recognition is extremely small compared with the ImageNet dataset, and thus it will be easy to over-fit on the training dataset. To address these issues, this report presents very deep two-stream ConvNets for action recognition, by adapting recent very deep architectures into video domain. However, this extension is not easy as the size of action recognition is quite small. We design several good practices for the training of very deep two-stream ConvNets, namely (i) pre-training for both spatial and temporal nets, (ii) smaller learning rates, (iii) more data augmentation techniques, (iv) high drop out ratio. Meanwhile, we extend the Caffe toolbox into Multi-GPU implementation with high computational efficiency and low memory consumption. We verify the performance of very deep two-stream ConvNets on the dataset of UCF101 and it achieves the recognition accuracy of $91.4\%$.

preprint2014arXiv

A model combining spectrum standardization and dominant factor based partial least square method for carbon analysis in coal by laser-induced breakdown spectroscopy

Successful quantitative measurement of carbon content in coal using laser-induced breakdown spectroscopy (LIBS) is suffered from relatively low precision and accuracy. In the present work, the spectrum standardization method was combined with the dominant factor based partial least square (PLS) method to improve the measurement accuracy of carbon content in coal by LIBS. The combination model employed the spectrum standardization method to convert the carbon line intensity into standard state for more accurately calculating the dominant carbon concentration, and then applied PLS with full spectrum information to correct the residual errors. The combination model was applied to the measurement of carbon content for 24 bituminous coal samples. The results demonstrated that the combination model could further improve the measurement accuracy compared with both our previously established spectrum standardization model and dominant factor based PLS model using spectral area normalized intensity for the dominant factor model. For example, the coefficient of determination (R2), the root-mean-square error of prediction (RMSEP), and the average relative error (ARE) for the combination model were 0.99, 1.75%, and 2.39%, respectively; while those values for the spectrum standardization method were 0.83, 2.71%, and 3.40%, respectively; and those values for the dominant factor based PLS model were 0.99, 2.66%, and 3.64%, respectively.

preprint2014arXiv

A Precise Calculation of Delayed Coincidence Selection Efficiency and Accidental Coincidence Rate

A model is proposed to address issues on the precise background evaluation due to the complex data structure defined by the delayed coincidence method, which is widely used in reactor electron-antineutrino oscillation experiments. In this model, the effects from the muon veto, uncorrelated random background, coincident signal and background are all studied with the analytical solutions, simplifying the estimation of the systematic uncertainties of signal efficiency and accidental background rate determined by the unstable single rate. The result of calculation is validated numerically with a number of simulation studies and is also applied and validated in the recent Daya Bay hydrogen-capture based oscillation measurement.

preprint2014arXiv

Acyclicity Notions for Existential Rules and Their Application to Query Answering in Ontologies

Answering conjunctive queries (CQs) over a set of facts extended with existential rules is a prominent problem in knowledge representation and databases. This problem can be solved using the chase algorithm, which extends the given set of facts with fresh facts in order to satisfy the rules. If the chase terminates, then CQs can be evaluated directly in the resulting set of facts. The chase, however, does not terminate necessarily, and checking whether the chase terminates on a given set of rules and facts is undecidable. Numerous acyclicity notions were proposed as sufficient conditions for chase termination. In this paper, we present two new acyclicity notions called model-faithful acyclicity (MFA) and model-summarising acyclicity (MSA). Furthermore, we investigate the landscape of the known acyclicity notions and establish a complete taxonomy of all notions known to us. Finally, we show that MFA and MSA generalise most of these notions. Existential rules are closely related to the Horn fragments of the OWL 2 ontology language; furthermore, several prominent OWL 2 reasoners implement CQ answering by using the chase to materialise all relevant facts. In order to avoid termination problems, many of these systems handle only the OWL 2 RL profile of OWL 2; furthermore, some systems go beyond OWL 2 RL, but without any termination guarantees. In this paper we also investigate whether various acyclicity notions can provide a principled and practical solution to these problems. On the theoretical side, we show that query answering for acyclic ontologies is of lower complexity than for general ontologies. On the practical side, we show that many of the commonly used OWL 2 ontologies are MSA, and that the number of facts obtained by materialisation is not too large. Our results thus suggest that principled development of materialisation-based OWL 2 reasoners is practically feasible.

preprint2014arXiv

DeepID-Net: multi-stage and deformable deep convolutional neural networks for object detection

In this paper, we propose multi-stage and deformable deep convolutional neural networks for object detection. This new deep learning object detection diagram has innovations in multiple aspects. In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty. With the proposed multi-stage training strategy, multiple classifiers are jointly optimized to process samples at different difficulty levels. A new pre-training strategy is proposed to learn feature representations more suitable for the object detection task and with good generalization capability. By changing the net structures, training strategies, adding and removing some key components in the detection pipeline, a set of models with large diversity are obtained, which significantly improves the effectiveness of modeling averaging. The proposed approach ranked \#2 in ILSVRC 2014. It improves the mean averaged precision obtained by RCNN, which is the state-of-the-art of object detection, from $31\%$ to $45\%$. Detailed component-wise analysis is also provided through extensive experimental evaluation.

preprint2014arXiv

Establishing Global Policies over Decentralized Online Social Networks

Conventional online social networks (OSNs) are implemented in a centralized manner. Although centralization is a convenient way for implementing OSNs, it has several well known drawbacks. Chief among them are the risks they pose to the security and privacy of the information maintained by the OSN; and the loss of control over the information contributed by individual members. These concerns prompted several attempts to create decentralized OSNs, or DOSNs. The basic idea underlying these attempts, is that each member of a social network keeps its data under its own control, instead of surrendering it to a central host; providing access to it to other members of the OSN according to its own access-control policy. Unfortunately all existing DOSN projects have a very serious limitation. Namely, they are unable to subject the membership of a DOSN, and the interaction between its members, to any global policy. We adopt the decentralization idea underlying DOSNs, complementing it with a means for specifying and enforcing a wide range of policies over the membership of a social community, and over the interaction between its disparate distributed members. And we do so in a scalable fashion.

preprint2014arXiv

Hete-CF: Social-Based Collaborative Filtering Recommendation using Heterogeneous Relations

Collaborative filtering algorithms haven been widely used in recommender systems. However, they often suffer from the data sparsity and cold start problems. With the increasing popularity of social media, these problems may be solved by using social-based recommendation. Social-based recommendation, as an emerging research area, uses social information to help mitigate the data sparsity and cold start problems, and it has been demonstrated that the social-based recommendation algorithms can efficiently improve the recommendation performance. However, few of the existing algorithms have considered using multiple types of relations within one social network. In this paper, we investigate the social-based recommendation algorithms on heterogeneous social networks and proposed Hete-CF, a Social Collaborative Filtering algorithm using heterogeneous relations. Distinct from the exiting methods, Hete-CF can effectively utilize multiple types of relations in a heterogeneous social network. In addition, Hete-CF is a general approach and can be used in arbitrary social networks, including event based social networks, location based social networks, and any other types of heterogeneous information networks associated with social information. The experimental results on two real-world data sets, DBLP (a typical heterogeneous information network) and Meetup (a typical event based social network) show the effectiveness and efficiency of our algorithm.

preprint2014arXiv

High-field spectroscopy of singlet-triplet transitions in the spin-dimer systems Sr3Cr2O8 and Ba3Cr2O8

Magnetic excitations in the isostructural spin-dimer systems Sr3Cr2O8 and Ba3Cr2O8 are probed by means of high-field electron spin resonance at sub-terahertz frequencies. Three types of magnetic modes were observed. One mode is gapless and corresponds to transitions within excited states, while two other sets of modes are gapped and correspond to transitions from the ground to the first excited states. The selection rules of the gapped modes are analyzed in terms of a dynamical Dzyaloshinskii-Moriya interaction, suggesting the presence of phonon-assisted effects in the low-temperature spin dynamics of Sr3Cr2O8 and Ba3Cr2O8

preprint2014arXiv

Magnetic Proximity Effect and Interlayer Exchange Coupling of Ferromagnetic/Topological Insulator/Ferromagnetic Trilayer

Magnetic proximity effect between topological insulator (TI) and ferromagnetic insulator (FMI) is considered to have great potential in spintronics. However, a complete determination of interfacial magnetic structure has been highly challenging. We theoretically investigate the interlayer exchange coupling of two FMIs separated by a TI thin film, and show that the particular electronic states of the TI contributing to the proximity effect can be directly identified through the coupling behavior between two FMIs, together with a tunability of coupling constant. Such FMI/TI/FMI structure not only serves as a platform to clarify the magnetic structure of FMI/TI interface, but also provides insights into designing the magnetic storage devices with ultrafast response.

preprint2014arXiv

MRF denoising with compressed sensing and adaptive filtering

The recently proposed Magnetic Resonance Fingerprinting (MRF) technique can simultaneously estimate multiple parameters through dictionary matching. It has promising potentials in a wide range of applications. However, MRF introduces errors due to undersampling during the data acquisition process and the limit of dictionary resolution. In this paper, we investigate the error source of MRF and propose the technologies of improving the quality of MRF with compressed sensing, error prediction by decision trees, and adaptive filtering. Experimental results support our observations and show significant improvement of the proposed technologies.

preprint2014arXiv

Multiuser Joint Energy-Bandwidth Allocation with Energy Harvesting - Part I: Optimum Algorithm & Multiple Point-to-Point Channels

In this paper, we develop optimal energy-bandwidth allocation algorithms in fading channels for multiple energy harvesting transmitters, each may communicate with multiple receivers via orthogonal channels. We first assume that the side information of both the channel states and the energy harvesting states is known for $K$ time slots {\em a priori}, and the battery capacity and the maximum transmission power in each time slot are bounded. The objective is to maximize the weighted sum-rate of all transmitters over the $K$ time slots by assigning the transmission power and bandwidth for each transmitter in each slot. The problem is formulated as a convex optimization problem with ${\cal O}(MK)$ constraints, where $M$ is the number of the receivers, making it hard to solve with a generic convex solver. An iterative algorithm is proposed that alternatively solves two subproblems in each iteration. The convergence and the optimality of this algorithm are also shown. We then consider the special case that each transmitter only communicates with one receiver and the objective is to maximize the total throughput. We develop efficient algorithms for solving the two subproblems and the optimal energy-bandwidth allocation can be obtained with an overall complexity of ${\cal O}(MK^2)$. Moreover, a heuristic algorithm is also proposed for energy-bandwidth allocation based on causal information of channel and energy harvesting states.

preprint2014arXiv

Multiuser Joint Energy-Bandwidth Allocation with Energy Harvesting - Part II: Multiple Broadcast Channels & Proportional Fairness

In this paper, we consider the energy-bandwidth allocation for a network with multiple broadcast channels, where the transmitters access the network orthogonally on the assigned frequency band and each transmitter communicates with multiple receivers orthogonally or non-orthogonally. We assume that the energy harvesting state and channel gain of each transmitter can be predicted for $K$ slots {\em a priori}. To maximize the weighted throughput, we formulate an optimization problem with $O(MK)$ constraints, where $M$ is the number of the receivers, and decompose it into the energy and bandwidth allocation subproblems. In order to use the iterative algorithm proposed in [1] to solve the problem, we propose efficient algorithms to solve the two subproblems, so that the optimal energy-bandwidth allocation can be obtained with an overall complexity of ${\cal O}(MK^2)$, even though the problem is non-convex when the broadcast channel is non-orthogonal. For the orthogonal broadcast channel, we further formulate a proportionally-fair (PF) throughput maximization problem and derive the equivalence conditions such that the optimal solution can be obtained by solving a weighted throughput maximization problem. Further, the algorithm to obtain the proper weights is proposed. Simulation results show that the proposed algorithm can make efficient use of the harvested energy and the available bandwidth, and achieve significantly better performance than some heuristic policies for energy and bandwidth allocation. Moreover, it is seen that with energy-harvesting transmitters, non-orthogonal broadcast offers limited gain over orthogonal broadcast.

preprint2014arXiv

Network of Time-Multiplexed Optical Parametric Oscillators as a Coherent Ising Machine

Finding the ground states of the Ising Hamiltonian [1] maps to various combinatorial optimization problems in biology, medicine, wireless communications, artificial intelligence, and social network. So far no efficient classical and quantum algorithm is known for these problems, and intensive research is focused on creating physical systems - Ising machines - capable of finding the absolute or approximate ground states of the Ising Hamiltonian [2-6]. Here we report a novel Ising machine using a network of degenerate optical parametric oscillators (OPOs). Spins are represented with above-threshold binary phases of the OPOs and the Ising couplings are realized by mutual injections [7]. The network is implemented in a single OPO ring cavity with multiple trains of femtosecond pulses and configurable mutual couplings, and operates at room temperature. We programed the smallest non-deterministic polynomial time (NP)- hard Ising problem on the machine, and in 1000 runs of the machine no computational error was detected.

preprint2014arXiv

The application of spectrum standardization method for carbon analysis in coal using laser-induced breakdown spectroscopy

Measurements of carbon content in coal using laser-induced breakdown spectroscopy (LIBS) is limited by its low measurement precision and accuracy. A spectrum standardization method was proposed to achieve both reproducible and accurate results for the quantitative analysis of carbon content in coal with LIBS. The proposed method utilized the molecular carbon emissions to compensate the diminution of atomic carbon emission caused by matrix effect. The compensated carbon line intensities were further converted into an assumed standard state with fixed plasma temperature, electron density, and total number density of elemental carbon, which is proportional to its concentration in the coal samples. In addition, in order to obtained better compensation for total carbon number density fluctuations, an iterative algorithm was applied, which is different from our previous standardization calculations. The modified spectrum standardization model was applied to the measurement of carbon content in 24 bituminous coal samples. The results demonstrated that the proposed method had superior performance over the generally applied normalization methods. The average relative standard deviation, the coefficient of determination, the root-mean-square error of prediction, and the average maximum relative error for the modified model were 3.44%, 0.83, 2.71%, and 12.61%, respectively, while the corresponding values for the normalization with segmental spectrum area were 6.00%, 0.75, 3.77%, and 15.40%, respectively, showing an overwhelming improvement.

preprint2014arXiv

Tunable THz Surface Plasmon Polariton based on Topological Insulator-Layered Superconductor Hybrid Structure

We theoretically investigate the surface plasmon polariton (SPP) at the interface between 3D strong topological insulator (TI) and layered superconductor-magnetic insulator structure. The tunability of SPP through electronic doping can be enhanced when the magnetic permeability of the layered structure becomes higher. When the interface is gapped by superconductivity or perpendicular magnetism, SPP dispersion is further distorted, accompanied by a shift of group velocity and penetration depth. Such a shift of SPP reaches maximum when the magnitude of Fermi level approaches the gap value, and may lead to observable effects. The tunable SPP at the interface between layered superconductor and magnetism materials in proximity to TI surface may provide new insight in the detection of Majorana Fermions.

preprint2013arXiv

A Coherent Ising Machine Based On Degenerate Optical Parametric Oscillators

A degenerate optical parametric oscillator network is proposed to solve the NP-hard problem of finding a ground state of the Ising model. The underlying operating mechanism originates from the bistable output phase of each oscillator and the inherent preference of the network in selecting oscillation modes with the minimum photon decay rate. Computational experiments are performed on all instances reducible to the NP-hard MAX-CUT problems on cubic graphs of order up to 20. The numerical results reasonably suggest the effectiveness of the proposed network.

preprint2013arXiv

Exciton-magnon transitions in the frustrated chromium antiferromagnets CuCrO2, alpha-CaCr2O4, CdCr2O4, and ZnCr2O4

We report on optical transmission spectroscopy of the Cr-based frustrated triangular antiferromagnets CuCrO2 and alpha-CaCr2O4, and the spinels CdCr2O4 and ZnCr2O4 in the near-infrared to visible-light frequency range. We explore the possibility to search for spin correlations far above the magnetic ordering temperature and for anomalies in the magnon lifetime in the magnetically ordered state by probing exciton-magnon sidebands of the spin-forbidden crystal-field transitions of the Cr3+ ions (spin S = 3/2). In CuCrO2 and alpha-CaCr2O4 the appearance of fine structures below T_N is assigned to magnon sidebands by comparison with neutron scattering results. The temperature dependence of the line width of the most intense sidebands in both compounds can be described by an Arrhenius law. For CuCrO2 the sideband associated with the 4A2 -> 2T2 transition can be observed even above T_N. Its line width does not show a kink at the magnetic ordering temperature and can alternatively be described by a Z2 vortex scenario proposed previously for similar materials. The exciton-magnon features in alpha-CaCr2O4 are more complex due to the orthorhombic distortion. While for CdCr2O4 magnon sidebands are identified below T_N and one sideband excitation is found to persist across the magnetic ordering transition, only a weak fine structure related to magnetic ordering has been observed in ZnCr2O4.

preprint2013arXiv

Genetic Algorithm with Ensemble Learning for Detecting Community Structure in Complex Networks

Community detection in complex networks is a topic of considerable recent interest within the scientific community. For dealing with the problem that genetic algorithm are hardly applied to community detection, we propose a genetic algorithm with ensemble learning (GAEL) for detecting community structure in complex networks. GAEL replaces its traditional crossover operator with a multi-individual crossover operator based on ensemble learning. Therefore, GAEL can avoid the problems that are brought by traditional crossover operator which is only able to mix string blocks of different individuals, but not able to recombine clustering contexts of different individuals into new better ones. In addition, the local search strategy, which makes mutated node be placed into the community where most of its neighbors are, is used in mutation operator. At last, a Markov random walk based method is used to initialize population in this paper, and it can provide us a population of accurate and diverse clustering solutions. Those diverse and accurate individuals are suitable for ensemble learning based multi-individual crossover operator. The proposed GAEL is tested on both computer-generated and real-world networks, and compared with current representative algorithms for community detection in complex networks. Experimental results demonstrate that GAEL is highly effective at discovering community structure.

preprint2013arXiv

Low-energy magnetic excitations in the quasi-one-dimensional spin-1 chain compound SrNi2V2O8

Multi-frequency electron spin resonance (ESR) transmission spectra have been measured as function of temperature and magnetic field on single crystals of the quasi-one-dimensional spin-1 chain compound SrNi2V2O8 in the GHz frequency range. Magnetic resonance modes above 50 K have been observed with an effective g-factor of 2.24 at 100 K. Below 30 K, intra-triplet excitations have been observed in the ESR spectra, which reveal the presence of single-ion anisotropy with D = -0.29 meV.

preprint2013arXiv

Orbital-selective metal-insulator transition and gap formation above Tc in superconducting Rb1-xFe2-ySe2

We report on a hierarchy of temperatures Tc < Tgap < Tmet in superconducting Rb1-xFe2-ySe2 observed by THz spectroscopy. Above Tmet = 90 K the material reveals semiconducting characteristics. Below Tmet a coherent metallic THz response emerges. This metal-to-insulator type, orbital selective transition is clearly indicated by an isosbestic point in the temperature dependence of the optical conductivity and dielectric constant at THz-frequencies. At Tgap = 61 K a gap opens in the THz regime and then the superconducting transition occurs at Tc = 32 K. This sequence of temperatures seems to reflect a corresponding hierarchy of the electronic correlations in the different bands.

preprint2013arXiv

Real Space Renormalization in Statistical Mechanics

This paper discusses methods for the construction of approximate real space renormalization transformations in statistical mechanics. In particular, it compares two methods of transformation: the "potential-moving" approach most used in the period 1975-1980 and the "rewiring method" as it has been developed in the last five years. These methods both employ a parameter, called χ, that measures the complexity of the localized stochastic variable forming the basis of the analysis. Both methods are here exemplified by calculations in terms of fixed points for the smallest possible values of χ. These calculations describe three models for two-dimensional systems: The Ising model solved by Onsager, the tricritical point of that model, and the three-state Potts model. The older method, often described as lower bound renormalization theory, provides a heuristic method giving reasonably accurate results for critical indices at the lowest degree of complexity, i.e. χ=2. In contrast, the rewiring method, employing "singular value decomposition", does not perform as well for low χvalues but offers an error that apparently decreases slowly toward zero as χis increased. It appears likely that no such improvement occurs in the older approach. A detailed comparison of the two methods is performed, with a particular eye to describing the reasons why they are so different. For example, the old method quite naturally employed fixed points for its analysis; these are hard to use in the newer approach. A discussion is given of why the fixed point approach proves to be hard in this context. In the new approach the calculated the thermal critical indices are satisfactory for the smallest values of χbut hardly improve as χis increased, while the magnetic critical indices do not agree well with the known theoretical values.

preprint2012arXiv

An Improved Traffic Matrix Decomposition Method with Frequency-Domain Regularization

We propose a novel network traffic matrix decomposition method named Stable Principal Component Pursuit with Frequency-Domain Regularization (SPCP-FDR), which improves the Stable Principal Component Pursuit (SPCP) method by using a frequency-domain noise regularization function. An experiment demonstrates the feasibility of this new decomposition method.

preprint2012arXiv

Electron spin resonance and exchange paths in the orthorhombic dimer system Sr2VO4

We report on magnetization and electron spin resonance (ESR) measurements of Sr$_{2}$VO$_4$ with orthorhombic symmetry. In this dimer system the $V^{4+}$ ions are in tetrahedral environment and are coupled by an antiferromagnetic intra-dimer exchange constant $J/k_B \approx$ 100 K to form a singlet ground state without any phase transitions between room temperature and 2 K. Based on an extended-Hückel-Tight-Binding analysis we identify the strongest exchange interaction to occur between two inequivalent vanadium sites via two intermediate oxygen ions. The ESR absorption spectra can be well described by a single Lorentzian line with an effective g-factor $g$ = 1.89. The temperature dependence of the ESR intensity is well described by a dimer model in agreement with the magnetization data. The temperature dependence of the ESR linewidth can be modeled by a superposition of a linear increase with temperature with a slope $α$ = 1.35 Oe/K and a thermally activated behavior with an activation energy $Δ/k_B$ = 1418 K, both of which point to spin-phonon coupling as the dominant relaxation mechanism in this compound.

preprint2012arXiv

Infrared phonons and specific heat in Ba3Cr2O8

We report on the phonon spectrum of Ba3Cr2O8 determined by infrared spectroscopy, and on specific heat measurements across the Jahn-Teller transition in magnetic fields up to 9 T. Phonon modes split below the Jahn-Teller transition, which occurs at T_{JT} = 70 K as detected by specific heat measurements. The field-dependent specific heat data is analyzed in terms of the contributions from lattice, magnetic and orbital degrees of freedom. In contrast to the isostructural compound Sr3Cr2O8 our analysis does not indicate the existence of orbital fluctuations below the Jahn-Teller transition in Ba3Cr2O8.

preprint2012arXiv

Orbital fluctuations and orbital order below the Jahn-Teller transition in Sr3Cr2O8

We report on the magnetic and phononic excitation spectrum of Sr3Cr2O8 determined by THz and infrared (IR) spectroscopy, and electron spin resonance (ESR) measurements across the Jahn-Teller transition, which is detected by specific-heat measurements to occur at T_{JT} = 285 K. We identify the singlet-triplet excitations in the dimerized ground state and estimate the exchange couplings in the system. Moreover, ESR absorptions were observed up to T* = 120 K with a linewidth proportional to exp{-Delta/k_{B}T} and Delta/k_{B} = 388 K indicating a phonon-mediated spin relaxation via the excited orbital state of the Cr $e$ doublet in the orbitally ordered state. In contrast to the expected drastic change of the IR active phonons upon entering the low-symmetry Jahn-Teller distorted phase below T_{JT}, we find an extended regime T*<T<T_{JT} where the IR active phonons change only gradually with decreasing temperature. This regime is associated with strong fluctuations in the orbital and lattice degrees of freedom in agreement with the loss of the ESR signal above T*. Using the measured magnetic and phononic excitation spectrum we model the orbital contribution to the specific heat and find the persistence of strong fluctuations far below T_{JT}.

preprint2012arXiv

Phonon-induced dephasing of chromium colour centres in diamond

We report on the coherence properties of single photons from chromium-based colour centres in diamond. We use field-correlation and spectral lineshape measurements to reveal the interplay between slow spectral wandering and fast dephasing mechanisms as a function of temperature. We show that the zero-phonon transition frequency and its linewidth follow a power-law dependence on temperature indicating that the dominant fast dephasing mechanisms for these centres are direct electron-phonon coupling and phonon-modulated Coulomb coupling to nearby impurities. Further, the observed reduction in the quantum yield for photon emission as a function of temperature is consistent with the opening of additional nonradiative channels through thermal activation to higher energy states predominantly and indicates a near-unity quantum efficiency at 4 K.

preprint2012arXiv

Structural Analysis of Network Traffic Matrix via Relaxed Principal Component Pursuit

The network traffic matrix is widely used in network operation and management. It is therefore of crucial importance to analyze the components and the structure of the network traffic matrix, for which several mathematical approaches such as Principal Component Analysis (PCA) were proposed. In this paper, we first argue that PCA performs poorly for analyzing traffic matrix that is polluted by large volume anomalies, and then propose a new decomposition model for the network traffic matrix. According to this model, we carry out the structural analysis by decomposing the network traffic matrix into three sub-matrices, namely, the deterministic traffic, the anomaly traffic and the noise traffic matrix, which is similar to the Robust Principal Component Analysis (RPCA) problem previously studied in [13]. Based on the Relaxed Principal Component Pursuit (Relaxed PCP) method and the Accelerated Proximal Gradient (APG) algorithm, we present an iterative approach for decomposing a traffic matrix, and demonstrate its efficiency and flexibility by experimental results. Finally, we further discuss several features of the deterministic and noise traffic. Our study develops a novel method for the problem of structural analysis of the traffic matrix, which is robust against pollution of large volume anomalies.

preprint2012arXiv

THz spectroscopy in the pseudo-Kagome system Cu3Bi(SeO3)2O2Br

Terahertz (THz) transmission spectra have been measured as function of temperature and magnetic field on single crystals of Cu3Bi(SeO3)2O2Br. In the time-domain THz spectra without magnetic field, two resonance absorptions are observed below the magnetic ordering temperature T_N~27.4 K. The corresponding resonance frequencies increase with decreasing temperature and reach energies of 1.28 and 1.23 meV at 3.5 K. Multi-frequency electron spin resonance transmission spectra as a function of applied magnetic field show the field dependence of four magnetic resonance modes, which can be modeled as a ferromagnetic resonance including demagnetization and anisotropy effects.

preprint2011arXiv

A Non-linearized PLS Model Based on Multivariate Dominant Factor for Laser-induced Breakdown Spectroscopy Measurements

A multivariate dominant factor based non-linearized PLS model is proposed. The intensities of different lines were taken to construct a multivariate dominant factor model, which describes the dominant concentration information of the measured species. In constructing such a multivariate model, non-linear transformation of multi characteristic line intensities according to the physical mechanisms of lased induced plasma spectrum were made, combined with linear-correlation-based PLS method, to model the nonlinear self-absorption and inter-element interference effects. This enables the linear PLS method to describe non-linear relationship more accurately and provides the statistics-based PLS method with physical backgrounds. Moreover, a secondary PLS is applied utilizing the whole spectra information to further correct the model results. Experiments were conducted using standard brass samples. Taylor expansion was applied to make the nonlinear transformation to describe the self-absorption effect of Cu. Then, line intensities of another two elements, Pb and Zn, were taken into account for inter-element interference. The proposed method shows a significant improvement when compared with conventional PLS model. Results also show that, even compared with the already-improved baseline dominant-factor-based PLS model, the present PLS model based on the multivariate dominant factor yields the same calibration quality (R2=0.999) while decreasing the RMSEP from 2.33% to 1.97%. The overall RMSE was also improved to 1.05% from 1.27%.

preprint2011arXiv

Bed-inventory Overturn Mechanism for Pant-leg Circulating Fluidized Bed Boilers

A numerical model was established to investigate the lateral mass transfer as well as the mechanism of bed-inventory overturn inside a pant-leg circulating fluidized bed (CFB), which are of great importance to maintain safe and efficient operation of the CFB. Results show that the special flow structure in which the solid particle volume fraction along the central line of the pant-leg CFB is relative high enlarges the lateral mass transfer rate and make it more possible for bed inventory overturn. Although the lateral pressure difference generated from lateral mass transfer inhibits continuing lateral mass transfer, providing the pant-leg CFB with self-balancing ability to some extent, the primary flow rate change due to the outlet pressure change often disable the self-balancing ability by continually enhancing the flow rate difference. As the flow rate of the primary air fan is more sensitive to its outlet pressure, it is easier to lead to bed inventory overturn. While when the solid particle is easier to change its flow patter to follow the surrounding air flow,the self-balancing ability is more active.

preprint2011arXiv

Daya Bay Neutrino Experiment: Goal, Progress and Schedule

Daya Bay Neutrino Experiment is dedicated to measuring the last unobserved neutrino mixing angle theta_13. The predicted precision on sin^2(2theta_13) is 0.01 at 90% confidence level. This document briefly reviews the measurement method and detector construction status. The first two anti-neutrino detectors' dry run result is also discussed. The Daya Bay near hall data taking is expected to commence in the summer of 2011 and the data taking of all of the three halls in the summer of 2012.

preprint2011arXiv

Efficient generation of isolated attosecond pulses with high beam-quality by two-color Bessel-Gauss beams

The generation of isolated attosecond pulses with high efficiency and high beam quality is essential for attosec- ond spectroscopy. We numerically investigate the supercontinuum generation in a neutral rare-gas medium driven by a two-color Bessel-Gauss beam. The results show that an efficient smooth supercontinuum in the plateau is obtained after propagation, and the spatial profile of the generated attosecond pulse is Gaussian-like with the divergence angle of 0.1 degree in the far field. This bright source with high beam quality is beneficial for detecting and controlling the microscopic processes on attosecond time scale.

preprint2011arXiv

Magnetization and specific heat of the dimer system CuTe2O5

We report on magnetization and specific heat measurements on single-crystalline CuTe2O5. The experimental data are directly compared to theoretical results for two different spin structures, namely an alternating spin-chain and a two-dimensional (2D) coupled dimer model, obtained by Das et al. [Phys. Rev. B 77, 224437 (2008)]. While the analysis of the specific heat does not allow to distinguish between the two models, the magnetization data is in good agreement with the 2D coupled dimer model.

preprint2011arXiv

Observation of the Meissner state in superconducting arrays of 4-Ångstrom carbon nanotubes

We report clear observations of the magnetic Meissner effect in arrays of superconducting 4 Å carbon nanotubes grown in the linear channels of AlPO4-5 (AFI) zeolite single crystals. Both bulk magnetization and magnetic torque experiments show a clear signature of the lower critical Hc1 transition, a pronounced difference in zero-field cooled and field cooled branches during temperature sweeps below 6K, and signatures of 1D superconducting fluctuations below ~15-18 K. These experiments extend the magnetic phase diagram we obtained previously by resistive experiments [Z. Wang et al., Phys. Rev. B 81, 174530 (2010)] towards low magnetic fields and within the range of zero resistance.

preprint2011arXiv

Simulation Study on neutrino nucleus cross section measurement in Segmented Detector at Spallation Neutron Source

Knowledge of $ν_e$-$\mathrm{Fe}/\mathrm{Pb}$ differential cross sections for $ν_e$ energy below several tens of MeV scale is believed to be crucial in understanding Supernova physics. In a segmented detector at Spallation Neutrino Source, $ν_e$ energy reconstructed from the electron range measurement is strongly affected because of both multiple scattering and electromagnetic showers occurring along the electron passage in target materials. In order to estimate the effect, a simulation study has been performed with a cube block model assuming a perfect tracking precision. The distortion of energy spectrum is observed to be proportional to the atomic number of target material. Feasibility of unfolding the distorted $ν_e$ energy spectrum is studied for both Fe and Pb cases. Evaluation of statistical accuracy attainable is therefore provided for a segmented detector.

preprint2011arXiv

Singlet-triplet excitations and high-field magnetization in CuTe2O5

By measuring the THz electron spin resonance (ESR) transmission spectra and high-field magnetization on the spin-gapped system CuTe$_2$O$_5$, we identified the singlet-triplet excitations in the dimerized non-magnetic ground state. The determined spin-gap value of $hν_0=4.94$ meV at the $Γ$ point ($\mathbf{Q}\simeq\mathbf{0}$) is significantly smaller than the strongest antiferromagnetic exchange interaction between the Cu ions predicted by theoretical investigations. We also observed the critical field $H_{c1}^{a^*}=37.6$ T for \textbf{H} $\bot$ \emph{bc}-plane and $H_{c1}^{bc}=40.6$ T for \textbf{H} $\|$ \emph{bc}-plane at the onset of non-zero magnetization, consistent with the gap value and corresponding anisotropic \emph{g}-factors determined previously. The observed singlet-triplet excitations in Faraday and Voigt configurations suggest a mixing of the singlet state with the $S_z=0$ triplet state and the $S_z=\pm 1$ triplet states, respectively, due to the Dzyaloshinskii-Moriya (DM) interaction with a DM vector perpendicular to the crystalline \emph{bc}-plane.

preprint2011arXiv

Spectrum standardization for laser-induced breakdown spectroscopy measurements

This paper presents a spectra normalization method for laser-induced breakdown spectroscopy (LIBS) measurements by converting the recorded characteristic line intensity at varying conditions to the intensity under a standard condition with standard plasma temperature, degree of ionization, and total number density of the interested species to reduce the measurement uncertainty. The characteristic line intensities of the interested species are first converted to the intensity at a fixed temperature and standard degree of ionization but varying total number density for each laser pulse analysis. Under this state, if the influence of the variation of plasma morphology is neglected, the sum of multiple spectral line intensities for the measured element can be regarded proportional to the total number density of the specific element, and the fluctuation of the total number density, or the variation of ablation mass, was compensated for by the application of this relationship. In the experiments with 29 brass alloy samples, the application of this method to determine Cu concentration shows a significant improvement over generally applied normalization method for measurement precision and accuracy. The average RSD value, average value of the error bar, R2, RMSEP, and average value of the maximum relative error were: 5.29%, 0.68%, 0.98, 2.72%, 16.97%, respectively, while the above parameter values for normalization with the whole spectrum area were: 8.61%, 1.37%, 0.95, 3.28%, 29.19%, respectively.

preprint2010arXiv

A Novel Multivariate Model Based on Dominant Factor for Laser-induced Breakdown Spectroscopy Measurements

This paper presents a new approach of applying partial least squares method combined with a physical principle based dominant factor. The characteristic line intensity of the specific element was taken to build up the dominant factor to reflect the major elemental concentration and partial least squares (PLS) approach was then applied to further improve the model accuracy. The deviation evolution of characteristic line intensity from the ideal condition was depicted and according to the deviation understanding, efforts were taken to model the non-linear self-absorption and inter-element interference effects to improve the accuracy of dominant factor model. With a dominant factor to carry the main quantitative information, the novel multivariate model combines advantages of both the conventional univariate and PLS models and partially avoids the overuse of the unrelated noise in the spectrum for PLS application. The dominant factor makes the combination model more robust over a wide concentration range and PLS application improves the model accuracy for samples with matrices within the calibration sample set. Results show that RMSEP of the final dominant factor based PLS model decreased to 2.33% from 5.25% when using the conventional PLS approach with full spectral information. Furthermore, with the development in understanding the physics of the laser-induced plasma, there is potential to easily improve the accuracy of the dominant factor model as well as the proposed novel multivariate model.

preprint2009arXiv

1D goes 2D: A Kosterlitz Thouless transition in superconducting arrays of 4-Angstrom carbon nanotubes

We report superconducting resistive transition characteristics for array(s) of coupled 4-Angstrom single wall carbon nanotubes embedded in aluminophosphate-five (AFI) zeolite. The transition was observed to initiate at 15K with a slow resistance decrease switching to a sharp, order of magnitude drop between 7.5-6.0K. The transition has strong (anisotropic) magnetic field dependence. Differential resistance versus current (voltage) measurements indicate that the establishment of coherence proceeds in stages as the temperature is lowered below 15K. In particular, the sharp resistance drop and its attendant nonlinear IV characteristics are consistent with the manifestations of a Kosterlitz-Thouless (KT) transition that establishes quasi long range order in the plane transverse to the c-axis of the nanotubes, leading to an inhomogeneous system comprising 3D superconducting regions connected by weak links. Global coherence is established at below 5K with the appearance of a well-defined supercurrent gap at 2K.

preprint2008arXiv

Holomorphic Motions and Related Topics

In this article we give an expository account of the holomorphic motion theorem based on work of Màñé-Sad-Sullivan, Bers-Royden, and Chirka. After proving this theorem, we show that tangent vectors to holomorphic motions have $|ε\log ε|$ moduli of continuity and then show how this type of continuity for tangent vectors can be combined with Schwarz's lemma and integration over the holomorphic variable to produce Hölder continuity on the mappings. We also prove, by using holomorphic motions, that Kobayashi's and Teichmüller's metrics on the Teichmüller space of a Riemann surface coincide. Finally, we present an application of holomorphic motions to complex dynamics, that is, we prove the Fatou linearization theorem for parabolic germs by involving holomorphic motions.

Zhe Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

152 published item(s)

A Breast Vision Pathology Foundation Model for Real-world Clinical Utility

Empowering Heterogeneous Graph Foundation Models via Decoupled Relation Alignment

A Tutorial on Extremely Large-Scale MIMO for 6G: Fundamentals, Signal Processing, and Applications

Uplink Precoding Design for Cell-Free Massive MIMO with Iteratively Weighted MMSE

Wasserstein convergence rates in the invariance principle for deterministic dynamical systems

A Competitive Method for Dog Nose-print Re-identification

Accelerating Real-Time Coupled Cluster Methods with Single-Precision Arithmetic and Adaptive Numerical Integration

AMinerGNN: Heterogeneous Graph Neural Network for Paper Click-through Rate Prediction with Fusion Query

Band Gap Opening in Bilayer Graphene-CrCl$_3$/CrBr$_3$/CrI$_3$ van der Waals Interfaces

Beyond Data Samples: Aligning Differential Networks Estimation with Scientific Knowledge

Ellipticity control of terahertz high-harmonic generation in a Dirac semimetal

FFConv: Fast Factorized Convolutional Neural Network Inference on Encrypted Data

FuncFooler: A Practical Black-box Attack Against Learning-based Binary Code Similarity Detection Methods

Identifying and Exploiting Sparse Branch Correlations for Optimizing Branch Prediction

Iteratively Weighted MMSE Uplink Precoding for Cell-Free Massive MIMO

Joint Learning of Deep Texture and High-Frequency Features for Computer-Generated Image Detection

Learning Versatile Neural Architectures by Propagating Network Codes

Magneto-optical study of metamagnetic transitions in the antiferromagnetic phase of $α$-RuCl$_3$

Mass Testing and Characterization of 20-inch PMTs for JUNO

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Maximising the Influence of Temporary Participants in Opinion Formation

Monolithically integrated active passive waveguide array fabricated on thin film lithium niobate using a single continuous photolithography process

Multifunctional Two-dimensional van der Waals Janus Magnet Cr-based Dichalcogenide Halides

Observation of three superconducting transitions in the pressurized CDW-bearing compound TaTe2

On-chip integrated Yb3+-doped waveguide amplifiers on thin film lithium niobate

Reduction of the 2D Toda Hierarchy and Linear Hodge Integrals

Subtype-Former: a deep learning approach for cancer subtype discovery with multi-omics data

TBI-GAN: An Adversarial Learning Approach for Data Synthesis on Traumatic Brain Segmentation

The Potential to Probe Solar Neutrino Physics with LiCl Water Solution

Towards the ultimate PMT waveform analysis for neutrino and dark matter experiments

Uplink Performance of Cell-Free Massive MIMO with Multi-Antenna Users Over Jointly-Correlated Rayleigh Fading Channels

WaveGAN: Frequency-aware GAN for High-Fidelity Few-shot Image Generation

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference

Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms

JUNO Physics and Detector

Network Pruning via Resource Reallocation

On-chip integrated waveguide amplifiers on Erbium-doped thin film lithium niobate on insulator

Relate and Predict: Structure-Aware Prediction with Jointly Optimized Neural DAG

Symmetric Rigidity for Circle Endomorphisms with Bounded Geometry

The ANTARES Astronomical Time-Domain Event Broker

The Role of the Hercules Autonomous Vehicle During the COVID-19 Pandemic: An Autonomous Logistic Vehicle for Contactless Goods Transportation

Variational Bihamiltonian Cohomologies and Integrable Hierarchies II: Virasoro symmetries

A Generalized Training Approach for Multiagent Learning

Achieving 50 femtosecond resolution in MeV ultrafast electron diffraction with a double bend achromat compressor

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization

Adaptive Gradient Methods Can Be Provably Faster than SGD after Finite Epochs

COLD: Towards the Next Generation of Pre-Ranking System

Cylinder3D: An Effective 3D Framework for Driving-scene LiDAR Semantic Segmentation

Discrete Darboux system with self-consistent sources and its symmetric reduction

Dynamics of entanglement in the one-dimensional anisotropic XXZ model

Exploring Trade-offs in Dynamic Task Triggering for Loosely Coupled Scientific Workflows

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network

Hierarchical Transformer Network for Utterance-level Emotion Recognition

High-index-contrast single-mode optical waveguides fabricated on lithium niobate by photolithography assisted chemo-mechanical etching (PLACE)

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

Hunting potassium geoneutrinos with liquid scintillator Cherenkov neutrino detectors

Non-asymptotic Convergence Analysis of Two Time-scale (Natural) Actor-Critic Algorithms

Nonequilibrium quasistationary spin disordered state in the Kitaev-Heisenberg magnet $α$-RuCl$_3$

Observation of E8 Particles in an Ising Chain Antiferromagnet

On the Allowable or Forbidden Nature of Vapor-Deposited Glasses

Phase-resolved Higgs response in superconducting cuprates

Predicting Camera Viewpoint Improves Cross-dataset Generalization for 3D Human Pose Estimation

Proximal Gradient Algorithm with Momentum and Flexible Parameter Restart for Nonconvex Optimization

Reanalysis of Variance Reduced Temporal Difference Learning

Resisting Crowd Occlusion and Hard Negatives for Pedestrian Detection in the Wild

Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction

SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

SpiderBoost and Momentum: Faster Stochastic Variance Reduction Algorithms

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy

ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language

Weak Supervision and Referring Attention for Temporal-Textual Association Learning

A compact and efficient three-dimensional microfluidic mixer