Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
60works
0followers
41topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

60 published item(s)

preprint2026arXiv

A Non-compact Positivity-Preserving Scheme for Parabolic PDE via Conditional Expectation

We propose a novel non-compact, positivity-preserving scheme for linear non-divergence form parabolic equations. Based on the Feynman-Kac formula, the solution is expressed as a conditional expectation of an associated diffusion process. Instead of using compact Markov chain approximations, we employ a wide stencil scheme to approximate the conditional expectation, ensuring consistency and positivity preservation. This method is effective for anisotropic diffusion with mixed derivatives, where classical schemes often fail unless the covariance matrix is diagonally dominated. A key feature of our framework is its robust treatment of boundary conditions, which avoids the accuracy loss commonly encountered in BZ and semi-Lagrangian schemes. For Dirichlet boundaries, we introduce (i) a quad-tree non-uniform stopping time scheme with O($Δt^{1/2}$) accuracy and (ii) a quad-tree uniform stopping time scheme with O($Δt$) accuracy. For Neumann boundaries, we use discrete specular reflection with O($Δt^{1/2}$) convergence, while periodic boundaries are treated using modular wrapping, achieving O($Δt$) accuracy. All analyses are conducted under the practical scaling $Δt \sim h$. Except for the uniform stopping time scheme, all schemes are explicit. The schemes are unconditionally stable and positive preserving, thanks to the probabilistic structure. To ensure consistency, a non-compact stencil is involved, which leads to the large time step constraint $Δt \sim h$. Numerical experiments confirm the predicted $L^\infty$ convergence rates for all types of boundary conditions.

preprint2026arXiv

A Pilot Kinematic Study on the Forehand Reverse Flick: Feasibility of a Novel Short Return Technique in Table Tennis

Background Following changes in table tennis ball materials, offensive returns have become more important for initiating sustained topspin offense. However, using the backhand flick (BF) to return forehand short balls often increases the difficulty of recovery and continuity, revealing a technical gap. This study preliminarily verified a novel forehand short return technique, the forehand reverse flick (FRF), and analyzed its similarities and differences with the BF. Methods Four elite athletes completed seven consecutive days of FRF specific training. Infrared motion capture and ultra-high-speed cameras were used to collect data on racket kinematics, movement duration, and ball performance. Results The success rate of the FRF increased steadily, reaching 86%. Racket trajectories of the two techniques were highly similar along the X (r = 1) and Y (r = 0.99) axes but differed along the Z (r = -0.04) axis. Racket and ball velocities were comparable between techniques, whereas the FRF showed lower resultant acceleration (approximately 265.57 m/s) and required about 0.03 s more for movement duration. Ball velocity was comparable between techniques, for the ball spin, the FRF generated lower spin (approximately 76.61 r/s) about 64% of the BF value (approximately 120.13 r/s). The highest participant mean spin rate reached 93 r/s, about 77% of the BF mean. Conclusion Overall, the FRF was found to have favorable learnability and training value, with potential for further optimization and competitive application.

preprint2026arXiv

Artificial Gauge Field Engineered Excited-State Topology: Control of Dynamical Evolution of Localized Spinons

Spinons are elementary excitations at the core of frustrated quantum magnets. Although it is well-established that a pair of spinons can emerge from a magnon via deconfinement, controlled manipulation of individual spinons and direct observation of their deconfinement remain elusive. We propose an artificial gauge field scenario that enables the engineering of specific excited states in quantum spin models. This generates spatially localized individual spinons with high controllability. By applying time-dependent gauge fields, we realize adiabatic braiding of these spinons, as well as their dynamical evolution in a controllable manner. These results not only provide the first direct visualization of individual spinons localized in the bulk, but also point to new possibilities to simulate their confinement process. Finally, we demonstrate the feasibility of our scenario in Rydberg atoms, which suggests an experimentally viable direction--gauge field engineering of correlated phenomena in excited states.

preprint2026arXiv

Why Retrieval-Augmented Generation Fails: A Graph Perspective

Retrieval-Augmented Generation (RAG) has become a powerful and widely used approach for improving large language models by grounding generation in retrieved evidence. However, RAG systems still produce incorrect answers in many cases. Why RAG fails despite having access to external information remains poorly understood. We present a model-internal study of retrieval-augmented generation that examines how retrieved evidence influences answer generation. Using circuit tracing, we construct attribution graphs that model the flow of information through transformer layers during decoding. These graphs represent interactions among retrieved context, intermediate model activations, and generated tokens, providing a graph, circuit-level view of how external evidence is integrated into the model's reasoning process across multiple question answering benchmarks, we observe consistent structural differences: correct predictions exhibit deeper reasoning paths, more distributed evidence flow, and a more structured pattern of local connectivity, while failed predictions show shallower, fragmented, and overly concentrated evidence flow. Building on these findings, we develop a graph-based error detection framework that uses attribution-graph topology features. Furthermore, we show that attribution graphs enable targeted interventions. By reinforcing question-constrained evidence grounding, we reshape internal routing so that answer generation remains guided by the question, leading to more effective integration of retrieved information and fewer errors.

preprint2025arXiv

Boundary-layer transition in the age of data: from a comprehensive dataset to fine-grained prediction

The laminar-to-turbulent transition remains a fundamental and enduring challenge in fluid mechanics. Its complexity arises from the intrinsic nonlinearity and extreme sensitivity to external disturbances. This transition is critical in a wide range of applications, including aerospace, marine engineering, geophysical flows, and energy systems. While the governing physics can be well described by the Navier-Stokes equations, practical prediction efforts often fall short due to the lack of comprehensive models for perturbation initialization and turbulence generation in numerical simulations. To address the uncertainty introduced by unforeseeable environmental perturbations, we propose a fine-grained predictive framework that accurately predicts the transition location. The framework generates an extensive dataset using nonlinear parabolized stability equations (NPSE). NPSE simulations are performed over a wide range of randomly prescribed initial conditions for the generic zero-pressure-gradient flat-plate boundary-layer flow, resulting in a large dataset that captures the nonlinear evolution of instability waves across three canonical transition pathways (Type-K, -H, and -O). From a database of 3000 simulation cases, we extract diagnostic quantities (e.g., wall pressure signals and skin-friction coefficients) from each simulation to construct a feature set that links pre-transition flow characteristics to transition onset locations. Machine learning models are systematically evaluated, with ensemble methods-particularly XGBoost-demonstrating exceptional predictive accuracy (mean relative error of approximately 0.001). Compared to methods currently available (e.g., N-factor, transitional turbulence model), this approach accounts for the physical process and achieves transition prediction without relying on any empirical parameters.

preprint2025arXiv

Phase transitions of eutectic high entropy alloy AlCoCrFeNi2.1 under shock compression

High entropy alloys (HEAs) are a new class of metals that exhibit unique mechanical performance. Among HEAs, additively manufactured eutectic high entropy alloys (AM-EHEAs) have recently emerged as candidate materials for use in extreme conditions due to their simultaneous high strength and ductility. However, the deformation and structural evolution of AM-EHEAs under conditions of high pressure have not been well characterized, limiting their use in extreme applications. We present dynamic compression experiments and molecular dynamics simulations studying the structural evolution of AM-EHEA AlCoCrFeNi2.1 when compressed to pressures up to 400 GPa. Our in-situ X-ray diffraction measurements capture the appearance of fcc and bcc phases at different pressure conditions, with pure- and mixed-phase regions. Understanding the phase stability and structural evolution of the AM EHEA offers new insights to guide the development of high-performance complex materials for extreme conditions.

preprint2024arXiv

Spin-Transfer-Torque Induced Spatially Nonuniform Switching in Ferrimagnets

Ferrimagnet (FiM), (FeCo)1-xGdx, attracts research attention due to its ultrafast magnetic dynamics and finite net magnetization. Incorporating FiM into the magnetic tunnel junction will be beneficial to further improve the writing speed of magnetic random access memory (MRAM). It is commonly assumed that the FeCo and Gd atoms are switched together due to the strong exchange coupling, which remains valid even if one performs the two-sublattice macrospin simulation. Interestingly, using the atomistic model developed by our group, it is clearly seen that different atoms are not switched together. In addition, our study reveals that the nature of switching is spatially nonuniform even in the small sample with the dimension of 20 nm-20 nm. Furthermore, the characteristics of nonuniformity are completely different for samples with different Gd composition (x). When x is close to the magnetization compensation point, successful switching cannot be obtained, but is accompanied by the stable oscillation. The atom type that dominates the oscillation is different from that predicted by the two-sublattice macrospin model. In addition, the size of singular region is a non-monotonic function of current density. All these results can only be understood by considering the spatial nonuniform magnetization dynamics.

preprint2022arXiv

A Simple Approach to Improve Single-Model Deep Uncertainty via Distance-Awareness

Accurate uncertainty quantification is a major challenge in deep learning, as neural networks can make overconfident errors and assign high confidence predictions to out-of-distribution (OOD) inputs. The most popular approaches to estimate predictive uncertainty in deep learning are methods that combine predictions from multiple neural networks, such as Bayesian neural networks (BNNs) and deep ensembles. However their practicality in real-time, industrial-scale applications are limited due to the high memory and computational cost. Furthermore, ensembles and BNNs do not necessarily fix all the issues with the underlying member networks. In this work, we study principled approaches to improve uncertainty property of a single network, based on a single, deterministic representation. By formalizing the uncertainty quantification as a minimax learning problem, we first identify distance awareness, i.e., the model's ability to quantify the distance of a testing example from the training data, as a necessary condition for a DNN to achieve high-quality (i.e., minimax optimal) uncertainty estimation. We then propose Spectral-normalized Neural Gaussian Process (SNGP), a simple method that improves the distance-awareness ability of modern DNNs with two simple changes: (1) applying spectral normalization to hidden weights to enforce bi-Lipschitz smoothness in representations and (2) replacing the last output layer with a Gaussian process layer. On a suite of vision and language understanding benchmarks, SNGP outperforms other single-model approaches in prediction, calibration and out-of-domain detection. Furthermore, SNGP provides complementary benefits to popular techniques such as deep ensembles and data augmentation, making it a simple and scalable building block for probabilistic deep learning. Code is open-sourced at https://github.com/google/uncertainty-baselines

preprint2022arXiv

Analytic AC conductivities from holography

We find exact, analytic solutions of the holographic AC conductivity at arbitrary frequency $ω$ and temperature $T$, in contrast to previous works where the AC conductivity was analytically obtained usually at small $ω$ and $T$. These solutions enable us to study the analyticity properties of the current-current correlator $G(ω)$ in detail. The first system we study is the AdS$_5$ planar black hole with momentum dissipation, whose extremal limit has an AdS$_2$ factor. Then we study AdS$_4$ and AdS$_5$ Einstein-dilaton systems whose special cases are maximal gauged supergravities. The solutions show how the poles move and how branch cuts emerge as the temperature varies. As a byproduct, we obtain an R-current correlator in $\mathcal{N}=4$ super-Yang-Mills theory on a sphere at finite temperature in the large $N$ and strong coupling limit.

preprint2022arXiv

Analytic critical points of charged Renyi entropies from hyperbolic black holes

We analytically study phase transitions of holographic charged Renyi entropies in two gravitational systems dual to the $\mathcal{N}=4$ super-Yang-Mills theory at finite density and zero temperature. The first system is the Reissner-Nordstrom-AdS$_5$ black hole, which has finite entropy at zero temperature. The second system is a charged dilatonic black hole in AdS$_5$, which has zero entropy at zero temperature. Hyperbolic black holes are employed to calculate the Renyi entropies with the entangling surface being a sphere. We perturb each system by a charged scalar field, and look for a zero mode signaling the instability of the extremal hyperbolic black hole. Zero modes as well as the leading order of the full retarded Green's function are analytically solved for both systems, in contrast to previous studies in which only the IR (near horizon) instability was analytically treated.

preprint2022arXiv

Color Image Edge Detection using Multi-scale and Multi-directional Gabor filter

In this paper, a color edge detection method is proposed where the multi-scale Gabor filter are used to obtain edges from input color images. The main advantage of the proposed method is that high edge detection accuracy is attained while maintaining good noise robustness. The proposed method consists of three aspects: First, the RGB color image is converted to CIE L*a*b* space because of its wide coloring area and uniform color distribution. Second, a set of Gabor filters are used to smooth the input images and the color edge strength maps are extracted, which are fused into a new ESM with the noise robustness and accurate edge extraction. Third, Embedding the fused ESM in the route of the Canny detector yields a noise-robust color edge detector. The results show that the proposed detector has the better experience in detection accuracy and noise-robustness.

preprint2022arXiv

Defense Against Gradient Leakage Attacks via Learning to Obscure Data

Federated learning is considered as an effective privacy-preserving learning mechanism that separates the client's data and model training process. However, federated learning is still under the risk of privacy leakage because of the existence of attackers who deliberately conduct gradient leakage attacks to reconstruct the client data. Recently, popular strategies such as gradient perturbation methods and input encryption methods have been proposed to defend against gradient leakage attacks. Nevertheless, these defenses can either greatly sacrifice the model performance, or be evaded by more advanced attacks. In this paper, we propose a new defense method to protect the privacy of clients' data by learning to obscure data. Our defense method can generate synthetic samples that are totally distinct from the original samples, but they can also maximally preserve their predictive features and guarantee the model performance. Furthermore, our defense strategy makes the gradient leakage attack and its variants extremely difficult to reconstruct the client data. Through extensive experiments, we show that our proposed defense method obtains better privacy protection while preserving high accuracy compared with state-of-the-art methods.

preprint2022arXiv

DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise Annotations

AI-aided drug discovery (AIDD) is gaining increasing popularity due to its promise of making the search for new pharmaceuticals quicker, cheaper and more efficient. In spite of its extensive use in many fields, such as ADMET prediction, virtual screening, protein folding and generative chemistry, little has been explored in terms of the out-of-distribution (OOD) learning problem with \emph{noise}, which is inevitable in real world AIDD applications. In this work, we present DrugOOD, a systematic OOD dataset curator and benchmark for AI-aided drug discovery, which comes with an open-source Python package that fully automates the data curation and OOD benchmarking processes. We focus on one of the most crucial problems in AIDD: drug target binding affinity prediction, which involves both macromolecule (protein target) and small-molecule (drug compound). In contrast to only providing fixed datasets, DrugOOD offers automated dataset curator with user-friendly customization scripts, rich domain annotations aligned with biochemistry knowledge, realistic noise annotations and rigorous benchmarking of state-of-the-art OOD algorithms. Since the molecular data is often modeled as irregular graphs using graph neural network (GNN) backbones, DrugOOD also serves as a valuable testbed for \emph{graph OOD learning} problems. Extensive empirical studies have shown a significant performance gap between in-distribution and out-of-distribution experiments, which highlights the need to develop better schemes that can allow for OOD generalization under noise for AIDD.

preprint2022arXiv

Elastic Valley Spin Controlled Chiral Coupling in Topological Valley Phononic Crystals

Distinct from the phononic valley pseudo-spin, the real physical spin of elastic waves adds a novel tool-kit capable of envisaging the valley-spin physics of topological valley phononic crystals from a local viewpoint. Here, we report the observation of local elastic valley spin as well as the hidden elastic spin-valley locking mechanism overlooked before. We demonstrate that the selective one-way routing of valley phonon states along the topological interface can be reversed by imposing the elastic spin meta-source at different interface locations with opposite valley-spin correspondence. We unveil the physical mechanism of selective directionality as the elastic spin controlled chiral coupling of valley phonon states, through both analytical theory and experimental measurement of the opposite local elastic spin density at different interface locations for different transport directions. The elastic spin of valley topological edge phonons can be extended to other topological states and offers new tool to explore topological metamaterials.

preprint2022arXiv

Exemplar-Based Image Colorization with A Learning Framework

Image learning and colorization are hot spots in multimedia domain. Inspired by the learning capability of humans, in this paper, we propose an automatic colorization method with a learning framework. This method can be viewed as a hybrid of exemplar-based and learning-based method, and it decouples the colorization process and learning process so as to generate various color styles for the same gray image. The matching process in the exemplar-based colorization method can be regarded as a parameterized function, and we employ a large amount of color images as the training samples to fit the parameters. During the training process, the color images are the ground truths, and we learn the optimal parameters for the matching process by minimizing the errors in terms of the parameters for the matching function. To deal with images with various compositions, a global feature is introduced, which can be used to classify the images with respect to their compositions, and then learn the optimal matching parameters for each image category individually. What's more, a spatial consistency based post-processing is design to smooth the extracted color information from the reference image to remove matching errors. Extensive experiments are conducted to verify the effectiveness of the method, and it achieves comparable performance against the state-of-the-art colorization algorithms.

preprint2022arXiv

Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence Embedding

Contrastive learning is emerging as a powerful technique for extracting knowledge from unlabeled data. This technique requires a balanced mixture of two ingredients: positive (similar) and negative (dissimilar) samples. This is typically achieved by maintaining a queue of negative samples during training. Prior works in the area typically uses a fixed-length negative sample queue, but how the negative sample size affects the model performance remains unclear. The opaque impact of the number of negative samples on performance when employing contrastive learning aroused our in-depth exploration. This paper presents a momentum contrastive learning model with negative sample queue for sentence embedding, namely MoCoSE. We add the prediction layer to the online branch to make the model asymmetric and together with EMA update mechanism of the target branch to prevent the model from collapsing. We define a maximum traceable distance metric, through which we learn to what extent the text contrastive learning benefits from the historical information of negative samples. Our experiments find that the best results are obtained when the maximum traceable distance is at a certain range, demonstrating that there is an optimal range of historical information for a negative sample queue. We evaluate the proposed unsupervised MoCoSE on the semantic text similarity (STS) task and obtain an average Spearman's correlation of $77.27\%$. Source code is available at https://github.com/xbdxwyh/mocose.

preprint2022arXiv

Geometric thermodynamic uncertainty relation in periodically driven thermoelectric heat engine

Thermodynamic uncertainty relation, quantifying a trade-off among average current, the associated fluctuation (precision), and entropy production (cost), has been formulated in nonequilibrium steady state and various stochastic systems. Herein, we study the thermodynamic uncertainty relation in generic thermoelectric heat engines under a periodic control protocol, by uncovering the underlying Berry-phase-like contribution. We show that our thermodynamic uncertainty relation breaks the seminal steady-state results, originating from the non-vanishing geometric effect. Furthermore, by deriving the consequent trade-off relation binding efficiency, power, and constancy, we prove that the periodically driven thermoelectric heat engines can generally outperform the steady-state analogies. The general bounds are illustrated by an analytically solvable two-terminal single quantum dot heat engine under the periodic modulation. Our work provides a geometric framework in bounding and optimizing a wide range of periodically driven thermoelectric thermal machines.

preprint2022arXiv

Long-Range Order and Quantum Criticality in Antiferromagnetic Chains with Long-Range Staggered Interactions

We study quantum phase transitions in Heisenberg antiferromagnetic chains with a staggered power-law decaying long-range interactions. Employing the density-matrix renormalization group (DMRG) algorithm and the fidelity susceptibility as the criticality measure, we establish more accurate values of quantum critical points than the results obtained from the spin-wave approximation, quantum Monte Carlo and DMRG in literatures. The deviation is especially evident for strong long-range interactions. We extend isotropic long-range interactions to the anisotropic cases and find that kaleidoscope of quantum phases emerge from the interplay of anisotropy of the long-range exchange interaction and symmetry breaking. We demonstrate nonfrustrating long-range interactions induce the true long-range order in Heisenberg antiferromagnetic chains with a continuous symmetry breaking, lifting the restrictions imposed by the Mermin-Wagner theorem.

preprint2022arXiv

Mass Testing and Characterization of 20-inch PMTs for JUNO

Main goal of the JUNO experiment is to determine the neutrino mass ordering using a 20kt liquid-scintillator detector. Its key feature is an excellent energy resolution of at least 3 % at 1 MeV, for which its instruments need to meet a certain quality and thus have to be fully characterized. More than 20,000 20-inch PMTs have been received and assessed by JUNO after a detailed testing program which began in 2017 and elapsed for about four years. Based on this mass characterization and a set of specific requirements, a good quality of all accepted PMTs could be ascertained. This paper presents the performed testing procedure with the designed testing systems as well as the statistical characteristics of all 20-inch PMTs intended to be used in the JUNO experiment, covering more than fifteen performance parameters including the photocathode uniformity. This constitutes the largest sample of 20-inch PMTs ever produced and studied in detail to date, i.e. 15,000 of the newly developed 20-inch MCP-PMTs from Northern Night Vision Technology Co. (NNVT) and 5,000 of dynode PMTs from Hamamatsu Photonics K. K.(HPK).

preprint2022arXiv

MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment

Large-scale Bundle Adjustment (BA) requires massive memory and computation resources which are difficult to be fulfilled by existing BA libraries. In this paper, we propose MegBA, a GPU-based distributed BA library. MegBA can provide massive aggregated memory by automatically partitioning large BA problems, and assigning the solvers of sub-problems to parallel nodes. The parallel solvers adopt distributed Precondition Conjugate Gradient and distributed Schur Elimination, so that an effective solution, which can match the precision of those computed by a single node, can be efficiently computed. To accelerate BA computation, we implement end-to-end BA computation using high-performance primitives available on commodity GPUs. MegBA exposes easy-to-use APIs that are compatible with existing popular BA libraries. Experiments show that MegBA can significantly outperform state-of-the-art BA libraries: Ceres (41.45$\times$), RootBA (64.576$\times$) and DeepLM (6.769$\times$) in several large-scale BA benchmarks. The code of MegBA is available at https://github.com/MegviiRobot/MegBA.

preprint2022arXiv

Plex: Towards Reliability using Pretrained Large Model Extensions

A recent trend in artificial intelligence is the use of pretrained models for language and vision tasks, which have achieved extraordinary performance but also puzzling failures. Probing these models' abilities in diverse ways is therefore critical to the field. In this paper, we explore the reliability of models, where we define a reliable model as one that not only achieves strong predictive performance but also performs well consistently over many decision-making tasks involving uncertainty (e.g., selective prediction, open set recognition), robust generalization (e.g., accuracy and proper scoring rules such as log-likelihood on in- and out-of-distribution datasets), and adaptation (e.g., active learning, few-shot uncertainty). We devise 10 types of tasks over 40 datasets in order to evaluate different aspects of reliability on both vision and language domains. To improve reliability, we developed ViT-Plex and T5-Plex, pretrained large model extensions for vision and language modalities, respectively. Plex greatly improves the state-of-the-art across reliability tasks, and simplifies the traditional protocol as it improves the out-of-the-box performance and does not require designing scores or tuning the model for each task. We demonstrate scaling effects over model sizes up to 1B parameters and pretraining dataset sizes up to 4B examples. We also demonstrate Plex's capabilities on challenging tasks including zero-shot open set recognition, active learning, and uncertainty in conversational language understanding.

preprint2022arXiv

Pluralistic Image Completion with Probabilistic Mixture-of-Experts

Pluralistic image completion focuses on generating both visually realistic and diverse results for image completion. Prior methods enjoy the empirical successes of this task. However, their used constraints for pluralistic image completion are argued to be not well interpretable and unsatisfactory from two aspects. First, the constraints for visual reality can be weakly correlated to the objective of image completion or even redundant. Second, the constraints for diversity are designed to be task-agnostic, which causes the constraints to not work well. In this paper, to address the issues, we propose an end-to-end probabilistic method. Specifically, we introduce a unified probabilistic graph model that represents the complex interactions in image completion. The entire procedure of image completion is then mathematically divided into several sub-procedures, which helps efficient enforcement of constraints. The sub-procedure directly related to pluralistic results is identified, where the interaction is established by a Gaussian mixture model (GMM). The inherent parameters of GMM are task-related, which are optimized adaptively during training, while the number of its primitives can control the diversity of results conveniently. We formally establish the effectiveness of our method and demonstrate it with comprehensive experiments.

preprint2022arXiv

Proximal Quasi-Newton Methods for Multiobjective Optimization Problems

We introduce some new proximal quasi-Newton methods for unconstrained multiobjective optimization problems (in short, UMOP), where each objective function is the sum of a twice continuously differentiable strongly convex function and a proper lower semicontinuous convex but not necessarily differentiable function. We propose proximal BFGS method, proximal self-scaling BFGS method, and proximal Huang BFGS method for (UMOP) with both line searches and without line searches cases. Under mild assumputions, we show that each accumulation point of the sequence generated by these algorithms, if exists, is a Pareto stationary point of the (UMOP). Moreover, we present their applications in both constrained multiobjective optimization problems and robust multiobjective optimization problems. In particular, for robust multiobjective optimization problems, we show that the subproblems of proximal quasi-Newton algorithms can be regarded as quadratic minimization problems with quadratic inequality constraints. Numerical experiments are also carried out to verify the effectiveness of the proposed proximal quasi-Newton methods.

preprint2022arXiv

Quantum Many-Body Scars in Spin-1 Kitaev Chains

To provide a physical example of quantum scars, we study the many-body scars in the spin-1 Kitaev chain where the so-called PXP Hamiltonian is exactly embedded in the spectra. Regarding the conserved quantities, the Hilbert space is fragmented into disconnected subspaces and we explore the associated constrained dynamics. The continuous revivals of the fidelity and the entanglement entropy when the initial state is prepared in $\vert\mathbb{Z}_k\rangle$ ($k=2,3$) state illustrate the essential physics of the PXP model. We study the quantum phase transitions in the one-dimensional spin-1 Kitaev-Heisenberg model using the density-matrix renormalization group and Lanczos exact diagonalization methods, and determine the phase diagram. We parametrize the two terms in the Hamiltonian by the angle $ϕ$, where the Kitaev term is $K\equiv\sin(ϕ)$ and competes with the Heisenberg $J\equiv\cos(ϕ)$ term. One finds a rich ground state phase diagram as a function of the angle $ϕ$. Depending on the ratio $K/J\equiv\tan(ϕ)$, the system either breaks the symmetry to one of distinct symmetry broken phases, or preserves the symmetry in a quantum spin liquid phase with frustrated interactions. We find that the scarred state is stable for perturbations which obey $\mathbb{Z}_2$-symmetry, while it becomes unstable against Heisenberg-type perturbations.\\ \textit{Accepted for publication in Physical Review Research}

preprint2022arXiv

Sequential Circuits Synthesis for Rapid Single Flux Quantum Logic Based on Finite State Machine Decomposition

Rapid Single Flux Quantum (RSFQ) logic is a promising technology to supersede Complementary metal-oxide-semiconductor (CMOS) logic in some specialized areas due to providing ultra-fast and energy-efficient circuits. To realize a large-scale integration design, electronic design automation (EDA) tools specialized for RSFQ logic are required due to the divergences in logic type, timing constraints, and circuit structure compared with CMOS logic. Logic synthesis is crucial in converting behavioral circuit description into circuit netlist, typically combining combinational and sequential circuit synthesis. For the RSFQ logic, the sequential circuit synthesis is challenging, especially for non-linear sequential blocks with feedback loops. Thus, this paper presents a sequential circuit synthesis algorithm based on finite state machine (FSM) decomposition, which ensures design functionality, lowers costs, and improves the RSFQ circuit performance. Additionally, we present the synthesis processes of the feedback logic and the 2-bit counter to demonstrate how the proposed algorithm operates, and ISCAS89 benchmark circuits reveal our method's ability to synthesize large-scale sequential circuits.

preprint2022arXiv

Theory of Topological Superconductivity in Doped IV-VI Semiconductors

We theoretically study potential unconventional superconductivity in doped AB-type IV-VI semi-conductors, based on a minimal effective model with interaction up to the next-nearest neighbors. According to the experimental implications, we focus on the spin-triplet channels and obtain the superconducting phase diagram with respect to the anisotropy of the Fermi surfaces and the interaction strength. Abundant nodal and nodeless states with different symmetry breaking appear in the phase diagram, and all the states are time reversal invariant and topologically nontrivial. Specifically, the various nodal superconducting ground states, dubbed as the topological Dirac superconductors, are featured by Dirac nodes in the bulk and Majorana arcs on the surface; among the full-gap states, there exist a mirror-symmetry-protected second-order topological superconductor state favoring helical Majorana hinge cones, and different first-order topological superconductor states supporting 4 surface Majorana cones. The experimental verification of the different kinds of superconducting ground states is also discussed.

preprint2022arXiv

Trap of Feature Diversity in the Learning of MLPs

In this paper, we focus on a typical two-phase phenomenon in the learning of multi-layer perceptrons (MLPs), and we aim to explain the reason for the decrease of feature diversity in the first phase. Specifically, people find that, in the training of MLPs, the training loss does not decrease significantly until the second phase. To this end, we further explore the reason why the diversity of features over different samples keeps decreasing in the first phase, which hurts the optimization of MLPs. We explain such a phenomenon in terms of the learning dynamics of MLPs. Furthermore, we theoretically explain why four typical operations can alleviate the decrease of the feature diversity.

preprint2022arXiv

Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning

High-quality estimates of uncertainty and robustness are crucial for numerous real-world applications, especially for deep learning which underlies many deployed ML systems. The ability to compare techniques for improving these estimates is therefore very important for research and practice alike. Yet, competitive comparisons of methods are often lacking due to a range of reasons, including: compute availability for extensive tuning, incorporation of sufficiently many baselines, and concrete documentation for reproducibility. In this paper we introduce Uncertainty Baselines: high-quality implementations of standard and state-of-the-art deep learning methods on a variety of tasks. As of this writing, the collection spans 19 methods across 9 tasks, each with at least 5 metrics. Each baseline is a self-contained experiment pipeline with easily reusable and extendable components. Our goal is to provide immediate starting points for experimentation with new methods or applications. Additionally we provide model checkpoints, experiment outputs as Python notebooks, and leaderboards for comparing results. Code available at https://github.com/google/uncertainty-baselines.

preprint2022arXiv

Why Adversarial Training of ReLU Networks Is Difficult?

This paper mathematically derives an analytic solution of the adversarial perturbation on a ReLU network, and theoretically explains the difficulty of adversarial training. Specifically, we formulate the dynamics of the adversarial perturbation generated by the multi-step attack, which shows that the adversarial perturbation tends to strengthen eigenvectors corresponding to a few top-ranked eigenvalues of the Hessian matrix of the loss w.r.t. the input. We also prove that adversarial training tends to strengthen the influence of unconfident input samples with large gradient norms in an exponential manner. Besides, we find that adversarial training strengthens the influence of the Hessian matrix of the loss w.r.t. network parameters, which makes the adversarial training more likely to oscillate along directions of a few samples, and boosts the difficulty of adversarial training. Crucially, our proofs provide a unified explanation for previous findings in understanding adversarial training.

preprint2021arXiv

Anomalous Transient Heat Conduction in Fractal Metamaterials

Transient dynamics of heat conduction in isotropic fractal metamaterials is investigated. By using the Laplacian operator in non-integer dimension, we analytically and numerically study the effect of fractal dimensionality on the evolution of the temperature profile, heat flux and excess energy under certain initial and boundary conditions. Particularly, with randomly distributed absorbing heat sinks in the fractal metamaterials, we obtain an anomalous non-exponential decay behavior of the heat pulse diffusion. and an optimal dimension for efficient heat absorption as a function of sink concentrations. Our results may have potential applications in controlling transient heat conduction in fractal media, which will be ubiquitous as porous, composite, networked, hierarchical meta-materials.

preprint2021arXiv

Back-n White Neutron Source at CSNS and its Applications

Back-streaming neutrons from the spallation target of the China Spallation Neutron Source (CSNS) that emit through the incoming proton channel were exploited to build a white neutron beam facility (the so-called Back-n white neutron source), which was completed in March 2018. The Back-n neutron beam is very intense, at approximately 2*10^7 n/cm^2/s at 55 m from the target, and has a nominal proton beam with a power of 100 kW in the CSNS-I phase and a kinetic energy of 1.6 GeV and a thick tungsten target in multiple slices with modest moderation from the cooling water through the slices. In addition, the excellent energy spectrum spanning from 0.5 eV to 200 MeV, and a good time resolution related to the time-of-flight measurements make it a typical white neutron source for nuclear data measurements; its overall performance is among that of the best white neutron sources in the world. Equipped with advanced spectrometers, detectors, and application utilities, the Back-n facility can serve wide applications, with a focus on neutron-induced cross-section measurements. This article presents an overview of the neutron beam characteristics, the experimental setups, and the ongoing applications at Back-n.

preprint2021arXiv

Cycle Flux Ranking of Network Analysis in Quantum Thermal Device

Manipulating quantum thermal transport relies on uncovering the principle working cycles of quantum devices. Here, we apply the cycle flux ranking of network analysis to nonequilibrium thermal devices described by graphs of quantum state transitions. To excavate the principal mechanism out of complex transport behaviors, we decompose the quantum-transition network into cycles, calculate the cycle flux by algebraic graph theory, and pick out the dominant cycles with top-ranked fluxes, i.e., the cycle trajectories with highest probabilities. We demonstrate the cycle flux ranking in typical quantum device models, such as a thermal-drag spin-Seebeck pump, and a quantum thermal transistor as thermal switch or heat amplifier. The dominant cycle trajectories indeed elucidate the principal working mechanisms of those quantum devices. The cycle flux analysis provides an alternative perspective that naturally describes the working cycle corresponding to the main functionality of quantum thermal devices, which would further guide the device optimization with desired performance

preprint2021arXiv

Deformed Symmetry Structures and Quantum Many-body Scar Subspaces

A quantum many-body scar system usually contains a special non-thermal subspace (approximately) decoupled from the rest of the Hilbert space. In this work, we propose a general structure called deformed symmetric spaces for the decoupled subspaces hosting quantum many-body scars, which are irreducible sectors of simple Lie groups transformed by matrix-product operators (or projected entangled pair operators), of which the entanglement entropies are proved to obey sub-volume-law scaling and thus violate the eigenstate thermalization hypothesis. A deformed symmetric space, in general, is required to have at least a U(1) sub-Lie-group symmetry to allow coherent periodic dynamics from certain low-entangled initial states. We enumerate several possible deforming transformations based on the sub-group symmetry requirement and recover many existing models whose scar states are not connected by symmetry. In particular, a two-dimensional scar model is proposed, which hosts a periodic dynamical trajectory on which all states are topologically ordered.

preprint2021arXiv

Directional Design of Materials Based on the Multi-Objective Optimization: A Case Study of Two-Dimensional Thermoelectric SnSe

Directional design of functional materials with multi-objective constraints is a big challenge, whose performance and stability are determined by different physics factors entangled with each other complicatedly. In this work, we apply the multi-objective optimization based on the Pareto Efficiency and Particle-Swarm Optimization methods to design new functional materials directionally. As a demonstration, we achieve the thermoelectric design of 2D SnSe materials through the methods. We identify several novel metastable 2D SnSe structures with simultaneously lower free energy and better thermoelectric performance over the experimentally-reported monolayer structures. We hope our results about the multi-objective Pareto Optimization method can make a step towards the integrative design of multi-objective and multi-functional materials in the future.

preprint2021arXiv

Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions

We develop and rigorously evaluate a deep learning based system that can accurately classify skin conditions while detecting rare conditions for which there is not enough data available for training a confident classifier. We frame this task as an out-of-distribution (OOD) detection problem. Our novel approach, hierarchical outlier detection (HOD) assigns multiple abstention classes for each training outlier class and jointly performs a coarse classification of inliers vs. outliers, along with fine-grained classification of the individual classes. We demonstrate the effectiveness of the HOD loss in conjunction with modern representation learning approaches (BiT, SimCLR, MICLe) and explore different ensembling strategies for further improving the results. We perform an extensive subgroup analysis over conditions of varying risk levels and different skin types to investigate how the OOD detection performance changes over each subgroup and demonstrate the gains of our framework in comparison to baselines. Finally, we introduce a cost metric to approximate downstream clinical impact. We use this cost metric to compare the proposed method against a baseline system, thereby making a stronger case for the overall system effectiveness in a real-world deployment scenario.

preprint2021arXiv

Identifying Gene-environment interactions with robust marginal Bayesian variable selection

In high-throughput genetics studies, an important aim is to identify gene-environment interactions associated with the clinical outcomes. Recently, multiple marginal penalization methods have been developed and shown to be effective in G$\times$E studies. However, within the Bayesian framework, marginal variable selection has not received much attention. In this study, we propose a novel marginal Bayesian variable selection method for G$\times$E studies. In particular, our marginal Bayesian method is robust to data contamination and outliers in the outcome variables. With the incorporation of spike-and-slab priors, we have implemented the Gibbs sampler based on MCMC. The proposed method outperforms a number of alternatives in extensive simulation studies. The utility of the marginal robust Bayesian variable selection method has been further demonstrated in the case studies using data from the Nurse Health Study (NHS). Some of the identified main and interaction effects from the real data analysis have important biological implications.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

ZeRO-Offload: Democratizing Billion-Scale Model Training

Large-scale model training has been a playing ground for a limited few requiring complex model refactoring and access to prohibitively expensive GPU clusters. ZeRO-Offload changes the large model training landscape by making large model training accessible to nearly everyone. It can train models with over 13 billion parameters on a single GPU, a 10x increase in size compared to popular framework such as PyTorch, and it does so without requiring any model change from the data scientists or sacrificing computational efficiency. ZeRO-Offload enables large model training by offloading data and compute to CPU. To preserve compute efficiency, it is designed to minimize the data movement to/from GPU, and reduce CPU compute time while maximizing memory savings on GPU. As a result, ZeRO-Offload can achieve 40 TFlops/GPU on a single NVIDIA V100 GPU for 10B parameter model compared to 30TF using PyTorch alone for a 1.4B parameter model, the largest that can be trained without running out of memory. ZeRO-Offload is also designed to scale on multiple-GPUs when available, offering near linear speedup on up to 128 GPUs. Additionally, it can work together with model parallelism to train models with over 70 billion parameters on a single DGX-2 box, a 4.5x increase in model size compared to using model parallelism alone. By combining compute and memory efficiency with ease-of-use, ZeRO-Offload democratizes large-scale model training making it accessible to even data scientists with access to just a single GPU.

preprint2020arXiv

Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems

The emergence of high-density byte-addressable non-volatile memory (NVM) is promising to accelerate data- and compute-intensive applications. Current NVM technologies have lower performance than DRAM and, thus, are often paired with DRAM in a heterogeneous main memory. Recently, byte-addressable NVM hardware becomes available. This work provides a timely evaluation of representative HPC applications from the &#34;Seven Dwarfs&#34; on NVM-based main memory. Our results quantify the effectiveness of DRAM-cached-NVM for accelerating HPC applications and enabling large problems beyond the DRAM capacity. On uncached-NVM, HPC applications exhibit three tiers of performance sensitivity, i.e., insensitive, scaled, and bottlenecked. We identify write throttling and concurrency control as the priorities in optimizing applications. We highlight that concurrency change may have a diverging effect on read and write accesses in applications. Based on these findings, we explore two optimization approaches. First, we provide a prediction model that uses datasets from a small set of configurations to estimate performance at various concurrency and data sizes to avoid exhaustive search in the configuration space. Second, we demonstrate that write-aware data placement on uncached-NVM could achieve $2$x performance improvement with a 60% reduction in DRAM usage.

preprint2020arXiv

Dynamical universality classes towards an infinite temperature state

Dynamical universality is the observation that the dynamical properties of different systems might exhibit universal behavior that are independent of the system details. In this paper, we study the long-time dynamics of an one-dimensional noisy quantum magnetic model, and find that even though the system are inevitably driven to an infinite temperature state, the relaxation dynamics towards such featureless state can be highly nontrivial and universal. The effect of various mode-coupling mechanisms (external potential, disorder, interaction, and the interplay between them) as well as the conservation law on the long-time dynamics of the systems have been studied, and their relevance with current ultracold atomic experiments have been discussed.

preprint2020arXiv

Entanglements and correlations of one-dimensional quantum spin-1/2 chain with anisotropic power-law long range interactions

The correlations, entanglement entropy, and fidelity susceptibility are calculated for a one-dimensional spin-1/2 XXZ chain with anisotropic power-law long range interactions by employing the density matrix renormalization group method. In particular, this long-range interaction is assigned to ferromagnetic for transversal components, while it can be either ferro- or antiferromagnetic for the longitudinal spin component. Two ground-state phase diagrams are established versus the anisotropy of the interactions which not only changes the phase boundaries of the counterparts with short-range interactions, but also leads to the emergence of exotic phases. We found that the long-range interactions of the z-component results in a Wigner crystal phase, whereas the transversal one may break a continuous symmetry, resulting in a continuous symmetry breaking phase.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO&#39;s features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO&#39;s potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

Gene-Environment Interaction: A Variable Selection Perspective

Gene-environment interactions have important implications to elucidate the genetic basis of complex diseases beyond the joint function of multiple genetic factors and their interactions (or epistasis). In the past, G$\times$E interactions have been mainly conducted within the framework of genetic association studies. The high dimensionality of G$\times$E interactions, due to the complicated form of environmental effects and presence of a large number of genetic factors including gene expressions and SNPs, has motivated the recent development of penalized variable selection methods for dissecting G$\times$E interactions, which has been ignored in majority of published reviews on genetic interaction studies. In this article, we first survey existing overviews on both gene-environment and gene-gene interactions. Then, after a brief introduction on the variable selection methods, we review penalization and relevant variable selection methods in marginal and joint paradigms respectively under a variety of conceptual models. Discussions on strengths and limitations, as well as computational aspects of the variable selection methods tailored for G$\times$E studies have also been provided.

preprint2020arXiv

Interpretable Complex-Valued Neural Networks for Privacy Protection

Previous studies have found that an adversary attacker can often infer unintended input information from intermediate-layer features. We study the possibility of preventing such adversarial inference, yet without too much accuracy degradation. We propose a generic method to revise the neural network to boost the challenge of inferring input attributes from features, while maintaining highly accurate outputs. In particular, the method transforms real-valued features into complex-valued ones, in which the input is hidden in a randomized phase of the transformed features. The knowledge of the phase acts like a key, with which any party can easily recover the output from the processing result, but without which the party can neither recover the output nor distinguish the original input. Preliminary experiments on various datasets and network structures have shown that our method significantly diminishes the adversary&#39;s ability in inferring about the input while largely preserves the resulting accuracy.

preprint2020arXiv

Managing quantum heat transfer in nonequilibrium qubit-phonon hybrid system

We investigate quantum heat transfer and thermal management in the nonequilibrium qubit-phonon hybrid system by applying the quantum master equation embedded with phononic coherent state. We obtain the steady state heat flow by tuning the arbitrary qubit-phonon coupling strength, which particularly exhibits the power-law scaling behavior and the turnover behavior in the weak and strong coupling regimes, respectively. Moreover, we analyze the negative differential thermal conductance and thermal rectification, which becomes profound with weak qubit-phonon interaction and large temperature bias. These results would contribute to smart energy control and design of phononic hybrid quantum devices.

preprint2020arXiv

Measurement of the neutron beam profile of the Back-n white neutron facility at CSNS with a Micromegas detector

The Back-n white neutron beam line, which uses back-streaming white neutrons from the spallation target of the China Spallation Neutron Source, is used for nuclear data measurements. A Micromegas-based neutron detector with two variants was specially developed to measure the beam spot distribution for this beam line. In this article, the design, fabrication, and characterization of the detector are described. The results of the detector performance tests are presented, which include the relative electron transparency, the gain and the gain uniformity, and the neutron beam profile reconstruction capability. The result of the first measurement of the Back-n neutron beam spot distribution is also presented.

preprint2020arXiv

Quantum phase transitions in a spin-1 antiferromagnetic chain with long-range interactions and modulated single-ion anisotropy

We study the phase diagram of spin-1 antiferromagnetic chain with isotropic antiferromagnetic interactions decaying with a power-law $\propto r^{-α}$ ($α\ge 1$) accompanied by modulated single-ion anisotropy. Employing the techniques of the density-matrix renormalization group, effects of long-range interactions and single-ion anisotropy on a variety of correlations are investigated. In order to check the consistency, the fidelity susceptibilities are evaluated across quantum phase transitions. The quantum critical points are faithfully detected and orders of phase transitions are determined. The correlation-length critical exponent is extracted from scaling functions of the fidelity susceptibility. The presence of long-range interactions leads to quantitative change of the phase boundaries and reduces the order of phase transition under certain conditions. A direct first-order transition between the periodic Néel phase and the large-$D$ phase occurs for slowly decaying antiferromagnetic interactions.

preprint2020arXiv

Revisiting One-vs-All Classifiers for Predictive Uncertainty and Out-of-Distribution Detection in Neural Networks

Accurate estimation of predictive uncertainty in modern neural networks is critical to achieve well calibrated predictions and detect out-of-distribution (OOD) inputs. The most promising approaches have been predominantly focused on improving model uncertainty (e.g. deep ensembles and Bayesian neural networks) and post-processing techniques for OOD detection (e.g. ODIN and Mahalanobis distance). However, there has been relatively little investigation into how the parametrization of the probabilities in discriminative classifiers affects the uncertainty estimates, and the dominant method, softmax cross-entropy, results in misleadingly high confidences on OOD data and under covariate shift. We investigate alternative ways of formulating probabilities using (1) a one-vs-all formulation to capture the notion of &#34;none of the above&#34;, and (2) a distance-based logit representation to encode uncertainty as a function of distance to the training manifold. We show that one-vs-all formulations can improve calibration on image classification tasks, while matching the predictive performance of softmax without incurring any additional training or test-time complexity.

preprint2020arXiv

Robust Bayesian variable selection for gene-environment interactions

Gene-environment (G$\times$E) interactions have important implications to elucidate the etiology of complex diseases beyond the main genetic and environmental effects. Outliers and data contamination in disease phenotypes of G$\times$E studies have been commonly encountered, leading to the development of a broad spectrum of robust regularization methods. Nevertheless, within the Bayesian framework, the issue has not been taken care of in existing studies. We develop a fully Bayesian robust variable selection method for G$\times$E interaction studies. The proposed Bayesian method can effectively accommodate heavy-tailed errors and outliers in the response variable while conducting variable selection by accounting for structural sparsity. In particular, for the robust sparse group selection, the spike-and-slab priors have been imposed on both individual and group levels to identify important main and interaction effects robustly. An efficient Gibbs sampler has been developed to facilitate fast computation. Extensive simulation studies and analysis of both the diabetes data with SNP measurements from the Nurses&#39; Health Study and TCGA melanoma data with gene expression measurements demonstrate the superior performance of the proposed method over multiple competing alternatives.

preprint2020arXiv

Smart, Adaptive Energy Optimization for Mobile Web Interactions

Web technology underpins many interactive mobile applications. However, energy-efficient mobile web interactions is an outstanding challenge. Given the increasing diversity and complexity of mobile hardware, any practical optimization scheme must work for a wide range of users, mobile platforms and web workloads. This paper presents CAMEL , a novel energy optimization system for mobile web interactions. CAMEL leverages machine learning techniques to develop a smart, adaptive scheme to judiciously trade performance for reduced power consumption. Unlike prior work, C AMEL directly models how a given web content affects the user expectation and uses this to guide energy optimization. It goes further by employing transfer learning and conformal predictions to tune a previously learned model in the end-user environment and improve it over time. We apply CAMEL to Chromium and evaluate it on four distinct mobile systems involving 1,000 testing webpages and 30 users. Compared to four state-of-the-art web-event optimizers, CAMEL delivers 22% more energy savings, but with 49% fewer violations on the quality of user experience, and exhibits orders of magnitudes less overhead when targeting a new computing environment.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

Unsupervised Manifold Clustering of Topological Phononics

Classification of topological phononics is challenging due to the lack of universal topological invariants and the randomness of structure patterns. Here, we show the unsupervised manifold learning for clustering topological phononics without any priori knowledge, neither topological invariants nor supervised trainings, even when systems are imperfect or disordered. This is achieved by exploiting the real-space projection operator about finite phononic lattices to describe the correlation between oscillators. We exemplify the efficient unsupervised manifold clustering in typical phononic systems, including one-dimensional Su-Schrieffer-Heeger-type phononic chain with random couplings, amorphous phononic topological insulators, higher-order phononic topological states and non-Hermitian phononic chain with random dissipations. The results would inspire more efforts on applications of unsupervised machine learning for topological phononic devices and beyond.

preprint2019arXiv

Measurements of differential and angle-integrated cross sections for the $^{10}$B($n, α$)$^{7}$Li reaction in the neutron energy range from 1.0 eV to 2.5 MeV

Differential and angle-integrated cross sections for the $^{10}$B($n, α$)$^{7}$Li, $^{10}$B($n, α$$_{0}$)$^{7}$Li and $^{10}$B($n, α$$_{1}$)$^{7}$Li$^{*}$ reactions have been measured at CSNS Back-n white neutron source. Two enriched (90%) $^{10}$B samples 5.0 cm in diameter and ~85.0 $μ$g/cm$^{2}$ in thickness each with an aluminum backing were prepared, and back-to-back mounted at the sample holder. The charged particles were detected using the silicon-detector array of the Light-charged Particle Detector Array (LPDA) system. The neutron energy E$_{n}$ was determined by TOF (time-of-flight) method, and the valid $α$ events were extracted from the E$_{n}$-Amplitude two-dimensional spectrum. With 15 silicon detectors, the differential cross sections of $α$-particles were measured from 19.2° to 160.8°. Fitted with the Legendre polynomial series, the ($n, α$) cross sections were obtained through integration. The absolute cross sections were normalized using the standard cross sections of the $^{10}$B($n, α$)$^{7}$Li reaction in the 0.3 - 0.5 MeV neutron energy region. The measurement neutron energy range for the $^{10}$B($n, α$)$^{7}$Li reaction is 1.0 eV $\le$ En < 2.5 MeV (67 energy points), and for the $^{10}$B($n, α$$_{0}$)$^{7}$Li and $^{10}$B($n, α$$_{1}$)$^{7}$Li$^{*}$ reactions is 1.0 eV $\le$ En < 1.0 MeV (59 energy points). The present results have been analyzed by the resonance reaction mechanism and the level structure of the $^{11}$B compound system, and compared with existing measurements and evaluations.

preprint2019arXiv

Unveiling CP property of top-Higgs coupling with graph neural networks at the LHC

The top-Higgs coupling plays an important role in particle physics and cosmology. The precision measurements of this coupling can provide an insight to new physics beyond the Standard Model. In this paper, we propose to use Message Passing Neural Network (MPNN) to reveal the CP nature of top-Higgs interaction through semi-leptonic channel $pp \to t(\to b\ell^-ν_\ell)\bar{t}(\to \bar{b}jj)h(\to b\bar{b})$. Using the test statistics constructed from the event classification probabilities given by the MPNN, we find that the pure CP-even and CP-odd components can be well distinguished at the LHC, with at most 300 fb$^{-1}$ experimental data.

preprint2018arXiv

Linear instability of Poiseuille flows with highly non-ideal fluids

The objective of this work is to investigate linear modal and algebraic instability in Poiseuille flows with fluids close to their vapour-liquid critical point. Close to this critical point, the ideal gas assumption does not hold and large non-ideal fluid behaviours occur. As a representative non-ideal fluid, we consider supercritical carbon dioxide (CO$_2$) at pressure of 80 bar, which is above its critical pressure of 73.9 bar. The Poiseuille flow is characterized by the Reynolds number ($Re=ρ_{w}^{*}u_{r}^{*}h^{*}/μ_{w}^{*}$), the product of Prandtl ($Pr=μ_{w}^{*}C_{pw}^{*}/κ_{w}^{*}$) and Eckert number ($Ec=u_{r}^{*2}/C_{pw}^{*}T_{w}^{*}$), and the wall temperature that in addition to pressure determines the thermodynamic reference condition. For low Eckert numbers, the flow is essentially isothermal and no difference with the well-known stability behaviour of incompressible flows is observed. However, if the Eckert number increases, the viscous heating causes gradients of thermodynamic and transport properties, and non-ideal gas effects become significant. Three regimes of the laminar base flow can be considered, subcritical (temperature in the channel is entirely below its pseudo-critical value), transcritical, and supercritical temperature regime. If compared to the linear stability of an ideal gas Poiseuille flow, we show that the base flow is more unstable in the subcritical regime, inviscid unstable in the transcritical regime, while significantly more stable in the supercritical regime. Following the corresponding states principle, we expect that qualitatively similar results will be obtained for other fluids at equivalent thermodynamic states.

preprint2018arXiv

Observation of acoustic spin

Unlike optical waves, acoustic waves in fluids are described by scalar pressure fields, and therefore are considered spinless. Here, we demonstrate experimentally the existence of spin in acoustics. In the interference of two acoustic waves propagating perpendicularly to each other, we observed the spin angular momentum in free space as a result of the rotation of local particle velocity. We successfully measured the acoustic spin, and spin induced torque acting on a lossy acoustic meta-atom that results from absorption of the spin angular momentum. The acoustic spin is also observed in the evanescent field of a guided mode traveling along a metamaterial waveguide. We found spin-momentum locking in acoustic waves whose propagation direction is determined by the sign of spin. The observed acoustic spin could open a new door in acoustics and their applications for the control of wave propagation and particle rotation.

preprint2018arXiv

The new mode of instability in viscous high-speed boundary layer flows

The new mode of instability found by Tunney et al. is studied with viscous stability theory in this article. When the high-speed boundary layer is subject to certain values of favorable pressure gradient and wall heating, a new mode becomes unstable due to the appearance of the streamwise velocity overshoot ($U(y)>U_\infty$) in the base flow. The present study shows that under practical Reynolds numbers, the new mode can hardly co-exist with conventional first mode and Mack&#39;s second mode. Due to the requirement for additional wall heating, the new mode may only lead to laminar-turbulent transition under experimental (artificial) conditions.

preprint2018arXiv

Unusually low thermal conductivity of atomically thin 2D tellurium

Tellurium is a high-performance thermoelectric material due to its superior electronic transport and low lattice thermal conductivity ($κ_L$). Here, we report the ultralow $κ_L$ in the monolayer tellurium, i.e., tellurene, which has been successfully synthesized in recent experiments. We find tellurene has a compellingly low room temperature $κ_L$ of 2.16 and 4.08 W m$^{-1}$ K$^{-1}$ along the armchair and zigzag directions, respectively, which is lower than any reported values for other 2D materials. We attribute this unusually low $κ_L$ to the soft acoustic modes, extremely low-energy optical modes and the strong scattering among optical-acoustic phonons, which place tellurene as a potential novel thermoelectric material. Finally, we disclose that $κ_L$ is proportional to the largest acoustic phonon frequency ($ω_{D}^{a}$) and the lowest optical phonon frequency at $Γ$ point ($ω_Γ^{o}$) in 2D materials, which reflect both harmonic and anharmonic thermal properties respectively.