Source author record

Xiang Zhou

Xiang Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

40works

26topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Distributed NLI: Learning to Predict Human Opinion Distributions for Language Reasoning

We introduce distributed NLI, a new NLU task with a goal to predict the distribution of human judgements for natural language inference. We show that by applying additional distribution estimation methods, namely, Monte Carlo (MC) Dropout, Deep Ensemble, Re-Calibration, and Distribution Distillation, models can capture human judgement distribution more effectively than the softmax baseline. We show that MC Dropout is able to achieve decent performance without any distribution annotations while Re-Calibration can give further improvements with extra distribution annotations, suggesting the value of multiple annotations for one example in modeling the distribution of human judgements. Despite these improvements, the best results are still far below the estimated human upper-bound, indicating that predicting the distribution of human judgements is still an open, challenging problem with a large room for improvements. We showcase the common errors for MC Dropout and Re-Calibration. Finally, we give guidelines on the usage of these methods with different levels of data availability and encourage future work on modeling the human opinion distribution for language reasoning. Our code and data are publicly available at https://github.com/easonnie/ChaosNLI

preprint2022arXiv

Efficient primal heuristics for mixed-integer linear programs

This paper is a short report about our work for the primal task in the Machine Learning for Combinatorial Optimization NeurIPS 2021 Competition. For each dataset of our interest in the competition, we propose customized primal heuristic methods to efficiently identify high-quality feasible solutions. The computational studies demonstrate the superiority of our proposed approaches over the competitors'.

preprint2022arXiv

Electromagnetic Dalitz Decays of $D_{(s)}^\ast$ Mesons

Rare electromagnetic decays of charmed mesons are useful laboratories to explore the structure of hadronic states and the interactions between photon and charmed mesons, to test the chiral perturbation theory in flavor sector and to search for new physics including dark photons. In this paper, we calculate the relative branching ratios of electromagnetic Dalitz decays $D_{(s)}^\ast\to D_{(s)}\ell^+\ell^-$ to their corresponding radiative decays $D_{(s)}^\ast\to D_{(s)}γ$, the dileptonic invariant mass spectra and the leptonic angular distributions with transition form factor in Vector-Meson Dominance model, where $D_{(s)}^\ast$ represents $D^\ast(2007)^0$, $D^\ast(2010)^\pm$, $D^\ast(2640)^\pm$, $D_s^{\ast\pm}$, $D_{s1}^\ast(2700)^\pm$ and $D_{s1}^\ast(2860)^\pm$.

preprint2022arXiv

Learn Quasi-stationary Distributions of Finite State Markov Chain

We propose a reinforcement learning (RL) approach to compute the expression of quasi-stationary distribution. Based on the fixed-point formulation of quasi-stationary distribution, we minimize the KL-divergence of two Markovian path distributions induced by the candidate distribution and the true target distribution. To solve this challenging minimization problem by gradient descent, we apply the reinforcement learning technique by introducing the reward and value functions. We derive the corresponding policy gradient theorem and design an actor-critic algorithm to learn the optimal solution and the value function. The numerical examples of finite state Markov chain are tested to demonstrate the new method.

preprint2022arXiv

Masked Part-Of-Speech Model: Does Modeling Long Context Help Unsupervised POS-tagging?

Previous Part-Of-Speech (POS) induction models usually assume certain independence assumptions (e.g., Markov, unidirectional, local dependency) that do not hold in real languages. For example, the subject-verb agreement can be both long-term and bidirectional. To facilitate flexible dependency modeling, we propose a Masked Part-of-Speech Model (MPoSM), inspired by the recent success of Masked Language Models (MLM). MPoSM can model arbitrary tag dependency and perform POS induction through the objective of masked POS reconstruction. We achieve competitive results on both the English Penn WSJ dataset as well as the universal treebank containing 10 diverse languages. Though modeling the long-term dependency should ideally help this task, our ablation study shows mixed trends in different languages. To better understand this phenomenon, we design a novel synthetic experiment that can specifically diagnose the model's ability to learn tag agreement. Surprisingly, we find that even strong baselines fail to solve this problem consistently in a very simplified setting: the agreement between adjacent words. Nonetheless, MPoSM achieves overall better performance. Lastly, we conduct a detailed error analysis to shed light on other remaining challenges. Our code is available at https://github.com/owenzx/MPoSM

preprint2022arXiv

Methods for Large-scale Single Mediator Hypothesis Testing: Possible Choices and Comparisons

Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, H0:alpha*beta=0 (alpha: effect of the exposure on the mediator after adjusting for confounders; beta: effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for multiple mediation hypothesis testing. In addition to these existing methods, we developed the Sobel-comp method, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods in terms of the false positive rates under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the class of methods which uses a mixture reference distribution could best maintain the false positive rates at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi-Ethnic Study of Atherosclerosis (MESA). We also provide guidelines for choosing the optimal mediation hypothesis testing method in practice.

preprint2022arXiv

Mission Apollo: Landing Optical Circuit Switching at Datacenter Scale

In this paper, we describe Apollo, to the best of our knowledge, the world's first large-scale production deployment of optical circuit switches (OCSes) for datacenter networking. We will first describe the infrastructure challenges and use cases that motivated optical switching inside datacenters. We then delve into the requirements of OCSes for datacenter applications: balancing cost, port count, switching time, and optical performance, which drive design choices and implementation details of our internally developed 3D MEMS-based OCS. To enable the Apollo optical switching layer, we employ circulators to realize bidirectional links through the OCS, effectively doubling the OCS radix. The OCS and circulator design choices were critical for meeting network bandwidth, scale, and cost targets. We review the critical co-design of WDM transceiver technology for these OCS plus circulator-based bidirectional links and their corresponding physical impairments, delivered over four generations/speeds of optical interconnect. Finally, we conclude with thoughts on future directions in hardware development and associated applications.

preprint2022arXiv

Sublinear Regret for Learning POMDPs

We study the model-based undiscounted reinforcement learning for partially observable Markov decision processes (POMDPs). The oracle we consider is the optimal policy of the POMDP with a known environment in terms of the average reward over an infinite horizon. We propose a learning algorithm for this problem, building on spectral method-of-moments estimations for hidden Markov models, the belief error control in POMDPs and upper-confidence-bound methods for online learning. We establish a regret bound of $O(T^{2/3}\sqrt{\log T})$ for the proposed learning algorithm where $T$ is the learning horizon. This is, to the best of our knowledge, the first algorithm achieving sublinear regret with respect to our oracle for learning general POMDPs.

preprint2022arXiv

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights

Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have focused on solving problem instances in isolation, ignoring that they often stem from related data distributions in practice. However, recent years have seen a surge of interest in using machine learning as a new approach for solving combinatorial problems, either directly as solvers or by enhancing exact solvers. Based on this context, the ML4CO aims at improving state-of-the-art combinatorial optimization solvers by replacing key heuristic components. The competition featured three challenging tasks: finding the best feasible solution, producing the tightest optimality certificate, and giving an appropriate solver configuration. Three realistic datasets were considered: balanced item placement, workload apportionment, and maritime inventory routing. This last dataset was kept anonymous for the contestants.

preprint2021arXiv

JUNO Physics and Detector

The Jiangmen Underground Neutrino Observatory (JUNO) is a 20 kton LS detector at 700-m underground. An excellent energy resolution and a large fiducial volume offer exciting opportunities for addressing many important topics in neutrino and astro-particle physics. With 6 years of data, the neutrino mass ordering can be determined at 3-4 sigma and three oscillation parameters can be measured to a precision of 0.6% or better by detecting reactor antineutrinos. With 10 years of data, DSNB could be observed at 3-sigma; a lower limit of the proton lifetime of 8.34e33 years (90% C.L.) can be set by searching for p->nu_bar K^+; detection of solar neutrinos would shed new light on the solar metallicity problem and examine the vacuum-matter transition region. A core-collapse supernova at 10 kpc would lead to ~5000 IBD and ~2000 (300) all-flavor neutrino-proton (electron) scattering events. Geo-neutrinos can be detected with a rate of ~400 events/year. We also summarize the final design of the JUNO detector and the key R&D achievements. All 20-inch PMTs have been tested. The average photon detection efficiency is 28.9% for the 15,000 MCP PMTs and 28.1% for the 5,000 dynode PMTs, higher than the JUNO requirement of 27%. Together with the >20 m attenuation length of LS, we expect a yield of 1345 p.e. per MeV and an effective energy resolution of 3.02%/\sqrt{E (MeV)}$ in simulations. The underwater electronics is designed to have a loss rate <0.5% in 6 years. With degassing membranes and a micro-bubble system, the radon concentration in the 35-kton water pool could be lowered to <10 mBq/m^3. Acrylic panels of radiopurity <0.5 ppt U/Th are produced. The 20-kton LS will be purified onsite. Singles in the fiducial volume can be controlled to ~10 Hz. The JUNO experiment also features a double calorimeter system with 25,600 3-inch PMTs, a LS testing facility OSIRIS, and a near detector TAO.

preprint2021arXiv

Regime Switching Bandits

We study a multi-armed bandit problem where the rewards exhibit regime switching. Specifically, the distributions of the random rewards generated from all arms are modulated by a common underlying state modeled as a finite-state Markov chain. The agent does not observe the underlying state and has to learn the transition matrix and the reward distributions. We propose a learning algorithm for this problem, building on spectral method-of-moments estimations for hidden Markov models, belief error control in partially observable Markov decision processes and upper-confidence-bound methods for online learning. We also establish an upper bound $O(T^{2/3}\sqrt{\log T})$ for the proposed learning algorithm where $T$ is the learning horizon. Finally, we conduct proof-of-concept experiments to illustrate the performance of the learning algorithm.

preprint2020arXiv

Bayesian Sparse Mediation Analysis with Targeted Penalization of Natural Indirect Effects

Causal mediation analysis aims to characterize an exposure's effect on an outcome and quantify the indirect effect that acts through a given mediator or a group of mediators of interest. With the increasing availability of measurements on a large number of potential mediators, like the epigenome or the microbiome, new statistical methods are needed to simultaneously accommodate high-dimensional mediators while directly target penalization of the natural indirect effect (NIE) for active mediator identification. Here, we develop two novel prior models for identification of active mediators in high-dimensional mediation analysis through penalizing NIEs in a Bayesian paradigm. Both methods specify a joint prior distribution on the exposure-mediator effect and mediator-outcome effect with either (a) a four-component Gaussian mixture prior or (b) a product threshold Gaussian prior. By jointly modeling the two parameters that contribute to the NIE, the proposed methods enable penalization on their product in a targeted way. Resultant inference can take into account the four-component composite structure underlying the NIE. We show through simulations that the proposed methods improve both selection and estimation accuracy compared to other competing methods. We applied our methods for an in-depth analysis of two ongoing epidemiologic studies: the Multi-Ethnic Study of Atherosclerosis (MESA) and the LIFECODES birth cohort. The identified active mediators in both studies reveal important biological pathways for understanding disease mechanisms.

preprint2020arXiv

DCMN+: Dual Co-Matching Network for Multi-choice Reading Comprehension

Multi-choice reading comprehension is a challenging task to select an answer from a set of candidate options when given passage and question. Previous approaches usually only calculate question-aware passage representation and ignore passage-aware question representation when modeling the relationship between passage and question, which obviously cannot take the best of information between passage and question. In this work, we propose dual co-matching network (DCMN) which models the relationship among passage, question and answer options bidirectionally. Besides, inspired by how human solve multi-choice questions, we integrate two reading strategies into our model: (i) passage sentence selection that finds the most salient supporting sentences to answer the question, (ii) answer option interaction that encodes the comparison information between answer options. DCMN integrated with the two strategies (DCMN+) obtains state-of-the-art results on five multi-choice reading comprehension datasets which are from different domains: RACE, SemEval-2018 Task 11, ROCStories, COIN, MCTest.

preprint2020arXiv

Deep Reinforcement Learning for On-line Dialogue State Tracking

Dialogue state tracking (DST) is a crucial module in dialogue management. It is usually cast as a supervised training problem, which is not convenient for on-line optimization. In this paper, a novel companion teaching based deep reinforcement learning (DRL) framework for on-line DST optimization is proposed. To the best of our knowledge, this is the first effort to optimize the DST module within DRL framework for on-line task-oriented spoken dialogue systems. In addition, dialogue policy can be further jointly updated. Experiments show that on-line DST optimization can effectively improve the dialogue manager performance while keeping the flexibility of using predefined policy. Joint training of both DST and policy can further improve the performance.

preprint2020arXiv

Enumerating Chemical Graphs with Two Disjoint Cycles Satisfying Given Path Frequency Specifications

Enumerating chemical graphs satisfying given constraints is a fundamental problem in mathematical and computational chemistry, and plays an essential part in a recently proposed framework for the inverse QSAR/QSPR. In this paper, constraints are given by feature vectors each of which consists of the frequencies of paths in a given set of paths. We consider the problem of enumerating chemical graphs that satisfy the path frequency constraints, which are given by a pair of feature vectors specifying upper and lower bounds of the frequency of each path. We design a branch-and-bound algorithm for enumerating chemical graphs of bi-block 2-augmented structure, that is, graphs that contain two edge-disjoint cycles. We present some computational experiments with an implementation of our proposed algorithm.

preprint2020arXiv

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

The Jiangmen Underground Neutrino Observatory~(JUNO) features a 20~kt multi-purpose underground liquid scintillator sphere as its main detector. Some of JUNO's features make it an excellent experiment for $^8$B solar neutrino measurements, such as its low-energy threshold, its high energy resolution compared to water Cherenkov detectors, and its much large target mass compared to previous liquid scintillator detectors. In this paper we present a comprehensive assessment of JUNO's potential for detecting $^8$B solar neutrinos via the neutrino-electron elastic scattering process. A reduced 2~MeV threshold on the recoil electron energy is found to be achievable assuming the intrinsic radioactive background $^{238}$U and $^{232}$Th in the liquid scintillator can be controlled to 10$^{-17}$~g/g. With ten years of data taking, about 60,000 signal and 30,000 background events are expected. This large sample will enable an examination of the distortion of the recoil electron spectrum that is dominated by the neutrino flavor transformation in the dense solar matter, which will shed new light on the tension between the measured electron spectra and the predictions of the standard three-flavor neutrino oscillation framework. If $Δm^{2}_{21}=4.8\times10^{-5}~(7.5\times10^{-5})$~eV$^{2}$, JUNO can provide evidence of neutrino oscillation in the Earth at the about 3$σ$~(2$σ$) level by measuring the non-zero signal rate variation with respect to the solar zenith angle. Moveover, JUNO can simultaneously measure $Δm^2_{21}$ using $^8$B solar neutrinos to a precision of 20\% or better depending on the central value and to sub-percent precision using reactor antineutrinos. A comparison of these two measurements from the same detector will help elucidate the current tension between the value of $Δm^2_{21}$ reported by solar neutrino experiments and the KamLAND experiment.

preprint2020arXiv

Machine Learning and Control Theory

We survey in this article the connections between Machine Learning and Control Theory. Control Theory provide useful concepts and tools for Machine Learning. Conversely Machine Learning can be used to solve large control problems. In the first part of the paper, we develop the connections between reinforcement learning and Markov Decision Processes, which are discrete time control problems. In the second part, we review the concept of supervised learning and the relation with static optimization. Deep learning which extends supervised learning, can be viewed as a control problem. In the third part, we present the links between stochastic gradient descent and mean-field theory. Conversely, in the fourth and fifth parts, we review machine learning approaches to stochastic control problems, and focus on the deterministic case, to explain, more easily, the numerical algorithms.

preprint2020arXiv

Semantics-aware BERT for Language Understanding

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.

preprint2020arXiv

Stochastic Modified Equations for Continuous Limit of Stochastic ADMM

Stochastic version of alternating direction method of multiplier (ADMM) and its variants (linearized ADMM, gradient-based ADMM) plays a key role for modern large scale machine learning problems. One example is the regularized empirical risk minimization problem. In this work, we put different variants of stochastic ADMM into a unified form, which includes standard, linearized and gradient-based ADMM with relaxation, and study their dynamics via a continuous-time model approach. We adapt the mathematical framework of stochastic modified equation (SME), and show that the dynamics of stochastic ADMM is approximated by a class of stochastic differential equations with small noise parameters in the sense of weak approximation. The continuous-time analysis would uncover important analytical insights into the behaviors of the discrete-time algorithm, which are non-trivial to gain otherwise. For example, we could characterize the fluctuation of the solution paths precisely, and decide optimal stopping time to minimize the variance of solution paths.

preprint2020arXiv

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

The Taishan Antineutrino Observatory (TAO, also known as JUNO-TAO) is a satellite experiment of the Jiangmen Underground Neutrino Observatory (JUNO). A ton-level liquid scintillator detector will be placed at about 30 m from a core of the Taishan Nuclear Power Plant. The reactor antineutrino spectrum will be measured with sub-percent energy resolution, to provide a reference spectrum for future reactor neutrino experiments, and to provide a benchmark measurement to test nuclear databases. A spherical acrylic vessel containing 2.8 ton gadolinium-doped liquid scintillator will be viewed by 10 m^2 Silicon Photomultipliers (SiPMs) of >50% photon detection efficiency with almost full coverage. The photoelectron yield is about 4500 per MeV, an order higher than any existing large-scale liquid scintillator detectors. The detector operates at -50 degree C to lower the dark noise of SiPMs to an acceptable level. The detector will measure about 2000 reactor antineutrinos per day, and is designed to be well shielded from cosmogenic backgrounds and ambient radioactivities to have about 10% background-to-signal ratio. The experiment is expected to start operation in 2022.

preprint2020arXiv

Towards Robustifying NLI Models Against Lexical Dataset Biases

While deep learning models are making fast progress on the task of Natural Language Inference, recent studies have also shown that these models achieve high accuracy by exploiting several dataset biases, and without deep understanding of the language semantics. Using contradiction-word bias and word-overlapping bias as our two bias examples, this paper explores both data-level and model-level debiasing methods to robustify models against lexical dataset biases. First, we debias the dataset through data augmentation and enhancement, but show that the model bias cannot be fully removed via this method. Next, we also compare two ways of directly debiasing the model without knowing what the dataset biases are in advance. The first approach aims to remove the label bias at the embedding level. The second approach employs a bag-of-words sub-model to capture the features that are likely to exploit the bias and prevents the original model from learning these biased features by forcing orthogonality between these two sub-models. We performed evaluations on new balanced datasets extracted from the original MNLI dataset as well as the NLI stress tests, and show that the orthogonality approach is better at debiasing the model while maintaining competitive overall accuracy. Our code and data are available at: https://github.com/owenzx/LexicalDebias-ACL2020

preprint2016arXiv

Explore Stochastic Instabilities of Periodic Points by Transition Path Theory

We consider the noise-induced transitions in the randomly perturbed discrete logistic map from a linearly stable periodic orbit consisting of T periodic points. The traditional large deviation theory and asymptotic analysis for small noise limit as well as the derived quasi-potential can not distinguish the quantitative difference in noise-induced stochastic instabilities of these T periodic points. We generalize the transition path theory to the discrete-time continuous-space stochastic process to attack this problem. As a first criterion of quantifying the relative instability among T periodic points, we compare the distribution of the last passage locations in the transitions from the whole periodic orbit to a prescribed set far away. This distribution is related to the contributions to the transition rate from each periodic points. The second criterion is based on the competency of the transition paths associated with each periodic point. Both criteria utilise the reactive probability current in the transition path theory. Our numerical results for the logistic map reveal the transition mechanism of escaping from the stable periodic orbit and identify which peri- odic point is more prone to lose stability so as to make successful transitions under random perturbations.

preprint2015arXiv

Densities, isobaric thermal expansion coefficients and isothermal compressibilities of linear alkylbenzene

We report the measurements of the densities of linear alkylbenzene at three temperatures over 4 to 23 Celsius degree with pressures up to 10 MPa. The measurements have been analysed to yield the isobaric thermal expansion coefficients and, so far for the first time, isothermal compressibilities of linear alkylbenzene. Relevance of results for current generation (i.e. Daya Bay) and next generation (i.e. JUNO) large liquid scintillator neutrino detectors are discussed.

preprint2015arXiv

Electrical control of exchange spring in antiferromagnetic metals

Manipulation of antiferromagnetic (AFM) spins by electrical means is on great demand to develop the AFM spintronics with low power consumption. In spite of the electrical modulation of insulated AFMs through coupling between their intrinsic ferroelectricity and antiferromagnetism, direct electrical control of AFM metals remains challenging due to the screening effect by the surface charge, and the manipulation is confined to a limited depth of atomic dimensions, which is insufficient to form a stable AFM exchange spring. In the present letter we primarily report a reversible electrical control of exchange spring in AFM metals, using an ionic liquid to exert a substantial electric-field effect. The exchange spring could transfer the force to the ferromagnet/antiferromagnet interface, enabling a deeper modulation depth in AFM metals. The manipulation of AFM moments by gate voltage is demonstrated in [Co/Pt]/IrMn model system and a single IrMn layer with the IrMn thickness up to 5 nm. Besides the fundamental significance of modulating the spin structures in metallic AFM via all-electrical fashion, the present finding would advance the development of low-power-consumption AFM spintronics.

preprint2015arXiv

Escaping from an attractor: Importance sampling and rest points I

We discuss importance sampling schemes for the estimation of finite time exit probabilities of small noise diffusions that involve escape from an equilibrium. A factor that complicates the analysis is that rest points are included in the domain of interest. We build importance sampling schemes with provably good performance both pre-asymptotically, that is, for fixed size of the noise, and asymptotically, that is, as the size of the noise goes to zero, and that do not degrade as the time horizon gets large. Simulation studies demonstrate the theoretical results.

preprint2015arXiv

Nature of magnetotransport in metal/insulating-ferromagnet heterostructures: Spin Hall magnetoresistance or magnetic proximity effect

We study the anomalous Hall-like effect (AHLE) and the effective anisotropic magnetoresistance (EAMR) in antiferromagnetic γ -IrMn3/Y3Fe5O12(YIG) and Pt/YIG heterostructures. For γ -IrMn3/YIG, the EAMR and the AHLE resistivity change sign with temperature due to the competition between the spin Hall magnetoresistance (SMR) and the magnetic proximity effect (MPE) induced by the interfacial antiferromagnetic uncompensated magnetic moment. In contrast, for Pt/YIG the AHLE resistivity changes sign with temperature whereas no sign change is observed in the EAMR. This is because the MPE and the SMR play a dominant role in the AHLE and the EAMR, respectively. As new types of galvanomagnetic property, the AHLE and the EAMR have proved vital in disentangling the MPE and the SMR in metal/insulating-ferromagnet heterostructures.

preprint2015arXiv

Neutrino Physics with JUNO

The Jiangmen Underground Neutrino Observatory (JUNO), a 20 kton multi-purpose underground liquid scintillator detector, was proposed with the determination of the neutrino mass hierarchy as a primary physics goal. It is also capable of observing neutrinos from terrestrial and extra-terrestrial sources, including supernova burst neutrinos, diffuse supernova neutrino background, geoneutrinos, atmospheric neutrinos, solar neutrinos, as well as exotic searches such as nucleon decays, dark matter, sterile neutrinos, etc. We present the physics motivations and the anticipated performance of the JUNO detector for various proposed measurements. By detecting reactor antineutrinos from two power plants at 53-km distance, JUNO will determine the neutrino mass hierarchy at a 3-4 sigma significance with six years of running. The measurement of antineutrino spectrum will also lead to the precise determination of three out of the six oscillation parameters to an accuracy of better than 1\%. Neutrino burst from a typical core-collapse supernova at 10 kpc would lead to ~5000 inverse-beta-decay events and ~2000 all-flavor neutrino-proton elastic scattering events in JUNO. Detection of DSNB would provide valuable information on the cosmic star-formation rate and the average core-collapsed neutrino energy spectrum. Geo-neutrinos can be detected in JUNO with a rate of ~400 events per year, significantly improving the statistics of existing geoneutrino samples. The JUNO detector is sensitive to several exotic searches, e.g. proton decay via the $p\to K^++\barν$ decay channel. The JUNO detector will provide a unique facility to address many outstanding crucial questions in particle and astrophysics. It holds the great potential for further advancing our quest to understanding the fundamental properties of neutrinos, one of the building blocks of our Universe.

preprint2015arXiv

Preliminary study of light yield dependence on LAB liquid scintillator composition

Liquid scintillator (LS) will be adopted as the detector material in JUNO (Jiangmen Underground Neutrino Observatory). The energy resolution requirement of JUNO is 3%, which has never previously been reached. To achieve this energy resolution, the light yield of liquid scintillator is an important factor. PPO (the fluor) and bis-MSB (the wavelength shifter) are the two main materials dissolved in LAB. To study the influence of these two materials on the transmission of scintillation photons in LS, 25 and 12 cm-long quartz vessels were used in a light yield experiment. LS samples with different concentration of PPO and bis-MSB were tested. At these lengths, the light yield growth is not obvious when the concentration of PPO is higher than 4 g/L. The influence from bis-MSB becomes insignificant when its concentration is higher than 8 mg/L. This result could provide some useful suggestions for the JUNO LS.

preprint2015arXiv

Rayleigh scattering and depolarization ratio in linear alkylbenzene

Linear alkylbenzene (LAB) is adopted to be the organic solvent for the Jiangmen Underground Neutrino Observatory (JUNO) liquid scintillator detectors due to the ultra-transparency. However the current Rayleigh scattering length calculation disagrees with the measurement. The present paper for the first time reports the Rayleigh scattering of LAB being anisotropic and the depolarization ratio being 0.31+-0.01(stat.)+-0.01(sys.). We proposed an indirectly method for Rayleigh scattering measurement with Einstein-Smoluchowski-Cabannes formula, and the Rayleigh scattering length of LAB is determined to be 28.2+-1.0 m at 430 nm.

preprint2015arXiv

Rayleigh scattering of linear alkylbenzene in large liquid scintillator detectors

Rayleigh scattering poses an intrinsic limit for the transparency of organic liquid scintillators. This work focuses on the Rayleigh scattering length of linear alkylbenzene (LAB), which will be used as the solvent of the liquid scintillator in the central detector of the Jiangmen Underground Neutrino Observatory. We investigate the anisotropy of the Rayleigh scattering in LAB, showing that the resulting Rayleigh scattering length will be significantly shorter than reported before. Given the same overall light attenuation, this will result in a more efficient transmission of photons through the scintillator, increasing the amount of light collected by the photosensors and thereby the energy resolution of the detector.

preprint2015arXiv

Spectroscopic study of light scattering in linear alkylbenzene for liquid scintillator neutrino detectors

We has set up a light scattering spectrometer to study the depolarization of light scattering in linear alkylbenzene. From the scattering spectra it can be unambiguously shown that the depolarized part of light scattering belongs to Rayleigh scattering. The additional depolarized Rayleigh scattering can make the effective transparency of linear alkylbenzene much better than it was expected. Therefore sufficient scintillation photons can transmit through the large liquid scintillator detector of JUNO. Our study is crucial to achieving the unprecedented energy resolution 3\%/$\sqrt{E\mathrm{(MeV)}}$ for JUNO experiment to determine the neutrino mass hierarchy. The spectroscopic method can also be used to judge the attribution of the depolarization of other organic solvents used in neutrino experiments.

preprint2014arXiv

An Iterative Minimization Formulation for Saddle-Point Search

This paper proposes and analyzes an iterative minimization formulation for search- ing index-1 saddle points of an energy function. This formulation differs from other eigenvector-following methods by constructing a new objective function near the guess at each iteration step. This leads to a quadratic convergence rate, in comparison to the linear case of the gentlest ascent dynamics (E and Zhou, nonlinearity, vol 24, p1831, 2011) and many other existing methods. We also propose the generalization of the new methodology for saddle points of higher index and for constrained energy functions on manifold.

preprint2014arXiv

Efficient rare event simulation for failure problems in random media

In this paper we study rare events associated to solutions of elliptic partial differential equations with spatially varying random coefficients. The random coefficients follow the lognormal distribution, which is determined by a Gaussian process. This model is employed to study the failure problem of elastic materials in random media in which the failure is characterized by that the strain field exceeds a high threshold. We propose an efficient importance sampling scheme to compute small failure probabilities in the high threshold limit. The change of measure in our scheme is parametrized by two density functions. The efficiency of the importance sampling scheme is validated by numerical examples.

preprint2014arXiv

Finding Transition Pathways on Manifolds

We consider noise-induced transition paths in randomly perturbed dynami- cal systems on a smooth manifold. The classical Freidlin-Wentzell large devia- tion theory in Euclidean spaces is generalized and new forms of action functionals are derived in the spaces of functions and the space of curves to accommodate the intrinsic constraints associated with the manifold. Numerical meth- ods are proposed to compute the minimum action paths for the systems with constraints. The examples of conformational transition paths for a single and double rod molecules arising in polymer science are numerically investigated.

preprint2013arXiv

An Explicit Cross Entropy Scheme for Mixtures

The key issue in importance sampling is the choice of the alternative sampling distribution, which is often chosen from the exponential tilt family of the underlying distribution. However, when the problem exhibits certain kind of nonconvexity, it is very likely that a single exponential change of measure will never attain asymptotic optimality and can lead to erroneous estimates. In this paper we introduce an explicit iterative scheme which combines the traditional cross-entropy method and the EM algorithm to find an efficient alternative sampling distribution in the form of mixtures. We also study the applications of this scheme to the estimation of rainbow option prices.

preprint2013arXiv

Efficient Algorithms for Multivariate Linear Mixed Models in Genome-wide Association Studies

Multivariate linear mixed models (mvLMMs) have been widely used in many areas of genetics, and have attracted considerable recent interest in genome-wide association studies (GWASs). However, fitting mvLMMs is computationally non-trivial, and no existing method is computationally practical for performing the likelihood ratio test (LRT) for mvLMMs in GWAS settings with moderate sample size n. The existing software MTMM perform an approximate LRT for two phenotypes, and as we find, its p values can substantially understate the significance of associations. Here, we present novel computationally-efficient algorithms for fitting mvLMMs, and computing the LRT in GWAS settings. After a single initial eigen-decomposition (with complexity O(n^3)) the algorithms i) reduce computational complexity (per iteration of the optimizer) from cubic to linear in n; and ii) in GWAS analyses, reduces per-marker complexity from cubic to quadratic in n. These innovations make it practical to compute the LRT for mvLMMs in GWASs for tens of thousands of samples and a moderate number of phenotypes (~2-10). With simulations, we show that the LRT provides correct control for type I error. With both simulations and real data we find that the LRT is more powerful than the approximate LRT from MTMM, and illustrate the benefits of analyzing more than two phenotypes. The method is implemented in the GEMMA software package, freely available at http://stephenslab.uchicago.edu/software.html

preprint2013arXiv

Extreme Analysis of a Non-convex and Nonlinear Functional of Gaussian Processes -- On the Tail Asymptotics of Random Ordinary Differential Equations

In this paper, we consider a stochastic system described by a differential equation admitting a spatially varying random coefficient. The differential equation has been employed to model various static physics systems such as elastic deformation, water flow, electric-magnetic fields, temperature distribution, etc. A random coefficient is introduced to account for the system's uncertainty and/or imperfect measurements. This random coefficient is described by a Gaussian process (the input process) and thus the solution to the differential equation (under certain boundary conditions) is a complexed functional of the input Gaussian process. In this paper, we focus the analysis on the one-dimensional case and derive asymptotic approximations of the tail probabilities of the solution to the equation that has various physics interpretations under different contexts. This analysis rests on the literature of the extreme analysis of Gaussian processes (such as the tail approximations of the supremum) and extends the analysis to more complexed functionals.

preprint2012arXiv

Polygenic Modeling with Bayesian Sparse Linear Mixed Models

Both linear mixed models (LMMs) and sparse regression models are widely used in genetics applications, including, recently, polygenic modeling in genome-wide association studies. These two approaches make very different assumptions, so are expected to perform well in different situations. However, in practice, for a given data set one typically does not know which assumptions will be more accurate. Motivated by this, we consider a hybrid of the two, which we refer to as a "Bayesian sparse linear mixed model" (BSLMM) that includes both these models as special cases. We address several key computational and statistical issues that arise when applying BSLMM, including appropriate prior specification for the hyper-parameters, and a novel Markov chain Monte Carlo algorithm for posterior inference. We apply BSLMM and compare it with other methods for two polygenic modeling applications: estimating the proportion of variance in phenotypes explained (PVE) by available genotypes, and phenotype (or breeding value) prediction. For PVE estimation, we demonstrate that BSLMM combines the advantages of both standard LMMs and sparse regression modeling. For phenotype prediction it considerably outperforms either of the other two methods, as well as several other large-scale regression methods previously suggested for this problem. Software implementing our method is freely available from http://stephenslab.uchicago.edu/software.html

preprint2011arXiv

The Gentlest Ascent Dynamics

Dynamical systems that describe the escape from the basins of attraction of stable invariant sets are presented and analyzed. It is shown that the stable fixed points of such dynamical systems are the index-1 saddle points. Generalizations to high index saddle points are discussed. Both gradient and non-gradient systems are considered. Preliminary results on the nature of the dynamical behavior are presented.

preprint2010arXiv

Maximum relative excitation of a specific vibrational mode via optimum laser pulse duration

For molecules and materials responding to femtosecond-scale optical laser pulses, we predict maximum relative excitation of a Raman-active vibrational mode with period T when the pulse has an FWHM duration of 0.42 T. This result follows from a general analytical model, and is precisely confirmed by detailed density-functional-based dynamical simulations for C60 and a carbon nanotube, which include anharmonicity, nonlinearity, no assumptions about the polarizability tensor, and no averaging over rapid oscillations within the pulse. The mode specificity is, of course, best at low temperature and for pulses that are electronically off-resonance, and the energy deposited in any mode is proportional to the fourth power of the electric field.

Xiang Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

40 published item(s)

Distributed NLI: Learning to Predict Human Opinion Distributions for Language Reasoning

Efficient primal heuristics for mixed-integer linear programs

Electromagnetic Dalitz Decays of $D_{(s)}^\ast$ Mesons

Learn Quasi-stationary Distributions of Finite State Markov Chain

Masked Part-Of-Speech Model: Does Modeling Long Context Help Unsupervised POS-tagging?

Methods for Large-scale Single Mediator Hypothesis Testing: Possible Choices and Comparisons

Mission Apollo: Landing Optical Circuit Switching at Datacenter Scale

Sublinear Regret for Learning POMDPs

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights

JUNO Physics and Detector

Regime Switching Bandits

Bayesian Sparse Mediation Analysis with Targeted Penalization of Natural Indirect Effects

DCMN+: Dual Co-Matching Network for Multi-choice Reading Comprehension

Deep Reinforcement Learning for On-line Dialogue State Tracking

Enumerating Chemical Graphs with Two Disjoint Cycles Satisfying Given Path Frequency Specifications

Feasibility and physics potential of detecting $^8$B solar neutrinos at JUNO

Machine Learning and Control Theory

Semantics-aware BERT for Language Understanding

Stochastic Modified Equations for Continuous Limit of Stochastic ADMM

TAO Conceptual Design Report: A Precision Measurement of the Reactor Antineutrino Spectrum with Sub-percent Energy Resolution

Towards Robustifying NLI Models Against Lexical Dataset Biases

Explore Stochastic Instabilities of Periodic Points by Transition Path Theory

Densities, isobaric thermal expansion coefficients and isothermal compressibilities of linear alkylbenzene

Electrical control of exchange spring in antiferromagnetic metals

Escaping from an attractor: Importance sampling and rest points I

Nature of magnetotransport in metal/insulating-ferromagnet heterostructures: Spin Hall magnetoresistance or magnetic proximity effect

Neutrino Physics with JUNO

Preliminary study of light yield dependence on LAB liquid scintillator composition

Rayleigh scattering and depolarization ratio in linear alkylbenzene

Rayleigh scattering of linear alkylbenzene in large liquid scintillator detectors

Spectroscopic study of light scattering in linear alkylbenzene for liquid scintillator neutrino detectors

An Iterative Minimization Formulation for Saddle-Point Search

Efficient rare event simulation for failure problems in random media

Finding Transition Pathways on Manifolds

An Explicit Cross Entropy Scheme for Mixtures

Efficient Algorithms for Multivariate Linear Mixed Models in Genome-wide Association Studies

Extreme Analysis of a Non-convex and Nonlinear Functional of Gaussian Processes -- On the Tail Asymptotics of Random Ordinary Differential Equations

Polygenic Modeling with Bayesian Sparse Linear Mixed Models

The Gentlest Ascent Dynamics

Maximum relative excitation of a specific vibrational mode via optimum laser pulse duration