Source author record

Yiming Xu

Yiming Xu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning hep-ph math.NA Numerical Analysis Applications hep-th math.OC math.ST Statistics Theory Artificial Intelligence astro-ph.CO Computation Computer Vision math.PR Social and Information Networks

Catalog footprint

What is connected

14works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A bandit-learning approach to multifidelity approximation

Multifidelity approximation is an important technique in scientific computation and simulation. In this paper, we introduce a bandit-learning approach for leveraging data of varying fidelities to achieve precise estimates of the parameters of interest. Under a linear model assumption, we formulate a multifidelity approximation as a modified stochastic bandit, and analyze the loss for a class of policies that uniformly explore each model before exploiting. Utilizing the estimated conditional mean-squared error, we propose a consistent algorithm, adaptive Explore-Then-Commit (AETC), and establish a corresponding trajectory-wise optimality result. These results are then extended to the case of vector-valued responses, where we demonstrate that the algorithm is efficient without the need to worry about estimating high-dimensional parameters. The main advantage of our approach is that we require neither hierarchical model structure nor \textit{a priori} knowledge of statistical information (e.g., correlations) about or between models. Instead, the AETC algorithm requires only knowledge of which model is a trusted high-fidelity model, along with (relative) computational cost estimates of querying each model. Numerical experiments are provided at the end to support our theoretical findings.

preprint2022arXiv

A General Pairwise Comparison Model for Extremely Sparse Networks

Statistical inference using pairwise comparison data is an effective approach to analyzing large-scale sparse networks. In this paper, we propose a general framework to model the mutual interactions in a network, which enjoys ample flexibility in terms of model parametrization. Under this setup, we show that the maximum likelihood estimator for the latent score vector of the subjects is uniformly consistent under a near-minimal condition on network sparsity. This condition is sharp in terms of the leading order asymptotics describing the sparsity. Our analysis utilizes a novel chaining technique and illustrates an important connection between graph topology and model consistency. Our results guarantee that the maximum likelihood estimator is justified for estimation in large-scale pairwise comparison networks where data are asymptotically deficient. Simulation studies are provided in support of our theoretical findings.

preprint2022arXiv

Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

The growing significance of ridesourcing services in recent years suggests a need to examine the key determinants of ridesourcing demand. However, little is known regarding the nonlinear effects and spatial heterogeneity of ridesourcing demand determinants. This study applies an explainable-machine-learning-based analytical framework to identify the key factors that shape ridesourcing demand and to explore their nonlinear associations across various spatial contexts (airport, downtown, and neighborhood). We use the ridesourcing-trip data in Chicago for empirical analysis. The results reveal that the importance of built environment varies across spatial contexts, and it collectively contributes the largest importance in predicting ridesourcing demand for airport trips. Additionally, the nonlinear effects of built environment on ridesourcing demand show strong spatial variations. Ridesourcing demand is usually most responsive to the built environment changes for downtown trips, followed by neighborhood trips and airport trips. These findings offer transportation professionals nuanced insights for managing ridesourcing services.

preprint2022arXiv

Nonparametric Embeddings of Sparse High-Order Interaction Events

High-order interaction events are common in real-world applications. Learning embeddings that encode the complex relationships of the participants from these events is of great importance in knowledge mining and predictive tasks. Despite the success of existing approaches, e.g. Poisson tensor factorization, they ignore the sparse structure underlying the data, namely the occurred interactions are far less than the possible interactions among all the participants. In this paper, we propose Nonparametric Embeddings of Sparse High-order interaction events (NESH). We hybridize a sparse hypergraph (tensor) process and a matrix Gaussian process to capture both the asymptotic structural sparsity within the interactions and nonlinear temporal relationships between the participants. We prove strong asymptotic bounds (including both a lower and an upper bound) of the sparsity ratio, which reveals the asymptotic properties of the sampled structure. We use batch-normalization, stick-breaking construction, and sparse variational GP approximations to develop an efficient, scalable model inference algorithm. We demonstrate the advantage of our approach in several real-world applications.

preprint2022arXiv

Probabilistic methods for approximate archetypal analysis

Archetypal analysis is an unsupervised learning method for exploratory data analysis. One major challenge that limits the applicability of archetypal analysis in practice is the inherent computational complexity of the existing algorithms. In this paper, we provide a novel approximation approach to partially address this issue. Utilizing probabilistic ideas from high-dimensional geometry, we introduce two preprocessing techniques to reduce the dimension and representation cardinality of the data, respectively. We prove that provided the data is approximately embedded in a low-dimensional linear subspace and the convex hull of the corresponding representations is well approximated by a polytope with a few vertices, our method can effectively reduce the scaling of archetypal analysis. Moreover, the solution of the reduced problem is near-optimal in terms of prediction errors. Our approach can be combined with other acceleration techniques to further mitigate the intrinsic complexity of archetypal analysis. We demonstrate the usefulness of our results by applying our method to summarize several moderately large-scale datasets.

preprint2021arXiv

Analysis of The Ratio of $\ell_1$ and $\ell_2$ Norms in Compressed Sensing

We first propose a novel criterion that guarantees that an $s$-sparse signal is the local minimizer of the $\ell_1/\ell_2$ objective; our criterion is interpretable and useful in practice. We also give the first uniform recovery condition using a geometric characterization of the null space of the measurement matrix, and show that this condition is easily satisfied for a class of random matrices. We also present analysis on the robustness of the procedure when noise pollutes data. Numerical experiments are provided that compare $\ell_1/\ell_2$ with some other popular non-convex methods in compressed sensing. Finally, we propose a novel initialization approach to accelerate the numerical optimization procedure. We call this initialization approach \emph{support selection}, and we demonstrate that it empirically improves the performance of existing $\ell_1/\ell_2$ algorithms.

preprint2021arXiv

Infections Forecasting and Intervention Effect Evaluation for COVID-19 via a Data-Driven Markov Process and Heterogeneous Simulation

The Coronavirus Disease 2019 (COVID-19) pandemic has caused tremendous amount of deaths and a devastating impact on the economic development all over the world. Thus, it is paramount to control its further transmission, for which purpose it is necessary to find the mechanism of its transmission process and evaluate the effect of different control strategies. To deal with these issues, we describe the transmission of COVID-19 as an explosive Markov process with four parameters. The state transitions of the proposed Markov process can clearly disclose the terrible explosion and complex heterogeneity of COVID-19. Based on this, we further propose a simulation approach with heterogeneous infections. Experimentations show that our approach can closely track the real transmission process of COVID-19, disclose its transmission mechanism, and forecast the transmission under different non-drug intervention strategies. More importantly, our approach can helpfully develop effective strategies for controlling COVID-19 and appropriately compare their control effect in different countries/cities.

preprint2020arXiv

Consistency of archetypal analysis

Archetypal analysis is an unsupervised learning method that uses a convex polytope to summarize multivariate data. For fixed $k$, the method finds a convex polytope with $k$ vertices, called archetype points, such that the polytope is contained in the convex hull of the data and the mean squared distance between the data and the polytope is minimal. In this paper, we prove a consistency result that shows if the data is independently sampled from a probability measure with bounded support, then the archetype points converge to a solution of the continuum version of the problem, of which we identify and establish several properties. We also obtain the convergence rate of the optimal objective values under appropriate assumptions on the distribution. If the data is independently sampled from a distribution with unbounded support, we also prove a consistency result for a modified method that penalizes the dispersion of the archetype points. Our analysis is supported by detailed computational experiments of the archetype points for data sampled from the uniform distribution in a disk, the normal distribution, an annular distribution, and a Gaussian mixture model.

preprint2020arXiv

Open Set Domain Adaptation by Extreme Value Theory

Common domain adaptation techniques assume that the source domain and the target domain share an identical label space, which is problematic since when target samples are unlabeled we have no knowledge on whether the two domains share the same label space. When this is not the case, the existing methods fail to perform well because the additional unknown classes are also matched with the source domain during adaptation. In this paper, we tackle the open set domain adaptation problem under the assumption that the source and the target label spaces only partially overlap, and the task becomes when the unknown classes exist, how to detect the target unknown classes and avoid aligning them with the source domain. We propose to utilize an instance-level reweighting strategy for domain adaptation where the weights indicate the likelihood of a sample belonging to known classes and to model the tail of the entropy distribution with Extreme Value Theory for unknown class detection. Experiments on conventional domain adaptation datasets show that the proposed method outperforms the state-of-the-art models.

preprint2014arXiv

A solution of 2D QCD at Finite $N$ using a conformal basis

We study 2D QCD with a fundamental fermion at small-$N$ using the recently proposed conformal basis approach. We find that effective conformal dominance still holds, namely that the spectrum converges efficiently, with high scaling-dimension operators decoupling exponentially quickly from the stable single-particle states. Consequently, for these stable bound states, accurate, analytic expressions for wavefunctions and parton distribution functions can be given, even for $N=3$.

preprint2014arXiv

Seeking Lorentz Violation from the Higgs

The recently discovered Higgs particle with a mass near $126$ GeV presents new opportunities to explore Lorentz violation. Ultra-high-energy cosmic rays are one of the most sensitive testing grounds for Lorentz symmetry, and can be used to seek for and limit departures from Lorentz invariance in the Higgs sector. If the Higgs were to have a super- or sub-luminal maximal speed both Higgs and weak interaction physics would be modified. Consideration of such modifications allow us to constrain the Higgs maximal velocity to agree with that of other Standard Model particles to parts in $10^{14}$.

preprint2014arXiv

Solving 2D QCD with an adjoint fermion analytically

We present an analytic approach to solving 1+1 dimensional QCD with an adjoint Majorana fermion. In the UV this theory is described by a trivial CFT containing free fermions. The quasi-primary operators of this CFT lead to a discrete basis of states which is useful for diagonalizing the Hamiltonian of the full strongly interacting theory. Working at large-$N$, we find that the decoupling of high scaling-dimension quasi-primary operators from the low-energy spectrum occurs exponentially fast in their scaling-dimension. This suggests a scheme, whereby, truncating the basis to operators of dimension below $Δ_{max}$, one can calculate the low-energy spectrum, parametrically to an accuracy of $e^{-Δ_{max}}$ (although the precise accuracy depends on the state). Choosing $Δ_{max} =9.5$ we find very good agreement with the known spectrum obtained earlier by numerical DLCQ methods. Specifically, below the first three-particle threshold, we are able to identify all six single-particle bound-states, as well as several two-particle thresholds.

preprint2012arXiv

Model Independent Direct Detection Analyses

Following the construction of the general effective theory for dark matter direct detection in 1203.3542, we perform an analysis of the experimental constraints on the full parameter space of elastically scattering dark matter. We review the prescription for calculating event rates in the general effective theory and discuss the sensitivity of various experiments to additional nuclear responses beyond the spin-independent (SI) and spin-dependent (SD) couplings: an angular-momentum-dependent (LD) and spin-and-angular-momentum-dependent (LSD) response, as well as a distinction between transverse and longitudinal spin-dependent responses. We consider the effect of interference between different operators and in particular look at directions in parameter space where such cancellations lead to holes in the sensitivity of individual experiments. We explore the complementarity of different experiments by looking at the improvement of bounds when experiments are combined. Finally, our scan through parameter space shows that within the assumptions on models and on the experiments' sensitivity that we make, no elastically scattering dark matter explanation of DAMA is consistent with all other experiments at 90%, though we find points in parameter space that are ruled out only by about a factor of 2 in the cross-section.

preprint2012arXiv

The Effective Field Theory of Dark Matter Direct Detection

We extend and explore the general non-relativistic effective theory of dark matter (DM) direct detection. We describe the basic non-relativistic building blocks of operators and discuss their symmetry properties, writing down all Galilean-invariant operators up to quadratic order in momentum transfer arising from exchange of particles of spin 1 or less. Any DM particle theory can be translated into the coefficients of an effective operator and any effective operator can be simply related to most general description of the nuclear response. We find several operators which lead to novel nuclear responses. These responses differ significantly from the standard minimal WIMP cases in their relative coupling strengths to various elements, changing how the results from different experiments should be compared against each other. Response functions are evaluated for common DM targets - F, Na, Ge, I, and Xe - using standard shell model techniques. We point out that each of the nuclear responses is familiar from past studies of semi-leptonic electroweak interactions, and thus potentially testable in weak interaction studies. We provide tables of the full set of required matrix elements at finite momentum transfer for a range of common elements, making a careful and fully model-independent analysis possible. Finally, we discuss embedding non-relativistic effective theory operators into UV models of dark matter.

Yiming Xu

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

A bandit-learning approach to multifidelity approximation

A General Pairwise Comparison Model for Extremely Sparse Networks

Examining spatial heterogeneity of ridesourcing demand determinants with explainable machine learning

Nonparametric Embeddings of Sparse High-Order Interaction Events

Probabilistic methods for approximate archetypal analysis

Analysis of The Ratio of $\ell_1$ and $\ell_2$ Norms in Compressed Sensing

Infections Forecasting and Intervention Effect Evaluation for COVID-19 via a Data-Driven Markov Process and Heterogeneous Simulation

Consistency of archetypal analysis

Open Set Domain Adaptation by Extreme Value Theory

A solution of 2D QCD at Finite $N$ using a conformal basis

Seeking Lorentz Violation from the Higgs

Solving 2D QCD with an adjoint fermion analytically

Model Independent Direct Detection Analyses

The Effective Field Theory of Dark Matter Direct Detection