Source author record

Zexin Pan

Zexin Pan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.NA Numerical Analysis Computation math.ST Statistics Theory

Catalog footprint

What is connected

5works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Quasi-Monte Carlo with one categorical variable

We study randomized quasi-Monte Carlo (RQMC) estimation of a multivariate integral where one of the variables takes only a finite number of values. This problem arises when the variable of integration is drawn from a mixture distribution as is common in importance sampling and also arises in some recent work on transport maps. We find that when integration error decreases at an RQMC rate that it is then important to oversample the smallest mixture components instead of using a proportional allocation. This can even improve the rate of convergence. The optimal allocations depend on the possibly unknown convergence rate. Designing the sample with an incorrect assumption on the rate still attains that convergence rate, with an inferior implied constant. The penalty for using a pessimistic rate is typically higher than for using an optimistic one. We also find that for the most accurate RQMC sampling methods, it is advantageous to arrange that our $n=2^m$ randomized Sobol' points split into subsample sizes that are also powers of $2$.

preprint2024arXiv

Computable error bounds for quasi-Monte Carlo using points with non-negative local discrepancy

Let $f:[0,1]^d\to\mathbb{R}$ be a completely monotone integrand as defined by Aistleitner and Dick (2015) and let points $\boldsymbol{x}_0,\dots,\boldsymbol{x}_{n-1}\in[0,1]^d$ have a non-negative local discrepancy (NNLD) everywhere in $[0,1]^d$. We show how to use these properties to get a non-asymptotic and computable upper bound for the integral of $f$ over $[0,1]^d$. An analogous non-positive local discrepancy (NPLD) property provides a computable lower bound. It has been known since Gabai (1967) that the two dimensional Hammersley points in any base $b\ge2$ have non-negative local discrepancy. Using the probabilistic notion of associated random variables, we generalize Gabai's finding to digital nets in any base $b\ge2$ and any dimension $d\ge1$ when the generator matrices are permutation matrices. We show that permutation matrices cannot attain the best values of the digital net quality parameter when $d\ge3$. As a consequence the computable absolutely sure bounds we provide come with less accurate estimates than the usual digital net estimates do in high dimensions. We are also able to construct high dimensional rank one lattice rules that are NNLD. We show that those lattices do not have good discrepancy properties: any lattice rule with the NNLD property in dimension $d\ge2$ either fails to be projection regular or has all its points on the main diagonal. Complete monotonicity is a very strict requirement that for some integrands can be mitigated via a control variate.

preprint2022arXiv

Super-polynomial accuracy of multidimensional randomized nets using the median-of-means

We study approximate integration of a function $f$ over $[0,1]^s$ based on taking the median of $2r-1$ integral estimates derived from independently randomized $(t,m,s)$-nets in base $2$. The nets are randomized by Matousek's random linear scramble with a digital shift. If $f$ is analytic over $[0,1]^s$, then the probability that any one randomized net's estimate has an error larger than $2^{-cm^2/s}$ times a quantity depending on $f$ is $O(1/\sqrt{m})$ for any $c<3\log(2)/π^2\approx 0.21$. As a result the median of the distribution of these scrambled nets has an error that is $O(n^{-c\log(n)/s})$ for $n=2^m$ function evaluations. The sample median of $2r-1$ independent draws attains this rate too, so long as $r/m^2$ is bounded away from zero as $m\to\infty$. We include results for finite precision estimates and some non-asymptotic comparisons to taking the mean of $2r-1$ independent draws.

preprint2022arXiv

Super-polynomial accuracy of one dimensional randomized nets using the median-of-means

Let $f$ be analytic on $[0,1]$ with $|f^{(k)}(1/2)|\leq Aα^kk!$ for some constant $A$ and $α<2$. We show that the median estimate of $μ=\int_0^1f(x)\,\mathrm{d}x$ under random linear scrambling with $n=2^m$ points converges at the rate $O(n^{-c\log(n)})$ for any $c< 3\log(2)/π^2\approx 0.21$. We also get a super-polynomial convergence rate for the sample median of $2k-1$ random linearly scrambled estimates, when $k=Ω(m)$. When $f$ has a $p$'th derivative that satisfies a $λ$-Hölder condition then the median-of-means has error $O( n^{-(p+λ)+ε})$ for any $ε>0$, if $k\to\infty$ as $m\to\infty$.

preprint2022arXiv

Where are the logs?

The commonly quoted error rates for QMC integration with an infinite low discrepancy sequence is $O(n^{-1}\log(n)^r)$ with $r=d$ for extensible sequences and $r=d-1$ otherwise. Such rates hold uniformly over all $d$ dimensional integrands of Hardy-Krause variation one when using $n$ evaluation points. Implicit in those bounds is that for any sequence of QMC points, the integrand can be chosen to depend on $n$. In this paper we show that rates with any $r<(d-1)/2$ can hold when $f$ is held fixed as $n\to\infty$. This is accomplished following a suggestion of Erich Novak to use some unpublished results of Trojan from the 1980s as given in the information based complexity monograph of Traub, Wasilkowski and Woźniakowski. The proof is made by applying a technique of Roth with the theorem of Trojan. The proof is non constructive and we do not know of any integrand of bounded variation in the sense of Hardy and Krause for which the QMC error exceeds $(\log n)^{1+ε}/n$ for infinitely many $n$ when using a digital sequence such as one of Sobol's. An empirical search when $d=2$ for integrands designed to exploit known weaknesses in certain point sets showed no evidence that $r>1$ is needed. An example with $d=3$ and $n$ up to $2^{100}$ might possibly require $r>1$.

Zexin Pan

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

Quasi-Monte Carlo with one categorical variable

Computable error bounds for quasi-Monte Carlo using points with non-negative local discrepancy

Super-polynomial accuracy of multidimensional randomized nets using the median-of-means

Super-polynomial accuracy of one dimensional randomized nets using the median-of-means

Where are the logs?