Researcher profile

Van Vu

Van Vu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2023arXiv

Matrices with Gaussian noise: optimal estimates for singular subspace perturbation

The Davis-Kahan-Wedin $\sin Θ$ theorem describes how the singular subspaces of a matrix change when subjected to a small perturbation. This classic result is sharp in the worst case scenario. In this paper, we prove a stochastic version of the Davis-Kahan-Wedin $\sin Θ$ theorem when the perturbation is a Gaussian random matrix. Under certain structural assumptions, we obtain an optimal bound that significantly improves upon the classic Davis-Kahan-Wedin $\sin Θ$ theorem. One of our key tools is a new perturbation bound for the singular values, which may be of independent interest.

preprint2023arXiv

Random perturbation of low rank matrices: Improving classical bounds

Matrix perturbation inequalities, such as Weyl's theorem (concerning the singular values) and the Davis-Kahan theorem (concerning the singular vectors), play essential roles in quantitative science; in particular, these bounds have found application in data analysis as well as related areas of engineering and computer science. In many situations, the perturbation is assumed to be random, and the original matrix has certain structural properties (such as having low rank). We show that, in this scenario, classical perturbation results, such as Weyl and Davis-Kahan, can be improved significantly. We believe many of our new bounds are close to optimal and also discuss some applications.

preprint2022arXiv

VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations

Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam. Out of this raw data, we release 18,000 images that were manually annotated by a total of 17 experienced radiologists with 22 local labels of rectangles surrounding abnormalities and 6 global labels of suspected diseases. The released dataset is divided into a training set of 15,000 and a test set of 3,000. Each scan in the training set was independently labeled by 3 radiologists, while each scan in the test set was labeled by the consensus of 5 radiologists. We designed and built a labeling platform for DICOM images to facilitate these annotation procedures. All images are made publicly available (https://www.physionet.org/content/vindr-cxr/1.0.0/) in DICOM format along with the labels of both the training set and the test set.

preprint2020arXiv

Reaching a Consensus on Random Networks: The Power of Few

A community of $n$ individuals splits into two camps, Red and Blue. The individuals are connected by a social network, which influences their colors. Everyday, each person changes his/her color according to the majority among his/her neighbors. Red (Blue) wins if everyone in the community becomes Red (Blue) at some point. We study this process when the underlying network is the random Erdos-Renyi graph $G(n, p)$. With a balanced initial state ($n/2$ person in each camp), it is clear that each color wins with the same probability. Our study reveals that for any constants $p$ and $\varepsilon$, there is a constant $C$ such that if one camp has $n/2 +C$ individuals, then it wins with probability at least $1 - \varepsilon$. The surprising key fact here is that $C$ does not depend on $n$, the population of the community. When $p=1/2$ and $\varepsilon =.1$, one can set $C$ as small as 6. If the aim of the process is to choose a candidate, then this means it takes only $6$ "defectors" to win an election unanimously with overwhelming odd.

preprint2010arXiv

Bulk universality for Wigner hermitian matrices with subexponential decay

We consider the ensemble of $n \times n$ Wigner hermitian matrices $H = (h_{\ell k})_{1 \leq \ell,k \leq n}$ that generalize the Gaussian unitary ensemble (GUE). The matrix elements $h_{k\ell} = \bar h_{\ell k}$ are given by $h_{\ell k} = n^{-1/2} (x_{\ell k} + \sqrt{-1} y_{\ell k})$, where $x_{\ell k}, y_{\ell k}$ for $1 \leq \ell < k \leq n$ are i.i.d. random variables with mean zero and variance 1/2, $y_{\ell\ell}=0$ and $x_{\ell \ell}$ have mean zero and variance 1. We assume the distribution of $x_{\ell k}, y_{\ell k}$ to have subexponential decay. In a recent paper, four of the authors recently established that the gap distribution and averaged $k$-point correlation of these matrices were \emph{universal} (and in particular, agreed with those for GUE) assuming additional regularity hypotheses on the $x_{\ell k}, y_{\ell k}$. In another recent paper, the other two authors, using a different method, established the same conclusion assuming instead some moment and support conditions on the $x_{\ell k}, y_{\ell k}$. In this short note we observe that the arguments of these two papers can be combined to establish universality of the gap distribution and averaged $k$-point correlations for all Wigner matrices (with subexponentially decaying entries), with no extra assumptions.

preprint2010arXiv

Random matrices: Universality of local eigenvalue statistics

In this paper, we consider the universality of the local eigenvalue statistics of random matrices. Our main result shows that these statistics are determined by the first four moments of the distribution of the entries. As a consequence, we derive the universality of eigenvalue gap distribution and $k$-point correlation and many other statistics (under some mild assumptions) for both Wigner Hermitian matrices and Wigner real symmetric matrices.

preprint2010arXiv

Singular vectors under random perturbation

Computing the first few singular vectors of a large matrix is a problem that frequently comes up in statistics and numerical analysis. Given the presence of noise, exact calculation is hard to achieve, and the following problem is of importance: \vskip2mm \centerline {\it How much a small perturbation to the matrix changes the singular vectors ?} \vskip2mm Answering this question, classical theorems, such as those of Davis-Kahan and Wedin, give tight estimates for the worst-case scenario. In this paper, we show that if the perturbation (noise) is random and our matrix has low rank, then better estimates can be obtained. Our method relies on high dimensional geometry and is different from those used an earlier papers.