Source author record

Wang Miao

Wang Miao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology math.ST Statistics Theory Applications eess.SP Networking and Internet Architecture Neural and Evolutionary Computing physics.ins-det physics.optics

Catalog footprint

What is connected

12works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Selective Review of Negative Control Methods in Epidemiology

Purpose of Review: Negative controls are a powerful tool to detect and adjust for bias in epidemiological research. This paper introduces negative controls to a broader audience and provides guidance on principled design and causal analysis based on a formal negative control framework. Recent Findings: We review and summarize causal and statistical assumptions, practical strategies, and validation criteria that can be combined with subject matter knowledge to perform negative control analyses. We also review existing statistical methodologies for detection, reduction, and correction of confounding bias, and briefly discuss recent advances towards nonparametric identification of causal effects in a double negative control design. Summary: There is great potential for valid and accurate causal inference leveraging contemporary healthcare data in which negative controls are routinely available. Design and analysis of observational data leveraging negative controls is an area of growing interest in health and social sciences. Despite these developments, further effort is needed to disseminate these novel methods to ensure they are adopted by practicing epidemiologists.

preprint2022arXiv

Doubly Robust Proximal Causal Inference under Confounded Outcome-Dependent Sampling

Unmeasured confounding and selection bias are often of concern in observational studies and may invalidate a causal analysis if not appropriately accounted for. Under outcome-dependent sampling, a latent factor that has causal effects on the treatment, outcome, and sample selection process may cause both unmeasured confounding and selection bias, rendering standard causal parameters unidentifiable without additional assumptions. Under an odds ratio model for the treatment effect, Li et al. 2022 established both proximal identification and estimation of causal effects by leveraging a pair of negative control variables as proxies of latent factors at the source of both confounding and selection bias. However, their approach relies exclusively on the existence and correct specification of a so-called treatment confounding bridge function, a model that restricts the treatment assignment mechanism. In this article, we propose doubly robust estimation under the odds ratio model with respect to two nuisance functions -- a treatment confounding bridge function and an outcome confounding bridge function that restricts the outcome law, such that our estimator is consistent and asymptotically normal if either bridge function model is correctly specified, without knowing which one is. Thus, our proposed doubly robust estimator is potentially more robust than that of Li et al. 2022. Our simulations confirm that the proposed proximal estimators of an odds ratio causal effect can adequately account for both residual confounding and selection bias under stated conditions with well-calibrated confidence intervals in a wide range of scenarios, where standard methods generally fail to be consistent. In addition, the proposed doubly robust estimator is consistent if at least one confounding bridge function is correctly specified.

preprint2022arXiv

Identification and estimation of causal effects in the presence of confounded principal strata

The principal stratification has become a popular tool to address a broad class of causal inference questions, particularly in dealing with non-compliance and truncation-by-death problems. The causal effects within principal strata which are determined by joint potential values of the intermediate variable, also known as the principal causal effects, are often of interest in these studies. Analyses of principal causal effects from observed data in the literature mostly rely on ignorability of the treatment assignment, which requires practitioners to accurately measure as many as covariates so that all possible confounding sources are captured. However, collecting all potential confounders in observational studies is often difficult and costly, the ignorability assumption may thus be questionable. In this paper, by leveraging available negative controls that have been increasingly used to deal with uncontrolled confounding, we consider identification and estimation of causal effects when the treatment and principal strata are confounded by unobserved variables. Specifically, we show that the principal causal effects can be nonparametrically identified by invoking a pair of negative controls that are both required not to directly affect the outcome. We then relax this assumption and establish identification of principal causal effects under various semiparametric or parametric models. We also propose an estimation method of principal causal effects. Extensive simulation studies show good performance of the proposed approach and a real data application from the National Longitudinal Survey of Young Men is used for illustration.

preprint2022arXiv

Identifying effects of multiple treatments in the presence of unmeasured confounding

Identification of treatment effects in the presence of unmeasured confounding is a persistent problem in the social, biological, and medical sciences. The problem of unmeasured confounding in settings with multiple treatments is most common in statistical genetics and bioinformatics settings, where researchers have developed many successful statistical strategies without engaging deeply with the causal aspects of the problem. Recently there have been a number of attempts to bridge the gap between these statistical approaches and causal inference, but these attempts have either been shown to be flawed or have relied on fully parametric assumptions. In this paper, we propose two strategies for identifying and estimating causal effects of multiple treatments in the presence of unmeasured confounding. The auxiliary variables approach leverages variables that are not causally associated with the outcome; in the case of a univariate confounder, our method only requires one auxiliary variable, unlike existing instrumental variable methods that would require as many instruments as there are treatments. An alternative null treatments approach relies on the assumption that at least half of the confounded treatments have no causal effect on the outcome, but does not require a priori knowledge of which treatments are null. Our identification strategies do not impose parametric assumptions on the outcome model and do not rest on estimation of the confounder. This paper extends and generalizes existing work on unmeasured confounding with a single treatment and models commonly used in bioinformatics.

preprint2022arXiv

Nonparametric inference about mean functionals of nonignorable nonresponse data without identifying the joint distribution

We consider identification and inference about mean functionals of observed covariates and an outcome variable subject to nonignorable missingness. By leveraging a shadow variable, we establish a necessary and sufficient condition for identification of the mean functional even if the full data distribution is not identified. We further characterize a necessary condition for $\sqrt{n}$-estimability of the mean functional. This condition naturally strengthens the identifying condition, and it requires the existence of a function as a solution to a representer equation that connects the shadow variable to the mean functional. Solutions to the representer equation may not be unique, which presents substantial challenges for nonparametric estimation and standard theories for nonparametric sieve estimators are not applicable here. We construct a consistent estimator for the solution set and then adapt the theory of extremum estimators to find from the estimated set a consistent estimator for an appropriately chosen solution. The estimator is asymptotically normal, locally efficient and attains the semiparametric efficiency bound under certain regularity conditions. We illustrate the proposed approach via simulations and a real data application on home pricing.

preprint2022arXiv

Proximal Causal Inference for Complex Longitudinal Studies

A standard assumption for causal inference about the joint effects of time-varying treatment is that one has measured sufficient covariates to ensure that within covariate strata, subjects are exchangeable across observed treatment values, also known as "sequential randomization assumption (SRA)". SRA is often criticized as it requires one to accurately measure all confounders. Realistically, measured covariates can rarely capture all confounders with certainty. Often covariate measurements are at best proxies of confounders, thus invalidating inferences under SRA. In this paper, we extend the proximal causal inference (PCI) framework of Miao et al. (2018) to the longitudinal setting under a semiparametric marginal structural mean model (MSMM). PCI offers an opportunity to learn about joint causal effects in settings where SRA based on measured time-varying covariates fails, by formally accounting for the covariate measurements as imperfect proxies of underlying confounding mechanisms. We establish nonparametric identification with a pair of time-varying proxies and provide a corresponding characterization of regular and asymptotically linear estimators of the parameter indexing the MSMM, including a rich class of doubly robust estimators, and establish the corresponding semiparametric efficiency bound for the MSMM. Extensive simulation studies and a data application illustrate the finite sample behavior of proposed methods.

preprint2020arXiv

A DoA Estimation Based Robust Beam Forming Method for UAV-BS Communication

High data rate communication with Unmanned Aerial Vehicles (UAV) is of growing demand among industrial and commercial applications since the last decade. In this paper, we investigate enhancing beam forming performance based on signal Direction of Arrival (DoA) estimation to support UAV-cellular network communication. We first study UAV fast moving scenario where we found that drone's mobility cause degradation of beam forming algorithm performance. Then, we propose a DoA estimation algorithm and a steering vector adaptive receiving beam forming method. The DoA estimation algorithm is of high precision with low computational complexity. Also it enables a beam former to timely adjust steering vector value in calculating beam forming weight. Simulation results show higher SINR performance and more stability of proposed method than traditional method based on Multiple Signal Classification (MUSIC) DoA estimation algorithm.

preprint2020arXiv

Routing-Led Placement of VNFs in Arbitrary Networks

The ever increasing demand for computing resources has led to the creation of hyperscale datacentres with tens of thousands of servers. As demand continues to rise, new technologies must be incorporated to ensure high quality services can be provided without the damaging environmental impact of high energy consumption. Virtualisation technology such as network function virtualisation (NFV) allows for the creation of services by connecting component parts known as virtual network functions (VNFs). VNFs cam be used to maximally utilise available datacentre resources by optimising the placement and routes of VNFs, to maintain a high quality of service whilst minimising energy costs. Current research on this problem has focussed on placing VNFs and considered routing as a secondary concern. In this work we argue that the opposite approach, a routing-led approach is preferable. We propose a novel routing-led algorithm and analyse each of the component parts over a range of different topologies on problems with up to 16000 variables and compare its performance against a traditional placement based algorithm. Empirical results show that our routing-led algorithm can produce significantly better, faster solutions to large problem instances on a range of datacentre topologies.

preprint2016arXiv

Identification and Inference for Marginal Average Treatment Effect on the Treated With an Instrumental Variable

In observational studies, treatments are typically not randomized and therefore estimated treatment effects may be subject to confounding bias. The instrumental variable (IV) design plays the role of a quasi-experimental handle since the IV is associated with the treatment and only affects the outcome through the treatment. In this paper, we present a novel framework for identification and inference using an IV for the marginal average treatment effect amongst the treated (ETT) in the presence of unmeasured confounding. For inference, we propose three different semiparametric approaches: (i) inverse probability weighting (IPW), (ii) outcome regression (OR), and (iii) doubly robust (DR) estimation, which is consistent if either (i) or (ii) is consistent, but not necessarily both. A closed-form locally semiparametric efficient estimator is obtained in the simple case of binary IV and outcome and the efficiency bound is derived for the more general case.

preprint2016arXiv

On Varieties of Doubly Robust Estimators Under Missingness Not at Random With a Shadow Variable

Suppose we are interested in the mean of an outcome variable missing not at random. Suppose however that one has available a fully observed shadow variable, which is associated with the outcome but independent of the missingness process conditional on covariates and the possibly unobserved outcome. Such a variable may be a proxy or a mismeasured version of the outcome available for all individuals. We have previously established necessary and sufficient conditions for identification of the full data law in such a setting, and have described semiparametric estimators including a doubly robust estimator of the outcome mean. Here, we propose two alternative doubly robust estimators for the outcome mean, which may be viewed as extensions of analogous methods under missingness at random, but enjoy different properties. We assess correctness of the required working models via straightforward goodness-of-fit tests.

preprint2015arXiv

Identifiability of Normal and Normal Mixture Models With Nonignorable Missing Data

Missing data problems arise in many applied research studies. They may jeopardize statistical inference of the model of interest, if the missing mechanism is nonignorable, that is, the missing mechanism depends on the missing values themselves even conditional on the observed data. With a nonignorable missing mechanism, the model of interest is often not identifiable without imposing further assumptions. We find that even if the missing mechanism has a known parametric form, the model is not identifiable without specifying a parametric outcome distribution. Although it is fundamental for valid statistical inference, identifiability under nonignorable missing mechanisms is not established for many commonly-used models. In this paper, we first demonstrate identifiability of the normal distribution under monotone missing mechanisms. We then extend it to the normal mixture and $t$ mixture models with non-monotone missing mechanisms. We discover that models under the Logistic missing mechanism are less identifiable than those under the Probit missing mechanism. We give necessary and sufficient conditions for identifiability of models under the Logistic missing mechanism, which sometimes can be checked in real data analysis. We illustrate our methods using a series of simulations, and apply them to a real-life dataset.

preprint2012arXiv

High-precision Absolute Distance Measurements over a Long Range Based on Two Optoelectronic Oscillators

Absolute distance measurement (ADM) over a long range has been studied intensely over the last several decades, due to its important applications in large-scale manufacturing and outer space explorations [1-5]. Traditional absolute distance measurements utilize detection of time-of-flight information, detection of phase shift, or a combination of the two [6-17]. In this paper, we present a novel scheme for high-precision ADM over a long range based on frequency detection by using two optoelectronic oscillators (OEO) to convert distance information to frequency information. By taking advantage of accumulative magnification theory, the absolute error of the measured distance is magnified by about 2*10E5 times, which makes the precision of the measured distance significantly improved. In our experiments, the maximum error is 1.5 um at the emulated ~6 km distance, including the drift error of about 1 um in the air path due to the change in environmental conditions. In addition, the measurable distance using this scheme could be further extended. The highest relative measurement precision is 2*10E10 in our current system while the actual relative measurement precision of our experimental system is limited by the variation of atmospheric conditions and is about 4*10E9.

Wang Miao

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

A Selective Review of Negative Control Methods in Epidemiology

Doubly Robust Proximal Causal Inference under Confounded Outcome-Dependent Sampling

Identification and estimation of causal effects in the presence of confounded principal strata

Identifying effects of multiple treatments in the presence of unmeasured confounding

Nonparametric inference about mean functionals of nonignorable nonresponse data without identifying the joint distribution

Proximal Causal Inference for Complex Longitudinal Studies

A DoA Estimation Based Robust Beam Forming Method for UAV-BS Communication

Routing-Led Placement of VNFs in Arbitrary Networks

Identification and Inference for Marginal Average Treatment Effect on the Treated With an Instrumental Variable

On Varieties of Doubly Robust Estimators Under Missingness Not at Random With a Shadow Variable

Identifiability of Normal and Normal Mixture Models With Nonignorable Missing Data

High-precision Absolute Distance Measurements over a Long Range Based on Two Optoelectronic Oscillators