Source author record

Dave Zachariah

Dave Zachariah appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.ST Statistics Theory Machine Learning Information Theory math.IT eess.SP Networking and Internet Architecture Systems and Control Applications math.OC Multiagent Systems Robotics

Catalog footprint

What is connected

21works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Learning plug-in surrogate endpoints for randomized experiments

Surrogate endpoints are used in place of long-term outcomes in randomized experiments when observing the real outcome for a large enough cohort is prohibitively expensive or impractical. A short-term surrogate is good if the result of an experiment using the surrogate is predictive of the result of a hypothetical study using the real outcome. Much attention has been paid to formalizing this property in causal terms, but most criteria are unidentifiable and cannot be turned into practical algorithms for learning surrogate endpoints from data. To address this, we study plug-in composite surrogates, functions of post-treatment variables that may be substituted directly for the primary outcome in a randomized experiment. We propose two methods for learning plug-in surrogates that maximize effect predictiveness, and characterize the possibility of finding endpoints that yield unbiased effect estimates in representative scenarios. Finally, in both synthetic experiments with known effects and in data from a real-world experiment, we find that our method, based on directly modeling the surrogate effect, returns plug-in endpoints more predictive of the primary effect than established methods.

preprint2022arXiv

Robust Learning in Heterogeneous Contexts

We consider the problem of learning from training data obtained in different contexts, where the underlying context distribution is unknown and is estimated empirically. We develop a robust method that takes into account the uncertainty of the context distribution. Unlike the conventional and overly conservative minimax approach, we focus on excess risks and construct distribution sets with statistical coverage to achieve an appropriate trade-off between performance and robustness. The proposed method is computationally scalable and shown to interpolate between empirical risk minimization and minimax regret objectives. Using both real and synthetic data, we demonstrate its ability to provide robustness in worst-case scenarios without harming performance in the nominal scenario.

preprint2020arXiv

A latent variable approach to heat load prediction in thermal grids

In this paper a new method for heat load prediction in district energy systems is proposed. The method uses a nominal model for the prediction of the outdoor temperature dependent space heating load, and a data driven latent variable model to predict the time dependent residual heat load. The residual heat load arises mainly from time dependent operation of space heating and ventilation, and domestic hot water production. The resulting model is recursively updated on the basis of a hyper-parameter free implementation that results in a parsimonious model allowing for high computational performance. The approach is applied to a single multi-dwelling building in Lulea, Sweden, predicting the heat load using a relatively small number of model parameters and easily obtained measurements. The results are compared with predictions using an artificial neural network, showing that the proposed method achieves better prediction accuracy for the validation case. Additionally, the proposed methods exhibits explainable behavior through the use of an interpretable physical model.

preprint2020arXiv

Learning Robust Decision Policies from Observational Data

We address the problem of learning a decision policy from observational data of past decisions in contexts with features and associated outcomes. The past policy maybe unknown and in safety-critical applications, such as medical decision support, it is of interest to learn robust policies that reduce the risk of outcomes with high costs. In this paper, we develop a method for learning policies that reduce tails of the cost distribution at a specified level and, moreover, provide a statistically valid bound on the cost of each decision. These properties are valid under finite samples -- even in scenarios with uneven or no overlap between features for different decisions in the observed data -- by building on recent results in conformal prediction. The performance and statistical properties of the proposed method are illustrated using both real and synthetic data.

preprint2020arXiv

Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees

A spatial point process can be characterized by an intensity function which predicts the number of events that occur across space. In this paper, we develop a method to infer predictive intensity intervals by learning a spatial model using a regularized criterion. We prove that the proposed method exhibits out-of-sample prediction performance guarantees which, unlike standard estimators, are valid even when the spatial model is misspecified. The method is demonstrated using synthetic as well as real spatial data.

preprint2020arXiv

Robust Prediction when Features are Missing

Predictors are learned using past training data which may contain features that are unavailable at the time of prediction. We develop an approach that is robust against outlying missing features, based on the optimality properties of an oracle predictor which observes them. The robustness properties of the approach are demonstrated on both real and synthetic data.

preprint2020arXiv

Robust Risk Minimization for Statistical Learning

We consider a general statistical learning problem where an unknown fraction of the training data is corrupted. We develop a robust learning method that only requires specifying an upper bound on the corrupted data fraction. The method minimizes a risk function defined by a non-parametric distribution with unknown probability weights. We derive and analyse the optimal weights and show how they provide robustness against corrupted data. Furthermore, we give a computationally efficient coordinate descent algorithm to solve the risk minimization problem. We demonstrate the wide range applicability of the method, including regression, classification, unsupervised learning and classic parameter estimation, with state-of-the-art performance.

preprint2016arXiv

Pearson information-based lower bound on Fisher information

The Fisher information matrix (FIM) plays an important role in the analysis of parameter inference and system design problems. In a number of cases, however, the statistical data distribution and its associated information matrix are either unknown or intractable. For this reason, it is of interest to develop useful lower bounds on the FIM. In this lecture note, we derive such a bound based on moment constraints. We call this bound the Pearson information matrix (PIM) and relate it to properties of a misspecified data distribution. Finally, we show that the inverse PIM coincides with the asymptotic covariance matrix of the optimally weighted generalized method of moments.

preprint2015arXiv

Cramer-Rao bound analog of Bayes rule

In this lecture note, we show a general property of the Cramer-Rao bound (CRB) that quantifies the interdependencies between the parameters in a vector. The presented result is valid for more general models than the additive noise model and also generalizes previous results to vector parameters. The CRB analog to Bayes' rule will be illustrated via two examples.

preprint2015arXiv

Joint Ranging and Clock Parameter Estimation by Wireless Round Trip Time Measurements

In this paper we develop a new technique for estimating fine clock errors and range between two nodes simultaneously by two-way time-of-arrival measurements us- ing impulse-radio ultra-wideband signals. Estimators for clock parameters and the range are proposed that are robust with respect to outliers. They are analyzed numerically and by means of experimental measurement campaigns. The technique and derived estimators achieve accuracies below 1Hz for frequency estimation, below 1 ns for phase estimation and 20 cm for range estimation, at 4m distance using 100MHz clocks at both nodes. Therefore, we show that the proposed joint approach is practical and can simultaneously provide clock synchronization and positioning in an experimental system.

preprint2015arXiv

Online Hyperparameter-Free Sparse Estimation Method

In this paper we derive an online estimator for sparse parameter vectors which, unlike the LASSO approach, does not require the tuning of any hyperparameters. The algorithm is based on a covariance matching approach and is equivalent to a weighted version of the square-root LASSO. The computational complexity of the estimator is of the same order as that of the online versions of regularized least-squares (RLS) and LASSO. We provide a numerical comparison with feasible and infeasible implementations of the LASSO and RLS to illustrate the advantage of the proposed online hyperparameter-free estimator.

preprint2015arXiv

Robust Optimal Power Distribution for Hyperthermia Cancer Treatment

We consider an optimization problem for spatial power distribution generated by an array of transmitting elements. Using ultrasound hyperthermia cancer treatment as a motivating example, the signal design problem consists of optimizing the power distribution across the tumor and healthy tissue regions, respectively. The models used in the optimization problem are, however, invariably subject to errors. deposition as well as inefficient treatment. To combat such unknown model errors, we formulate a robust signal design framework that can take the uncertainty into account using a worst-case approach. This leads to a semi-infinite programming (SIP) robust design problem which we reformulate as a tractable convex problem, potentially has a wider range of applications.

preprint2014arXiv

Estimation for the Linear Model with Uncertain Covariance Matrices

We derive a maximum a posteriori estimator for the linear observation model, where the signal and noise covariance matrices are both uncertain. The uncertainties are treated probabilistically by modeling the covariance matrices with prior inverse-Wishart distributions. The nonconvex problem of jointly estimating the signal of interest and the covariance matrices is tackled by a computationally efficient fixed-point iteration as well as an approximate variational Bayes solution. The statistical performance of estimators is compared numerically to state-of-the-art estimators from the literature and shown to perform favorably.

preprint2014arXiv

Weighted SPICE: A Unifying Approach for Hyperparameter-Free Sparse Estimation

In this paper we present the SPICE approach for sparse parameter estimation in a framework that unifies it with other hyperparameter-free methods, namely LIKES, SLIM and IAA. Specifically, we show how the latter methods can be interpreted as variants of an adaptively reweighted SPICE method. Furthermore, we establish a connection between SPICE and the l1-penalized LAD estimator as well as the square-root LASSO method. We evaluate the four methods mentioned above in a generic sparse regression problem and in an array processing application.

preprint2013arXiv

Cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging

The implementation challenges of cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging are discussed and work on the subject is reviewed. System architecture and sensor fusion are identified as key challenges. A partially decentralized system architecture based on step-wise inertial navigation and step-wise dead reckoning is presented. This architecture is argued to reduce the computational cost and required communication bandwidth by around two orders of magnitude while only giving negligible information loss in comparison with a naive centralized implementation. This makes a joint global state estimation feasible for up to a platoon-sized group of agents. Furthermore, robust and low-cost sensor fusion for the considered setup, based on state space transformation and marginalization, is presented. The transformation and marginalization are used to give the necessary flexibility for presented sampling based updates for the inter-agent ranging and ranging free fusion of the two feet of an individual agent. Finally, characteristics of the suggested implementation are demonstrated with simulations and a real-time system implementation.

preprint2013arXiv

Line Spectrum Estimation with Probabilistic Priors

For line spectrum estimation, we derive the maximum a posteriori probability estimator where prior knowledge of frequencies is modeled probabilistically. Since the spectrum is periodic, an appropriate distribution is the circular von Mises distribution that can parameterize the entire range of prior certainty of the frequencies. An efficient alternating projections method is used to solve the resulting optimization problem. The estimator is evaluated numerically and compared with other estimators and the Cramér-Rao bound.

preprint2013arXiv

Self-Localization of Asynchronous Wireless Nodes With Parameter Uncertainties

We investigate a wireless network localization scenario in which the need for synchronized nodes is avoided. It consists of a set of fixed anchor nodes transmitting according to a given sequence and a self-localizing receiver node. The setup can accommodate additional nodes with unknown positions participating in the sequence. We propose a localization method which is robust with respect to uncertainty of the anchor positions and other system parameters. Further, we investigate the Cramér-Rao bound for the considered problem and show through numerical simulations that the proposed method attains the bound.

preprint2013arXiv

Utilization of Noise-Only Samples in Array Processing With Prior Knowledge

For array processing, we consider the problem of estimating signals of interest, and their directions of arrival (DOA), in unknown colored noise fields. We develop an estimator that efficiently utilizes a set of noise-only samples and, further, can incorporate prior knowledge of the DOAs with varying degrees of certainty. The estimator is compared with state of the art estimators that utilize noise-only samples, and the Cramér-Rao bound, exhibiting improved performance for smaller sample sets and in poor signal conditions.

preprint2012arXiv

Bayesian Estimation with Distance Bounds

We consider the problem of estimating a random state vector when there is information about the maximum distances between its subvectors. The estimation problem is posed in a Bayesian framework in which the minimum mean square error (MMSE) estimate of the state is given by the conditional mean. Since finding the conditional mean requires multidimensional integration, an approximate MMSE estimator is proposed. The performance of the proposed estimator is evaluated in a positioning problem. Finally, the application of the estimator in inequality constrained recursive filtering is illustrated by applying the estimator to a dead-reckoning problem. The MSE of the estimator is compared with two related posterior Cramér-Rao bounds.

preprint2012arXiv

Dynamic Iterative Pursuit

For compressive sensing of dynamic sparse signals, we develop an iterative pursuit algorithm. A dynamic sparse signal process is characterized by varying sparsity patterns over time/space. For such signals, the developed algorithm is able to incorporate sequential predictions, thereby providing better compressive sensing recovery performance, but not at the cost of high complexity. Through experimental evaluations, we observe that the new algorithm exhibits a graceful degradation at deteriorating signal conditions while capable of yielding substantial performance gains as conditions improve.

preprint2012arXiv

The Linear Model under Mixed Gaussian Inputs: Designing the Transfer Matrix

Suppose a linear model y = Hx + n, where inputs x, n are independent Gaussian mixtures. The problem is to design the transfer matrix H so as to minimize the mean square error (MSE) when estimating x from y. This problem has important applications, but faces at least three hurdles. Firstly, even for a fixed H, the minimum MSE (MMSE) has no analytical form. Secondly, the MMSE is generally not convex in H. Thirdly, derivatives of the MMSE w.r.t. H are hard to obtain. This paper casts the problem as a stochastic program and invokes gradient methods. The study is motivated by two applications in signal processing. One concerns the choice of error-reducing precoders; the other deals with selection of pilot matrices for channel estimation. In either setting, our numerical results indicate improved estimation accuracy - markedly better than those obtained by optimal design based on standard linear estimators. Some implications of the non-convexities of the MMSE are noteworthy, yet, to our knowledge, not well known. For example, there are cases in which more pilot power is detrimental for channel estimation. This paper explains why.

Dave Zachariah

What is connected

Connect this record

See the researcher in context

Building this map preview

21 published item(s)

Learning plug-in surrogate endpoints for randomized experiments

Robust Learning in Heterogeneous Contexts

A latent variable approach to heat load prediction in thermal grids

Learning Robust Decision Policies from Observational Data

Prediction of Spatial Point Processes: Regularized Method with Out-of-Sample Guarantees

Robust Prediction when Features are Missing

Robust Risk Minimization for Statistical Learning

Pearson information-based lower bound on Fisher information

Cramer-Rao bound analog of Bayes rule

Joint Ranging and Clock Parameter Estimation by Wireless Round Trip Time Measurements

Online Hyperparameter-Free Sparse Estimation Method

Robust Optimal Power Distribution for Hyperthermia Cancer Treatment

Estimation for the Linear Model with Uncertain Covariance Matrices

Weighted SPICE: A Unifying Approach for Hyperparameter-Free Sparse Estimation

Cooperative localization by dual foot-mounted inertial sensors and inter-agent ranging

Line Spectrum Estimation with Probabilistic Priors

Self-Localization of Asynchronous Wireless Nodes With Parameter Uncertainties

Utilization of Noise-Only Samples in Array Processing With Prior Knowledge

Bayesian Estimation with Distance Bounds

Dynamic Iterative Pursuit

The Linear Model under Mixed Gaussian Inputs: Designing the Transfer Matrix