Source author record

Ran Dai

Ran Dai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Machine Learning math.OC math.ST Statistics Theory

Catalog footprint

What is connected

6works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach

Sampling-based model predictive control (MPC) has found significant success in optimal control problems with non-smooth system dynamics and cost function. Many machine learning-based works proposed to improve MPC by a) learning or fine-tuning the dynamics/ cost function, or b) learning to optimize for the update of the MPC controllers. For the latter, imitation learning-based optimizers are trained to update the MPC controller by mimicking the expert demonstrations, which, however, are expensive or even unavailable. More significantly, many sequential decision-making problems are in non-stationary environments, requiring that an optimizer should be adaptable and generalizable to update the MPC controller for solving different tasks. To address those issues, we propose to learn an optimizer based on meta-reinforcement learning (RL) to update the controllers. This optimizer does not need expert demonstration and can enable fast adaptation (e.g., few-shots) when it is deployed in unseen control tasks. Experimental results validate the effectiveness of the learned optimizer regarding fast adaptation.

preprint2020arXiv

Convex and Non-convex Approaches for Statistical Inference with Class-Conditional Noisy Labels

We study the problem of estimation and testing in logistic regression with class-conditional noise in the observed labels, which has an important implication in the Positive-Unlabeled (PU) learning setting. With the key observation that the label noise problem belongs to a special sub-class of generalized linear models (GLM), we discuss convex and non-convex approaches that address this problem. A non-convex approach based on the maximum likelihood estimation produces an estimator with several optimal properties, but a convex approach has an obvious advantage in optimization. We demonstrate that in the low-dimensional setting, both estimators are consistent and asymptotically normal, where the asymptotic variance of the non-convex estimator is smaller than the convex counterpart. We also quantify the efficiency gap which provides insight into when the two methods are comparable. In the high-dimensional setting, we show that both estimation procedures achieve $\ell_2$-consistency at the minimax optimal $\sqrt{s\log p/n}$ rates under mild conditions. Finally, we propose an inference procedure using a de-biasing approach. We validate our theoretical findings through simulations and a real-data example.

preprint2020arXiv

The bias of isotonic regression

We study the bias of the isotonic regression estimator. While there is extensive work characterizing the mean squared error of the isotonic regression estimator, relatively little is known about the bias. In this paper, we provide a sharp characterization, proving that the bias scales as $O(n^{-β/3})$ up to log factors, where $1 \leq β\leq 2$ is the exponent corresponding to H{ö}lder smoothness of the underlying mean. Importantly, this result only requires a strictly monotone mean and that the noise distribution has subexponential tails, without relying on symmetric noise or other restrictive assumptions.

preprint2016arXiv

An Iterative Method for Nonconvex Quadratically Constrained Quadratic Programs

This paper examines the nonconvex quadratically constrained quadratic programming (QCQP) problems using an iterative method. One of the existing approaches for solving nonconvex QCQP problems relaxes the rank one constraint on the unknown matrix into semidefinite constraint to obtain the bound on the optimal value without finding the exact solution. By reconsidering the rank one matrix, an iterative rank minimization (IRM) method is proposed to gradually approach the rank one constraint. Each iteration of IRM is formulated as a convex problem with semidefinite constraints. An augmented Lagrangian method, named extended Uzawa algorithm, is developed to solve the subproblem at each iteration of IRM for improved scalability and computational efficiency. Simulation examples are presented using the proposed method and comparative results obtained from the other methods are provided and discussed.

preprint2016arXiv

Instrumental Variable with Competing Risk Model

In this paper, we discuss causal inference on the efficacy of a treatment or medication on a time-to-event outcome with competing risks. Although the treatment group can be randomized, there can be confoundings between the compliance and the outcome. Unmeasured confoundings may exist even after adjustment for measured co- variates. Instrumental variable (IV) methods are commonly used to yield consistent estimations of causal parameters in the presence of unmeasured confoundings. Based on a semi-parametric additive hazard model for the subdistribution hazard, we pro- pose an instrumental variable estimator to yield consistent estimation of efficacy in the presence of unmeasured confoundings for competing risk settings. We derived the asymptotic properties for the proposed estimator. The estimator is shown to be well per- formed under finite sample size according to simulation results. We applied our method to a real transplant data example and showed that the unmeasured confoundings lead to significant bias in the estimation of the effect (about 50% attenuated).

preprint2016arXiv

The knockoff filter for FDR control in group-sparse and multitask regression

We propose the group knockoff filter, a method for false discovery rate control in a linear regression setting where the features are grouped, and we would like to select a set of relevant groups which have a nonzero effect on the response. By considering the set of true and false discoveries at the group level, this method gains power relative to sparse regression methods. We also apply our method to the multitask regression problem where multiple response variables share similar sparsity patterns across the set of possible features. Empirically, the group knockoff filter successfully controls false discoveries at the group level in both settings, with substantially more discoveries made by leveraging the group structure.

Ran Dai

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach

Convex and Non-convex Approaches for Statistical Inference with Class-Conditional Noisy Labels

The bias of isotonic regression

An Iterative Method for Nonconvex Quadratically Constrained Quadratic Programs

Instrumental Variable with Competing Risk Model

The knockoff filter for FDR control in group-sparse and multitask regression