Source author record

Yongyi Guo

Yongyi Guo appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Biological Physics econ.EM math.OC Methodology Molecular Networks

Catalog footprint

What is connected

3works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Machine Learning for Variance Reduction in Online Experiments

We consider the problem of variance reduction in randomized controlled trials, through the use of covariates correlated with the outcome but independent of the treatment. We propose a machine learning regression-adjusted treatment effect estimator, which we call MLRATE. MLRATE uses machine learning predictors of the outcome to reduce estimator variance. It employs cross-fitting to avoid overfitting biases, and we prove consistency and asymptotic normality under general conditions. MLRATE is robust to poor predictions from the machine learning step: if the predictions are uncorrelated with the outcomes, the estimator performs asymptotically no worse than the standard difference-in-means estimator, while if predictions are highly correlated with outcomes, the efficiency gains are large. In A/A tests, for a set of 48 outcome metrics commonly monitored in Facebook experiments the estimator has over 70% lower variance than the simple difference-in-means estimator, and about 19% lower variance than the common univariate procedure which adjusts only for pre-experiment values of the outcome.

preprint2022arXiv

Policy Optimization Using Semi-parametric Models for Dynamic Pricing

In this paper, we study the contextual dynamic pricing problem where the market value of a product is linear in its observed features plus some market noise. Products are sold one at a time, and only a binary response indicating success or failure of a sale is observed. Our model setting is similar to Javanmard and Nazerzadeh [2019] except that we expand the demand curve to a semiparametric model and need to learn dynamically both parametric and nonparametric components. We propose a dynamic statistical learning and decision-making policy that combines semiparametric estimation from a generalized linear model with an unknown link and online decision-making to minimize regret (maximize revenue). Under mild conditions, we show that for a market noise c.d.f. $F(\cdot)$ with $m$-th order derivative ($m\geq 2$), our policy achieves a regret upper bound of $\tilde{O}_{d}(T^{\frac{2m+1}{4m-1}})$, where $T$ is time horizon and $\tilde{O}_{d}$ is the order that hides logarithmic terms and the dimensionality of feature $d$. The upper bound is further reduced to $\tilde{O}_{d}(\sqrt{T})$ if $F$ is super smooth whose Fourier transform decays exponentially. In terms of dependence on the horizon $T$, these upper bounds are close to $Ω(\sqrt{T})$, the lower bound where $F$ belongs to a parametric class. We further generalize these results to the case with dynamically dependent product features under the strong mixing condition.

preprint2015arXiv

Stochastic robustness and relative stability of multiple pathways in biological networks

Multiple dynamic pathways always exist in biological networks, but their robustness against internal fluctuations and relative stability have not been well recognized and carefully analyzed yet. Here we try to address these issues through an illustrative example, namely the Siah-1/beta-catenin/p14/19 ARF loop of protein p53 dynamics. Its deterministic Boolean network model predicts that two parallel pathways with comparable magnitudes of attractive basins should exist after the protein p53 is activated when a cell becomes harmfully disturbed. Once the low but non-neglectable intrinsic fluctuations are incorporated into the model, we show that a phase transition phenomenon is emerged: in one parameter region the probability weights of the normal pathway, reported in experimental literature, are comparable with the other pathway which is seemingly abnormal with the unknown functions, whereas, in some other parameter regions, the probability weight of the abnormal pathway can even dominate and become globally attractive. The theory of exponentially perturbed Markov chains is applied and further generalized in order to quantitatively explain such a phase transition phenomenon, in which the nonequilibrium "activation energy barriers" along each transiting trajectory between the parallel pathways and the number of "optimal transition paths" play a central part. Our theory can also determine how the transition time and the number of optimal transition paths between the parallel pathways depend on each interaction's strength, and help to identify those possibly more crucial interactions in the biological network.