Source author record

Yajing Liu

Yajing Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Computer Vision math.GT Artificial Intelligence eess.SY Machine Learning Systems and Control eess.SP math.PR math.SG

Catalog footprint

What is connected

14works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A UCB Bandit Algorithm for General ML-Based Estimators

We present ML-UCB, a generalized upper confidence bound algorithm that integrates arbitrary machine learning models into multi-armed bandit frameworks. A fundamental challenge in deploying sophisticated ML models for sequential decision-making is the lack of tractable concentration inequalities required for principled exploration. We overcome this limitation by directly modeling the learning curve behavior of the underlying estimator. Specifically, assuming the Mean Squared Error decreases as a power law in the number of training samples, we derive a generalized concentration inequality and prove that ML-UCB achieves sublinear regret. This framework enables the principled integration of any ML model whose learning curve can be empirically characterized, eliminating the need for model-specific theoretical analysis. We validate our approach through experiments on a collaborative filtering recommendation system using online matrix factorization with synthetic data designed to simulate a simplified two-tower model, demonstrating substantial improvements over LinUCB

preprint2022arXiv

Multi-Area Distribution System State Estimation via Distributed Tensor Completion

This paper proposes a model-free distribution system state estimation method based on tensor completion using canonical polyadic decomposition. In particular, we consider a setting where the network is divided into multiple areas. The measured physical quantities at buses located in the same area are processed by an area controller. A three-way tensor is constructed to collect these measured quantities. The measurements are analyzed locally to recover the full state information of the network. A distributed closed-form iterative algorithm based on the alternating direction method of multipliers is developed to obtain the low-rank factors of the whole network state tensor where information exchange happens only between neighboring areas. The convergence properties of the distributed algorithm and the sufficient conditions on the number of samples for each smaller network that guarantee the identifiability of the factors of the state tensor are presented. To demonstrate the efficacy of the proposed algorithm and to check the identifiability conditions, numerical simulations are carried out using the IEEE 123-bus system.

preprint2022arXiv

Prompt Distribution Learning

We present prompt distribution learning for effectively adapting a pre-trained vision-language model to address downstream recognition tasks. Our method not only learns low-bias prompts from a few samples but also captures the distribution of diverse prompts to handle the varying visual representations. In this way, we provide high-quality task-related content for facilitating recognition. This prompt distribution learning is realized by an efficient approach that learns the output embeddings of prompts instead of the input embeddings. Thus, we can employ a Gaussian distribution to model them effectively and derive a surrogate loss for efficient training. Extensive experiments on 12 datasets demonstrate that our method consistently and significantly outperforms existing methods. For example, with 1 sample per category, it relatively improves the average result by 9.1% compared to human-crafted prompts.

preprint2022arXiv

Self-Supervision Can Be a Good Few-Shot Learner

Existing few-shot learning (FSL) methods rely on training with a large labeled dataset, which prevents them from leveraging abundant unlabeled data. From an information-theoretic perspective, we propose an effective unsupervised FSL method, learning representations with self-supervision. Following the InfoMax principle, our method learns comprehensive representations by capturing the intrinsic structure of the data. Specifically, we maximize the mutual information (MI) of instances and their representations with a low-bias MI estimator to perform self-supervised pre-training. Rather than supervised pre-training focusing on the discriminable features of the seen classes, our self-supervised model has less bias toward the seen classes, resulting in better generalization for unseen classes. We explain that supervised pre-training and self-supervised pre-training are actually maximizing different MI objectives. Extensive experiments are further conducted to analyze their FSL performance with various training settings. Surprisingly, the results show that self-supervised pre-training can outperform supervised pre-training under the appropriate conditions. Compared with state-of-the-art FSL methods, our approach achieves comparable performance on widely used FSL benchmarks without any labels of the base classes.

preprint2022arXiv

Source-Free Domain Adaptation for Real-world Image Dehazing

Deep learning-based source dehazing methods trained on synthetic datasets have achieved remarkable performance but suffer from dramatic performance degradation on real hazy images due to domain shift. Although certain Domain Adaptation (DA) dehazing methods have been presented, they inevitably require access to the source dataset to reduce the gap between the source synthetic and target real domains. To address these issues, we present a novel Source-Free Unsupervised Domain Adaptation (SFUDA) image dehazing paradigm, in which only a well-trained source model and an unlabeled target real hazy dataset are available. Specifically, we devise the Domain Representation Normalization (DRN) module to make the representation of real hazy domain features match that of the synthetic domain to bridge the gaps. With our plug-and-play DRN module, unlabeled real hazy images can adapt existing well-trained source networks. Besides, the unsupervised losses are applied to guide the learning of the DRN module, which consists of frequency losses and physical prior losses. Frequency losses provide structure and style constraints, while the prior loss explores the inherent statistic property of haze-free images. Equipped with our DRN module and unsupervised loss, existing source dehazing models are able to dehaze unlabeled real hazy images. Extensive experiments on multiple baselines demonstrate the validity and superiority of our method visually and quantitatively.

preprint2020arXiv

A General Framework for Bounding Approximate Dynamic Programming Schemes

For years, there has been interest in approximation methods for solving dynamic programming problems, because of the inherent complexity in computing optimal solutions characterized by Bellman's principle of optimality. A wide range of approximate dynamic programming (ADP) methods now exists. It is of great interest to guarantee that the performance of an ADP scheme be at least some known fraction, say $β$, of optimal. This paper introduces a general approach to bounding the performance of ADP methods, in this sense, in the stochastic setting. The approach is based on new results for bounding greedy solutions in string optimization problems, where one has to choose a string (ordered set) of actions to maximize an objective function. This bounding technique is inspired by submodularity theory, but submodularity is not required for establishing bounds. Instead, the bounding is based on quantifying certain notions of curvature of string functions; the smaller the curvatures the better the bound. The key insight is that any ADP scheme is a greedy scheme for some surrogate string objective function that coincides in its optimal solution and value with those of the original optimal control problem. The ADP scheme then yields to the bounding technique mentioned above, and the curvatures of the surrogate objective determine the value $β$ of the bound. The surrogate objective and its curvatures depend on the specific ADP.

preprint2020arXiv

Matrix Completion Using Alternating Minimization for Distribution System State Estimation

This paper examines the problem of state estimation in power distribution systems under low-observability conditions. The recently proposed constrained matrix completion method which combines the standard matrix completion method and power flow constraints has been shown to be effective in estimating voltage phasors under low-observability conditions using single-snapshot information. However, the method requires solving a semidefinite programming (SDP) problem, which becomes computationally infeasible for large systems and if multiple-snapshot (time-series) information is used. This paper proposes an efficient algorithm to solve the constrained matrix completion problem with time-series data. This algorithm is based on reformulating the matrix completion problem as a bilinear (non-convex) optimization problem, and applying the alternating minimization algorithm to solve this problem. This paper proves the summable convergence of the proposed algorithm, and demonstrates its efficacy and scalability via IEEE 123-bus system and a real utility feeder system. This paper also explores the value of adding more data from the history in terms of computation time and estimation accuracy.

preprint2020arXiv

Model-Free State Estimation Using Low-Rank Canonical Polyadic Decomposition

As electric grids experience high penetration levels of renewable generation, fundamental changes are required to address real-time situational awareness. This paper uses unique traits of tensors to devise a model-free situational awareness and energy forecasting framework for distribution networks. This work formulates the state of the network at multiple time instants as a three-way tensor; hence, recovering full state information of the network is tantamount to estimating all the values of the tensor. Given measurements received from $μ$phasor measurement units and/or smart meters, the recovery of unobserved quantities is carried out using the low-rank canonical polyadic decomposition of the state tensor---that is, the state estimation task is posed as a tensor imputation problem utilizing observed patterns in measured quantities. Two structured sampling schemes are considered: slab sampling and fiber sampling. For both schemes, we present sufficient conditions on the number of sampled slabs and fibers that guarantee identifiability of the factors of the state tensor. Numerical results demonstrate the ability of the proposed framework to achieve high estimation accuracy in multiple sampling scenarios.

preprint2016arXiv

Bounding the Greedy Strategy in Finite-Horizon String Optimization

We consider an optimization problem where the decision variable is a string of bounded length. For some time there has been an interest in bounding the performance of the greedy strategy for this problem. Here, we provide weakened sufficient conditions for the greedy strategy to be bounded by a factor of $(1-(1-1/K)^K)$, where $K$ is the optimization horizon length. Specifically, we introduce the notions of $K$-submodularity and $K$-GO-concavity, which together are sufficient for this bound to hold. By introducing a notion of \emph{curvature} $η\in(0,1]$, we prove an even tighter bound with the factor $(1/η)(1-e^{-η})$. Finally, we illustrate the strength of our results by considering two example applications. We show that our results provide weaker conditions on parameter values in these applications than in previous results.

preprint2016arXiv

Hyperbolic 3-manifolds admitting no fillable contact structures

In this paper, we find infinite hyperbolic 3-manifolds that admit no weakly symplectically fillable contact structures, using tools in Heegaard Floer theory. We also remark that part of these manifolds do admit tight contact structures.

preprint2016arXiv

Performance Bounds for the $k$-Batch Greedy Strategy in Optimization Problems with Curvature

The $k$-batch greedy strategy is an approximate algorithm to solve optimization problems where the optimal solution is hard to obtain. Starting with the empty set, the $k$-batch greedy strategy adds a batch of $k$ elements to the current solution set with the largest gain in the objective function while satisfying the constraints. In this paper, we bound the performance of the $k$-batch greedy strategy with respect to the optimal strategy by defining the total curvature $α_k$. We show that when the objective function is nondecreasing and submodular, the $k$-batch greedy strategy satisfies a harmonic bound $1/(1+α_k)$ for a general matroid constraint and an exponential bound $\left(1-(1-α_k/{t})^t\right)/α_k$ for a uniform matroid constraint, where $k$ divides the cardinality of the maximal set in the general matroid, $t=K/k$ is an integer, and $K$ is the rank of the uniform matroid. We also compare the performance of the $k$-batch greedy strategy with that of the $k_1$-batch greedy strategy when $k_1$ divides $k$. Specifically, we prove that when the objective function is nondecreasing and submodular, the $k$-batch greedy strategy has better harmonic and exponential bounds in terms of the total curvature. Finally, we illustrate our results by considering a task-assignment problem.

preprint2015arXiv

$L$-space surgeries on links

An $L$-space link is a link in $S^3$ on which all large surgeries are $L$-spaces. In this paper, we initiate a general study of the definitions, properties, and examples of $L$-space links. In particular, we find many hyperbolic $L$-space links, including some chain links and two-bridge links; from them, we obtain many hyperbolic $L$-spaces by integral surgeries, including the Weeks manifold. We give bounds on the ranks of the link Floer homology of $L$-space links and on the coefficients in the multi-variable Alexander polynomials. We also describe the Floer homology of surgeries on any $L$-space link using the link surgery formula of Ozsváth and Manolescu. As applications, we compute the graded Heegaard Floer homology of surgeries on 2-component $L$-space links in terms of only the Alexander polynomial and the surgery framing, and give a fast algorithm to classify $L$-space surgeries among them.

preprint2014arXiv

Guaranteed Bounds for General Approximate Dynamic Programming

In this paper, we will develop a systematic approach to deriving guaranteed bounds for approximate dynamic programming (ADP) schemes in optimal control problems. Our approach is inspired by our recent results on bounding the performance of greedy strategies in optimization of string-submodular functions over a finite horizon. The approach is to derive a string-submodular optimization problem, for which the optimal strategy is the optimal control solution and the greedy strategy is the ADP solution. Using this approach, we show that any ADP solution achieves a performance that is at least a factor of $β$ of the performance of the optimal control solution, which satisfies Bellman's optimality principle. The factor $β$ depends on the specific ADP scheme, as we will explicitly characterize. To illustrate the applicability of our bounding technique, we present examples of ADP schemes, including the popular rollout method.

preprint2014arXiv

Heegaard Floer homology of surgeries on two-bridge links

We give an $O(p^{2})$ time algorithm to compute the generalized Heegaard Floer complexes $A_{s_{1},s_{2}}^{-}(\overrightarrow{L})$'s for a two-bridge link $\overrightarrow{L}=b(p,q)$ by using nice diagrams. Using the link surgery formula of Manolescu-Ozsváth, we also show that ${\bf HF}^{-}$ and their $d$-invariants of all integer surgeries on two-bridge links are determined by $A_{s_{1},s_{2}}^{-}(\overrightarrow{L})$'s. We obtain a polynomial time algorithm to compute ${\bf HF}^{-}$ of all the surgeries on two-bridge links, with $\mathbb{Z}/2\mathbb{Z}$ coefficients. In addition, we calculate some examples explicitly: ${\bf HF}^{-}$ and the $d$-invariants of all integer surgeries on a family of hyperbolic two-bridge links including the Whitehead link.

Yajing Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

A UCB Bandit Algorithm for General ML-Based Estimators

Multi-Area Distribution System State Estimation via Distributed Tensor Completion

Prompt Distribution Learning

Self-Supervision Can Be a Good Few-Shot Learner

Source-Free Domain Adaptation for Real-world Image Dehazing

A General Framework for Bounding Approximate Dynamic Programming Schemes

Matrix Completion Using Alternating Minimization for Distribution System State Estimation

Model-Free State Estimation Using Low-Rank Canonical Polyadic Decomposition

Bounding the Greedy Strategy in Finite-Horizon String Optimization

Hyperbolic 3-manifolds admitting no fillable contact structures

Performance Bounds for the $k$-Batch Greedy Strategy in Optimization Problems with Curvature

$L$-space surgeries on links

Guaranteed Bounds for General Approximate Dynamic Programming

Heegaard Floer homology of surgeries on two-bridge links