Source author record

Donghao Li

Donghao Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning cond-mat.quant-gas Computer Vision math.DS Applications Artificial Intelligence math.OC

Catalog footprint

What is connected

8works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Breaking the Computational Barrier: Provably Efficient Actor-Critic for Low-Rank MDPs

Reinforcement learning (RL) is a fundamental framework for sequential decision-making, in which an agent learns an optimal policy through interactions with an unknown environment. In settings with function approximation, many existing RL algorithms achieve favorable sample complexity, but often rely on computationally intractable oracles. In this paper, we use supervised learning as a computational proxy to establish a clear hierarchy of commonly adopted RL oracles under low-rank Markov Decision Processes (MDPs). This hierarchy shows that policy evaluation is the most computationally efficient oracle, provided that supervised learning can be efficiently solved. Motivated by this observation, we propose a novel optimistic actor-critic algorithm that relies solely on the policy evaluation oracle. We prove that our algorithm outperforms the existing sample complexity guarantees for low-rank MDPs while avoiding computationally expensive planning or optimization oracles commonly assumed in prior works. We further extend our theoretical results to approximately low-rank MDPs and demonstrate that this setting captures a broad class of real-world environments. Finally, we validate our theoretical results with experiments on several standard Gym environments.

preprint2026arXiv

Efficient Multi-objective Prompt Optimization via Pure-exploration Bandits

Prompt engineering has become central to eliciting the capabilities of large language models (LLMs). At its core lies prompt selection -- efficiently identifying the most effective prompts. However, most prior investigations overlook a key challenge: the inherently multi-faceted nature of prompt performance, which cannot be captured by a single metric. To fill this gap, we study the multi-objective prompt selection problem under two practical settings: Pareto prompt set recovery and best feasible prompt identification. Casting the problem into the pure-exploration bandits framework, we adapt provably efficient algorithms from multi-objective bandits and further introduce a novel design for best feasible arm identification in structured bandits, with theoretical guarantees on the identification error in the linear case. Extensive experiments across multiple LLMs show that the bandit-based approaches yield significant improvements over baselines, establishing a principled and efficient framework for multi-objective prompt optimization.

preprint2022arXiv

Exploring Structural Sparsity of Deep Networks via Inverse Scale Spaces

The great success of deep neural networks is built upon their over-parameterization, which smooths the optimization landscape without degrading the generalization ability. Despite the benefits of over-parameterization, a huge amount of parameters makes deep networks cumbersome in daily life applications. Though techniques such as pruning and distillation are developed, they are expensive in fully training a dense network as backward selection methods, and there is still a void on systematically exploring forward selection methods for learning structural sparsity in deep networks. To fill in this gap, this paper proposes a new approach based on differential inclusions of inverse scale spaces, which generate a family of models from simple to complex ones along the dynamics via coupling a pair of parameters, such that over-parameterized deep models and their structural sparsity can be explored simultaneously. This kind of differential inclusion scheme has a simple discretization, dubbed Deep structure splitting Linearized Bregman Iteration (DessiLBI), whose global convergence in learning deep networks could be established under the Kurdyka-Lojasiewicz framework. Experimental evidence shows that our method achieves comparable and even better performance than the competitive optimizers in exploring the sparse structure of several widely used backbones on the benchmark datasets. Remarkably, with early stopping, our method unveils `winning tickets' in early epochs: the effective sparse network structures with comparable test accuracy to fully trained over-parameterized models, that are further transferable to similar alternative tasks. Furthermore, our method is able to grow networks efficiently with adaptive filter configurations, demonstrating a good performance with much less computational cost. Codes and models can be downloaded at {https://github.com/DessiLBI2020/DessiLBI}.

preprint2021arXiv

Particle filter re-detection for visual tracking via correlation filters

Most of the correlation filter based tracking algorithms can achieve good performance and maintain fast computational speed. However, in some complicated tracking scenes, there is a fatal defect that causes the object to be located inaccurately. In order to address this problem, we propose a particle filter redetection based tracking approach for accurate object localization. During the tracking process, the kernelized correlation filter (KCF) based tracker locates the object by relying on the maximum response value of the response map; when the response map becomes ambiguous, the KCF tracking result becomes unreliable. Our method can provide more candidates by particle resampling to detect the object accordingly. Additionally, we give a new object scale evaluation mechanism, which merely considers the differences between the maximum response values in consecutive frames. Extensive experiments on OTB2013 and OTB2015 datasets demonstrate that the proposed tracker performs favorably in relation to the state-of-the-art methods.

preprint2020arXiv

DessiLBI: Exploring Structural Sparsity of Deep Networks via Differential Inclusion Paths

Over-parameterization is ubiquitous nowadays in training neural networks to benefit both optimization in seeking global optima and generalization in reducing prediction error. However, compressive networks are desired in many real world applications and direct training of small networks may be trapped in local optima. In this paper, instead of pruning or distilling over-parameterized models to compressive ones, we propose a new approach based on differential inclusions of inverse scale spaces. Specifically, it generates a family of models from simple to complex ones that couples a pair of parameters to simultaneously train over-parameterized deep models and structural sparsity on weights of fully connected and convolutional layers. Such a differential inclusion scheme has a simple discretization, proposed as Deep structurally splitting Linearized Bregman Iteration (DessiLBI), whose global convergence analysis in deep learning is established that from any initializations, algorithmic iterations converge to a critical point of empirical risks. Experimental evidence shows that DessiLBI achieve comparable and even better performance than the competitive optimizers in exploring the structural sparsity of several widely used backbones on the benchmark datasets. Remarkably, with early stopping, DessiLBI unveils "winning tickets" in early epochs: the effective sparse structure with comparable test accuracy to fully trained over-parameterized models.

preprint2020arXiv

Experimental realization of spin-tensor momentum coupling in ultracold Fermi gases

We experimentally realize the spin-tensor momentum coupling (STMC) using the three ground Zeeman states coupled by three Raman laser beams in ultracold atomic system of $^{40}$K Fermi atoms. This new type of STMC consists of two bright-state bands as a regular spin-orbit coupled spin-1/2 system and one dark-state middle band. Using radio-frequency spin-injection spectroscopy, we investigate the energy band of STMC. It is demonstrated that the middle state is a dark state in the STMC system. The realized energy band of STMC may open the door for further exploring exotic quantum matters.

preprint2016arXiv

Experimental observation of a topological band gap opening in ultracold Fermi gases with two-dimensional spin-orbit coupling

The recent experimental realization of synthetic spin-orbit coupling (SOC) opens a new avenue for exploring novel quantum states with ultracold atoms. However, in experiments for generating two-dimensional SOC (e.g., Rashba type), a perpendicular Zeeman field, which opens a band gap at the Dirac point and induces many topological phenomena, is still lacking. Here we theoretically propose and experimentally realize a simple scheme for generating two-dimension SOC and a perpendicular Zeeman field simultaneously in ultracold Fermi gases by tuning the polarization of three Raman lasers that couple three hyperfine ground states of atoms. The resulting band gap opening at the Dirac point is probed using spin injection radio-frequency spectroscopy. Our observation may pave the way for exploring topological transport and topological superfluids with exotic Majorana and Weyl fermion excitations in ultracold atoms.

preprint2015arXiv

Experimental realization of a two-dimensional synthetic spin-orbit coupling in ultracold Fermi gases

Spin-orbit coupling (SOC) is central to many physical phenomena, including fine structures of atomic spectra and quantum topological matters. Whereas SOC is in general fixed in a physical system, atom-laser interaction provides physicists a unique means to create and control synthetic SOC for ultracold atoms \cite{Dalibard}. Though significant experimental progresses have been made, a bottleneck in current studies is the lack of a two-dimensional (2D) synthetic SOC, which is crucial for realizing high-dimensional topological matters. Here, we report the experimental realization of 2D SOC in ultracold $^{40}$K Fermi gases using three lasers, each of which dresses one atomic hyperfine spin state. Through spin injection radio-frequency (rf) spectroscopy, we probe the spin-resolved energy dispersions of dressed atoms, and observe a highly controllable Dirac point created by the 2D SOC. Our work paves the way for exploring high-dimensional topological matters in ultracold atoms using Raman schemes.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint