Source author record

Mingzhuo Li

Mingzhuo Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Computer Vision Machine Learning physics.optics

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Difficulty-guided Sampling: Bridging the Target Gap between Dataset Distillation and Downstream Tasks

In this paper, we propose difficulty-guided sampling (DGS) to bridge the target gap between the distillation objective and the downstream task, therefore improving the performance of dataset distillation. Deep neural networks achieve remarkable performance but have time and storage-consuming training processes. Dataset distillation is proposed to generate compact, high-quality distilled datasets, enabling effective model training while maintaining downstream performance. Existing approaches typically focus on features extracted from the original dataset, overlooking task-specific information, which leads to a target gap between the distillation objective and the downstream task. We propose leveraging characteristics that benefit the downstream training into data distillation to bridge this gap. Focusing on the downstream task of image classification, we introduce the concept of difficulty and propose DGS as a plug-in post-stage sampling module. Following the specific target difficulty distribution, the final distilled dataset is sampled from image pools generated by existing methods. We also propose difficulty-aware guidance (DAG) to explore the effect of difficulty in the generation process. Extensive experiments across multiple settings demonstrate the effectiveness of the proposed methods. It also highlights the broader potential of difficulty for diverse downstream tasks.

preprint2026arXiv

SAS: Semantic-aware Sampling for Generative Dataset Distillation

Deep neural networks have achieved impressive performance across a wide range of tasks, but this success often comes with substantial computational and storage costs due to large-scale training data. Dataset distillation addresses this challenge by constructing compact yet informative datasets that enable efficient model training while maintaining downstream performance. However, most existing approaches primarily emphasize matching data distributions or downstream training statistics, with limited attention to preserving high-level semantic information in the distilled data. In this work, we introduce a semantic-aware perspective for dataset distillation by leveraging Contrastive Language-Image Pretraining (CLIP) as a semantic prior for post-sampling. Our goal is to obtain distilled datasets that are not only compact but also semantically class-discriminative and diverse. To this end, we design three semantic scoring functions that quantify class relevance, inter-class separability, and intra-set diversity in a pretrained semantic space. Based on image pools generated by existing distillation methods, we further develop a two-stage strategy for effective sampling: the first stage filters semantically discriminative samples to form a reliable candidate set, and the second stage performs a dynamic diversity-aware selection to reduce redundancy while preserving semantic coverage. Extensive experiments across multiple datasets, image pools, and downstream models demonstrate consistent performance gains, highlighting the effectiveness of incorporating semantic information into dataset distillation.

preprint2014arXiv

Plasmonic lasing of nanocavity embedding in metallic nanoantenna array

Plasmonic nanolasers have ultrahigh lasing thresholds, especially those devices for which all three dimensions are truly subwavelength. Because of a momentum mismatch between the propagating light and localized optical field of the subwavelength nanocavity, poor optical pumping efficiency is another important reason for the ultrahigh threshold but is normally always ignored. Based on a cavity-embedded nanoantenna array design, we demonstrate a room-temperature low-threshold plasmonic nanolaser that is robust, reproducible, and easy-to-fabricate using chemical-template lithography. The mode volume of the device is~0.22(λ/2n)3 (here,λ is resonant wavelength and n is the refractive index), and the experimental lasing threshold produced is ~2.70MW/mm2. The lasing polarization and the function of nanoantenna array are investigated in detail. Our work provides a new strategy to achieve room-temperature low-threshold plasmonic nanolasers of interest in applications to biological sensoring and information technology.