Source author record

Li Song

Li Song appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Machine Learning physics.optics Computer Vision cond-mat.mes-hall cond-mat.mtrl-sci eess.IV math.ST Multimedia physics.acc-ph physics.app-ph quant-ph Software Engineering Sound Statistics Theory

Catalog footprint

What is connected

12works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair

Agent-repair leaderboards reorder under evaluator reconfiguration, and a measurable share of the reordering is produced by methods that consult evaluator-derived signal during internal selection of candidate repairs. We document this failure mode on a public leaderboard and release AuditRepairBench, a paired-execution trace corpus of 576,000 registered cells (96,000 executed) that operationalizes evaluator-channel-blocking ranking instability within a declared observability boundary. A modular screening architecture decides pathway-blocking through four interchangeable implementations, a learned influence proxy, a rule-based channel-exposure ratio that uses no trained model, a counterfactual sensitivity proxy, and a sparse human-audit proxy, combined into a screening posterior that feeds a cell-level flip functional, a set-valued label, a stratified system score, and a set-valued leaderboard. The resource is supported by mechanism-anchored validation on an 80-case source-level channel-surgery subset, an independent-discovery protocol under which two annotator groups separated from the pipeline developers discover coupling patterns blinded to the screening design and the frozen ensemble attains pooled AUROC 0.83 on their 79 cases, implementation robustness, uncertainty propagation that raises 95% coverage from 0.81 to 0.95, and forward transfer with pooled community-evaluator Spearman \r{ho} = 0.65. Screening-guided blinding patches reduce rank displacement by 55--74% (mean 62%) at fewer than 50 lines of code, whereas random channel blinding produces at most 7% reduction and generic retraining at most 13%. AuditRepairBench-Lite, a rule-only configuration on a 12,000-cell subset, preserves the leaderboard at Kendall τ = 0.88 under twenty-four GPU-hours and is the primary release artifact at 42 GB.

preprint2026arXiv

Maximizing Rollout Informativeness under a Fixed Budget: A Submodular View of Tree Search for Tool-Use Agentic Reinforcement Learning

We formalize Rollout Informativeness under a Fixed Budget (RIFB) as the expected non-vanishing policy-gradient mass that a tool-use rollout set injects into Group Relative Policy Optimization (GRPO). We prove that any budget-agnostic independent sampler suffers a collapse rate bounded away from zero for hard prompts regardless of the budget. Motivated by this, we recast intermediate state selection as a monotone submodular maximization problem, where a greedy one-step selector enjoys a 1 minus 1/e approximation guarantee. Our Uncertainty-aware Upper Confidence Bound (UUCB) terms arise as closed-form marginal gains of this objective. This turns the token-level entropy bonus from an empirical trick into an analytic consequence of the formulation. We present InfoTree, a training-time tree-search framework coupling UUCB with a learned Adaptive Budget Allocator (ABA) and an asynchronous Speculative Expansion scheme. ABA rescues prompts whose initial tree is wasted on uniform outcomes, lifting the mixed-outcome ratio from 58.1 percent to 76.3 percent with less than 5 percent budget overhead. Speculative Expansion reduces wall-clock overhead from 14.3 percent to 4.8 percent by tolerating bounded staleness in UUCB scores. Across nine benchmarks spanning math reasoning (AIME 2024 and 2025, MATH-500, OlympiadBench, USAMO), web-search agents (GAIA, HLE-100, BrowseComp-lite), and tool-rich coding and OS agents (APPS-verified, AgentBench-OS), InfoTree outperforms flat GRPO, DeepSearch, Tree-GRPO, AT2PO, CW-GRPO, and RC-GRPO. Head-to-head compositions with Tree-GRPO prefix sharing and CW-GRPO contribution weights deliver further gains, confirming that our selector operates orthogonally to rollout reuse and trajectory re-weighting. A 5 by 5 by 5 robustness grid reveals that over three quarters of the hyperparameter space lies on a performance plateau, confirming UUCB robustness.

preprint2023arXiv

On the use of deep learning for phase recovery

Phase recovery (PR) refers to calculating the phase of the light field from its intensity measurements. As exemplified from quantitative phase imaging and coherent diffraction imaging to adaptive optics, PR is essential for reconstructing the refractive index distribution or topography of an object and correcting the aberration of an imaging system. In recent years, deep learning (DL), often implemented through deep neural networks, has provided unprecedented support for computational imaging, leading to more efficient solutions for various PR problems. In this review, we first briefly introduce conventional methods for PR. Then, we review how DL provides support for PR from the following three stages, namely, pre-processing, in-processing, and post-processing. We also review how DL is used in phase image processing. Finally, we summarize the work in DL for PR and outlook on how to better use DL to improve the reliability and efficiency in PR. Furthermore, we present a live-updating resource (https://github.com/kqwang/phase-recovery) for readers to learn more about PR.

preprint2022arXiv

StableFace: Analyzing and Improving Motion Stability for Talking Face Generation

While previous speech-driven talking face generation methods have made significant progress in improving the visual quality and lip-sync quality of the synthesized videos, they pay less attention to lip motion jitters which greatly undermine the realness of talking face videos. What causes motion jitters, and how to mitigate the problem? In this paper, we conduct systematic analyses on the motion jittering problem based on a state-of-the-art pipeline that uses 3D face representations to bridge the input audio and output video, and improve the motion stability with a series of effective designs. We find that several issues can lead to jitters in synthesized talking face video: 1) jitters from the input 3D face representations; 2) training-inference mismatch; 3) lack of dependency modeling among video frames. Accordingly, we propose three effective solutions to address this issue: 1) we propose a gaussian-based adaptive smoothing module to smooth the 3D face representations to eliminate jitters in the input; 2) we add augmented erosions on the input data of the neural renderer in training to simulate the distortion in inference to reduce mismatch; 3) we develop an audio-fused transformer generator to model dependency among video frames. Besides, considering there is no off-the-shelf metric for measuring motion jitters in talking face video, we devise an objective metric (Motion Stability Index, MSI), to quantitatively measure the motion jitters by calculating the reciprocal of variance acceleration. Extensive experimental results show the superiority of our method on motion-stable face video generation, with better quality than previous systems.

preprint2021arXiv

IdentityDP: Differential Private Identification Protection for Face Images

Because of the explosive growth of face photos as well as their widespread dissemination and easy accessibility in social media, the security and privacy of personal identity information becomes an unprecedented challenge. Meanwhile, the convenience brought by advanced identity-agnostic computer vision technologies is attractive. Therefore, it is important to use face images while taking careful consideration in protecting people's identities. Given a face image, face de-identification, also known as face anonymization, refers to generating another image with similar appearance and the same background, while the real identity is hidden. Although extensive efforts have been made, existing face de-identification techniques are either insufficient in photo-reality or incapable of well-balancing privacy and utility. In this paper, we focus on tackling these challenges to improve face de-identification. We propose IdentityDP, a face anonymization framework that combines a data-driven deep neural network with a differential privacy (DP) mechanism. This framework encompasses three stages: facial representations disentanglement, $ε$-IdentityDP perturbation and image reconstruction. Our model can effectively obfuscate the identity-related information of faces, preserve significant visual similarity, and generate high-quality images that can be used for identity-agnostic computer vision tasks, such as detection, tracking, etc. Different from the previous methods, we can adjust the balance of privacy and utility through the privacy budget according to pratical demands and provide a diversity of results without pre-annotations. Extensive experiments demonstrate the effectiveness and generalization ability of our proposed anonymization framework.

preprint2020arXiv

Atomically Thin Boron Nitride as an Ideal Spacer for Metal-Enhanced Fluorescence

The metal-enhanced fluorescence (MEF) considerably enhances the luminescence for various applications, but its performance largely depends on the dielectric spacer between the fluorophore and plasmonic system. It is still challenging to produce a defect-free spacer having an optimized thickness with a subnanometer accuracy that enables reusability without affecting the enhancement. In this study, we demonstrate the use of atomically thin hexagonal boron nitride (BN) as an ideal MEF spacer owing to its multifold advantages over the traditional dielectric thin films. With rhodamine 6G as a representative fluorophore, it largely improves the enhancement factor (up to ~95+-5), sensitivity (10^-8 M), reproducibility, and reusability (~90% of the plasmonic activity is retained after 30 cycles of heating at 350 °C in air) of MEF. This can be attributed to its two-dimensional structure, thickness control at the atomic level, defect-free quality, high affinities to aromatic fluorophores, good thermal stability, and excellent impermeability. The atomically thin BN spacers could increase the use of MEF in different fields and industries.

preprint2016arXiv

An efficient Multiple Scattering method based on partitioning of scattering matrix by angular momentum and approximations of matrix elements

We present a numerically efficient and accurate Multiple Scattering formalism, which is a generalization of the Multiple Scattering method with a truncated basis set [X. -G. Zhang and W. H. Butler, Phys. Rev. B 46,7433 (1992)]. Compared to the latter method, we keep the phase shifts of high angular momenta but apply approximations in the elements of the scattering matrix which is the subtraction of the unit matrix and the product of transition operator matrix and structure constant matrix. The detailed behaviour of our formalism for different types of calculations, where not full information of Green's function is needed, are discussed. We apply our formalism to study density of states of fcc Cu and silicon and C K-edge X-ray absorption spectra of graphene, in order to check the efficiency and accuracy of our formalism. We find that compared to Zhang's method, the accuracy is greatly improved by our method.

preprint2015arXiv

Performance Analysis of Quantitative Phase Retrieval Method in Zernike Phase Contrast X-ray Microscopy

Since the invention of Zernike phase contrast method in 1930, it has been widely used in optical microscopy and more recently in X-ray microscopy. Considering the image contrast is a mixture of absorption and phase information, we recently have proposed and demonstrated a method for quantitative phase retrieval in Zernike phase contrast X-ray microscopy. In this contribution, we analyzed the performance of this method at different photon energies. Intensity images of PMMA samples are simulated at 2.5 keV and 6.2 keV, respectively, and phase retrieval is performed using the proposed method. The results demonstrated that the proposed phase retrieval method is applicable over a wide energy range. For weakly absorbing features, the optimal photon energy is 2.5 keV, from the point of view of image contrast and accuracy of phase retrieval. On the other hand, in the case of strong absorption objects, a higher photon energy is preferred to reduce the error of phase retrieval. Those results can be used as guidelines to perform quantitative phase retrieval in Zernike phase contrast X-ray microscopy with the proposed method.

preprint2013arXiv

Hard X-ray optics simulation using the coherent mode decomposition of Gaussian Schell model

The propagation of hard X ray beam from partially coherent synchrotron source is simulated by using the novel method based on the coherent mode decomposition of Gaussian Schell model and wave front propagation. We investigate how the coherency properties and intensity distributions of the beam are changed by propagation through optical elements. Here, we simulate and analyze the propagation of the partially coherent radiation transmitted through an ideal slit. We present the first simulations for focusing partially coherent synchrotron hard X ray beams using this novel method. And when compared with the traditional method which assumes the source is a totally coherent point source or completely incoherent, this method is proved to be more reasonable and can also demonstrate the coherence properties of the focusing beam. We also simulate the double slit experiment and the simulated results validate the academic analysis.

preprint2012arXiv

Catalytic sub-surface etching of nanoscale channels in graphite

Catalytic hydrogenation of graphite has recently attracted renewed attention, as a route for nano-patterning of graphene and to produce graphene nano-ribbons. These reports show that metallic nanoparticles etch surface layers of graphite, or graphene anisotropically along the crystallographic zigzag <11-20> or armchair <1010> directions. On graphene the etching direction can be influenced by external magnetic fields or the substrate. Here we report the sub-surface etching of highly oriented pyrolytic graphite (HOPG) by Ni nanoparticles, to form a network of tunnels, as seen by SEM and STM. In this new nanoporous form of graphite, the top layers bend inward on top of the tunnels, while their local density of states remains fundamentally unchanged. Engineered nanoporous tunnel networks in graphite allow further chemical modification and may find applications in storage or sensing.

preprint2012arXiv

Unit roots in moving averages beyond first order

The asymptotic theory of various estimators based on Gaussian likelihood has been developed for the unit root and near unit root cases of a first-order moving average model. Previous studies of the MA(1) unit root problem rely on the special autocovariance structure of the MA(1) process, in which case, the eigenvalues and eigenvectors of the covariance matrix of the data vector have known analytical forms. In this paper, we take a different approach to first consider the joint likelihood by including an augmented initial value as a parameter and then recover the exact likelihood by integrating out the initial value. This approach by-passes the difficulty of computing an explicit decomposition of the covariance matrix and can be used to study unit root behavior in moving averages beyond first order. The asymptotics of the generalized likelihood ratio (GLR) statistic for testing unit roots are also studied. The GLR test has operating characteristics that are competitive with the locally best invariant unbiased (LBIU) test of Tanaka for some local alternatives and dominates for all other alternatives.

preprint2011arXiv

Anomalous insulator metal transition in boron nitride-graphene hybrid atomic layers

The study of two-dimensional (2D) electronic systems is of great fundamental significance in physics. Atomic layers containing hybridized domains of graphene and hexagonal boron nitride (h-BNC) constitute a new kind of disordered 2D electronic system. Magneto-electric transport measurements performed at low temperature in vapor phase synthesized h-BNC atomic layers show a clear and anomalous transition from an insulating to a metallic behavior upon cooling. The observed insulator to metal transition can be modulated by electron and hole doping and by the application of an external magnetic field. These results supported by ab-initio calculations suggest that this transition in h-BNC has distinctly different characteristics when compared to other 2D electron systems and is the result of the coexistence between two distinct mechanisms, namely, percolation through metallic graphene networks and hopping conduction between edge states on randomly distributed insulating h-BN domains.

Li Song

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair

Maximizing Rollout Informativeness under a Fixed Budget: A Submodular View of Tree Search for Tool-Use Agentic Reinforcement Learning

On the use of deep learning for phase recovery

StableFace: Analyzing and Improving Motion Stability for Talking Face Generation

IdentityDP: Differential Private Identification Protection for Face Images

Atomically Thin Boron Nitride as an Ideal Spacer for Metal-Enhanced Fluorescence

An efficient Multiple Scattering method based on partitioning of scattering matrix by angular momentum and approximations of matrix elements

Performance Analysis of Quantitative Phase Retrieval Method in Zernike Phase Contrast X-ray Microscopy

Hard X-ray optics simulation using the coherent mode decomposition of Gaussian Schell model

Catalytic sub-surface etching of nanoscale channels in graphite

Unit roots in moving averages beyond first order

Anomalous insulator metal transition in boron nitride-graphene hybrid atomic layers