Researcher profile

Yibo Wang

Yibo Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

A Study on the Triggering of Nucleonic Direct Urca Processes in Neutron Stars of Specific Masses and Their Hyperon Dependence

This work aims to analyze how hyperons affect neutrino radiation properties in nucleonic direct URCA processes, expecting to provide useful references for finding evidence of the existence of hyperons in astronomical observations. This analysis is carried out using the GM1 and NL3 parameter sets under the SU(6) and SU(3) flavor symmetries in the relativistic mean field theory framework. Combined with the inferred mass and radius values of PSRs J1231-1411, J0030+0451, and J0740+6620, our results show that nucleonic direct Urca processes are absent in PSR J1231-1411 due to momentum conservation violation. In hyperon-containing PSR J0030+0451 (NL3 parameter set), the nucleonic direct Urca processes involving $e^{-}$/ $μ^{-}$ would occur. A large inferred mass span induces hyperon fraction variations, affecting neutrino emissivity. If the inferred mass of PSR J0030+0451 exceeds approximately 1.8 $M_{\odot}$, the neutrino luminosity of the nucleonic direct Urca processes under the SU(3) flavor symmetry remains nearly the same as that in npe$μ$ matter, without depending on hyperons. However, it exhibits an obvious hyperon dependence under the SU(6) spin-flavor symmetry. For hyperon-containing J0740+6620, the nucleonic direct Urca processes under the SU(3) flavor symmetry in GM1 parameter set predicts faster neutrino luminosity decline with hyperonic fraction than npe$μ$ matter, and under the SU(6) spin-flavor symmetry in NL3 parameter set it shows monotonic decreasing trend. The research shows that hyperonic fraction significantly affect the neutrino radiation properties of the nucleonic direct URCA processes in neutron stars. Different-mass pulsars (e.g., PSRs J1231-1411, J0030+0451, J0740+6620) exhibit the distinct nucleonic direct URCA processes behaviors, dependent on inferred masses/radii, parameter sets, and theoretical models.

preprint2026arXiv

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

Deep research systems are widely used for multi-step web research, analysis, and cross-source synthesis, yet their evaluation remains challenging. Existing benchmarks often require annotation-intensive task construction, rely on static evaluation dimensions, or fail to reliably verify facts when citations are missing. To bridge these gaps, we introduce DeepResearchEval, an automated framework for deep research task construction and agentic evaluation. For task construction, we propose a persona-driven pipeline generating realistic, complex research tasks anchored in diverse user profiles, applying a two-stage filter Task Qualification and Search Necessity to retain only tasks requiring multi-source evidence integration and external retrieval. For evaluation, we propose an agentic pipeline with two components: an Adaptive Point-wise Quality Evaluation that dynamically derives task-specific evaluation dimensions, criteria, and weights conditioned on each generated task, and an Active Fact-Checking that autonomously extracts and verifies report statements via web search, even when citations are missing.

preprint2026arXiv

Dust-obscured radio-emitting tidal disruption event coincident with a high-energy neutrino event

Despite the growing number of high-energy neutrinos (TeV-PeV) detected by IceCube, their astrophysical origins remain largely unidentified. Recent observations have linked a few tidal disruption events (TDEs) to the production of high-energy neutrino emission, all of which display dust-reprocessed infrared flares, indicating a dust- and gas-rich environment. By cross-matching the neutrino events and a sample of mid-infrared outbursts in nearby galaxies with transient radio flares, we uncover an optically obscured TDE candidate, SDSS J151345.75 $+$ 311125.2, which shows both spatial and temporal coincidence with the sub-PeV neutrino event IC170514B. Using a standard equipartition analysis of the synchrotron spectral evolution spanning 605 days post mid-infrared discovery, we find a little evolution in the radio-emitting region, with a kinetic energy up to $10^{51}$ erg, depending on the outflow geometry and shock acceleration efficiency assumed. High-resolution European VLBI Network imaging reveals a compact radio emission that is unresolved at a scale of $<$ 2.1 pc, with a brightness temperature of $T_b>5\times10^6$ K, suggesting that the observed late-time radio emission might originate from the interaction between a decelerating outflow and a dense circumnuclear medium. If the association is genuine, the neutrino production is possibly related to the acceleration of protons through pp collisions during the outflow expanding process, implying that the outflow-cloud interaction could provide a physical site with a high-density environment for producing the sub-PeV neutrinos. Such a scenario can be tested with future identifications of radio transients coincident with high-energy neutrinos.

preprint2026arXiv

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation

Harmful fine-tuning attack introduces significant security risks to the fine-tuning services. Main-stream defenses aim to vaccinate the model such that the later harmful fine-tuning attack is less effective. However, our evaluation results show that such defenses are fragile--with a few fine-tuning steps, the model still can learn the harmful knowledge. To this end, we do further experiment and find that an embarrassingly simple solution--adding purely random perturbations to the fine-tuned model, can recover the model from harmful behaviors, though it leads to a degradation in the model&#39;s fine-tuning performance. To address the degradation of fine-tuning performance, we further propose Panacea, which optimizes an adaptive perturbation that will be applied to the model after fine-tuning. Panacea maintains model&#39;s safety alignment performance without compromising downstream fine-tuning performance. Comprehensive experiments are conducted on different harmful ratios, fine-tuning tasks and mainstream LLMs, where the average harmful scores are reduced by up-to 21.2%, while maintaining fine-tuning performance. As a by-product, we analyze the adaptive perturbation and show that different layers in various LLMs have distinct safety affinity, which coincide with finding from several previous study. Source code available at https://github.com/w-yibo/Panacea.

preprint2026arXiv

QUITE: A Query Rewrite System Beyond Rules with LLM Agents

Query rewrite transforms SQL queries into semantically equivalent forms that run more efficiently. Existing approaches mainly rely on predefined rewrite rules, but they handle a limited subset of queries and can cause performance regressions. This limitation stems from three challenges of rule-based query rewrite: (1) it is hard to discover and verify new rules, (2) fixed rewrite rules do not generalize to new query patterns, and (3) some rewrite techniques cannot be expressed as fixed rules. Motivated by the fact that human experts exhibit significantly better rewrite ability but suffer from scalability, and Large Language Models (LLMs) have demonstrated nearly human-level semantic and reasoning abilities, we propose a new approach of using LLMs to rewrite SQL queries beyond rules. Due to the hallucination problems in LLMs, directly applying LLMs often leads to nonequivalent and suboptimal queries. To address this issue, we propose QUITE (query rewrite), a training-free and feedback-aware system based on LLM agents that rewrites SQL queries into semantically equivalent forms with significantly better performance, covering a broader range of query patterns and rewrite strategies compared to rule-based methods. Firstly, we design a multi-agent framework controlled by a finite state machine (FSM) to equip LLMs with the ability to use external tools and enhance the rewrite process with real-time database feedback. Secondly, we develop a rewrite middleware to enhance the ability of LLMs to generate optimized query equivalents. Finally, we employ a novel hint injection technique to improve execution plans for rewritten queries. Extensive experiments show that QUITE reduces query execution time by up to 35.8% over state-of-the-art approaches and produces 24.1% more rewrites than prior methods, covering query cases that earlier systems did not handle.

preprint2026arXiv

Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs

Recently, self-play fine-tuning (SPIN) has been proposed to adapt large language models to downstream applications with scarce expert-annotated data, by iteratively generating synthetic responses from the model itself. However, SPIN is designed to optimize the current reward advantages of annotated responses over synthetic responses at hand, which may gradually vanish during iterations, leading to unstable optimization. Moreover, the utilization of reference policy induces a misalignment issue between the reward formulation for training and the metric for generation. To address these limitations, we propose a novel Triplet-based Self-Play fIne-tuNing (T-SPIN) method that integrates two key designs. First, beyond current advantages, T-SPIN additionally incorporates historical advantages between iteratively generated responses and proto-synthetic responses produced by the initial policy. Even if the current advantages diminish, historical advantages remain effective, stabilizing the overall optimization. Second, T-SPIN introduces the entropy constraint into the self-play framework, which is theoretically justified to support reference-free fine-tuning, eliminating the training-generation discrepancy. Empirical results on various tasks demonstrate not only the superior performance of T-SPIN over SPIN, but also its stable evolution during iterations. Remarkably, compared to supervised fine-tuning, T-SPIN achieves comparable or even better performance with only 25% samples, highlighting its effectiveness when faced with scarce annotated data.

preprint2022arXiv

Aper: Evolution-Aware Runtime Permission Misuse Detection for Android Apps

The Android platform introduces the runtime permission model in version 6.0. The new model greatly improves data privacy and user experience, but brings new challenges for app developers. First, it allows users to freely revoke granted permissions. Hence, developers cannot assume that the permissions granted to an app would keep being granted. Instead, they should make their apps carefully check the permission status before invoking dangerous APIs. Second, the permission specification keeps evolving, bringing new types of compatibility issues into the ecosystem. To understand the impact of the challenges, we conducted an empirical study on 13,352 popular Google Play apps. We found that 86.0% apps used dangerous APIs asynchronously after permission management and 61.2% apps used evolving dangerous APIs. If an app does not properly handle permission revocations or platform differences, unexpected runtime issues may happen and even cause app crashes. We call such Android Runtime Permission issues as ARP bugs. Unfortunately, existing runtime permission issue detection tools cannot effectively deal with the ARP bugs induced by asynchronous permission management and permission specification evolution. To fill the gap, we designed a static analyzer, Aper, that performs reaching definition and dominator analysis on Android apps to detect the two types of ARP bugs. To compare Aper with existing tools, we built a benchmark, ARPfix, from 60 real ARP bugs. Our experiment results show that Aper significantly outperforms two academic tools, ARPDroid and RevDroid, and an industrial tool, Lint, on ARPfix, with an average improvement of 46.3% on F1-score. In addition, Aper successfully found 34 ARP bugs in 214 opensource Android apps, most of which can result in abnormal app behaviors (such as app crashes) according to our manual validation.

preprint2022arXiv

Discovery of ATLAS17jrp as an Optical, X-ray and Infrared Bright TDE in a Star-forming Galaxy

We hereby report the discovery of ATLAS17jrp as an extraordinary TDE in star-forming galaxy SDSSJ162034.99+240726.5 in our recent sample of mid-infrared outbursts in nearby galaxies. Its optical/UV light curves rise to a peak luminosity $\sim1.06\times10^{44}\rm\,erg\,s^{-1}$ in about a month and then decay as $\rm t^{-5/3}$ with a roughly constant temperature around 19000~K, and the optical spectra show a blue continuum and very broad Balmer lines with FWHM$\sim$15000 km/s which gradually narrowed to 1400 km/s within 4 years, all agreeing well with other optical TDEs. A delayed and rapidly rising X-ray flare with a peak luminosity $\rm \sim 1.27\times10^{43}\,erg\,s^{-1}$ was detected at $\rm \sim$ 170 days after the optical peak. The high MIR luminosity of ATLAS17jrp ($\sim2\times10^{43} \rm\,erg\,s^{-1}$) has revealed a distinctive dusty environment with covering factor as high as $\sim0.2$, that is comparable with that of torus in active galactic nuclei but at least one order of magnitude higher than normal optical TDEs. Therefore, ATLAS17jrp turns out to be one of the rare unambiguous TDE found in star-forming galaxies and its high dust covering factor implies that the dust extinction could play an important role in the absence of optical TDEs in star-forming galaxies.

preprint2022arXiv

Multi-block-Single-probe Variance Reduced Estimator for Coupled Compositional Optimization

Variance reduction techniques such as SPIDER/SARAH/STORM have been extensively studied to improve the convergence rates of stochastic non-convex optimization, which usually maintain and update a sequence of estimators for a single function across iterations. What if we need to track multiple functional mappings across iterations but only with access to stochastic samples of $\mathcal{O}(1)$ functional mappings at each iteration? There is an important application in solving an emerging family of coupled compositional optimization problems in the form of $\sum_{i=1}^m f_i(g_i(\mathbf{w}))$, where $g_i$ is accessible through a stochastic oracle. The key issue is to track and estimate a sequence of $\mathbf g(\mathbf{w})=(g_1(\mathbf{w}), \ldots, g_m(\mathbf{w}))$ across iterations, where $\mathbf g(\mathbf{w})$ has $m$ blocks and it is only allowed to probe $\mathcal{O}(1)$ blocks to attain their stochastic values and Jacobians. To improve the complexity for solving these problems, we propose a novel stochastic method named Multi-block-Single-probe Variance Reduced (MSVR) estimator to track the sequence of $\mathbf g(\mathbf{w})$. It is inspired by STORM but introduces a customized error correction term to alleviate the noise not only in stochastic samples for the selected blocks but also in those blocks that are not sampled. With the help of the MSVR estimator, we develop several algorithms for solving the aforementioned compositional problems with improved complexities across a spectrum of settings with non-convex/convex/strongly convex/Polyak-Łojasiewicz (PL) objectives. Our results improve upon prior ones in several aspects, including the order of sample complexities and dependence on the strong convexity parameter. Empirical studies on multi-task deep AUC maximization demonstrate the better performance of using the new estimator.

preprint2022arXiv

Tick-Tock: The Imminent Merger of a Supermassive Black Hole Binary

Supermassive black hole binaries (SMBHs) are a fascinating byproduct of galaxy mergers in the hierarchical universe. In the last stage of their orbital evolution, gravitational wave radiation drives the binary inspiral and produces the loudest siren awaiting to be detected by gravitational wave observatories. Periodically varying emission from active galactic nuclei has been proposed as a powerful approach to probe such systems, although none of the identified candidates are close to their final coalescence such that the observed periods stay constant in time. In this work, we report on the first system with rapid decaying periods revealed by its optical and X-ray light curves, which has decreased from about one year to one month in three years. Together with its optical hydrogen line spectroscopy, we propose that the system is an uneven mass-ratio, highly eccentric SMBH binary which will merge within three years, as predicted by the trajectory evolution model. If the interpretation is true, coordinated, multi-band electromagnetic campaign should be planned for this first binary SMBH merger event observed in human history, together with possible neutrino measurements. Gravitational wave memory from this event may also be detectable by Pulsar Timing Array with additional five-to-ten year observation.

preprint2021arXiv

Mid-InfraRed Outbursts in Nearby Galaxies (MIRONG). II. Optical Spectroscopic Follow-up

Infrared echo has proven to be an effective means to discover transient accretion events of supermassive black holes (SMBHs), such as tidal disruption events (TDEs) and changing-look active galactic nuclei (AGNs), in dusty circumnuclear environments. To explore the dusty populations of SMBH transient events, we have constructed a large sample of Mid-infrared Outbursts in Nearby Galaxies (MIRONG) and performed multiwavelength observations. Here we present the results of multiepoch spectroscopic follow-up observations of a subsample of 54 objects spanning a time scale of 4 yr. Emission-line variability was detected in 22 of them with either emergence or enhancement of broad Balmer emission lines in comparison with pre-outburst spectra. Coronal lines, HeIIλ4686 and Bowen line NIIIλ4640 appeared in the spectra of nine,seven and two sources, respectively. These results suggest that MIRONG is a mixed bag of different transient sources. We have tentatively classified them into different subclass according to their spectral evolution and light curves. Two sources have been in a steady high broad Hα flux up to the latest observation and might be turn-on AGNs. Broad lines faded out in the remaining sources, indicating a transient ionizing source ignited by TDE or sporadic gas accretion. Thirty-one sources do not show noticeable spectral change with respect to their pre-outburst spectra. They have a statistically redder MIR color and lower MIR luminosity of the outbursts,which are consistent with heavily obscured events.

preprint2020arXiv

Mid-InfraRed Outburst in Nearby Galaxies (MIRONG) I: Sample Selection and Characterization

The optical time-domain astronomy has grown rapidly in the past decade but the dynamic infrared sky is rarely explored. Aiming to construct a sample of mid-infrared outburst in nearby galaxies (MIRONG), we have conducted a systematical search of low-redshift ($z<0.35$) SDSS spectroscopic galaxies that have experienced recent MIR flares using their Wide-field Infrared Survey Explorer (WISE) light curves. A total of 137 galaxies have been selected by requiring a brightening amplitude of 0.5 magnitude in at least one WISE band with respect to their quiescent phases. Only a small faction (10.9%) has corresponding optical flares. Except for the four supernova (SNe) in our sample, the MIR luminosity of remaining sources ($L_{\rm 4.6μm}>10^{42}~\rm erg~s^{-1}$) are markedly brighter than known SNe and their physical locations are very close to the galactic center (median <0.1&#34;). Only four galaxies are radio-loud indicating that synchrotron radiation from relativistic jets could contribute MIR variability. We propose that these MIR outburst are dominated by the dust echoes of transient accretion onto supermassive black holes, such as tidal disruption events (TDEs) and turn-on (changing-look) AGNs. Moreover, the inferred peak MIR luminosity function is generally consistent with the X-ray and optical TDEs at high end albeit with large uncertainties. Our results suggest that a large population of transients have been overlooked by optical surveys, probably due to dust obscuration or intrinsically optical weakness. Thus, a search in the infrared band is crucial for us to obtain a panoramic picture of nuclear outburst. The multiwavength follow-up observations of the MIRONG sample are in progress and will be presented in a series of subsequent papers.