Researcher profile

Hongbo Wang

Hongbo Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

BoostAPR: Boosting Automated Program Repair via Execution-Grounded Reinforcement Learning with Dual Reward Models

Reinforcement learning for program repair is hindered by sparse execution feedback and coarse sequence-level rewards that obscure which edits actually fix bugs. We present BoostAPR, a three-stage framework addressing these challenges: (1) supervised fine-tuning on execution-verified demonstrations with reasoning traces, (2) training dual reward models--a sequence-level assessor and a line-level credit allocator--from execution outcomes, and (3) PPO optimization where the line-level model redistributes rewards to critical edit regions. This line-level credit assignment operates at an intermediate granularity naturally suited to code changes. Trained on SWE-Gym and evaluated on four benchmarks, BoostAPR achieves 40.7% on SWE-bench Verified (+22.9pp over base model), 24.8% on Defects4J (Python-to-Java transfer), 84.5% on HumanEval-Java, and 95.0% on QuixBugs, achieving competitive results among open-source models with strong cross-language generalization.

preprint2026arXiv

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

With the advancement of multimodal large language models (MLLMs) and coding agents, the website development has shifted from manual programming to agent-based project-level code synthesis. Existing benchmarks rely on idealized assumptions, especially for well-structured, information-rich inputs and static execution settings. In contrast, real-world development is constrained by a critical bottleneck: the semantic misalignment between ambiguous, low-quality instructions from non-expert users and model understanding, which results in a failure mode that we term blind execution. To address this gap, we introduce InteractWeb-Bench, the first multimodal interactive benchmark for website generation under non-expert low-code user conditions. InteractWeb-Bench introduces four types of user agents and persona-driven instruction perturbations to systematically simulate diverse user behaviors, including ambiguity, redundancy, and contradiction, grounded in requirement engineering defect taxonomies. We develop an interactive execution environment for agents, featuring a unified action space comprising Clarify, Implement, Verify, and Submit, enabling iterative intent refinement, code synthesis, and visual feedback-based validation. Extensive experiments and analysis reveal that frontier MLLM-based agents remain trapped in blind execution, exposing limitations in intent recognition and adaptive interaction.

preprint2026arXiv

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination

Patent examination is a complex, multi-stage process requiring both technical expertise and legal reasoning, increasingly challenged by rising application volumes. Prior benchmarks predominantly view patent examination as discriminative classification or static extraction, failing to capture its inherently interactive and iterative nature, similar to the peer review and rebuttal process in academic publishing. In this paper, we introduce PatRe, the first benchmark that models the full patent examination lifecycle, including Office Action generation and applicant rebuttal. PatRe comprises 480 real-world cases and supports both oracle and retrieval-simulated evaluation settings. Our benchmark reframes patent examination as a dynamic, multi-turn process of justification and response. Extensive experiments across various LLMs reveal critical insights into model performance, including differences between proprietary and open-source models, as well as task asymmetries between examiner analysis and applicant-side rebuttal. These findings highlight both the potential and current limitations of LLMs in modeling complex, real-world legal reasoning and technical novelty judgment in patent examination. We release our code and dataset to facilitate future research on patent examination modeling.

preprint2026arXiv

Swarm Skills: A Portable, Self-Evolving Multi-Agent System Specification for Coordination Engineering

As artificial intelligence engineering paradigms shift from single-agent Prompt and Context Engineering toward multi-agent \textbf{Coordination Engineering}, the ability to codify and systematically improve how multiple agents collaborate has emerged as a critical bottleneck. While single-agent skills can now be distributed as portable assets, multi-agent coordination protocols remain locked within framework-internal code or static configurations, preventing them from being shared across systems or autonomously improved over time. We propose \textbf{Swarm Skills}, a portable specification that extends the Anthropic Skills standard with multi-agent semantics. Swarm Skills turns multi-agent workflows into first-class, distributable assets that consist of roles, workflows, execution bounds, and a built-in semantic structure for self-evolution. To operationalize the specification's evolving nature, we present a companion self-evolution algorithm that automatically distills successful execution trajectories into new Swarm Skills and continuously patches existing ones based on multi-dimensional scoring (Effectiveness, Utilization, and Freshness), eliminating the need for human-in-the-loop oversight during the refinement process. Through an architectural compatibility analysis and a comprehensive qualitative case study using the open-source JiuwenSwarm reference implementation, we demonstrate how Swarm Skills achieves zero-adapter cross-agent portability via progressive disclosure, enabling agent teams to self-evolve their coordination strategies without framework lock-in.

preprint2020arXiv

Experimental Syntheses of Sodalite-like Clathrate EuH$_6$ and EuH$_9$ at Extreme Pressures

The recent discovery of a class of sodalite-like clathrate superhydrides (e.g., YH6, YH9, ThH9, ThH10, and LaH10) at extreme pressures, which exhibit commonly a high-temperature superconductivity with the highest Tc approaching 260 K for LaH10, opened up a new era in search of high-temperature superconductors in metal superhydrides. There is a high interest towards the finding of alternative clathrate superhydrides that might witness the long-dreamed room-temperature superconductivity. Here, we target on the experimental synthesis of strongly-correlated europium (Eu) superhydrides where theory can fail for the prediction of superconductivity. We pressurized and laser-heated the mixture of metal Eu and ammonia borane (NH3BH3) in a diamond anvil cell and successfully synthesized the sodalite-like clathrate EuH6 and EuH9 at conditions of 152 GPa and 1,700 K, and 170 GPa and 2,800 K, respectively. Two non-clathrate structured phases of EuH5 and EuH6 were also synthesized that are not reported in lanthanide superhydrides. Calculated large H-derived electronic density of states at the Fermi level in clathrate EuH6 implies the potential of high temperature superconductivity. Our work created a model superhydride platform for subsequent investigation on how strongly-correlated effect in electronic structure can affect the superconductivity of superhydrides, a phenomenon that is not known thus far.

preprint2020arXiv

Picosecond-precision optical time transfer in free space using flexible binary offset carrier modulation

Free-space optical time transfer that features high precision and flexibility will act a crucial role in near-future ground-to-satellite/inter-satellite clock networks and outdoor timing services. Here we propose a free-space optical flexible-binary-offset-carrier-modulated (FlexBOC-modulated) time transfer method. The utilized FlexBOC modulation could yield a comparative precision, although its occupied bandwidth is tremendously reduced by at least 97.5% compared to optical binary phase modulation. Meanwhile, the adoption of optical techniques eliminates the multi-path effect that is major limit in the current microwave satellite time transfer system. What's more, the time interval measurement avoids a continuous link that may be routinely broken by physical obstructions. For verification, a time transfer experiment with our home-built system between two sites separated by a 30-m free-space path outside the laboratory was conducted. Over a 15 h period, the time deviation is 2.3 ps in a 1-s averaging time, and averages down to 1.0 ps until ~60 s. The fractional frequency instability exhibits 4.0E-12 at a gate time of 1 s, and approaches to 2.6E10-15 at 10000 s.

preprint2020arXiv

VI3: a 2D Ising ferromagnet

Two-dimensional (2D) magnetic materials are of great current interest for their promising applications in spintronics. Here we propose the van der Waals (vdW) material VI3 to be a 2D Ising ferromagnet (FM), using density functional calculations, crystal field level diagrams, superexchange model analyses, and Monte Carlo simulations. The $a_{1g}$$^1$$e'_{-}$$^1$ ground state in the trigonal crystal field gives rise to the 2D Ising FM due to a significant single ion anisotropy (SIA) and enhanced FM superexchange both associated with the $S_z$=1 and $L_z$=--1 state of V3+ ions. We find that a tensile strain on the VI3 monolayer further stabilizes the $a_{1g}$$^1$$e'_{-}$$^1$ ground state, and its Curie temperature ($T_{\rm C}$) would increase from 70 K to 90-110 K under a 2.5-5\% tensile strain. Moreover, we suggest a group of spin-orbital states with a strong SIA which may help to search more 2D Ising magnets.