Researcher profile

Chao Peng

Chao Peng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2025arXiv

AdaGReS:Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

Retrieval-augmented generation (RAG) is highly sensitive to the quality of selected context, yet standard top-k retrieval often returns redundant or near-duplicate chunks that waste token budget and degrade downstream generation. We present AdaGReS, a redundancy-aware context selection framework for token-budgeted RAG that optimizes a set-level objective combining query-chunk relevance and intra-set redundancy penalties. AdaGReS performs greedy selection under a token-budget constraint using marginal gains derived from the objective, and introduces a closed-form, instance-adaptive calibration of the relevance-redundancy trade-off parameter to eliminate manual tuning and adapt to candidate-pool statistics and budget limits. We further provide a theoretical analysis showing that the proposed objective exhibits epsilon-approximate submodularity under practical embedding similarity conditions, yielding near-optimality guarantees for greedy selection. Experiments on open-domain question answering (Natural Questions) and a high-redundancy biomedical (drug) corpus demonstrate consistent improvements in redundancy control and context quality, translating to better end-to-end answer quality and robustness across settings.

preprint2025arXiv

Benchmarking LLMs for Fine-Grained Code Review with Enriched Context in Practice

Code review is a cornerstone of software quality assurance, and recent advances in Large Language Models (LLMs) have shown promise in its automation. However, existing benchmarks for LLM-based code review face three major limitations. Lack of semantic context: most benchmarks provide only code diffs without textual information such as issue descriptions, which are crucial for understanding developer intent. Data quality issues: without rigorous validation, many samples are noisy-e.g., reviews on outdated or irrelevant code-reducing evaluation reliability. Coarse granularity: most benchmarks operate at the file or commit level, overlooking the fine-grained, line-level reasoning essential for precise review. We introduce ContextCRBench, a high-quality, context-rich benchmark for fine-grained LLM evaluation in code review. Our construction pipeline comprises: Raw Data Crawling, collecting 153.7K issues and pull requests from top-tier repositories; Comprehensive Context Extraction, linking issue-PR pairs for textual context and extracting the full surrounding function or class for code context; and Multi-stage Data Filtering, combining rule-based and LLM-based validation to remove outdated, malformed, or low-value samples, resulting in 67,910 context-enriched entries. ContextCRBench supports three evaluation scenarios aligned with the review workflow: hunk-level quality assessment, line-level defect localization, and line-level comment generation. Evaluating eight leading LLMs (four closed-source and four open-source) reveals that textual context yields greater performance gains than code context alone, while current LLMs remain far from human-level review ability. Deployed at ByteDance, ContextCRBench drives a self-evolving code review system, improving performance by 61.98% and demonstrating its robustness and industrial utility. https://github.com/kinesiatricssxilm14/ContextCRBench.

preprint2022arXiv

Low-threshold nanolasers based on miniaturized bound states in the continuum

The pursuit of compact lasers with low-thresholds has imposed strict requirements on tight light confinements with minimized radiation losses. Bound states in the continuum (BICs) have been recently demonstrated as an effective mechanism to trap light along the out-of-plane direction, paving the way to low-threshold lasers. To date, most reported BIC lasers are still bulky due to the absence of in-plane light confinement. In this work, we combine BICs and photonic band gaps to realize three-dimensional (3D) light confinements, as referred to miniaturized (mini-) BICs. Together with 3D carrier confinements provided by quantum dots (QDs) as optical gain materials, we have realized highly-compact active BIC resonators with a record-high quality ($Q$) factor up to 32500, which enables single-mode continuous wave (CW) lasing with the lowest threshold of 80 W/cm$^{2}$ among the reported BIC lasers. In addidtion, our photon statistics measurements under both CW and pulsed excitations confirm the occurence of the phase transition from spontaneous emission to stimulated emission, further suggesting that conventional criteria of input-output and linewidth are not sufficient for claiming nanoscale lasing. Our work reveal a via path towards compact BIC lasers with ultra-low power consumption and potentially boost the applications in cavity quantum electrodynamics (QEDs), nonlinear optics and integrated photonics.

preprint2022arXiv

Monolithic Active Pixel Sensors on CMOS technologies

Collider detectors have taken advantage of the resolution and accuracy of silicon detectors for at least four decades. Future colliders will need large areas of silicon sensors for low mass trackers and sampling calorimetry. Monolithic Active Pixel Sensors (MAPS), in which Si diodes and readout circuitry are combined in the same pixels, and can be fabricated in some of standard CMOS processes, are a promising technology for high-granularity and light detectors. In this paper we review 1) the requirements on MAPS for trackers and electromagnetic calorimeters (ECal) at future colliders experiments, 2) the ongoing efforts towards dedicated MAPS for the Electron-Ion Collider (EIC) at BNL, for which the EIC Silicon Consortium was already instantiated, and 3) space-born applications for MeV $γ$-ray experiments with MAPS based trackers (AstroPix).

preprint2022arXiv

Performance of photosensors in a high-rate environment for gas Cherenkov detectors

The solenoidal large intensity device (SoLID) at Jefferson Lab will push the boundaries of luminosity for a large-acceptance detector, which necessitates the use of a light-gas threshold Cherenkov counter for online event selection. Due to the high luminosity, the single-photon background rate in this counter can exceed 160 kHz/cm$^2$ at the photosensors. Therefore, it is essential to validate the high-rate limits of the planned photosensors and readout electronics in order to mitigate the risk of failure. We report on the design and an early set of studies carried out using a small telescopic Cherenkov device in a high-rate environment up to 60 kHz/cm$^2$, in Hall C at Jefferson Lab. Commercially available multi-anode photomultipliers (MaPMT) and low-cost large-area picosecond photodetectors (LAPPD) were tested using the JLab FADC250 modules for readout. The test beam results show that the MaPMT array and the internal stripline LAPPD can detect and identify single-electron and pair-production events in high-rate environments. Due to its higher quantum efficiency, the MaPMT array provided a better separation between the single-electron and the pair-production events compared to the internal stripline LAPPD. A GEANT4 simulation confirms the experimental performance of our telescopic device.

preprint2022arXiv

Roadmap on Topological Photonics

Topological photonics seeks to control the behaviour of the light through the design of protected topological modes in photonic structures. While this approach originated from studying the behaviour of electrons in solid-state materials, it has since blossomed into a field that is at the very forefront of the search for new topological types of matter. This can have real implications for future technologies by harnessing the robustness of topological photonics for applications in photonics devices. This Roadmap surveys some of the main emerging areas of research within topological photonics, with a special attention to questions in fundamental science, which photonics is in an ideal position to address. Each section provides an overview of the current and future challenges within a part of the field, highlighting the most exciting opportunities for future research and developments.

preprint2021arXiv

Advanced extraction of the deuteron charge radius from electron-deuteron scattering data

To extract the charge radius of the proton, $r_{p}$, from the electron scattering data, the PRad collaboration at Jefferson Lab has developed a rigorous framework for finding the best functional forms - the fitters - for a robust extraction of $r_{p}$ from a wide variety of sample functions for the range and uncertainties of the PRad data. In this paper we utilize and further develop this framework. Herein we discuss methods for searching for the best fitter candidates as well as a procedure for testing the robustness of extraction of the deuteron charge radius, $r_{d}$, from parametrizations based on elastic electron-deuteron scattering data. The ansatz proposed in this paper for the robust extraction of $r_{d}$, for the proposed low-$Q^{2}$ DRad experiment at Jefferson Lab, can be further improved once there are more data.

preprint2021arXiv

CFNS Ad-Hoc meeting on Radiative Corrections Whitepaper

Current precision scattering experiments and even more so many experiments planed for the Electron Ion Collider will be limited by systematics. From the theory side, a fundamental source of systematic uncertainty is the correct treatment of radiative effects. To gauge the current state of technique and knowledge, help the cross-pollination between different direction of nuclear physics, and to give input to the yellow report process, the community met in an ad-hoc workshop hosted by the Center for Frontiers in Nuclear Science, Stony Brook University. This whitepaper is a collection of contributions to this workshop.

preprint2018arXiv

Topologically Enabled Ultra-high-Q Guided Resonances Robust to Out-of-plane Scattering

Due to their ability to confine light, optical resonators are of great importance to science and technology, yet their performances are often limited by out-of-plane scattering losses from inevitable fabrication imperfections. Here, we theoretically propose and experimentally demonstrate a class of guided resonances in photonic crystal slabs, where out-of-plane scattering losses are strongly suppressed due to their topological nature. Specifically, these resonances arise when multiple bound states in the continuum - each carrying a topological charge - merge in the momentum space and enhance the quality factors of all resonances nearby. We experimentally achieve quality factors as high as $4.9\times 10^5$ based on these resonances in the telecommunication regime, which is 12-times higher than ordinary designs. We further show this enhancement is robust across the samples we fabricated.Our work paves the way for future explorations of topological photonics in systems with open boundary condition and their applications in improving optoelectronic devices in photonic integrated circuits.