Researcher profile

Pengyu Yang

Pengyu Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Constructing large-scale datasets for the GitHub issue resolution task is crucial for both training and evaluating the software engineering capabilities of Large Language Models (LLMs). However, the existing GitHub issue resolution data construction pipeline is challenging and labor-intensive. We identify three key limitations in existing pipelines: (1) test patches collected often omit binary file changes; (2) the manual construction of evaluation environments is labor-intensive; and (3) the fail2pass validation phase requires manual inspection of test logs and writing custom parsing code to extract test status from logs. In this paper, we propose SWE-Factory, a fully automated issue resolution data construction pipeline, to resolve these limitations. First, our pipeline automatically recovers missing binary test files and ensures the correctness of test patches. Second, we introduce SWE-Builder, a LLM-based multi-agent system that automates evaluation environment construction. Third, we introduce a standardized, exit-code-based log parsing method to automatically extract test status, enabling a fully automated fail2pass validation. Experiments on 671 real-world GitHub issues across four programming languages show that our method can effectively construct valid evaluation environments for GitHub issues at a reasonable cost. For example, with GPT-4.1 mini, our SWE-Builder constructs 337 valid task instances out of 671 issues, at $0.047 per instance. Our ablation study further shows the effectiveness of different components of SWE-Builder. We also demonstrate through manual inspection that our exit-code-based fail2pass validation method is highly accurate, achieving an F1 score of 0.99. Additionally, we conduct an exploratory experiment to investigate whether we can use SWE-Factory to enhance models' software engineering ability.

preprint2022arXiv

Equidistribution in the space of 3-lattices and Dirichlet-improvable vectors on planar lines

Let $X=\text{SL}_3(\mathbb{R})/\text{SL}_3(\mathbb{Z})$, and $g_t=\text{diag}(e^{2t}, e^{-t}, e^{-t})$. Let $ν$ denote the push-forward of the normalized Lebesgue measure on a segment of a straight line in the expanding horosphere of $\{g_t\}_{t>0}$, under the map $h\mapsto h\text{SL}_3(\mathbb{Z})$ from $\text{SL}_3(\mathbb{R})$ to $X$. We give explicit necessary and sufficient Diophantine conditions on the line for equidistribution of each of the following families of measures on $X$: (1) $g_t$-translates of $ν$ as $t\to\infty$. (2) averages of $g_t$-translates of $ν$ over $t\in[0,T]$ as $T\to\infty$. (3) $g_{t_i}$-translates of $ν$ for some $t_i\to\infty$. We apply this dynamical result to show that Lebesgue-almost every point on the planar line $y=ax+b$ is not Dirichlet-improvable if and only if $(a,b)\notin\mathbb{Q}^2$.