Source author record

Pengyu Yang

Pengyu Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence math.DS math.NT Software Engineering

Catalog footprint

What is connected

2works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

Constructing large-scale datasets for the GitHub issue resolution task is crucial for both training and evaluating the software engineering capabilities of Large Language Models (LLMs). However, the existing GitHub issue resolution data construction pipeline is challenging and labor-intensive. We identify three key limitations in existing pipelines: (1) test patches collected often omit binary file changes; (2) the manual construction of evaluation environments is labor-intensive; and (3) the fail2pass validation phase requires manual inspection of test logs and writing custom parsing code to extract test status from logs. In this paper, we propose SWE-Factory, a fully automated issue resolution data construction pipeline, to resolve these limitations. First, our pipeline automatically recovers missing binary test files and ensures the correctness of test patches. Second, we introduce SWE-Builder, a LLM-based multi-agent system that automates evaluation environment construction. Third, we introduce a standardized, exit-code-based log parsing method to automatically extract test status, enabling a fully automated fail2pass validation. Experiments on 671 real-world GitHub issues across four programming languages show that our method can effectively construct valid evaluation environments for GitHub issues at a reasonable cost. For example, with GPT-4.1 mini, our SWE-Builder constructs 337 valid task instances out of 671 issues, at $0.047 per instance. Our ablation study further shows the effectiveness of different components of SWE-Builder. We also demonstrate through manual inspection that our exit-code-based fail2pass validation method is highly accurate, achieving an F1 score of 0.99. Additionally, we conduct an exploratory experiment to investigate whether we can use SWE-Factory to enhance models' software engineering ability.

preprint2022arXiv

Equidistribution in the space of 3-lattices and Dirichlet-improvable vectors on planar lines

Let $X=\text{SL}_3(\mathbb{R})/\text{SL}_3(\mathbb{Z})$, and $g_t=\text{diag}(e^{2t}, e^{-t}, e^{-t})$. Let $ν$ denote the push-forward of the normalized Lebesgue measure on a segment of a straight line in the expanding horosphere of $\{g_t\}_{t>0}$, under the map $h\mapsto h\text{SL}_3(\mathbb{Z})$ from $\text{SL}_3(\mathbb{R})$ to $X$. We give explicit necessary and sufficient Diophantine conditions on the line for equidistribution of each of the following families of measures on $X$: (1) $g_t$-translates of $ν$ as $t\to\infty$. (2) averages of $g_t$-translates of $ν$ over $t\in[0,T]$ as $T\to\infty$. (3) $g_{t_i}$-translates of $ν$ for some $t_i\to\infty$. We apply this dynamical result to show that Lebesgue-almost every point on the planar line $y=ax+b$ is not Dirichlet-improvable if and only if $(a,b)\notin\mathbb{Q}^2$.