Researcher profile

Haoyang Chen

Haoyang Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

A neural network for modeling human concept formation, understanding and communication

A remarkable capability of the human brain is to form more abstract conceptual representations from sensorimotor experiences and flexibly apply them independent of direct sensory inputs. However, the computational mechanism underlying this ability remains poorly understood. Here, we present a dual-module neural network framework, the CATS Net, to bridge this gap. Our model consists of a concept-abstraction module that extracts low-dimensional conceptual representations, and a task-solving module that performs visual judgement tasks under the hierarchical gating control of the formed concepts. The system develops transferable semantic structure based on concept representations that enable cross-network knowledge transfer through conceptual communication. Model-brain fitting analyses reveal that these emergent concept spaces align with both neurocognitive semantic model and brain response structures in the human ventral occipitotemporal cortex, while the gating mechanisms mirror that in the semantic control brain network. This work establishes a unified computational framework that can offer mechanistic insights for understanding human conceptual cognition and engineering artificial systems with human-like conceptual intelligence.

preprint2026arXiv

CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging

Debugging CUDA programs has long been challenging because failures often arise from subtle interactions among hardware behavior, compiler decisions, memory hierarchy, and asynchronous execution. More importantly, with the rapid expansion of GPU usage across scientific computing, machine learning, graphics, and systems workloads, CUDA debugging has become more challenging than ever. Current evaluations of LLM-based CUDA programming largely miss this setting: a model can pass correctness tests with repair by degeneration, simplifying the CUDA code into a safer but slower program that abandons the original optimization structure. We introduce CUDABEAVER, a benchmark for CUDA debugging from real failing workspaces produced during LLM-based CUDA generation. Each task provides the broken candidate, native build/test commands, raw error evidence, and a single editable file. CUDABEAVER evaluates whether a fixer truly repairs the failing CUDA code or merely finds a slower test-passing replacement, reporting results by failure category, debugging trajectory, stagnation mode, and performance preservation. We further propose pass@k(M,C,A), a protocol-conditional CUDA debugging metric by making the fixer M, corpus C, and protocol axes Aexplicit. Using this metric across 213 tasks and seven frontier LLMs, we show that protocol-aware evaluation gives a more faithful view of CUDA debugging ability: when performance-loss tolerance is high, fixers appear much stronger, but even a minor stricter performance requirement can sharply reduce measured success, shifting scores by up to 40 percentage points.

preprint2025arXiv

RAJ-PGA: Reasoning-Activated Jailbreak and Principle-Guided Alignment Framework for Large Reasoning Models

Large Reasoning Models (LRMs) face a distinct safety vulnerability: their internal reasoning chains may generate harmful content even when the final output appears benign. To address this overlooked risk, we first propose a novel attack paradigm, Reasoning-Activated Jailbreak (RAJ) via Concretization, which demonstrates that refining malicious prompts to be more specific can trigger step-by-step logical reasoning that overrides the model's safety protocols. To systematically mitigate this vulnerability, we further develop a scalable framework for constructing high-quality safety alignment datasets. This framework first leverages the RAJ attack to elicit challenging harmful reasoning chains from LRMs, then transforms these high-risk traces into safe, constructive, and educational responses through a tailored Principle-Guided Alignment (PGA) mechanism. Then, we introduce the PGA dataset, a verified alignment dataset containing 3,989 samples using our proposed method. Extensive experiments show that fine-tuning LRMs with PGA dataset significantly enhances model safety, achieving up to a 29.5% improvement in defense success rates across multiple jailbreak benchmarks. Critically, our approach not only defends against sophisticated reasoning-based attacks but also preserves, even enhances, the model's general reasoning capabilities. This work provides a scalable and effective pathway for safety alignment in reasoning-intensive AI systems, addressing the core trade-off between safety and functional performance.

preprint2022arXiv

$H^{\frac{11}{4}}(\mathbb{R}^2)$ ill-posedness for 2D Elastic Wave system

In this paper, we prove that for the 2D elastic wave equations, a physical system with multiple wave-speeds, its Cauchy problem fails to be locally well-posed in $H^{\frac{11}{4}}(\mathbb{R}^2)$. The ill-posedness here is driven by instantaneous shock formation. In 2D Smith-Tataru showed that the Cauchy problem for a single quasilinear wave equation is locally well-posed in $H^s$ with $s>\frac{11}{4}$. Hence our $H^{\frac{11}{4}}$ ill-posedness obtained here is a desired result. Our proof relies on combining a geometric method and an algebraic wave-decomposition approach, equipped with detailed analysis of the corresponding hyperbolic system.

preprint2022arXiv

Quantitative blow-up estimates for spacelike singularities in gravitational-collapse cosmological spacetimes

Under spherical symmetry, with double-null coordinates $(u,v)$, we study the gravitational collapse of the Einstein--scalar field system with a positive cosmological constant. The spacetime singularities arise when area radius $r$ vanishes and they are spacelike. We derive new quantitative estimates, obtain polynomial blow-up rates $O(1/r^N)$ for various quantities, and extend the results in [5] by the first author and Zhang and the arguments in [3] by the first author and Gajic to the cosmological settings. In particular, we sharpen the estimates of $r\partial_u r$ and $r\partial_v r$ in [5] and prove that the spacelike singularities where $r(u,v)=0$ are $C^{1,1/3}$ in $(u,v)$ coordinates. As an application, these estimates also give quantitative blow-up upper bounds of fluid velocity and density for the hard-phase model of the Einstein-Euler system under irrotational assumption. Near the timelike infinity, we also generalize the theorems in [3] by linking the precise blow-up rates of the Kretschmann scalar to the exponential Price's law along the event horizon. In cosmological settings, this further reveals the mass-inflation phenomena along the spacelike singularities for the first time.

preprint2020arXiv

Low regularity ill-posedness for elastic waves driven by shock formation

In this paper, we construct counterexamples to the local existence of low-regularity solutions to elastic wave equations in three spatial dimensions (3D). Inspired by the recent works of Christodoulou, we generalize Lindblad's classic results on the scalar wave equation by showing that the Cauchy problem for 3D elastic waves, a physical system with multiple wave-speeds, is ill-posed in $H^3(\mathbb{R}^3)$. We further prove that the ill-posedness is caused by instantaneous shock formation, which is characterized by the vanishing of the inverse foliation density. The main difficulties of the 3D case come from the multiple wave-speeds and its associated non-strict hyperbolicity. We obtain the desired results by designing and combining a geometric approach and an algebraic approach, equipped with detailed studies and calculations of the structures and coefficients of the corresponding non-strictly hyperbolic system. Moreover, the ill-posedness we depict also applies to 2D elastic waves, which corresponds to a strictly hyperbolic case.