Source author record

Haoyang Chen

Haoyang Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math-ph math.AP math.MP Artificial Intelligence Computation and Language Cryptography and Security gr-qc Machine Learning math.DG Neurons and Cognition physics.flu-dyn Programming Languages Software Engineering

Catalog footprint

What is connected

6works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A neural network for modeling human concept formation, understanding and communication

A remarkable capability of the human brain is to form more abstract conceptual representations from sensorimotor experiences and flexibly apply them independent of direct sensory inputs. However, the computational mechanism underlying this ability remains poorly understood. Here, we present a dual-module neural network framework, the CATS Net, to bridge this gap. Our model consists of a concept-abstraction module that extracts low-dimensional conceptual representations, and a task-solving module that performs visual judgement tasks under the hierarchical gating control of the formed concepts. The system develops transferable semantic structure based on concept representations that enable cross-network knowledge transfer through conceptual communication. Model-brain fitting analyses reveal that these emergent concept spaces align with both neurocognitive semantic model and brain response structures in the human ventral occipitotemporal cortex, while the gating mechanisms mirror that in the semantic control brain network. This work establishes a unified computational framework that can offer mechanistic insights for understanding human conceptual cognition and engineering artificial systems with human-like conceptual intelligence.

preprint2026arXiv

CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging

Debugging CUDA programs has long been challenging because failures often arise from subtle interactions among hardware behavior, compiler decisions, memory hierarchy, and asynchronous execution. More importantly, with the rapid expansion of GPU usage across scientific computing, machine learning, graphics, and systems workloads, CUDA debugging has become more challenging than ever. Current evaluations of LLM-based CUDA programming largely miss this setting: a model can pass correctness tests with repair by degeneration, simplifying the CUDA code into a safer but slower program that abandons the original optimization structure. We introduce CUDABEAVER, a benchmark for CUDA debugging from real failing workspaces produced during LLM-based CUDA generation. Each task provides the broken candidate, native build/test commands, raw error evidence, and a single editable file. CUDABEAVER evaluates whether a fixer truly repairs the failing CUDA code or merely finds a slower test-passing replacement, reporting results by failure category, debugging trajectory, stagnation mode, and performance preservation. We further propose pass@k(M,C,A), a protocol-conditional CUDA debugging metric by making the fixer M, corpus C, and protocol axes Aexplicit. Using this metric across 213 tasks and seven frontier LLMs, we show that protocol-aware evaluation gives a more faithful view of CUDA debugging ability: when performance-loss tolerance is high, fixers appear much stronger, but even a minor stricter performance requirement can sharply reduce measured success, shifting scores by up to 40 percentage points.

preprint2025arXiv

RAJ-PGA: Reasoning-Activated Jailbreak and Principle-Guided Alignment Framework for Large Reasoning Models

Large Reasoning Models (LRMs) face a distinct safety vulnerability: their internal reasoning chains may generate harmful content even when the final output appears benign. To address this overlooked risk, we first propose a novel attack paradigm, Reasoning-Activated Jailbreak (RAJ) via Concretization, which demonstrates that refining malicious prompts to be more specific can trigger step-by-step logical reasoning that overrides the model's safety protocols. To systematically mitigate this vulnerability, we further develop a scalable framework for constructing high-quality safety alignment datasets. This framework first leverages the RAJ attack to elicit challenging harmful reasoning chains from LRMs, then transforms these high-risk traces into safe, constructive, and educational responses through a tailored Principle-Guided Alignment (PGA) mechanism. Then, we introduce the PGA dataset, a verified alignment dataset containing 3,989 samples using our proposed method. Extensive experiments show that fine-tuning LRMs with PGA dataset significantly enhances model safety, achieving up to a 29.5% improvement in defense success rates across multiple jailbreak benchmarks. Critically, our approach not only defends against sophisticated reasoning-based attacks but also preserves, even enhances, the model's general reasoning capabilities. This work provides a scalable and effective pathway for safety alignment in reasoning-intensive AI systems, addressing the core trade-off between safety and functional performance.

preprint2022arXiv

$H^{\frac{11}{4}}(\mathbb{R}^2)$ ill-posedness for 2D Elastic Wave system

In this paper, we prove that for the 2D elastic wave equations, a physical system with multiple wave-speeds, its Cauchy problem fails to be locally well-posed in $H^{\frac{11}{4}}(\mathbb{R}^2)$. The ill-posedness here is driven by instantaneous shock formation. In 2D Smith-Tataru showed that the Cauchy problem for a single quasilinear wave equation is locally well-posed in $H^s$ with $s>\frac{11}{4}$. Hence our $H^{\frac{11}{4}}$ ill-posedness obtained here is a desired result. Our proof relies on combining a geometric method and an algebraic wave-decomposition approach, equipped with detailed analysis of the corresponding hyperbolic system.

preprint2022arXiv

Quantitative blow-up estimates for spacelike singularities in gravitational-collapse cosmological spacetimes

Under spherical symmetry, with double-null coordinates $(u,v)$, we study the gravitational collapse of the Einstein--scalar field system with a positive cosmological constant. The spacetime singularities arise when area radius $r$ vanishes and they are spacelike. We derive new quantitative estimates, obtain polynomial blow-up rates $O(1/r^N)$ for various quantities, and extend the results in [5] by the first author and Zhang and the arguments in [3] by the first author and Gajic to the cosmological settings. In particular, we sharpen the estimates of $r\partial_u r$ and $r\partial_v r$ in [5] and prove that the spacelike singularities where $r(u,v)=0$ are $C^{1,1/3}$ in $(u,v)$ coordinates. As an application, these estimates also give quantitative blow-up upper bounds of fluid velocity and density for the hard-phase model of the Einstein-Euler system under irrotational assumption. Near the timelike infinity, we also generalize the theorems in [3] by linking the precise blow-up rates of the Kretschmann scalar to the exponential Price's law along the event horizon. In cosmological settings, this further reveals the mass-inflation phenomena along the spacelike singularities for the first time.

preprint2020arXiv

Low regularity ill-posedness for elastic waves driven by shock formation

In this paper, we construct counterexamples to the local existence of low-regularity solutions to elastic wave equations in three spatial dimensions (3D). Inspired by the recent works of Christodoulou, we generalize Lindblad's classic results on the scalar wave equation by showing that the Cauchy problem for 3D elastic waves, a physical system with multiple wave-speeds, is ill-posed in $H^3(\mathbb{R}^3)$. We further prove that the ill-posedness is caused by instantaneous shock formation, which is characterized by the vanishing of the inverse foliation density. The main difficulties of the 3D case come from the multiple wave-speeds and its associated non-strict hyperbolicity. We obtain the desired results by designing and combining a geometric approach and an algebraic approach, equipped with detailed studies and calculations of the structures and coefficients of the corresponding non-strictly hyperbolic system. Moreover, the ill-posedness we depict also applies to 2D elastic waves, which corresponds to a strictly hyperbolic case.