Source author record

Xiangdong Chen

Xiangdong Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Robotics math.OC physics.optics quant-ph

Catalog footprint

What is connected

4works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

C-CoT: Counterfactual Chain-of-Thought with Vision-Language Models for Safe Autonomous Driving

Safety-critical planning in complex environments, particularly at urban intersections, remains a fundamental challenge for autonomous driving. Existing methods, whether rule-based or data-driven, frequently struggle to capture complex scene semantics, infer potential risks, and make reliable decisions in rare, high-risk situations. While vision-language models (VLMs) offer promising approaches for safe decision-making in these environments, most current approaches lack reflective and causal reasoning, thereby limiting their overall robustness. To address this, we propose a counterfactual chain-of-thought (C-CoT) framework that leverages VLMs to decompose driving decisions into five sequential stages: scene description, critical object identification, risk prediction, counterfactual risk reasoning, and final action planning. Within the counterfactual reasoning stage, we introduce a structured meta-action evaluation tree to explicitly assess the potential consequences of alternative action combinations. This self-reflective reasoning establishes causal links between action choices and safety outcomes, improving robustness in long-tail and out-of-distribution scenarios. To validate our approach, we construct the DeepAccident-CCoT dataset based on the DeepAccident benchmark and fine-tune a Qwen2.5-VL (7B) model using low-rank adaptation. Our model achieves a risk prediction recall of 81.9%, reduces the collision rate to 3.52%, and lowers L2 error to 1.98 m. Ablation studies further confirm the critical role of counterfactual reasoning and the meta-action evaluation tree in enhancing safety and interpretability.

preprint2026arXiv

SafeAlign-VLA: A Negative-Enhanced Safe Alignment Framework for Risk-Aware Autonomous Driving

End-to-end autonomous driving systems excel in common scenarios but struggle with safety-critical long-tail cases. Vision-Language-Action (VLA) models are promising due to their strong reasoning capabilities. However, most VLA-based approaches rely on positive expert demonstrations, rarely exploiting negative samples, leading to insufficient understanding of risky behaviors and safety boundaries. To address this limitation, we propose SafeAlign-VLA, a unified negative-enhanced safe alignment framework that incorporates negative data into supervised learning and reinforcement learning. First, we develop a counterfactual safety pairing paradigm to generate structured safety labels and counterfactual positive trajectories from risky scenarios via counterfactual reasoning. Then, a two-stage training strategy is adopted: negative-enhanced supervised fine-tuning for failure feedback and trajectory correction, followed by anchor-based group relative policy optimization that uses positive and negative trajectories as contrastive anchors to steer sampling and penalize high-risk behaviors via group-relative advantages. Experiments on NAVSIM and DeepAccident validate the proposed framework. SafeAlign-VLA achieves 89.1 PDMS on the NAVSIM v1 testset, improving over the baseline without negative data by 1.3%. On DeepAccident, it reduces the collision rate to 3.36%, while achieving 84.2% language accuracy and 85.8% risk prediction accuracy. These results demonstrate the effectiveness of the proposed negative-enhanced safe alignment framework for safe and robust autonomous driving.

preprint2021arXiv

Network-level rhythmic control of heterogeneous automated traffic with buses

Guaranteeing the quality of transit service is of great importance to promote the attractiveness of buses and alleviate urban traffic issues such as congestion and pollution. Emerging technologies of automated driving and V2X communication have the potential to enable the accurate control of vehicles and the efficient organization of traffic to enhance both the schedule adherence of buses and the overall network mobility. This study proposes an innovative network-level control scheme for heterogeneous automated traffic composed of buses and private cars under a full connected and automated environment. Inheriting the idea of network-level rhythmic control proposed by Lin et al. (2020), an augmented rhythmic control scheme for heterogeneous traffic, i.e., RC-H, is established to organize the mixed traffic in a rhythmic manner. Realized virtual platoons are designed for accommodating vehicles to pass through the network, including dedicated virtual platoons for buses to provide exclusive right-of-ways (ROWs) on their trips and regular virtual platoons for private cars along with an optimal assignment plan to minimize the total travel cost. A mixed-integer linear program (MILP) is formulated to optimize the RC-H scheme and a bilevel heuristic solution method is designed to relieve the computational burden of MILP. Numerical examples and simulation experiments are conducted to evaluate the performance of the RC-H scheme under different scenarios. The results show that the bus operation can be guaranteed and the travel delay can be minimized under various demand levels with transit priority. Moreover, compared with traffic signal control strategies, the RC-H scheme has significant advantages in handling massive traffic demand, in terms of both vehicle delay and network throughput.

preprint2014arXiv

Sub-diffraction optical manipulation of the chargestate of nitrogen vacancy center in diamond

As a potential candidate for quantum computation and metrology, the nitrogen vacancy (NV)center in diamond presented both challenges and opportunities resulted from charge state conversion. By utilizing different lasers for the photon-induced charge state conversion, we achieved the sub-diffraction charge state manipulation. The charge state depletion (CSD) microscopy resolution was improved to 4.1 nm by optimizing the laser pulse sequences. Subsequently, the electron spin state dynamics of adjacent NV centers were selectively detected via the CSD. The experimental results demonstrated that the CSD can improve the spatial resolution of the measurement of NV centers for nanoscale sensing and quantum information.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint