Source author record

Pengcheng Li

Pengcheng Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AT nucl-th Artificial Intelligence Distributed, Parallel, and Cluster Computing gr-qc Machine Learning astro-ph.IM Computer Vision cond-mat.mes-hall cond-mat.mtrl-sci Cryptography and Security hep-th math.GT

Catalog footprint

What is connected

15works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Multimodal large language models (MLLMs) demonstrate exceptional capabilities in semantic understanding and visual reasoning, yet they still face challenges in precise object localization and resource-constrained edge-cloud deployment. To address this, this paper proposes the AIVD framework, which achieves unified precise localization and high-quality semantic generation through the collaboration between lightweight edge detectors and cloud-based MLLMs. To enhance the cloud MLLM's robustness against edge cropped-box noise and scenario variations, we design an efficient fine-tuning strategy with visual-semantic collaborative augmentation, significantly improving classification accuracy and semantic consistency. Furthermore, to maintain high throughput and low latency across heterogeneous edge devices and dynamic network conditions, we propose a heterogeneous resource-aware dynamic scheduling algorithm. Experimental results demonstrate that AIVD substantially reduces resource consumption while improving MLLM classification performance and semantic generation quality. The proposed scheduling strategy also achieves higher throughput and lower latency across diverse scenarios.

preprint2026arXiv

Mitigating Many-shot Jailbreak Attacks with One Single Demonstration

Many-shot jailbreaking (MSJ) causes safety-aligned language models to answer harmful queries by preceding them with many harmful question-answer demonstrations. We study why this attack becomes stronger as the number of demonstrations increases. Empirically, we find that MSJ induces a progressive activation drift: the representation of a fixed harmful query moves step by step away from the safety-aligned region as more harmful demonstrations are added. Theoretically, we show that this drift can be interpreted as implicit malicious fine-tuning: conditioning on N harmful demonstrations induces SGD-style updates equivalent to optimizing on the corresponding N harmful samples. This view turns the attack mechanism into a defense principle. We append a fixed one-shot safety demonstration at inference time, which induces a counteracting safety-oriented update and restores refusal behavior. The resulting method improves the model's robustness to MSJ without modifying its parameters or requiring white-box access at deployment. Code is available at https://github.com/Thecommonirin/SafeEnd.

preprint2026arXiv

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving

With the rapid advancement of large language models (LLMs) technologies, their application in the domain of autonomous driving has become increasingly widespread. However, existing methods suffer from unstructured reasoning, poor generalization, and misalignment with human driving intent. While Chain-of-Thought (CoT) reasoning enhances decision transparency, conventional supervised fine-tuning (SFT) fails to fully exploit its potential, and reinforcement learning (RL) approaches face instability and suboptimal reasoning depth. We propose ThinkDrive, a CoT guided progressive RL fine-tuning framework for autonomous driving that synergizes explicit reasoning with difficulty-aware adaptive policy optimization. Our method employs a two-stage training strategy. First, we perform SFT using CoT explanations. Then, we apply progressive RL with a difficulty-aware adaptive policy optimizer that dynamically adjusts learning intensity based on sample complexity. We evaluate our approach on a public dataset. The results show that ThinkDrive outperforms strong RL baselines by 1.45%, 1.95%, and 1.01% on exam, easy-exam, and accuracy, respectively. Moreover, a 2B-parameter model trained with our method surpasses the much larger GPT-4o by 3.28% on the exam metric.

preprint2022arXiv

Accessing the in-medium effects on nucleon-nucleon elastic cross section with collective flows and nuclear stopping

A systematic study of the in-medium correction factor ($F$) on nucleon-nucleon elastic cross section is performed within the Ultra-relativistic Quantum Molecular Dynamics (UrQMD) model. The effects of the beam energy dependence of $F$ on the directed, elliptic flow and nuclear stopping in $^{197}$Au+$^{197}$Au collisions with energy ranging from $0.09$ to $0.8A$ GeV are explored. It is found that the directed, elliptic flow and nuclear stopping at relatively low energies are very sensitive to $F$, and the sensitivity gradually weakens with increasing beam energy. The beam energy dependent in-medium correction factor $F$ is deduced from the comparison of the excitation functions of the directed, elliptic flow and nuclear stopping between the calculated results and the FOPI experimental data.

preprint2022arXiv

On Modular Cohomotopy Groups

Let $p$ be a prime and let $π^n(X;\mathbb{Z}/p^r)=[X,M_n(\mathbb{Z}/p^r)]$ be the set of homotopy classes of based maps from CW-complexes $X$ into the mod $p^r$ Moore spaces $M_n(\mathbb{Z}/p^r)$ of degree $n$, where $\mathbb{Z}/p^r$ denotes the integers mod $p^r$. In this paper we firstly determine the modular cohomotopy groups $π^n(X;\mathbb{Z}/p^r)$ up to extensions by classical methods of primary cohomology operations and give conditions for the splitness of the extensions. Secondly we utilize some unstable homotopy theory of Moore spaces to study the modular cohomotopy groups; especially, the group $π^3(X;\mathbb{Z}_{(2)})$ with $\dim(X)\leq 6$ is determined.

preprint2022arXiv

Self-closeness numbers of product spaces

The self-closeness number of a CW-complex is a homotopy invariant defined by the minimal number $n$ such that every self-maps of $X$ which induces automorphisms on the first $n$ homotopy groups of $X$ is a homotopy equivalence. In this article we study the self-closeness numbers of finite Cartesian products, and prove that under certain conditions (called reducibility), the self-closeness number of product spaces equals to the maximum of self-closeness numbers of the factors. A series of criteria for the reducibility are investigated, and the results are used to determine self-closeness numbers of product spaces of some special spaces, such as Moore spaces, Eilenberg-MacLane spaces or atomic spaces.

preprint2021arXiv

Proton correlations and apparent intermittency in the UrQMD model with hadronic potentials

It is shown that the inclusion of hadronic interactions, and in particular nuclear potentials, in simulations of heavy ion collisions at the SPS energy range can lead to obvious correlations of protons. These correlations contribute significantly to an intermittency analysis as performed at the NA61 experiment. The beam energy and system size dependence is studied by comparing the resulting intermittency index for heavy ion collisions of different nuclei at beam energies of $40A$, $80A$ and $150A$ GeV. The resulting intermittency index from our simulations is similar to the reported values of the NA61 collaboration, if nuclear interactions are included. The observed apparent intermittency signal is the result of the correlated proton pairs with small relative transverse momentum $Δp_{t}$, which would be enhanced by hadronic potentials, and this correlation between the protons is slightly influenced by the coalescence parameters and the relative invariant four-momentum $q_{inv}$ cut.

preprint2020arXiv

(Co)Homology Self-closeness Numbers of Simply-connected Spaces

The (co)homology self-closeness number of a simply-connected based CW-complexes $X$ is the minimal number $k$ such that any self-map $f$ of $X$ inducing an automorphism of the (co)homology groups for dimensions$\leq k$ is a self-homotopy equivalence. These two numbers are homotopy invariants and have a close relation with the group of self-homotopy equivalences. In this paper, we compare the (co)homology self-closeness numbers of spaces in certain cofibrations, define the mod $p$ (co)homology self-closeness number of simply-connected $p$-local spaces with finitely generated homologies and study some properties of the (mod $p$) (co)homology self-closeness numbers.

preprint2020arXiv

DaSGD: Squeezing SGD Parallelization Performance in Distributed Training Using Delayed Averaging

The state-of-the-art deep learning algorithms rely on distributed training systems to tackle the increasing sizes of models and training data sets. Minibatch stochastic gradient descent (SGD) algorithm requires workers to halt forward/back propagations, to wait for gradients aggregated from all workers, and to receive weight updates before the next batch of tasks. This synchronous execution model exposes the overheads of gradient/weight communication among a large number of workers in a distributed training system. We propose a new SGD algorithm, DaSGD (Local SGD with Delayed Averaging), which parallelizes SGD and forward/back propagations to hide 100% of the communication overhead. By adjusting the gradient update scheme, this algorithm uses hardware resources more efficiently and reduces the reliance on the low-latency and high-throughput inter-connects. The theoretical analysis and the experimental results show its convergence rate O(1/sqrt(K)), the same as SGD. The performance evaluation demonstrates it enables a linear performance scale-up with the cluster size.

preprint2020arXiv

Elliptic flow splitting between protons and antiprotons from hadronic potentials

The difference in elliptic flow $v_{2}$ between protons and antiprotons, produced in $^{197}\text{Au}+^{197}\text{Au}$ collisions at center-of-mass energies $\sqrt{s_{NN}}=5-12~\text{GeV}$, is studied within a modified version of the ultrarelativistic quantum molecular dynamics (UrQMD) model. Two different model scenarios are compared: the cascade mode and the mean field mode which includes potential interactions for both formed and pre-formed hadrons. The model results for the elliptic flow of protons and the relative $v_{2}$ difference between protons and antiprotons obtained from the mean field mode agree with the available experimental data, while the $v_{2}$ difference is near zero for the cascade mode. Our results show that the elliptic flow splitting, observed for particles and antiparticles, can be explained by the inclusion of proper hadronic interactions. In addition, the difference in $v_{2}$ between protons and antiprotons depends on the centrality and the rapidity window. With smaller centrality and/or rapidity acceptance, the observed elliptic flow splitting is more sensitive to the beam energy, indicating a strong net baryon density dependence of the effect. We propose to confirm this splitting at the upcoming experiments from Beam Energy Scan (BES) Phase-\Rmnum{2} at Relativistic Heavy Ion Collider (RHIC), the Compressed Baryonic Matter (CBM) at Facility for Antiproton and Ion Research (FAIR), High Intensity heavy ion Accelerator Facility (HIAF) and Nuclotron-based Ion Collider fAcility (NICA).

preprint2020arXiv

Learning Forward Reuse Distance

Caching techniques are widely used in the era of cloud computing from applications, such as Web caches to infrastructures, Memcached and memory caches in computer architectures. Prediction of cached data can greatly help improve cache management and performance. The recent advancement of deep learning techniques enables the design of novel intelligent cache replacement policies. In this work, we propose a learning-aided approach to predict future data accesses. We find that a powerful LSTM-based recurrent neural network model can provide high prediction accuracy based on only a cache trace as input. The high accuracy results from a carefully crafted locality-driven feature design. Inspired by the high prediction accuracy, we propose a pseudo OPT policy and evaluate it upon 13 real-world storage workloads from Microsoft Research. Results demonstrate that the new cache policy improves state-of-art practical policies by up to 19.2% and incurs only 2.3% higher miss ratio than OPT on average.

preprint2020arXiv

The TianQin project: current progress on science and technology

TianQin is a planned space-based gravitational wave (GW) observatory consisting of three earth orbiting satellites with an orbital radius of about $10^5~{\rm km}$. The satellites will form a equilateral triangle constellation the plane of which is nearly perpendicular to the ecliptic plane. TianQin aims to detect GWs between $10^{-4}~{\rm Hz}$ and $1~{\rm Hz}$ that can be generated by a wide variety of important astrophysical and cosmological sources, including the inspiral of Galactic ultra-compact binaries, the inspiral of stellar-mass black hole binaries, extreme mass ratio inspirals, the merger of massive black hole binaries, and possibly the energetic processes in the very early universe or exotic sources such as cosmic strings. In order to start science operations around 2035, a roadmap called the 0123 plan is being used to bring the key technologies of TianQin to maturity, supported by the construction of a series of research facilities on the ground. Two major projects of the 0123 plan are being carried out. In this process, the team has created a new generation $17~{\rm cm}$ single-body hollow corner-cube retro-reflector which has been launched with the QueQiao satellite on 21 May 2018; a new laser ranging station equipped with a $1.2~{\rm m}$ telescope has been constructed and the station has successfully ranged to all the five retro-reflectors on the Moon; and the TianQin-1 experimental satellite has been launched on 20 December 2019 and the first round result shows that the satellite has exceeded all of its mission requirements.

preprint2015arXiv

A Review of Proximity Effect Correction in Electron-beam Lithography

I review the work of proximity effect correction (PEC) in electron-beam (e-beam) lithography with emphasis on dose modification and shape modification PEC techniques.

preprint2015arXiv

Optimization of Spatial Dose Distribution for Controlling Sidewall Shape in Electron-beam Lithography

Electron-beam (e-beam) lithography is widely employed in fabrication of 2-D patterns and 3-D structures. A certain type or shape of the sidewall in the remaining resist profile may be desired in an application, e.g., an undercut for lift-off and a vertical sidewall for etching, or required for a device. Also, as the feature size is decreased well below a micron, a small variation of the sidewall slope can lead to a significant (relative) CD error in certain layers of resist. Therefore, it is important to understand effects of spatial dose distribution on sidewall shape and be able to achieve the desired shape. In this study, via simulation, the relationship among the total dose, spatial distribution of dose, developing time and sidewall shape, and performance of the method developed to optimize the dose distribution for a target sidewall shape have been analyzed. The simulation results have been verified through experiments.

preprint2015arXiv

Quasinormal modes of Gauss-Bonnet black holes at large D

Einstein's General Relativity theory simplifies dramatically in the limit that the spacetime dimension D is very large. This could still be true in the gravity theory with higher derivative terms. In this paper, as the first step to study the gravity with a Gauss-Bonnet(GB) term, we compute the quasi-normal modes of the spherically symmetric GB black hole in the large D limit. When the GB parameter is small, we find that the non-decoupling modes are the same as the Schwarzschild case and the decoupled modes are slightly modified by the GB term. However, when the GB parameter is large, we find some novel features. We notice that there are another set of non-decoupling modes due to the appearance of a new plateau in the effective radial potential. Moreover, the effective radial potential for the decoupled vector-type and scalar-type modes becomes more complicated. Nevertheless we manage to compute the frequencies of the these decoupled modes analytically. When the GB parameter is neither very large nor very small, though analytic computation is not possible, the problem is much simplified in the large D expansion and could be numerically treated. We study numerically the vector-type quasinormal modes in this case.

Pengcheng Li

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Mitigating Many-shot Jailbreak Attacks with One Single Demonstration

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving

Accessing the in-medium effects on nucleon-nucleon elastic cross section with collective flows and nuclear stopping

On Modular Cohomotopy Groups

Self-closeness numbers of product spaces

Proton correlations and apparent intermittency in the UrQMD model with hadronic potentials

(Co)Homology Self-closeness Numbers of Simply-connected Spaces

DaSGD: Squeezing SGD Parallelization Performance in Distributed Training Using Delayed Averaging

Elliptic flow splitting between protons and antiprotons from hadronic potentials

Learning Forward Reuse Distance

The TianQin project: current progress on science and technology

A Review of Proximity Effect Correction in Electron-beam Lithography

Optimization of Spatial Dose Distribution for Controlling Sidewall Shape in Electron-beam Lithography

Quasinormal modes of Gauss-Bonnet black holes at large D