Researcher profile

Pengcheng Li

Pengcheng Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

AIVD: Adaptive Edge-Cloud Collaboration for Accurate and Efficient Industrial Visual Detection

Multimodal large language models (MLLMs) demonstrate exceptional capabilities in semantic understanding and visual reasoning, yet they still face challenges in precise object localization and resource-constrained edge-cloud deployment. To address this, this paper proposes the AIVD framework, which achieves unified precise localization and high-quality semantic generation through the collaboration between lightweight edge detectors and cloud-based MLLMs. To enhance the cloud MLLM's robustness against edge cropped-box noise and scenario variations, we design an efficient fine-tuning strategy with visual-semantic collaborative augmentation, significantly improving classification accuracy and semantic consistency. Furthermore, to maintain high throughput and low latency across heterogeneous edge devices and dynamic network conditions, we propose a heterogeneous resource-aware dynamic scheduling algorithm. Experimental results demonstrate that AIVD substantially reduces resource consumption while improving MLLM classification performance and semantic generation quality. The proposed scheduling strategy also achieves higher throughput and lower latency across diverse scenarios.

preprint2026arXiv

Mitigating Many-shot Jailbreak Attacks with One Single Demonstration

Many-shot jailbreaking (MSJ) causes safety-aligned language models to answer harmful queries by preceding them with many harmful question-answer demonstrations. We study why this attack becomes stronger as the number of demonstrations increases. Empirically, we find that MSJ induces a progressive activation drift: the representation of a fixed harmful query moves step by step away from the safety-aligned region as more harmful demonstrations are added. Theoretically, we show that this drift can be interpreted as implicit malicious fine-tuning: conditioning on N harmful demonstrations induces SGD-style updates equivalent to optimizing on the corresponding N harmful samples. This view turns the attack mechanism into a defense principle. We append a fixed one-shot safety demonstration at inference time, which induces a counteracting safety-oriented update and restores refusal behavior. The resulting method improves the model's robustness to MSJ without modifying its parameters or requiring white-box access at deployment. Code is available at https://github.com/Thecommonirin/SafeEnd.

preprint2026arXiv

ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving

With the rapid advancement of large language models (LLMs) technologies, their application in the domain of autonomous driving has become increasingly widespread. However, existing methods suffer from unstructured reasoning, poor generalization, and misalignment with human driving intent. While Chain-of-Thought (CoT) reasoning enhances decision transparency, conventional supervised fine-tuning (SFT) fails to fully exploit its potential, and reinforcement learning (RL) approaches face instability and suboptimal reasoning depth. We propose ThinkDrive, a CoT guided progressive RL fine-tuning framework for autonomous driving that synergizes explicit reasoning with difficulty-aware adaptive policy optimization. Our method employs a two-stage training strategy. First, we perform SFT using CoT explanations. Then, we apply progressive RL with a difficulty-aware adaptive policy optimizer that dynamically adjusts learning intensity based on sample complexity. We evaluate our approach on a public dataset. The results show that ThinkDrive outperforms strong RL baselines by 1.45%, 1.95%, and 1.01% on exam, easy-exam, and accuracy, respectively. Moreover, a 2B-parameter model trained with our method surpasses the much larger GPT-4o by 3.28% on the exam metric.

preprint2022arXiv

Accessing the in-medium effects on nucleon-nucleon elastic cross section with collective flows and nuclear stopping

A systematic study of the in-medium correction factor ($F$) on nucleon-nucleon elastic cross section is performed within the Ultra-relativistic Quantum Molecular Dynamics (UrQMD) model. The effects of the beam energy dependence of $F$ on the directed, elliptic flow and nuclear stopping in $^{197}$Au+$^{197}$Au collisions with energy ranging from $0.09$ to $0.8A$ GeV are explored. It is found that the directed, elliptic flow and nuclear stopping at relatively low energies are very sensitive to $F$, and the sensitivity gradually weakens with increasing beam energy. The beam energy dependent in-medium correction factor $F$ is deduced from the comparison of the excitation functions of the directed, elliptic flow and nuclear stopping between the calculated results and the FOPI experimental data.

preprint2022arXiv

On Modular Cohomotopy Groups

Let $p$ be a prime and let $π^n(X;\mathbb{Z}/p^r)=[X,M_n(\mathbb{Z}/p^r)]$ be the set of homotopy classes of based maps from CW-complexes $X$ into the mod $p^r$ Moore spaces $M_n(\mathbb{Z}/p^r)$ of degree $n$, where $\mathbb{Z}/p^r$ denotes the integers mod $p^r$. In this paper we firstly determine the modular cohomotopy groups $π^n(X;\mathbb{Z}/p^r)$ up to extensions by classical methods of primary cohomology operations and give conditions for the splitness of the extensions. Secondly we utilize some unstable homotopy theory of Moore spaces to study the modular cohomotopy groups; especially, the group $π^3(X;\mathbb{Z}_{(2)})$ with $\dim(X)\leq 6$ is determined.

preprint2022arXiv

Self-closeness numbers of product spaces

The self-closeness number of a CW-complex is a homotopy invariant defined by the minimal number $n$ such that every self-maps of $X$ which induces automorphisms on the first $n$ homotopy groups of $X$ is a homotopy equivalence. In this article we study the self-closeness numbers of finite Cartesian products, and prove that under certain conditions (called reducibility), the self-closeness number of product spaces equals to the maximum of self-closeness numbers of the factors. A series of criteria for the reducibility are investigated, and the results are used to determine self-closeness numbers of product spaces of some special spaces, such as Moore spaces, Eilenberg-MacLane spaces or atomic spaces.

preprint2021arXiv

Proton correlations and apparent intermittency in the UrQMD model with hadronic potentials

It is shown that the inclusion of hadronic interactions, and in particular nuclear potentials, in simulations of heavy ion collisions at the SPS energy range can lead to obvious correlations of protons. These correlations contribute significantly to an intermittency analysis as performed at the NA61 experiment. The beam energy and system size dependence is studied by comparing the resulting intermittency index for heavy ion collisions of different nuclei at beam energies of $40A$, $80A$ and $150A$ GeV. The resulting intermittency index from our simulations is similar to the reported values of the NA61 collaboration, if nuclear interactions are included. The observed apparent intermittency signal is the result of the correlated proton pairs with small relative transverse momentum $Δp_{t}$, which would be enhanced by hadronic potentials, and this correlation between the protons is slightly influenced by the coalescence parameters and the relative invariant four-momentum $q_{inv}$ cut.

preprint2020arXiv

(Co)Homology Self-closeness Numbers of Simply-connected Spaces

The (co)homology self-closeness number of a simply-connected based CW-complexes $X$ is the minimal number $k$ such that any self-map $f$ of $X$ inducing an automorphism of the (co)homology groups for dimensions$\leq k$ is a self-homotopy equivalence. These two numbers are homotopy invariants and have a close relation with the group of self-homotopy equivalences. In this paper, we compare the (co)homology self-closeness numbers of spaces in certain cofibrations, define the mod $p$ (co)homology self-closeness number of simply-connected $p$-local spaces with finitely generated homologies and study some properties of the (mod $p$) (co)homology self-closeness numbers.

preprint2020arXiv

DaSGD: Squeezing SGD Parallelization Performance in Distributed Training Using Delayed Averaging

The state-of-the-art deep learning algorithms rely on distributed training systems to tackle the increasing sizes of models and training data sets. Minibatch stochastic gradient descent (SGD) algorithm requires workers to halt forward/back propagations, to wait for gradients aggregated from all workers, and to receive weight updates before the next batch of tasks. This synchronous execution model exposes the overheads of gradient/weight communication among a large number of workers in a distributed training system. We propose a new SGD algorithm, DaSGD (Local SGD with Delayed Averaging), which parallelizes SGD and forward/back propagations to hide 100% of the communication overhead. By adjusting the gradient update scheme, this algorithm uses hardware resources more efficiently and reduces the reliance on the low-latency and high-throughput inter-connects. The theoretical analysis and the experimental results show its convergence rate O(1/sqrt(K)), the same as SGD. The performance evaluation demonstrates it enables a linear performance scale-up with the cluster size.

preprint2020arXiv

Elliptic flow splitting between protons and antiprotons from hadronic potentials

The difference in elliptic flow $v_{2}$ between protons and antiprotons, produced in $^{197}\text{Au}+^{197}\text{Au}$ collisions at center-of-mass energies $\sqrt{s_{NN}}=5-12~\text{GeV}$, is studied within a modified version of the ultrarelativistic quantum molecular dynamics (UrQMD) model. Two different model scenarios are compared: the cascade mode and the mean field mode which includes potential interactions for both formed and pre-formed hadrons. The model results for the elliptic flow of protons and the relative $v_{2}$ difference between protons and antiprotons obtained from the mean field mode agree with the available experimental data, while the $v_{2}$ difference is near zero for the cascade mode. Our results show that the elliptic flow splitting, observed for particles and antiparticles, can be explained by the inclusion of proper hadronic interactions. In addition, the difference in $v_{2}$ between protons and antiprotons depends on the centrality and the rapidity window. With smaller centrality and/or rapidity acceptance, the observed elliptic flow splitting is more sensitive to the beam energy, indicating a strong net baryon density dependence of the effect. We propose to confirm this splitting at the upcoming experiments from Beam Energy Scan (BES) Phase-\Rmnum{2} at Relativistic Heavy Ion Collider (RHIC), the Compressed Baryonic Matter (CBM) at Facility for Antiproton and Ion Research (FAIR), High Intensity heavy ion Accelerator Facility (HIAF) and Nuclotron-based Ion Collider fAcility (NICA).

preprint2020arXiv

Learning Forward Reuse Distance

Caching techniques are widely used in the era of cloud computing from applications, such as Web caches to infrastructures, Memcached and memory caches in computer architectures. Prediction of cached data can greatly help improve cache management and performance. The recent advancement of deep learning techniques enables the design of novel intelligent cache replacement policies. In this work, we propose a learning-aided approach to predict future data accesses. We find that a powerful LSTM-based recurrent neural network model can provide high prediction accuracy based on only a cache trace as input. The high accuracy results from a carefully crafted locality-driven feature design. Inspired by the high prediction accuracy, we propose a pseudo OPT policy and evaluate it upon 13 real-world storage workloads from Microsoft Research. Results demonstrate that the new cache policy improves state-of-art practical policies by up to 19.2% and incurs only 2.3% higher miss ratio than OPT on average.

preprint2020arXiv

The TianQin project: current progress on science and technology

TianQin is a planned space-based gravitational wave (GW) observatory consisting of three earth orbiting satellites with an orbital radius of about $10^5~{\rm km}$. The satellites will form a equilateral triangle constellation the plane of which is nearly perpendicular to the ecliptic plane. TianQin aims to detect GWs between $10^{-4}~{\rm Hz}$ and $1~{\rm Hz}$ that can be generated by a wide variety of important astrophysical and cosmological sources, including the inspiral of Galactic ultra-compact binaries, the inspiral of stellar-mass black hole binaries, extreme mass ratio inspirals, the merger of massive black hole binaries, and possibly the energetic processes in the very early universe or exotic sources such as cosmic strings. In order to start science operations around 2035, a roadmap called the 0123 plan is being used to bring the key technologies of TianQin to maturity, supported by the construction of a series of research facilities on the ground. Two major projects of the 0123 plan are being carried out. In this process, the team has created a new generation $17~{\rm cm}$ single-body hollow corner-cube retro-reflector which has been launched with the QueQiao satellite on 21 May 2018; a new laser ranging station equipped with a $1.2~{\rm m}$ telescope has been constructed and the station has successfully ranged to all the five retro-reflectors on the Moon; and the TianQin-1 experimental satellite has been launched on 20 December 2019 and the first round result shows that the satellite has exceeded all of its mission requirements.