Source author record

Shiyang Lai

Shiyang Lai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language physics.soc-ph cs.CY Machine Learning Populations and Evolution q-fin.ST

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Can Editing LLMs Inject Harm?

Large Language Models (LLMs) have emerged as a new information channel. Meanwhile, one critical but under-explored question is: Is it possible to bypass the safety alignment and inject harmful information into LLMs stealthily? In this paper, we propose to reformulate knowledge editing as a new type of safety threat for LLMs, namely Editing Attack, and conduct a systematic investigation with a newly constructed dataset EditAttack. Specifically, we focus on two typical safety risks of Editing Attack including Misinformation Injection and Bias Injection. For the first risk, we find that editing attacks can inject both commonsense and long-tail misinformation into LLMs, and the effectiveness for the former one is particularly high. For the second risk, we discover that not only can biased sentences be injected into LLMs with high effectiveness, but also one single biased sentence injection can degrade the overall fairness. Then, we further illustrate the high stealthiness of editing attacks. Our discoveries demonstrate the emerging misuse risks of knowledge editing techniques on compromising the safety alignment of LLMs and the feasibility of disseminating misinformation or bias with LLMs as new channels.

preprint2026arXiv

Reasoning Models Generate Societies of Thought

Large language models have achieved remarkable capabilities across domains, yet mechanisms underlying sophisticated reasoning remain elusive. Recent reasoning models outperform comparable instruction-tuned models on complex cognitive tasks, attributed to extended computation through longer chains of thought. Here we show that enhanced reasoning emerges not from extended computation alone, but from simulating multi-agent-like interactions -- a society of thought -- which enables diversification and debate among internal cognitive perspectives characterized by distinct personality traits and domain expertise. Through quantitative analysis and mechanistic interpretability methods applied to reasoning traces, we find that reasoning models like DeepSeek-R1 and QwQ-32B exhibit much greater perspective diversity than instruction-tuned models, activating broader conflict between heterogeneous personality- and expertise-related features during reasoning. This multi-agent structure manifests in conversational behaviors, including question-answering, perspective shifts, and the reconciliation of conflicting views, and in socio-emotional roles that characterize sharp back-and-forth conversations, together accounting for the accuracy advantage in reasoning tasks. Controlled reinforcement learning experiments reveal that base models increase conversational behaviors when rewarded solely for reasoning accuracy, and fine-tuning models with conversational scaffolding accelerates reasoning improvement over base models. These findings indicate that the social organization of thought enables effective exploration of solution spaces. We suggest that reasoning models establish a computational parallel to collective intelligence in human groups, where diversity enables superior problem-solving when systematically structured, which suggests new opportunities for agent organization to harness the wisdom of crowds.

preprint2020arXiv

Inferring incubation period distribution of COVID-19 based on SEAIR Model

To reduce the biases of traditional survey-based methods, this paper proposes an epidemic model-based approach to inference the incubation period distribution of COVID-19 utilizing the publicly reported confirmed case number. We construct an epidemic model, namely SEAIR, and take advantage of the dynamic transmission process depicted by SEAIR to estimate the onset probability in each day of exposed individuals in eight impacted countries. Based on these estimations, the general incubation probability distribution of COVID-19 has been revealed. The proposed method can avoid several biases of traditional survey-based methods. However, due to the mathematical-model-based nature of this method, the inference results are somewhat sensitive to the setting of parameters. Therefore, this method should be practiced reasonably on the basis of a certain understanding of the studied epidemic.

preprint2020arXiv

Using detrended deconvolution foreign exchange network to identify currency status

This article proposed a hybrid detrended deconvolution foreign exchange network construction method (DDFEN), which combined the detrended cross-correlation analysis coefficient (DCCC) and the network deconvolution method together. DDFEN is designed to reveal the `true' correlation of currencies by filtering indirect effects in the foreign exchange networks (FXNs). The empirical results show that DDFEN can reflect the change of currency status in the long term and also perform more stable than traditional network construction methods.

Shiyang Lai

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Can Editing LLMs Inject Harm?

Reasoning Models Generate Societies of Thought

Inferring incubation period distribution of COVID-19 based on SEAIR Model

Using detrended deconvolution foreign exchange network to identify currency status