Researcher profile

Hsuan-Wei Lee

Hsuan-Wei Lee contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2026arXiv

How Exploration Breaks Cooperation in Shared-Policy Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning in dynamic social dilemmas commonly relies on parameter sharing to enable scalability. We show that in shared-policy Deep Q-Network learning, standard exploration can induce a robust and systematic collapse of cooperation even in environments where fully cooperative equilibria are stable and payoff dominant. Through controlled experiments, we demonstrate that shared DQN converges to stable but persistently low-cooperation regimes. This collapse is not caused by reward misalignment, noise, or insufficient training, but by a representational failure arising from partial observability combined with parameter coupling across heterogeneous agent states. Exploration-driven updates bias the shared representation toward locally dominant defection responses, which then propagate across agents and suppress cooperative learning. We confirm that the failure persists across network sizes, exploration schedules, and payoff structures, and disappears when parameter sharing is removed or when agents maintain independent representations. These results identify a fundamental failure mode of shared-policy MARL and establish structural conditions under which scalable learning architectures can systematically undermine cooperation. Our findings provide concrete guidance for the design of multi-agent learning systems in social and economic environments where collective behavior is critical.

preprint2022arXiv

When costly migration helps to improve cooperation

Motion is a typical reaction among animals and humans trying to reach better conditions in a changing world. This aspect has been studied intensively in social dilemmas where competing players' individual and collective interests are in conflict. Starting from the traditional public goods game model, where players are locally fixed and unconditional cooperators or defectors are present, we introduce two additional strategies through which agents can change their positions of dependence on the local cooperation level. More importantly, these so-called sophisticated players should bear an extra cost to maintain their permanent capacity to evaluate their neighborhood and react accordingly. Hence, four strategies compete, and the most successful one can be imitated by its neighbors. Crucially, the introduction of costly movement has a highly biased consequence on the competing main strategies. In the majority of parameter space, it is harmful to defectors and provides a significantly higher cooperation level when the population is rare. At an intermediate population density, which would be otherwise optimal for a system of immobile players, the presence of mobile actors could be detrimental if the interaction pattern changes slightly, thereby blocking the optimal percolation of information flow. In this parameter space, sophisticated cooperators can also show the co-called Moor effect by first avoiding the harmful vicinity of defectors; they subsequentially transform into an immobile cooperator state. Hence, paradoxically, the additional cost of movement could be advantageous to reach a higher general income, especially for a rare population when subgroups would be isolated otherwise.

preprint2020arXiv

Status hierarchy and group cooperation: A generalized model

In a refreshing mathematical investigation, Mark (2018) shows that status hierarchy may facilitate the emergence of cooperation in groups. Despite the contribution, the present paper notes that there are limitations in Mark's model that makes it less realistic than it could in explaining real-world experiences. Consequently, we present a more generalized modified framework in which his model is a special case, by developing and introducing a new hierarchy measure into the model to estimate the cooperation level in a set of hierarchical structures omitted in Mark's work yet common in everyday life--those with multiple leaders. We derived the conditions under which cooperation can emerge in these groups, and verified our analytical predictions in agent-based computer simulations. In so doing, not only does our model elaborate on its predecessor and support Mark's general prediction. For theory, our work further reveals two novel phenomena of group cooperation: Both the relative number of cooperators to defectors in groups and the assortativity among these different roles can backfire; they are not always the higher, the better for cooperation to thrive. For methodology, the hierarchy measure developed and our model using the measure may also be applied in future research on a wide range of related topics.