Source author record

Hsuan-Wei Lee

Hsuan-Wei Lee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.soc-ph Applications Computer Science and Game Theory econ.GN math.OC Multiagent Systems nlin.AO q-fin.EC

Catalog footprint

What is connected

5works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

How Exploration Breaks Cooperation in Shared-Policy Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning in dynamic social dilemmas commonly relies on parameter sharing to enable scalability. We show that in shared-policy Deep Q-Network learning, standard exploration can induce a robust and systematic collapse of cooperation even in environments where fully cooperative equilibria are stable and payoff dominant. Through controlled experiments, we demonstrate that shared DQN converges to stable but persistently low-cooperation regimes. This collapse is not caused by reward misalignment, noise, or insufficient training, but by a representational failure arising from partial observability combined with parameter coupling across heterogeneous agent states. Exploration-driven updates bias the shared representation toward locally dominant defection responses, which then propagate across agents and suppress cooperative learning. We confirm that the failure persists across network sizes, exploration schedules, and payoff structures, and disappears when parameter sharing is removed or when agents maintain independent representations. These results identify a fundamental failure mode of shared-policy MARL and establish structural conditions under which scalable learning architectures can systematically undermine cooperation. Our findings provide concrete guidance for the design of multi-agent learning systems in social and economic environments where collective behavior is critical.

preprint2022arXiv

When costly migration helps to improve cooperation

Motion is a typical reaction among animals and humans trying to reach better conditions in a changing world. This aspect has been studied intensively in social dilemmas where competing players' individual and collective interests are in conflict. Starting from the traditional public goods game model, where players are locally fixed and unconditional cooperators or defectors are present, we introduce two additional strategies through which agents can change their positions of dependence on the local cooperation level. More importantly, these so-called sophisticated players should bear an extra cost to maintain their permanent capacity to evaluate their neighborhood and react accordingly. Hence, four strategies compete, and the most successful one can be imitated by its neighbors. Crucially, the introduction of costly movement has a highly biased consequence on the competing main strategies. In the majority of parameter space, it is harmful to defectors and provides a significantly higher cooperation level when the population is rare. At an intermediate population density, which would be otherwise optimal for a system of immobile players, the presence of mobile actors could be detrimental if the interaction pattern changes slightly, thereby blocking the optimal percolation of information flow. In this parameter space, sophisticated cooperators can also show the co-called Moor effect by first avoiding the harmful vicinity of defectors; they subsequentially transform into an immobile cooperator state. Hence, paradoxically, the additional cost of movement could be advantageous to reach a higher general income, especially for a rare population when subgroups would be isolated otherwise.

preprint2020arXiv

Status hierarchy and group cooperation: A generalized model

In a refreshing mathematical investigation, Mark (2018) shows that status hierarchy may facilitate the emergence of cooperation in groups. Despite the contribution, the present paper notes that there are limitations in Mark's model that makes it less realistic than it could in explaining real-world experiences. Consequently, we present a more generalized modified framework in which his model is a special case, by developing and introducing a new hierarchy measure into the model to estimate the cooperation level in a set of hierarchical structures omitted in Mark's work yet common in everyday life--those with multiple leaders. We derived the conditions under which cooperation can emerge in these groups, and verified our analytical predictions in agent-based computer simulations. In so doing, not only does our model elaborate on its predecessor and support Mark's general prediction. For theory, our work further reveals two novel phenomena of group cooperation: Both the relative number of cooperators to defectors in groups and the assortativity among these different roles can backfire; they are not always the higher, the better for cooperation to thrive. For methodology, the hierarchy measure developed and our model using the measure may also be applied in future research on a wide range of related topics.

preprint2016arXiv

Prediction and Optimal Scheduling of Advertisements in Linear Television

Advertising is a crucial component of marketing and an important way for companies to raise awareness of goods and services in the marketplace. Advertising campaigns are designed to convey a marketing image or message to an audience of potential consumers and television commercials can be an effective way of transmitting these messages to a large audience. In order to meet the requirements for a typical advertising order, television content providers must provide advertisers with a predetermined number of "impressions" in the target demographic. However, because the number of impressions for a given program is not known a priori and because there are a limited number of time slots available for commercials, scheduling advertisements efficiently can be a challenging computational problem. In this case study, we compare a variety of methods for estimating future viewership patterns in a target demographic from past data. We also present a method for using those predictions to generate an optimal advertising schedule that satisfies campaign requirements while maximizing advertising revenue.

preprint2016arXiv

Transitivity reinforcement in the coevolving voter model

One of the fundamental structural properties of many networks is triangle closure. Whereas the influence of this transitivity on a variety of contagion dynamics has been previously explored, existing models of coevolving or adaptive network systems use rewiring rules that randomize away this important property. In contrast, we study here a modified coevolving voter model dynamics that explicitly reinforces and maintains such clustering. Employing extensive numerical simulations, we establish that the transitions and dynamical states observed in coevolving voter model networks without clustering are altered by reinforcing transitivity in the model. We then use a semi-analytical framework in terms of approximate master equations to predict the dynamical behaviors of the model for a variety of parameter settings.

Hsuan-Wei Lee

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

How Exploration Breaks Cooperation in Shared-Policy Multi-Agent Reinforcement Learning

When costly migration helps to improve cooperation

Status hierarchy and group cooperation: A generalized model

Prediction and Optimal Scheduling of Advertisements in Linear Television

Transitivity reinforcement in the coevolving voter model