Researcher profile

Feng Fu

Feng Fu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Dynamics of Multi-Agent Actor-Critic Learning in Stochastic Games: from Multistability and Chaos to Stable Cooperation

Achieving robust coordination and cooperation is a central challenge in multi-agent reinforcement learning (MARL). Uncovering the mechanisms underlying such emergent behaviors calls for a dynamical understanding of learn processes. In this work, we investigate the dynamics of actor-critic agents in stochastic games, focusing on the impact of entropy regularization. By leveraging time-scale separation, we derive the system's evolution equations, which are then formally analyzed using dynamical systems theory. We find that in the constant-sum game of Matching Pennies, the system exhibits chaotic behavior. Entropy regularization mitigates this chaos and drives the dynamics toward convergence to fair cooperation. In contrast, in the general-sum game of the Prisoner's Dilemma, the system displays multistability. Interestingly, the three stable equilibria of the system correspond to the well-known ALLC (Always Cooperate), ALLD (Always Defect), and GRIM (Grim Trigger) strategies from evolutionary game theory (EGT). Entropy regularization strengthens system resilience by enlarging the basin of attraction of the cooperative equilibrium. Our findings reveal a close link between the mechanism of direct reciprocity in EGT and how cooperation emerges in MARL, offering insights for designing more robust and collaborative multi-agent systems.

preprint2026arXiv

Strategies of cooperation and defection in five large language models

Large language models (LLMs) are increasingly deployed to support human decision-making. This use of LLMs has concerning implications, especially when their prescriptions affect the welfare of others. To gauge how LLMs make social decisions, we explore whether five leading models produce sensible strategies in the repeated prisoner's dilemma, which is the main metaphor of reciprocal cooperation. First, we measure the propensity of LLMs to cooperate in a neutral setting, without using language reminiscent of how this game is usually presented. We record to what extent LLMs implement Nash equilibria or other well-known strategy classes. Thereafter, we explore how LLMs adapt their strategies to changes in parameter values. We vary the game's continuation probability, the payoff values, and whether the total number of rounds is commonly known. We also study the effect of different framings. In each case, we test whether the adaptations of the LLMs are in line with basic intuition, theoretical predictions of evolutionary game theory, and experimental evidence from human participants. While all LLMs perform well in many of the tasks, none of them exhibit full consistency over all tasks. We also conduct tournaments between the inferred LLM strategies and study direct interaction between LLMs in games over ten rounds with a known or unknown last round. Our experiments shed light on how current LLMs instantiate reciprocal cooperation.

preprint2022arXiv

Eco-Evolutionary Dynamics of Bimatrix Games

Feedbacks between strategies and the environment are common in social-ecological, evolutionary-ecological, and even psychological-economic systems. Utilizing common resources is always a dilemma for community members, like tragedy of the commons. Here we consider replicator dynamics with feedback-evolving games, where the payoffs switch between two different matrices. Although each payoff matrix on its own represents an environment where cooperators and defectors can't coexist stably, we show that it's possible to design appropriate switching control laws and achieve persistent oscillations of strategy abundance. This result should help guide the widespread problem of population state control in microbial experiments and other social problems with eco-evolutionary feedback loops.

preprint2022arXiv

Highly coordinated nationwide massive travel restrictions are central to effective mitigation and control of COVID-19 outbreaks in China

The COVID-19, the disease caused by the novel coronavirus 2019 (SARS-CoV-2), has caused graving woes across the globe since first reported in the epicenter Wuhan, Hubei, China, December 2019. The spread of COVID-19 in China has been successfully curtailed by massive travel restrictions that put more than 900 million people housebound for more than two months since the lockdown of Wuhan on 23 January 2020 when other provinces in China followed suit. Here, we assess the impact of China's massive lockdowns and travel restrictions reflected by the changes in mobility patterns before and during the lockdown period. We quantify the synchrony of mobility patterns across provinces and within provinces. Using these mobility data, we calibrate movement flow between provinces in combination with an epidemiological compartment model to quantify the effectiveness of lockdowns and reductions in disease transmission. Our analysis demonstrates that the onset and phase of local community transmission in other provinces depends on the cumulative population outflow received from the epicenter Hubei. As such, infections can propagate further into other interconnected places both near and far, thereby necessitating synchronous lockdowns. Moreover, our data-driven modeling analysis shows that lockdowns and consequently reduced mobility lag a certain time to elicit an actual impact on slowing down the spreading and ultimately putting the epidemic under check. In spite of the vastly heterogeneous demographics and epidemiological characteristics across China, mobility data shows that massive travel restrictions have been applied consistently via a top-down approach along with high levels of compliance from the bottom up.

preprint2022arXiv

Outlearning Extortioners by Fair-minded Unbending Strategies

Recent theory shows that extortioners taking advantage of the zero-determinant (ZD) strategy can unilaterally claim an unfair share of the payoffs in the Iterated Prisoner's Dilemma. It is thus suggested that against a fixed extortioner, any adapting co-player should be subdued with full cooperation as their best response. In contrast, recent experiments demonstrate that human players often choose not to accede to extortion out of concern for fairness, actually causing extortioners to suffer more loss than themselves. In light of this, here we reveal fair-minded strategies that are unbending to extortion such that any payoff-maximizing extortioner ultimately will concede in their own interest by offering a fair split in head-to-head matches. We find and characterize multiple general classes of such unbending strategies, including generous zero-determinant strategies and Win-Stay, Lose-Shift as particular examples. When against fixed unbending players, extortioners are forced with consequentially increasing losses whenever intending to demand more unfair share. Our analysis also pivots to the importance of payoff structure in determining the superiority of zero-determinant strategies and in particular their extortion ability. We show that an extortionate ZD player can be even outperformed by, for example, Win-Stay Lose-Shift, if the total payoff of unilateral cooperation is smaller than that of mutual defection. Unbending strategies can be used to outlearn evolutionary extortioners and catalyze the evolution of Tit-for-Tat-like strategies out of ZD players. Our work has implications for promoting fairness and resisting extortion so as to uphold a just and cooperative society.

preprint2022arXiv

Spatial Games of Fake News

To curb the spread of fake news on social media platforms, recent studies have considered an online crowdsourcing fact-checking approach as one possible intervention method to reduce misinformation. However, it remains unclear under what conditions crowdsourcing fact-checking efforts deter the spread of misinformation. To address this issue, we model such distributed fact-checking as `peer policing' that will reduce the perceived payoff to share or disseminate false information (fake news) and also reward the spread of trustworthy information (real news). By simulating our model on synthetic square lattices and small-world networks, we show that the presence of social network structure enables fake news spreaders to be self-organized into echo chambers, thereby providing a boost to the efficacy of fake news and thus its resistance to fact-checking efforts. Additionally, to study our model in a more realistic setting, we utilize a Twitter network dataset and study the effectiveness of deliberately choosing specific individuals to be fact-checkers. We find that targeted fact-checking efforts can be highly effective, seeing the same level of success with as little as a fifth of the number of fact-checkers, but it depends on the structure of the network in question. In the limit of weak selection, we obtain closed-form analytical conditions for critical threshold of crowdsourced fact-checking in terms of the payoff values in our fact-checker/fake news game. Our work has practical implications for developing model-based mitigation strategies for controlling the spread of misinformation that interferes with the political discourse.

preprint2022arXiv

The Geometry of Zero-Determinant Strategies

The advent of Zero-Determinant (ZD) strategies has reshaped the study of reciprocity and cooperation in the iterated Prisoner's Dilemma games. The ramification of ZD strategies has been demonstrated through their ability to unilaterally enforce a linear relationship between their own average payoff and that of their co-player. Common practice conveniently represents this relationship by a straight line in the parametric plot of pairwise payoffs. Yet little attention has been paid to studying the actual geometry of the strategy space of all admissible ZD strategies. Here, our work offers intuitive geometric relationships between different classes of ZD strategies as well as nontrivial geometric interpretations of their specific parameterizations. Adaptive dynamics of ZD strategies further reveals the unforeseen connection between general ZD strategies and the so-called equalizers that can set any co-player's payoff to a fixed value. We show that the class of equalizers forming a hyperplane is the critical equilibrium manifold, only part of which is stable. The same hyperplane is also a separatrix of the cooperation-enhancing region where the optimum response is to increase cooperation for each of the four payoff outcomes. Our results shed light on the simple but elegant geometry of ZD strategies that is previously overlooked.

preprint2022arXiv

The Point of No Return: Evolution of Excess Mutation Rate is Possible Even for Simple Mutation Models

Under constant selection, each trait has a fixed fitness, and small mutation rates allow populations to efficiently exploit the optimal trait. Therefore it is reasonable to expect mutation rates will evolve downwards. However, we find this need not be the case, examining several models of mutation. While upwards evolution of mutation rate has been found with frequency or time dependent fitness, we demonstrate its possibility in a much simpler context. This work uses adaptive dynamics to study the evolution of mutation rate, and the replicator-mutator equation to model trait evolution. Our approach differs from previous studies by considering a wide variety of methods to represent mutation. We use a finite string approach inspired by genetics, as well as a model of local mutation on a discretization of the unit intervals, handling mutation beyond the endpoints in three ways. The main contribution of this work is a demonstration that the evolution of mutation rate can be significantly more complicated than what is usually expected in relatively simple models.

preprint2020arXiv

Asymmetric Partisan Voter Turnout Games

Since Downs proposed that the act of voting is irrational in 1957, myriad models have been proposed to explain voting and account for observed turnout patterns. We propose a model in which partisans consider both the instrumental and expressive benefits of their vote when deciding whether or not to abstain in an election, introducing an asymmetry that most other models do not consider. Allowing learning processes within our electorate, we analyze what turnout states are rationalizable under various conditions. Our model predicts comparative statics that are consistent with voter behavior. Furthermore, relaxing some of our preliminary assumptions eliminates some of the discrepancies between our model and empirical voter behavior.

preprint2020arXiv

Eco-evolutionary dynamics with environmental feedback: cooperation in a changing world

Eco-evolutionary game dynamics which characterizes the mutual interactions and the coupled evolutions of strategies and environments has been of growing interests in very recent years. Since such feedback loops widely exist in a range of coevolutionary systems, such as microbial systems, social-ecological system and psychological-economic system, recent modeling frameworks that unveil the oscillating dynamics of social dilemmas have great potential for practical applications. In this perspective article, we overview the latest progress of evolutionary game theory in this direction. We describe both mathematical methods and interdisciplinary applications across different fields. The ideas worthy of further consideration are discussed in prospects, with the central role of promoting cooperations in a changing world.

preprint2020arXiv

Elitism in Mathematics and Inequality

The Fields Medal, often referred as the Nobel Prize of mathematics, is awarded to no more than four mathematician under the age of 40, every four years. In recent years, its conferral has come under scrutiny of math historians, for rewarding the existing elite rather than its original goal of elevating mathematicians from under-represented communities. Prior studies of elitism focus on citational practices and sub-fields; the structural forces that prevent equitable access remain unclear. Here we show the flow of elite mathematicians between countries and lingo-ethnic identity, using network analysis and natural language processing on 240,000 mathematicians and their advisor-advisee relationships. We found that the Fields Medal helped integrate Japan after WWII, through analysis of the elite circle formed around Fields Medalists. Arabic, African, and East Asian identities remain under-represented at the elite level. Through analysis of inflow and outflow, we rebuts the myth that minority communities create their own barriers to entry. Our results demonstrate concerted efforts by international academic committees, such as prize-giving, are a powerful force to give equal access. We anticipate our methodology of academic genealogical analysis can serve as a useful diagnostic for equality within academic fields.

preprint2020arXiv

Evolutionary Kuramoto Dynamics

Common models of synchronizable oscillatory systems consist of a collection of coupled oscillators governed by a collection of differential equations. The ubiquitous Kuramoto models rely on an {\em a priori} fixed connectivity pattern facilitates mutual communication and influence between oscillators. In biological synchronizable systems, like the mammalian suprachaismatic nucleus, enabling communication comes at a cost -- the organism expends energy creating and maintaining the system -- linking their development to evolutionary selection. Here, we introduce and analyze a new evolutionary game theoretic framework modeling the behavior and evolution of systems of coupled oscillators. Each oscillator in our model is characterized by a pair of dynamic behavioral traits: an oscillatory phase and whether they connect and communicate to other oscillators or not. Evolution of the system occurs along these dimensions, allowing oscillators to change their phases and/or their communication strategies. We measure success of mutations by comparing the benefit of phase synchronization to the organism balanced against the cost of creating and maintaining connections between the oscillators. Despite such a simple setup, this system exhibits a wealth of nontrivial behaviors, mimicking different classical games -- the Prisoner's Dilemma, the snowdrift game, and coordination games -- as the landscape of the oscillators changes over time. Despite such complexity, we find a surprisingly simple characterization of synchronization through connectivity and communication: if the benefit of synchronization $B(0)$ is greater than twice the cost $c$, $B(0) > 2c$, the organism will evolve towards complete communication and phase synchronization. Taken together, our model demonstrates possible evolutionary constraints on both the existence of a synchronized oscillatory system and its overall connectivity.

preprint2020arXiv

Public discourse and social network echo chambers driven by socio-cognitive biases

In recent years, social media has increasingly become an important platform for political campaigns, especially elections. It remains elusive how exactly public discourse is driven by the intricate interplay between individual socio-cognitive biases, dueling campaign efforts, and social media platforms. We examine this complex socio-political process by integrating observed retweet networks from the 2016 political networks with an agent-based model of political opinion formation and network structure. Here we show that the range of political viewpoints individuals are willing to consider is a key determinant in the formation of polarized networks and the emergence of echo chambers. We also find that winning majority support in public discourse is determined by both the effort exerted by campaigns and the relative ideological positioning of opposing campaigns. Our results demonstrate how public discourse and political polarization can be modeled as an interactive process of shifting individual opinions, evolving social networks, and political campaigns.

preprint2020arXiv

Understanding Gambling Behavior and Risk Attitudes Using Cryptocurrency-based Casino Blockchain Data

The statistical concept of Gambler's Ruin suggests that gambling has a large amount of risk. Nevertheless, gambling at casinos and gambling on the Internet are both hugely popular activities. In recent years, both prospect theory and lab-controlled experiments have been used to improve our understanding of risk attitudes associated with gambling. Despite theoretical progress, collecting real-life gambling data, which is essential to validate predictions and experimental findings, remains a challenge. To address this issue, we collect publicly available betting data from a \emph{DApp} (decentralized application) on the Ethereum Blockchain, which instantly publishes the outcome of every single bet (consisting of each bet's timestamp, wager, probability of winning, userID, and profit). This online casino is a simple dice game that allows gamblers to tune their own winning probabilities. Thus the dataset is well suited for studying gambling strategies and the complex dynamic of risk attitudes involved in betting decisions. We analyze the dataset through the lens of current probability-theoretic models and discover empirical examples of gambling systems. Our results shed light on understanding the role of risk preferences in human financial behavior and decision-makings beyond gambling.

preprint2019arXiv

Mathematically Modeling Spillover Dynamics of Emerging Zoonoses with Intermediate Hosts

The World Health Organization describes zoonotic diseases as a major pandemic threat, and modeling the behavior of such diseases is a key component of their control. Many emerging zoonoses, such as SARS, Nipah, and Hendra, mutated from their wild type while circulating in an intermediate host population, usually a domestic species, to become more transmissible among humans, and moreover, this transmission route will only become more likely as agriculture and trade intensifies around the world. Passage through an intermediate host enables many otherwise rare diseases to become better adapted to humans, and so understanding this process with mathematical epidemiological models is necessary to prevent epidemics of emerging zoonoses, guide policy interventions in public health, and predict the behavior of an epidemic. In this paper, we account for spillovers of a zoonotic disease mutating in an intermediate host by means of modeling transmission dynamics within and between three host species, namely, wild reservoir, intermediate domestic animals, and humans. We calculate the basic reproductive number of the pathogen, present critical conditions for the emergence dynamics of zoonosis, and perform stability analysis of admissible disease equilibria. Our analytical results agree well with long-term simulations of the system. We find that in the presence of biologically realistic interspecies transmission parameters, a zoonotic disease can establish itself in humans even if it fails to persist in its reservoir and intermediate host species. Our model and results can be used to understand the dynamic behavior of any zoonosis with intermediate hosts and assist efforts to protect public health.

preprint2019arXiv

Steering Eco-Evolutionary Games Dynamics with Manifold Control

Feedback loops between population dynamics of individuals and their ecological environment are ubiquitously found in nature, and have shown profound effects on the resulting eco-evolutionary dynamics. Incorporating linear environmental feedback law into replicator dynamics of two-player games, recent theoretical studies shed light on understanding the oscillating dynamics of social dilemma. However, detailed effects of more general nonlinear feedback loops in multi-player games, which is more common especially in microbial systems, remain unclear. Here, we focus on ecological public goods games with environmental feedbacks driven by nonlinear selection gradient. Unlike previous models, multiple segments of stable and unstable equilibrium manifolds can emerge from the population dynamical systems. We find that a larger relative asymmetrical feedback speed for group interactions centered on cooperators not only accelerates the convergence of stable manifolds, but also increases the attraction basin of these stable manifolds. Furthermore, our work offers an innovative manifold control approach: by designing appropriate switching control laws, we are able to steer the eco-evolutionary dynamics to any desired population states. Our mathematical framework is an important generalization and complement to coevolutionary game dynamics, and also fills the theoretical gap in guiding the widespread problem of population state control in microbial experiments.