Researcher profile

J. Doyne Farmer

J. Doyne Farmer contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2022arXiv

Calibrating Agent-based Models to Microdata with Graph Neural Networks

Calibrating agent-based models (ABMs) to data is among the most fundamental requirements to ensure the model fulfils its desired purpose. In recent years, simulation-based inference methods have emerged as powerful tools for performing this task when the model likelihood function is intractable, as is often the case for ABMs. In some real-world use cases of ABMs, both the observed data and the ABM output consist of the agents' states and their interactions over time. In such cases, there is a tension between the desire to make full use of the rich information content of such granular data on the one hand, and the need to reduce the dimensionality of the data to prevent difficulties associated with high-dimensional learning tasks on the other. A possible resolution is to construct lower-dimensional time-series through the use of summary statistics describing the macrostate of the system at each time point. However, a poor choice of summary statistics can result in an unacceptable loss of information from the original dataset, dramatically reducing the quality of the resulting calibration. In this work, we instead propose to learn parameter posteriors associated with granular microdata directly using temporal graph neural networks. We will demonstrate that such an approach offers highly compelling inductive biases for Bayesian inference using the raw ABM microstates as output.

preprint2022arXiv

Estimating initial conditions for dynamical systems with incomplete information

In this paper we study the problem of inferring the initial conditions of a dynamical system under incomplete information. Studying several model systems, we infer the latent microstates that best reproduce an observed time series when the observations are sparse,noisy and aggregated under a (possibly) nonlinear observation operator. This is done by minimizing the least-squares distance between the observed time series and a model-simulated time series using gradient-based methods. We validate this method for the Lorenz and Mackey-Glass systems by making out-of-sample predictions. Finally, we analyze the predicting power of our method as a function of the number of observations available. We find a critical transition for the Mackey-Glass system, beyond which it can be initialized with arbitrary precision.

preprint2022arXiv

Measuring productivity dispersion: a parametric approach using the Lévy alpha-stable distribution

It is well-known that value added per worker is extremely heterogeneous among firms, but relatively little has been done to characterize this heterogeneity more precisely. Here we show that the distribution of value-added per worker exhibits heavy tails, a very large support, and consistently features a proportion of negative values, which prevents log transformation. We propose to model the distribution of value added per worker using the four parameter Lévy stable distribution, a natural candidate deriving from the Generalised Central Limit Theorem, and we show that it is a better fit than key alternatives. Fitting a distribution allows us to capture dispersion through the tail exponent and scale parameters separately. We show that these parametric measures of dispersion are at least as useful as interquantile ratios, through case studies on the evolution of dispersion in recent years and the correlation between dispersion and intangible capital intensity.

preprint2021arXiv

How production networks amplify economic growth

Technological improvement is the most important cause of long-term economic growth. We study the effects of technology improvement in the setting of a production network, in which each producer buys input goods and converts them to other goods, selling the product to households or other producers. We show how this network amplifies the effects of technological improvements as they propagate along chains of production. Longer production chains for an industry bias it towards faster price reduction, and longer production chains for a country bias it towards faster GDP growth. These predictions are in good agreement with data and improve with the passage of time, demonstrating a key influence of production chains in price change and output growth over the long term.

preprint2021arXiv

In and out of lockdown: Propagation of supply and demand shocks in a dynamic input-output model

Economic shocks due to Covid-19 were exceptional in their severity, suddenness and heterogeneity across industries. To study the upstream and downstream propagation of these industry-specific demand and supply shocks, we build a dynamic input-output model inspired by previous work on the economic response to natural disasters. We argue that standard production functions, at least in their most parsimonious parametrizations, are not adequate to model input substitutability in the context of Covid-19 shocks. We use a survey of industry analysts to evaluate, for each industry, which inputs were absolutely necessary for production over a short time period. We calibrate our model on the UK economy and study the economic effects of the lockdown that was imposed at the end of March and gradually released in May. Looking back at predictions that we released in May, we show that the model predicted aggregate dynamics very well, and sectoral dynamics to a large extent. We discuss the relative extent to which the model's dynamics and performance was due to the choice of the production function or the choice of an exogenous shock scenario. To further explore the behavior of the model, we use simpler scenarios with only demand or supply shocks, and find that popular metrics used to predict a priori the impact of shocks, such as output multipliers, are only mildly useful.

preprint2020arXiv

Automation and occupational mobility: A data-driven network model

The potential impact of automation on the labor market is a topic that has generated significant interest and concern amongst scholars, policymakers, and the broader public. A number of studies have estimated occupation-specific risk profiles by examining the automatability of associated skills and tasks. However, relatively little work has sought to take a more holistic view on the process of labor reallocation and how employment prospects are impacted as displaced workers transition into new jobs. In this paper, we develop a new data-driven model to analyze how workers move through an empirically derived occupational mobility network in response to automation scenarios which increase labor demand for some occupations and decrease it for others. At the macro level, our model reproduces a key stylized fact in the labor market known as the Beveridge curve and provides new insights for explaining the curve's counter-clockwise cyclicality. At the micro level, our model provides occupation-specific estimates of changes in short and long-term unemployment corresponding to a given automation shock. We find that the network structure plays an important role in determining unemployment levels, with occupations in particular areas of the network having very few job transition opportunities. Such insights could be fruitfully applied to help design more efficient and effective policies aimed at helping workers adapt to the changing nature of the labor market.

preprint2020arXiv

Production networks and epidemic spreading: How to restart the UK economy?

We analyse the economics and epidemiology of different scenarios for a phased restart of the UK economy. Our economic model is designed to address the unique features of the COVID-19 pandemic. Social distancing measures affect both supply and demand, and input-output constraints play a key role in restricting economic output. Standard models for production functions are not adequate to model the short-term effects of lockdown. A survey of industry analysts conducted by IHS Markit allows us to evaluate which inputs for each industry are absolutely necessary for production over a two month period. Our model also includes inventory dynamics and feedback between unemployment and consumption. We demonstrate that economic outcomes are very sensitive to the choice of production function, show how supply constraints cause strong network effects, and find some counter-intuitive effects, such as that reopening only a few industries can actually lower aggregate output. Occupation-specific data and contact surveys allow us to estimate how different industries affect the transmission rate of the disease. We investigate six different re-opening scenarios, presenting our best estimates for the increase in R0 and the increase in GDP. Our results suggest that there is a reasonable compromise that yields a relatively small increase in R0 and delivers a substantial boost in economic output. This corresponds to a situation in which all non-consumer facing industries reopen, schools are open only for workers who need childcare, and everyone who can work from home continues to work from home.

preprint2020arXiv

Statistical analysis and stochastic interest rate modelling for valuing the future with implications in climate change mitigation

High future discounting rates favor inaction on present expending while lower rates advise for a more immediate political action. A possible approach to this key issue in global economy is to take historical time series for nominal interest rates and inflation, and to construct then real interest rates and finally obtaining the resulting discount rate according to a specific stochastic model. Extended periods of negative real interest rates, in which inflation dominates over nominal rates, are commonly observed, occurring in many epochs and in all countries. This feature leads us to choose a well-known model in statistical physics, the Ornstein-Uhlenbeck model, as a basic dynamical tool in which real interest rates randomly fluctuate and can become negative, even if they tend to revert to a positive mean value. By covering 14 countries over hundreds of years we suggest different scenarios and include an error analysis in order to consider the impact of statistical uncertainty in our results. We find that only 4 of the countries have positive long-run discount rates while the other ten countries have negative rates. Even if one rejects the countries where hyperinflation has occurred, our results support the need to consider low discounting rates. The results provided by these fourteen countries significantly increase the priority of confronting global actions such as climate change mitigation. We finally extend the analysis by first allowing for fluctuations of the mean level in the Ornstein-Uhlenbeck model and secondly by considering modified versions of the Feller and lognormal models. In both cases, results remain basically unchanged thus demonstrating the robustness of the results presented.

preprint2020arXiv

Technological interdependencies predict innovation dynamics

We propose a simple model where the innovation rate of a technological domain depends on the innovation rate of the technological domains it relies on. Using data on US patents from 1836 to 2017, we make out-of-sample predictions and find that the predictability of innovation rates can be boosted substantially when network effects are taken into account. In the case where a technology$'$s neighborhood future innovation rates are known, the average predictability gain is 28$\%$ compared to simpler time series model which do not incorporate network effects. Even when nothing is known about the future, we find positive average predictability gains of 20$\%$. The results have important policy implications, suggesting that the effective support of a given technology must take into account the technological ecosystem surrounding the targeted technology.

preprint2010arXiv

An empirical study of the tails of mutual fund size

The mutual fund industry manages about a quarter of the assets in the U.S. stock market and thus plays an important role in the U.S. economy. The question of how much control is concentrated in the hands of the largest players is best quantitatively discussed in terms of the tail behavior of the mutual fund size distribution. We study the distribution empirically and show that the tail is much better described by a log-normal than a power law, indicating less concentration than, for example, personal income. The results are highly statistically significant and are consistent across fifteen years. This contradicts a recent theory concerning the origin of the power law tails of the trading volume distribution. Based on the analysis in a companion paper, the log-normality is to be expected, and indicates that the distribution of mutual funds remains perpetually out of equilibrium.

preprint2010arXiv

Leverage Causes Fat Tails and Clustered Volatility

We build a simple model of leveraged asset purchases with margin calls. Investment funds use what is perhaps the most basic financial strategy, called "value investing", i.e. systematically attempting to buy underpriced assets. When funds do not borrow, the price fluctuations of the asset are normally distributed and uncorrelated across time. All this changes when the funds are allowed to leverage, i.e. borrow from a bank, to purchase more assets than their wealth would otherwise permit. During good times competition drives investors to funds that use more leverage, because they have higher profits. As leverage increases price fluctuations become heavy tailed and display clustered volatility, similar to what is observed in real markets. Previous explanations of fat tails and clustered volatility depended on "irrational behavior", such as trend following. Here instead this comes from the fact that leverage limits cause funds to sell into a falling market: A prudent bank makes itself locally safer by putting a limit to leverage, so when a fund exceeds its leverage limit, it must partially repay its loan by selling the asset. Unfortunately this sometimes happens to all the funds simultaneously when the price is already falling. The resulting nonlinear feedback amplifies large downward price movements. At the extreme this causes crashes, but the effect is seen at every time scale, producing a power law of price disturbances. A standard (supposedly more sophisticated) risk control policy in which individual banks base leverage limits on volatility causes leverage to rise during periods of low volatility, and to contract more quickly when volatility gets high, making these extreme fluctuations even worse.

preprint2010arXiv

The cause of universality in growth fluctuations

Phenomena as diverse as breeding bird populations, the size of U.S. firms, money invested in mutual funds, the GDP of individual countries and the scientific output of universities all show unusual but remarkably similar growth fluctuations. The fluctuations display characteristic features, including double exponential scaling in the body of the distribution and power law scaling of the standard deviation as a function of size. To explain this we propose a remarkably simple additive replication model: At each step each individual is replaced by a new number of individuals drawn from the same replication distribution. If the replication distribution is sufficiently heavy tailed then the growth fluctuations are Levy distributed. We analyze the data from bird populations, firms, and mutual funds and show that our predictions match the data well, in several respects: Our theory results in a much better collapse of the individual distributions onto a single curve and also correctly predicts the scaling of the standard deviation with size. To illustrate how this can emerge from a collective microscopic dynamics we propose a model based on stochastic influence dynamics over a scale-free contact network and show that it produces results similar to those observed. We also extend the model to deal with correlations between individual elements. Our main conclusion is that the universality of growth fluctuations is driven by the additivity of growth processes and the action of the generalized central limit theorem.

preprint2010arXiv

Tick size and price diffusion

A tick size is the smallest increment of a security price. It is clear that at the shortest time scale on which individual orders are placed the tick size has a major role which affects where limit orders can be placed, the bid-ask spread, etc. This is the realm of market microstructure and there is a vast literature on the role of tick size on market microstructure. However, tick size can also affect price properties at longer time scales, and relatively less is known about the effect of tick size on the statistical properties of prices. The present paper is divided in two parts. In the first we review the effect of tick size change on the market microstructure and the diffusion properties of prices. The second part presents original results obtained by investigating the tick size changes occurring at the New York Stock Exchange (NYSE). We show that tick size change has three effects on price diffusion. First, as already shown in the literature, tick size affects price return distribution at an aggregate time scale. Second, reducing the tick size typically leads to an increase of volatility clustering. We give a possible mechanistic explanation for this effect, but clearly more investigation is needed to understand the origin of this relation. Third, we explicitly show that the ability of the subordination hypothesis in explaining fat tails of returns and volatility clustering is strongly dependent on tick size. While for large tick sizes the subordination hypothesis has significant explanatory power, for small tick sizes we show that subordination is not the main driver of these two important stylized facts of financial market.