Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
38works
0followers
39topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

38 published item(s)

preprint2026arXiv

A disease-spread model on hypergraphs with distinct droplet and aerosol transmission modes

We examine the spread of an infectious disease, such as one that is caused by a respiratory virus, with two distinct modes of transmission. To do this, we consider a susceptible--infected--susceptible (SIS) disease on a hypergraph, which allows us to incorporate the effects of both dyadic (i.e., pairwise) and polyadic (i.e., group) interactions on disease propagation. This disease can spread either via large droplets through direct social contacts, which we associate with edges (i.e., hyperedges of size 2), or via infected aerosols in the environment through hyperedges of size at least 3 (i.e., polyadic interactions). We derive mean-field approximations of our model for two types of hypergraphs, and we obtain threshold conditions that characterize whether the disease dies out or becomes endemic. Additionally, we numerically simulate our model and a mean-field approximation of it to examine the impact of various factors, such as hyperedge size (when the size is uniform), hyperedge-size distribution (when the sizes are nonuniform), and hyperedge-recovery rates (when the sizes are nonuniform) on the disease dynamics.

preprint2026arXiv

Graph energy as a measure of community detectability in networks

A key challenge in network science is the detection of communities, which are sets of nodes in a network that are densely connected internally but sparsely connected to the rest of the network. A fundamental result in community detection is the existence of a nontrivial threshold for community detectability on sparse graphs that are generated by the planted partition model (PPM). Below this so-called ``detectability limit'', no community-detection method can perform better than random chance. Spectral methods for community detection fail before this detectability limit because the eigenvalues corresponding to the eigenvectors that are relevant for community detection can be absorbed by the bulk of the spectrum. One can bypass the detectability problem by using special matrices, like the non-backtracking matrix, but this requires one to consider higher-dimensional matrices. In this paper, we show that the difference in graph energy between a PPM and an Erdős--Rényi (ER) network has a distinct transition at the detectability threshold even for the adjacency matrices of the underlying networks. The graph energy is based on the full spectrum of an adjacency matrix, so our result suggests that standard graph matrices still allow one to separate the parameter regions with detectable and undetectable communities.

preprint2022arXiv

A Majority-Vote Model On Multiplex Networks with Community Structure

We investigate a majority-vote model on two-layer multiplex networks with community structure. In our majority-vote model, the edges on each layer encode one type of social relationship and an individual changes their opinion based on the majority opinions of their neighbors in each layer. To capture the fact that different relationships often have different levels of importance, we introduce a layer-preference parameter, which determines the probability of a node to adopt an opinion when the node's neighborhoods on the two layers have different majority opinions. We construct our networks so that each node is a member of one community on each layer, and we consider situations in which nodes tend to have more connections with nodes from the same community than with nodes from different communities. We study the influence of the layer-preference parameter, the intralayer communities, and interlayer membership correlation on the steady-state behavior of our model using both direct numerical simulations and a mean-field approximation. We find three different types of steady-state behavior: a fully-mixed state, consensus states, and polarized states. We demonstrate that a stronger interlayer community correlation makes polarized steady states reachable for wider ranges of the other model parameters. We also show that different values of the layer-preference parameter result in qualitatively different phase diagrams for the mean opinions at steady states.

preprint2022arXiv

Analysis of Spatial and Spatiotemporal Anomalies Using Persistent Homology: Case Studies with COVID-19 Data

We develop a method for analyzing spatial and spatiotemporal anomalies in geospatial data using topological data analysis (TDA). To do this, we use persistent homology (PH), which allows one to algorithmically detect geometric voids in a data set and quantify the persistence of such voids. We construct an efficient filtered simplicial complex (FSC) such that the voids in our FSC are in one-to-one correspondence with the anomalies. Our approach goes beyond simply identifying anomalies; it also encodes information about the relationships between anomalies. We use vineyards, which one can interpret as time-varying persistence diagrams (which are an approach for visualizing PH), to track how the locations of the anomalies change with time. We conduct two case studies using spatially heterogeneous COVID-19 data. First, we examine vaccination rates in New York City by zip code at a single point in time. Second, we study a year-long data set of COVID-19 case rates in neighborhoods of the city of Los Angeles.

preprint2022arXiv

Lonely individuals process the world in idiosyncratic ways

Loneliness is detrimental to well-being and is often accompanied by self-reported feelings of not being understood by others. What contributes to such feelings in lonely people? We used functional magnetic resonance imaging (fMRI) of 66 participants to unobtrusively measure the relative alignment of people's mental processing of naturalistic stimuli and tested whether or not lonely people actually process the world in idiosyncratic ways. We found evidence for such idiosyncrasy: lonely individuals' neural responses were dissimilar to their peers, particularly in regions of the default-mode network in which similar responses have been associated with shared perspectives and subjective understanding. These relationships persisted when controlling for demographic similarities, objective social isolation, and participants' friendships with each other. Our findings suggest the possibility that being surrounded by people who see the world differently from oneself, even if one is friends with them, may be a risk factor for loneliness.

preprint2022arXiv

Mixed Logit Models and Network Formation

The study of network formation is pervasive in economics, sociology, and many other fields. In this paper, we model network formation as a `choice' that is made by nodes in a network to connect to other nodes. We study these `choices' using discrete-choice models, in which an agent chooses between two or more discrete alternatives. We employ the `repeated-choice' (RC) model to study network formation. We argue that the RC model overcomes important limitations of the multinomial logit (MNL) model, which gives one framework for studying network formation, and that it is well-suited to study network formation. We also illustrate how to use the RC model to accurately study network formation using both synthetic and real-world networks. Using edge-independent synthetic networks, we also compare the performance of the MNL model and the RC model. We find that the RC model estimates the data-generation process of our synthetic networks more accurately than the MNL model. In a patent citation network, which forms sequentially, we present a case study of a qualitatively interesting scenario -- the fact that new patents are more likely to cite older, more cited, and similar patents -- for which employing the RC model yields interesting insights.

preprint2021arXiv

Counterparty Credit Limits: The Impact of a Risk-Mitigation Measure on Everyday Trading

A counterparty credit limit (CCL) is a limit that is imposed by a financial institution to cap its maximum possible exposure to a specified counterparty. CCLs help institutions to mitigate counterparty credit risk via selective diversification of their exposures. In this paper, we analyze how CCLs impact the prices that institutions pay for their trades during everyday trading. We study a high-quality data set from a large electronic trading platform in the foreign exchange spot market, which enables institutions to apply CCLs. We find empirically that CCLs had little impact on the vast majority of trades in this data. We also study the impact of CCLs using a new model of trading. By simulating our model with different underlying CCL networks, we highlight that CCLs can have a major impact in some situations.

preprint2021arXiv

Tie-decay networks in continuous time and eigenvector-based centralities

Network theory is a useful framework for studying interconnected systems of interacting entities. Many networked systems evolve continuously in time, but most existing methods for the analysis of time-dependent networks rely on discrete or discretized time. In this paper, we propose an approach for studying networks that evolve in continuous time by distinguishing between \emph{interactions}, which we model as discrete contacts, and \emph{ties}, which encode the strengths of relationships as functions of time. To illustrate our tie-decay network formalism, we adapt the well-known PageRank centrality score to our tie-decay framework in a mathematically tractable and computationally efficient way. We apply this framework to a synthetic example and then use it to study a network of retweets during the 2012 National Health Service controversy in the United Kingdom. Our work also provides guidance for similar generalizations of other tools from network theory to continuous-time networks with tie decay, including for applications to streaming data.

preprint2020arXiv

A Model for the Influence of Media on the Ideology of Content in Online Social Networks

Many people rely on online social networks as sources of news and information, and the spread of media content with ideologies across the political spectrum influences online discussions and impacts actions offline. To examine the impact of media in online social networks, we generalize bounded-confidence models of opinion dynamics by incorporating media accounts as influencers in a network. We quantify partisanship of content with a continuous parameter on an interval, and we formulate higher-dimensional generalizations to incorporate content quality and increasingly nuanced political positions. We simulate our model with one and two ideological dimensions, and we use the results of our simulations to quantify the "entrainment" of content from non-media accounts to the ideologies of media accounts in a network. We maximize media impact in a social network by tuning the number of media accounts that promote the content and the number of followers of the accounts. Using numerical computations, we find that the entrainment of the ideology of content spread by non-media accounts to media ideology depends on a network's structural features, including its size, the mean number of followers of its nodes, and the receptiveness of its nodes to different opinions. We then introduce content quality --- a key novel contribution of our work --- into our model. We incorporate multiple media sources with ideological biases and quality-level estimates that we draw from real media sources and demonstrate that our model can produce distinct communities ("echo chambers") that are polarized in both ideology and quality. Our model provides a step toward understanding content quality and ideology in spreading dynamics, with ramifications for how to mitigate the spread of undesired content and promote the spread of desired content.

preprint2020arXiv

A unified framework for equivalences in social networks

A key concern in network analysis is the study of social positions and roles of actors in a network. The notion of "position" refers to an equivalence class of nodes that have similar ties to other nodes, whereas a "role" is an equivalence class of compound relations that connect the same pairs of nodes. An open question in network science is whether it is possible to simultaneously perform role and positional analysis. Motivated by the principle of functoriality in category theory we propose a new method that allows to tie role and positional analysis together. We illustrate our methods on two well-studied data sets in network science.

preprint2020arXiv

Connecting the Dots: Discovering the "Shape" of Data

Scientists use a mathematical subject called 'topology' to study the shapes of objects. An important part of topology is counting the numbers of pieces and holes in objects, and people use this information to group objects into different types. For example, a doughnut has the same number of holes and the same number of pieces as a teacup with one handle, but it is different from a ball. In studies that resemble activities like "connect the dots", scientists use ideas from topology to study the shape of data. Data can take many possible forms: a picture made of dots, a large collection of numbers from a scientific experiment, or something else. The approach in these studies is called 'topological data analysis', and it has been used to study the branching structures of veins in leaves, how people vote in elections, flight patterns in models of bird flocking, and more. Scientists can take data on the way veins branch on leaves and use topological data analysis to divide the leaves into different groups and discover patterns that may otherwise be hard to find.

preprint2020arXiv

Disease Detectives: Using Mathematics to Forecast the Spread of Infectious Diseases

The COVID-19 pandemic has led to significant changes in how people are currently living their lives. To determine how to best reduce the effects of the pandemic and start reopening societies, governments have drawn insights from mathematical models of the spread of infectious diseases. In this article, we give an introduction to a family of mathematical models (called "compartmental models") and discuss how the results of analyzing these models influence government policies and human behavior, such as encouraging mask wearing and physical distancing to help slow the spread of the disease.

preprint2020arXiv

Fitting In and Breaking Up: A Nonlinear Version of Coevolving Voter Models

We investigate a nonlinear version of coevolving voter models, in which node states and network structure update as a coupled stochastic dynamical process. Most prior work on coevolving voter models has focused on linear update rules with fixed and homogeneous rewiring and adopting probabilities. By contrast, in our nonlinear version, the probability that a node rewires or adopts is a function of how well it "fits in" within its neighborhood. To explore this idea, we incorporate a parameter $σ$ that represents the fraction of neighbors of an updating node that share its opinion state. In an update, with probability $σ^q$ (for some nonlinearity parameter $q$), the updating node rewires; with complementary probability $1-σ^q$, the updating node adopts a new opinion state. We study this mechanism using three rewiring schemes: after an updating node deletes a discordant edge, it then either (1) "rewires-to-random" by choosing a new neighbor in a random process; (2) "rewires-to-same" by choosing a new neighbor in a random process from nodes that share its state; or (3) "rewires-to-none" by not rewiring at all (akin to "unfriending" on social media). We compare our nonlinear coevolving model to several existing linear models, and we find in our model that initial network topology plays a larger role in the dynamics and the choice of rewiring mechanism plays a smaller role. A particularly interesting feature of our model is that, under certain conditions, the opinion state that is held initially by a minority of the nodes can effectively spread to almost every node in a network if the minority nodes view themselves as the majority. In light of this observation, we relate our results to recent work on the majority illusion in social networks.

preprint2020arXiv

Forecasting elections using compartmental models of infection

Forecasting elections -- a challenging, high-stakes problem -- is the subject of much uncertainty, subjectivity, and media scrutiny. To shed light on this process, we develop a method for forecasting elections from the perspective of dynamical systems. Our model borrows ideas from epidemiology, and we use polling data from United States elections to determine its parameters. Surprisingly, our general model performs as well as popular forecasters for the 2012 and 2016 U.S. races for president, senators, and governors. Although contagion and voting dynamics differ, our work suggests a valuable approach to elucidate how elections are related across states. It also illustrates the effect of accounting for uncertainty in different ways, provides an example of data-driven forecasting using dynamical systems, and suggests avenues for future research on political elections. We conclude with our forecasts for the senatorial and gubernatorial races on 6~November 2018, which we posted on 5 November 2018.

preprint2020arXiv

Inference of Edge Correlations in Multilayer Networks

Many recent developments in network analysis have focused on multilayer networks, which one can use to encode time-dependent interactions, multiple types of interactions, and other complications that arise in complex systems. Like their monolayer counterparts, multilayer networks in applications often have mesoscale features, such as community structure. A prominent type of method for inferring such structures is the employment of multilayer stochastic block models (SBMs). A common (but {potentially} inadequate) assumption of these models is the sampling of edges in different layers independently, conditioned on the community labels of the nodes. In this paper, we relax this assumption of independence by incorporating edge correlations into an SBM-like model. We derive maximum-likelihood estimates of the key parameters of our model, and we propose a measure of layer correlation that reflects the similarity between connectivity patterns in different layers. Finally, we explain how to use correlated models for edge "prediction" (i.e., inference) in multilayer networks. By taking into account edge correlations, prediction accuracy improves both in synthetic networks and in a temporal network of shoppers who are connected to previously-purchased grocery products.

preprint2020arXiv

Migration Networks: Applications of Network Analysis to Macroscale Migration Patterns

An emerging area of research is the study of macroscale migration patterns as a network of nodes that represent places (e.g., countries, cities, and rural areas) and edges that encode migration ties that connect those places. In this chapter, we first review advances in the study of migration networks and recent work that has employed network analysis to examine such networks at different geographical scales. In our discussion, we focus in particular on global scale migration networks. We then propose ways to leverage network analysis in concert with digital technologies and online geolocated data to examine the structure and dynamics of migration networks. The implementation of such approaches for studying migration networks faces many challenges, including ethical ones, methodological ones, socio-technological ones (e.g., data availability and reuse), and research reproducibility. We detail these challenges, and we then consider possible ways of linking digital geolocated data to administrative and survey data as a way of harnessing new technologies to construct increasingly realistic migration networks (e.g., using multiplex networks). We also briefly discuss new methods (e.g., multilayer network analysis) in network analysis and adjacent fields (e.g., machine learning) that can help advance understanding of macroscale patterns of migration.

preprint2020arXiv

Random walks and diffusion on networks

Random walks are ubiquitous in the sciences, and they are interesting from both theoretical and practical perspectives. They are one of the most fundamental types of stochastic processes; can be used to model numerous phenomena, including diffusion, interactions, and opinions among humans and animals; and can be used to extract information about important entities or dense groups of entities in a network. Random walks have been studied for many decades on both regular lattices and (especially in the last couple of decades) on networks with a variety of structures. In the present article, we survey the theory and applications of random walks on networks, restricting ourselves to simple cases of single and non-adaptive random walkers. We distinguish three main types of random walks: discrete-time random walks, node-centric continuous-time random walks, and edge-centric continuous-time random walks. We first briefly survey random walks on a line, and then we consider random walks on various types of networks. We extensively discuss applications of random walks, including ranking of nodes (e.g., PageRank), community detection, respondent-driven sampling, and opinion models such as voter models.

preprint2020arXiv

Social Network Analysis for Social Neuroscientists

Although social neuroscience is concerned with understanding how the brain interacts with its social environment, prevailing research in the field has primarily considered the human brain in isolation, deprived of its rich social context. Emerging work in social neuroscience that leverages tools from network analysis has begun to pursue this issue, advancing knowledge of how the human brain influences and is influenced by the structures of its social environment. In this paper, we provide an overview of key theory and methods in network analysis (especially for social systems) as an introduction for social neuroscientists who are interested in relating individual cognition to the structures of an individual's social environments. We also highlight some exciting new work as examples of how to productively use these tools to investigate questions of relevance to social neuroscientists. We include tutorials to help with practical implementation of the concepts that we discuss. We conclude by highlighting a broad range of exciting research opportunities for social neuroscientists who are interested in using network analysis to study social systems.

preprint2020arXiv

Spatial Applications of Topological Data Analysis: Cities, Snowflakes, Random Structures, and Spiders Spinning Under the Influence

Spatial networks are ubiquitous in social, geographical, physical, and biological applications. To understand the large-scale structure of networks, it is important to develop methods that allow one to directly probe the effects of space on structure and dynamics. Historically, algebraic topology has provided one framework for rigorously and quantitatively describing the global structure of a space, and recent advances in topological data analysis (TDA) have given scholars a new lens for analyzing network data. In this paper, we study a variety of spatial networks -- including both synthetic and natural ones -- using novel topological methods that we recently developed for analyzing spatial networks. We demonstrate that our methods are able to capture meaningful quantities, with specifics that depend on context, in spatial networks and thereby provide useful insights into the structure of those networks, including a novel approach for characterizing them based on their topological structures. We illustrate these ideas with examples of synthetic networks and dynamics on them, street networks in cities, snowflakes, and webs spun by spiders under the influence of various psychotropic substances.

preprint2020arXiv

Spatial Strength Centrality and the Effect of Spatial Embeddings on Network Architecture

For many networks, it is useful to think of their nodes as being embedded in a latent space, and such embeddings can affect the probabilities for nodes to be adjacent to each other. In this paper, we extend existing models of synthetic networks to spatial network models by first embedding nodes in Euclidean space and then modifying the models so that progressively longer edges occur with progressively smaller probabilities. We start by extending a geographical fitness model by employing Gaussian-distributed fitnesses, and we then develop spatial versions of preferential attachment and configuration models. We define a notion of "spatial strength centrality" to help characterize how strongly a spatial embedding affects network structure, and we examine spatial strength centrality on a variety of real and synthetic networks.

preprint2020arXiv

Topological Data Analysis of Task-Based fMRI Data from Experiments on Schizophrenia

We use methods from computational algebraic topology to study functional brain networks, in which nodes represent brain regions and weighted edges encode the similarity of fMRI time series from each region. With these tools, which allow one to characterize topological invariants such as loops in high-dimensional data, we are able to gain understanding into low-dimensional structures in networks in a way that complements traditional approaches that are based on pairwise interactions. In the present paper, we use persistent homology to analyze networks that we construct from task-based fMRI data from schizophrenia patients, healthy controls, and healthy siblings of schizophrenia patients. We thereby explore the persistence of topological structures such as loops at different scales in these networks. We use persistence landscapes and persistence images to create output summaries from our persistent-homology calculations, and we study the persistence landscapes and images using $k$-means clustering and community detection. Based on our analysis of persistence landscapes, we find that the members of the sibling cohort have topological features (specifically, their 1-dimensional loops) that are distinct from the other two cohorts. From the persistence images, we are able to distinguish all three subject groups and to determine the brain regions in the loops (with four or more edges) that allow us to make these distinctions.

preprint2020arXiv

Tunable Eigenvector-Based Centralities for Multiplex and Temporal Networks

Characterizing the importances (i.e., centralities) of nodes in social, biological, and technological networks is a core topic in both network science and data science. We present a linear-algebraic framework that generalizes eigenvector-based centralities, including PageRank and hub/authority scores, to provide a common framework for two popular classes of multilayer networks: multiplex networks (which have layers that encode different types of relationships) and temporal networks (in which the relationships change over time). Our approach involves the study of joint, marginal, and conditional "supracentralities" that one can calculate from the dominant eigenvector of a supracentrality matrix [Taylor et al., 2017], which couples centrality matrices that are associated with individual network layers. We extend this prior work (which was restricted to temporal networks with layers that are coupled by adjacent-in-time coupling) by allowing the layers to be coupled through a (possibly asymmetric) interlayer-adjacency matrix $\tilde{\bf A}$, where the entry $\tilde{A}_{tt'} \geq 0$ encodes the coupling between layers $t$ and $t'$. Our framework provides a unifying foundation for centrality analysis of multiplex and temporal networks; it also illustrates a complicated dependency of the supracentralities on the topology and weights of interlayer coupling. By scaling $\tilde{\bf A}$ by an interlayer-coupling strength $ω\ge0$ and developing a singular perturbation theory for the limits of weak ($ω\to0^+$) and strong coupling ($ω\to\infty$), we also reveal an interesting dependence of supracentralities on the dominant left and right eigenvectors of $\tilde{\bf A}$.

preprint2019arXiv

A Framework for the Construction of Generative Models for Mesoscale Structure in Multilayer Networks

Multilayer networks allow one to represent diverse and coupled connectivity patterns --- e.g., time-dependence, multiple subsystems, or both --- that arise in many applications and which are difficult or awkward to incorporate into standard network representations. In the study of multilayer networks, it is important to investigate mesoscale (i.e., intermediate-scale) structures, such as dense sets of nodes known as communities, to discover network features that are not apparent at the microscale or the macroscale. The ill-defined nature of mesoscale structure and its ubiquity in empirical networks make it crucial to develop generative models that can produce the features that one encounters in empirical networks. Key purposes of such generative models include generating synthetic networks with empirical properties of interest, benchmarking mesoscale-detection methods and algorithms, and inferring structure in empirical multilayer networks. In this paper, we introduce a framework for the construction of generative models for mesoscale structures in multilayer networks. Our framework provides a standardized set of generative models, together with an associated set of principles from which they are derived, for studies of mesoscale structures in multilayer networks. It unifies and generalizes many existing models for mesoscale structures in fully-ordered (e.g., temporal) and unordered (e.g., multiplex) multilayer networks. One can also use it to construct generative models for mesoscale structures in partially-ordered multilayer networks (e.g., networks that are both temporal and multiplex). Our framework has the ability to produce many features of empirical multilayer networks, and it explicitly incorporates a user-specified dependency structure between layers.

preprint2013arXiv

Dark Solitary Waves in a Class of Collisionally Inhomogeneous Bose-Einstein Condensates

We study the structure, stability, and dynamics of dark solitary waves in parabolically trapped, collisionally inhomogeneous Bose-Einstein condensates (BECs) with spatially periodic variations of the scattering length. This collisional inhomogeneity yields a nonlinear lattice, which we tune from a small-amplitude, approximately sinusoidal structure to a periodic sequence of densely spaced spikes. We start by investigating time-independent inhomogeneities, and we subsequently examine the dynamical response when one starts with a collisionally homogeneous BEC and then switches on an inhomogeneity either adiabatically or nonadiabatically. Using Bogoliubov-de Gennes linearization as well as direct numerical simulations of the Gross-Pitaevskii equation, we observe dark solitary waves, which can become unstable through oscillatory or exponential instabilities. We find a critical wavelength of the nonlinear lattice that is comparable to the healing length. Near this value, the fundamental eigenmode responsible for the stability of the dark solitary wave changes its direction of movement as a function of the strength of the nonlinearity. When it increases, it collides with other eigenmodes, leading to oscillatory instabilities; when it decreases, it collides with the origin and becomes imaginary, illustrating that the instability mechanism is fundamentally different in wide-well versus narrow-well lattices. When starting from a collisionally homogeneous setup and switching on inhomogeneities, we find that dark solitary waves are preserved generically for aligned lattices. We briefly examine the time scales for the onset of solitary-wave oscillations in this scenario.

preprint2013arXiv

Two-Particle Circular Billiards Versus Randomly Perturbed One-Particle Circular Billiards

We study a two-particle circular billiard containing two finite-size circular particles that collide elastically with the billiard boundary and with each other. Such a two-particle circular billiard provides a clean example of an "intermittent" system. This billiard system behaves chaotically, but the time scale on which chaos manifests can become arbitrarily long as the sizes of the confined particles become smaller. The finite-time dynamics of this system depends on the relative frequencies of (chaotic) particle-particle collisions versus (integrable) particle-boundary collisions, and investigating these dynamics is computationally intensive because of the long time scales involved. To help improve understanding of such two-particle dynamics, we compare the results of diagnostics used to measure chaotic dynamics for a two-particle circular billiard with those computed for two types of one-particle circular billiards in which a confined particle undergoes random perturbations. Importantly, such one-particle approximations are much less computationally demanding than the original two-particle system, and we expect them to yield reasonable estimates of the extent of chaotic behavior in the two-particle system when the sizes of confined particles are small. Our computations of recurrence-rate coefficients, finite-time Lyapunov exponents, and autocorrelation coefficients support this hypothesis and suggest that studying randomly perturbed one-particle billiards has the potential to yield insights into the aggregate properties of two-particle billiards, which are difficult to investigate directly without enormous computation times (especially when the sizes of the confined particles are small).

preprint2012arXiv

Taxonomies of Networks

The study of networks has grown into a substantial interdisciplinary endeavour that encompasses myriad disciplines in the natural, social, and information sciences. Here we introduce a framework for constructing taxonomies of networks based on their structural similarities. These networks can arise from any of numerous sources: they can be empirical or synthetic, they can arise from multiple realizations of a single process, empirical or synthetic, or they can represent entirely different systems in different disciplines. Since the mesoscopic properties of networks are hypothesized to be important for network function, we base our comparisons on summaries of network community structures. While we use a specific method for uncovering network communities, much of the introduced framework is independent of that choice. After introducing the framework, we apply it to construct a taxonomy for 746 individual networks and demonstrate that our approach usefully identifies similar networks. We also construct taxonomies within individual categories of networks, and in each case we expose non-trivial structure. For example we create taxonomies for similarity networks constructed from both political voting data and financial data. We also construct network taxonomies to compare the social structures of 100 Facebook networks and the growth structures produced by different types of fungi.

preprint2012arXiv

The Extraordinary SVD

The singular value decomposition (SVD) is a popular matrix factorization that has been used widely in applications ever since an efficient algorithm for its computation was developed in the 1970s. In recent years, the SVD has become even more prominent due to a surge in applications and increased computational memory and speed. To illustrate the vitality of the SVD in data analysis, we highlight three of its lesser-known yet fascinating applications: the SVD can be used to characterize political positions of Congressmen, measure the growth rate of crystals in igneous rock, and examine entanglement in quantum computation. We also discuss higher-dimensional generalizations of the SVD, which have become increasingly crucial with the newfound wealth of multidimensional data and have launched new research initiatives in both theoretical and applied mathematics. With its bountiful theory and applications, the SVD is truly extraordinary.

preprint2011arXiv

Party Polarization in Congress: A Network Science Approach

We measure polarization in the United States Congress using the network science concept of modularity. Modularity provides a conceptually-clear measure of polarization that reveals both the number of relevant groups and the strength of inter-group divisions without making restrictive assumptions about the structure of the party system or the shape of legislator utilities. We show that party influence on Congressional blocs varies widely throughout history, and that existing measures underestimate polarization in periods with weak party structures. We demonstrate that modularity is a significant predictor of changes in majority party and that turnover is more prevalent at medium levels of modularity. We show that two variables related to modularity, called `divisiveness' and `solidarity,' are significant predictors of reelection success for individual House members. Our results suggest that modularity can serve as an early warning of changing group dynamics, which are reflected only later by changes in party labels.

preprint2010arXiv

Community Structure in the United Nations General Assembly

We study the community structure of networks representing voting on resolutions in the United Nations General Assembly. We construct networks from the voting records of the separate annual sessions between 1946 and 2008 in three different ways: (1) by considering voting similarities as weighted unipartite networks; (2) by considering voting similarities as weighted, signed unipartite networks; and (3) by examining signed bipartite networks in which countries are connected to resolutions. For each formulation, we detect communities by optimizing network modularity using an appropriate null model. We compare and contrast the results that we obtain for these three different network representations. In so doing, we illustrate the need to consider multiple resolution parameters and explore the effectiveness of each network representation for identifying voting groups amidst the large amount of agreement typical in General Assembly votes.

preprint2010arXiv

Community Structure in Time-Dependent, Multiscale, and Multiplex Networks

Network science is an interdisciplinary endeavor, with methods and applications drawn from across the natural, social, and information sciences. A prominent problem in network science is the algorithmic detection of tightly-connected groups of nodes known as communities. We developed a generalized framework of network quality functions that allowed us to study the community structure of arbitrary multislice networks, which are combinations of individual networks coupled through links that connect each node in one network slice to itself in other slices. This framework allows one to study community structure in a very general setting encompassing networks that evolve over time, have multiple types of links (multiplexity), and have multiple scales.

preprint2010arXiv

Comparing Community Structure to Characteristics in Online Collegiate Social Networks

We study the structure of social networks of students by examining the graphs of Facebook "friendships" at five American universities at a single point in time. We investigate each single-institution network's community structure and employ graphical and quantitative tools, including standardized pair-counting methods, to measure the correlations between the network communities and a set of self-identified user characteristics (residence, class year, major, and high school). We review the basic properties and statistics of the pair-counting indices employed and recall, in simplified notation, a useful analytical formula for the z-score of the Rand coefficient. Our study illustrates how to examine different instances of social networks constructed in similar environments, emphasizes the array of social forces that combine to form "communities," and leads to comparative observations about online social lives that can be used to infer comparisons about offline social structures. In our illustration of this methodology, we calculate the relative contributions of different characteristics to the community structure of individual universities and subsequently compare these relative contributions at different universities, measuring for example the importance of common high school affiliation to large state universities and the varying degrees of influence common major can have on the social structure at different universities. The heterogeneity of communities that we observe indicates that these networks typically have multiple organizing factors rather than a single dominant one.

preprint2010arXiv

Dynamical Clustering of Exchange Rates

We use techniques from network science to study correlations in the foreign exchange (FX) market over the period 1991--2008. We consider an FX market network in which each node represents an exchange rate and each weighted edge represents a time-dependent correlation between the rates. To provide insights into the clustering of the exchange rate time series, we investigate dynamic communities in the network. We show that there is a relationship between an exchange rate's functional role within the market and its position within its community and use a node-centric community analysis to track the time dynamics of this role. This reveals which exchange rates dominate the market at particular times and also identifies exchange rates that experienced significant changes in market role. We also use the community dynamics to uncover major structural changes that occurred in the FX market. Our techniques are general and will be similarly useful for investigating correlations in other markets.

preprint2010arXiv

Intrinsic Energy Localization through Discrete Gap Breathers in One-Dimensional Diatomic Granular Crystals

We present a systematic study of the existence and stability of discrete breathers that are spatially localized in the bulk of a one-dimensional chain of compressed elastic beads that interact via Hertzian contact. The chain is diatomic, consisting of a periodic arrangement of heavy and light spherical particles. We examine two families of discrete gap breathers: (1) an unstable discrete gap breather that is centered on a heavy particle and characterized by a symmetric spatial energy profile and (2) a potentially stable discrete gap breather that is centered on a light particle and is characterized by an asymmetric spatial energy profile. We investigate their existence, structure, and stability throughout the band gap of the linear spectrum and classify them into four regimes: a regime near the lower optical band edge of the linear spectrum, a moderately discrete regime, a strongly discrete regime that lies deep within the band gap of the linearized version of the system, and a regime near the upper acoustic band edge. We contrast discrete breathers in anharmonic FPU-type diatomic chains with those in diatomic granular crystals, which have a tensionless interaction potential between adjacent particles, and highlight in that the asymmetric nature of the latter interaction potential may lead to a form of hybrid bulk-surface localized solutions.

preprint2010arXiv

Revisiting Date and Party Hubs: Novel Approaches to Role Assignment in Protein Interaction Networks

The idea of 'date' and 'party' hubs has been influential in the study of protein-protein interaction networks. Date hubs display low co-expression with their partners, whilst party hubs have high co-expression. It was proposed that party hubs are local coordinators whereas date hubs are global connectors. Here we show that the reported importance of date hubs to network connectivity can in fact be attributed to a tiny subset of them. Crucially, these few, extremely central, hubs do not display particularly low expression correlation, undermining the idea of a link between this quantity and hub function. The date/party distinction was originally motivated by an approximately bimodal distribution of hub co-expression; we show that this feature is not always robust to methodological changes. Additionally, topological properties of hubs do not in general correlate with co-expression. Thus, we suggest that a date/party dichotomy is not meaningful and it might be more useful to conceive of roles for protein-protein interactions rather than individual proteins. We find significant correlations between interaction centrality and the functional similarity of the interacting proteins.

preprint2009arXiv

Localized Breathing Modes in Granular Crystals with Defects

We investigate nonlinear localized modes at light-mass impurities in a one-dimensional, strongly-compressed chain of beads under Hertzian contacts. Focusing on the case of one or two such "defects", we analyze the problem's linear limit to identify the system eigenfrequencies and the linear defect modes. We then examine the bifurcation of nonlinear defect modes from their linear counterparts and study their linear stability in detail. We identify intriguing differences between the case of impurities in contact and ones that are not in contact. We find that the former bears similarities to the single defect case, whereas the latter features symmetry-breaking bifurcations with interesting static and dynamic implications.

preprint2008arXiv

Dissipative Solitary Waves in Granular Crystals

We provide a quantitative characterization of dissipative effects in one-dimensional granular crystals. We use the propagation of highly nonlinear solitary waves as a diagnostic tool and develop optimization schemes that allow one to compute the relevant exponents and prefactors of the dissipative terms in the equations of motion. We thereby propose a quantitatively-accurate extension of the Hertzian model that encompasses dissipative effects via a discrete Laplacian of the velocities. Experiments and computations with steel, brass, and polytetrafluoroethylene reveal a {\em common} dissipation exponent with a material-dependent prefactor.

preprint2006arXiv

Random Walker Ranking for NCAA Division I-A Football

Each December, college football fans and pundits across America debate which two teams should meet in the NCAA Division I-A National Championship game. The Bowl Championship Series (BCS) standings employed to select the teams invited to this game are intended to provide an unequivocal #1 v. #2 game for the championship; however, this selection process has itself been highly controversial in recent years. The computer algorithms that constitute one part of the BCS standings often act as lightning rods for the controversy, in part because they are inadequately explained to the public. We present an alternative algorithm that is simply explained yet remains effective at ranking the best teams. We define a ranking in terms of biased random walkers on the graph formed by the schedule of games played, with two teams (vertices) connected by an edge if they played each other. Each random walker moves from team to team by selecting a game and "voting" for its winner with probability p, tracing out a never-ending path motivated by the "my team beat your team" argument. We study the statistical properties of a collection of such walkers, relate the rankings to the community structure of the underlying network, and demonstrate the results for recent NCAA Division I-A seasons. We also discuss the algorithm's asymptotic behavior, illustrated with some analytically tractable cases for round-robin tournaments, and discuss possible generalizations.