Source author record

Carson C. Chow

Carson C. Chow appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

chao-dyn Genomics nlin.CD Populations and Evolution Tissues and Organs Applications Computation cond-mat cond-mat.dis-nn cond-mat.soft cond-mat.stat-mech math.DS math.SP Neurons and Cognition nlin.AO nlin.PS patt-sol Quantitative Methods

Catalog footprint

What is connected

10works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2015arXiv

The Universality of Cancer

Cancer has been characterized as a constellation of hundreds of diseases differing in underlying mutations and depending on cellular environments. Carcinogenesis as a stochastic physical process has been studied for over sixty years, but there is no accepted standard model. We show that the hazard rates of all cancers are characterized by a simple dynamic stochastic process on a half-line, with a universal linear restoring force balancing a universal simple Brownian motion starting from a universal initial distribution. Only a critical radius defining the transition from normal to tumorigenic genomes distinguishes between different cancer types when time is measured in cell--cycle units. Reparametrizing to chronological time units introduces two additional parameters: the onset of cellular senescence with age and the time interval over which this cessation in replication takes place. This universality implies that there may exist a finite separation between normal cells and tumorigenic cells in all tissue types that may be a viable target for both early detection and preventive therapy.

preprint2014arXiv

Application of compressed sensing to genome wide association studies and genomic selection

We show that the signal-processing paradigm known as compressed sensing (CS) is applicable to genome-wide association studies (GWAS) and genomic selection (GS). The aim of GWAS is to isolate trait-associated loci, whereas GS attempts to predict the phenotypic values of new individuals on the basis of training data. CS addresses a problem common to both endeavors, namely that the number of genotyped markers often greatly exceeds the sample size. We show using CS methods and theory that all loci of nonzero effect can be identified (selected) using an efficient algorithm, provided that they are sufficiently few in number (sparse) relative to sample size. For heritability h2 = 1, there is a sharp phase transition to complete selection as the sample size is increased. For heritability values less than one, complete selection can still occur although the transition is smoothed. The transition boundary is only weakly dependent on the total number of genotyped markers. The crossing of a transition boundary provides an objective means to determine when true effects are being recovered; we discuss practical methods for detecting the boundary. For h2 = 0.5, we find that a sample size that is thirty times the number of nonzero loci is sufficient for good recovery.

preprint2014arXiv

Second-generation PLINK: rising to the challenge of larger and richer datasets

PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.

preprint2013arXiv

Generalized activity equations for spiking neural network dynamics

Much progress has been made in uncovering the computational capabilities of spiking neural networks. However, spiking neurons will always be more expensive to simulate compared to rate neurons because of the inherent disparity in time scales - the spike duration time is much shorter than the inter-spike time, which is much shorter than any learning time scale. In numerical analysis, this is a classic stiff problem. Spiking neurons are also much more difficult to study analytically. One possible approach to making spiking networks more tractable is to augment mean field activity models with some information about spiking correlations. For example, such a generalized activity model could carry information about spiking rates and correlations between spikes self-consistently. Here, we will show how this can be accomplished by constructing a complete formal probabilistic description of the network and then expanding around a small parameter such as the inverse of the number of neurons in the network. The mean field theory of the system gives a rate-like description. The first order terms in the perturbation expansion keep track of covariances.

preprint2013arXiv

The causal meaning of Fisher's average effect

In order to formulate the Fundamental Theorem of Natural Selection, Fisher defined the average excess and average effect of a gene substitution. Finding these notions to be somewhat opaque, some authors have recommended reformulating Fisher's ideas in terms of covariance and regression, which are classical concepts of statistics. We argue that Fisher intended his two averages to express a distinction between correlation and causation. On this view the average effect is a specific weighted average of the actual phenotypic changes that result from physically changing the allelic states of homologous genes. We show that the statistical and causal conceptions of the average effect, perceived as inconsistent by Falconer, can be reconciled if certain relationships between the genotype frequencies and non-additive residuals are conserved. There are certain theory-internal considerations favoring Fisher's original formulation in terms of causality; for example, the frequency-weighted mean of the average effects equaling zero at each locus becomes a derivable consequence rather than an arbitrary constraint. More broadly, Fisher's distinction between correlation and causation is of critical importance to gene-trait mapping studies and the foundations of evolutionary biology.

preprint2012arXiv

Path Integral Methods for Stochastic Differential Equations

We give a pedagogical review of the application of field theoretic and path integral methods to calculate moments of the probability density function of stochastic differential equations perturbatively.

preprint2008arXiv

Competition between transients in the rate of approach to a fixed point

Dynamical systems studies of differential equations often focus on the behavior of solutions near critical points and on invariant manifolds, to elucidate the organization of the associated flow. In addition, effective methods, such as the use of Poincare maps and phase resetting curves, have been developed for the study of periodic orbits. However, the analysis of transient dynamics associated with solutions on their way to an attracting fixed point has not received much rigorous attention. This paper introduces methods for the study of such transient dynamics. In particular, we focus on the analysis of whether one component of a solution to a system of differential equations can overtake the corresponding component of a reference solution, given that both solutions approach the same stable node. We call this phenomenon tolerance, which derives from a certain biological effect. Here, we establish certain general conditions, based on the initial conditions associated with the two solutions and the properties of the vector field, that guarantee that tolerance does or does not occur in two-dimensional systems. We illustrate these conditions in particular examples, and we derive and demonstrate additional techniques that can be used on a case by case basis to check for tolerance. Finally, we give a full rigorous analysis of tolerance in two-dimensional linear systems.

preprint2008arXiv

The dynamics of human body weight change

An imbalance between energy intake and energy expenditure will lead to a change in body weight (mass) and body composition (fat and lean masses). A quantitative understanding of the processes involved, which currently remains lacking, will be useful in determining the etiology and treatment of obesity and other conditions resulting from prolonged energy imbalance. Here, we show that the long-term dynamics of human weight change can be captured by a mathematical model of the macronutrient flux balances and all previous models are special cases of this model. We show that the generic dynamical behavior of body composition for a clamped diet can be divided into two classes. In the first class, the body composition and mass are determined uniquely. In the second class, the body composition can exist at an infinite number of possible states. Surprisingly, perturbations of dietary energy intake or energy expenditure can give identical responses in both model classes and existing data are insufficient to distinguish between these two possibilities. However, this distinction is important for the efficacy of clinical interventions that alter body composition and mass.

preprint1999arXiv

Hydrodynamics of the Kuramoto-Sivashinsky Equation in Two Dimensions

The large scale properties of spatiotemporal chaos in the 2d Kuramoto-Sivashinsky equation are studied using an explicit coarse graining scheme. A set of intermediate equations are obtained. They describe interactions between the small scale (e.g., cellular) structures and the hydrodynamic degrees of freedom. Possible forms of the effective large scale hydrodynamics are constructed and examined. Although a number of different universality classes are allowed by symmetry, numerical results support the simplest scenario, that being the KPZ universality class.

preprint1994arXiv

Defect-Mediated Stability: An Effective Hydrodynamic Theory of Spatio-Temporal Chaos

Spatiotemporal chaos (STC) exhibited by the Kuramoto-Sivashinsky (KS) equation is investigated analytically and numerically. An effective stochastic equation belonging to the KPZ universality class is constructed by incorporating the chaotic dynamics of the small KS system in a coarse-graining procedure. The bare parameters of the effective theory are computed approximately. Stability of the system is shown to be mediated by space-time defects that are accompanied by stochasticity. The method of analysis and the mechanism of stability may be relevant to a class of STC problems.