Source author record

Henry D. I. Abarbanel

Henry D. I. Abarbanel appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

nlin.CD physics.data-an Biological Physics Neurons and Cognition Machine Learning Neural and Evolutionary Computing physics.comp-ph physics.geo-ph Artificial Intelligence math.DS math.OC physics.ao-ph

Catalog footprint

What is connected

13works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Systematic Exploration of Reservoir Computing for Forecasting Complex Spatiotemporal Dynamics

A reservoir computer (RC) is a type of simplified recurrent neural network architecture that has demonstrated success in the prediction of spatiotemporally chaotic dynamical systems. A further advantage of RC is that it reproduces intrinsic dynamical quantities essential for its incorporation into numerical forecasting routines such as the ensemble Kalman filter -- used in numerical weather prediction to compensate for sparse and noisy data. We explore here the architecture and design choices for a "best in class" RC for a number of characteristic dynamical systems, and then show the application of these choices in scaling up to larger models using localization. Our analysis points to the importance of large scale parameter optimization. We also note in particular the importance of including input bias in the RC design, which has a significant impact on the forecast skill of the trained RC model. In our tests, the the use of a nonlinear readout operator does not affect the forecast time or the stability of the forecast. The effects of the reservoir dimension, spinup time, amount of training data, normalization, noise, and the RC time step are also investigated. While we are not aware of a generally accepted best reported mean forecast time for different models in the literature, we report over a factor of 2 increase in the mean forecast time compared to the best performing RC model of Vlachas et.al (2020) for the 40 dimensional spatiotemporally chaotic Lorenz 1996 dynamics, and we are able to accomplish this using a smaller reservoir size.

preprint2021arXiv

Integrating Recurrent Neural Networks with Data Assimilation for Scalable Data-Driven State Estimation

Data assimilation (DA) is integrated with machine learning in order to perform entirely data-driven online state estimation. To achieve this, recurrent neural networks (RNNs) are implemented as surrogate models to replace key components of the DA cycle in numerical weather prediction (NWP), including the conventional numerical forecast model, the forecast error covariance matrix, and the tangent linear and adjoint models. It is shown how these RNNs can be initialized using DA methods to directly update the hidden/reservoir state with observations of the target system. The results indicate that these techniques can be applied to estimate the state of a system for the repeated initialization of short-term forecasts, even in the absence of a traditional numerical forecast model. Further, it is demonstrated how these integrated RNN-DA methods can scale to higher dimensions by applying domain localization and parallelization, providing a path for practical applications in NWP.

preprint2021arXiv

Robust Forecasting using Predictive Generalized Synchronization in Reservoir Computing

Reservoir computers (RC) are a form of recurrent neural network (RNN) used for forecasting timeseries data. As with all RNNs, selecting the hyperparameters presents a challenge when training onnew inputs. We present a method based on generalized synchronization (GS) that gives direction in designing and evaluating the architecture and hyperparameters of an RC. The 'auxiliary method' for detecting GS provides a computationally efficient pre-training test that guides hyperparameterselection. Furthermore, we provide a metric for RC using the reproduction of the input system's Lyapunov exponentsthat demonstrates robustness in prediction.

preprint2020arXiv

Machine Learning Classification Informed by a Functional Biophysical System

We present a novel machine learning architecture for classification suggested by experiments on olfactory systems. The network separates input stimuli, represented as spatially distinct currents, via winnerless competition---a process based on the intrinsic sequential dynamics of the neural system---then uses a support vector machine (SVM) to provide precision to the space-time separation of the output. The combined network uses biophysical models of neurons and shows high discrimination among inputs and robustness to noise. While using the SVM alone does not permit determination of the components of mixtures of classified inputs, the combined network is able to tell the precise concentrations of the constituent parts.

preprint2020arXiv

Precision annealing Monte Carlo methods for statistical data assimilation and machine learning

In statistical data assimilation (SDA) and supervised machine learning (ML), we wish to transfer information from observations to a model of the processes underlying those observations. For SDA, the model consists of a set of differential equations that describe the dynamics of a physical system. For ML, the model is usually constructed using other strategies. In this paper, we develop a systematic formulation based on Monte Carlo sampling to achieve such information transfer. Following the derivation of an appropriate target distribution, we present the formulation based on the standard Metropolis-Hasting (MH) procedure and the Hamiltonian Monte Carlo (HMC) method for performing the high dimensional integrals that appear. To the extensive literature on MH and HMC, we add (1) an annealing method using a hyperparameter that governs the precision of the model to identify and explore the highest probability regions of phase space dominating those integrals, and (2) a strategy for initializing the state space search. The efficacy of the proposed formulation is demonstrated using a nonlinear dynamical model with chaotic solutions widely used in geophysics.

preprint2016arXiv

Model of the Songbird Nucleus HVC as a Network of Central Pattern Generators

We propose a functional architecture of the adult songbird nucleus HVC in which the core element is a "functional syllable unit" (FSU). In this model, HVC is organized into FSUs, each of which provides the basis for the production of one syllable in vocalization. Within each FSU, the inhibitory neuron population takes one of two operational states: (A) simultaneous firing wherein all inhibitory neurons fire simultaneously, and (B) competitive firing of the inhibitory neurons. Switching between these basic modes of activity is accomplished via changes in the synaptic strengths among the inhibitory neurons. The inhibitory neurons connect to excitatory projection neurons such that during state (A) the activity of projection neurons is suppressed, while during state (B) patterns of sequential firing of projection neurons can occur. The latter state is stabilized by feedback from the projection to the inhibitory neurons. Song composition for specific species is distinguished by the manner in which different FSUs are functionally connected to each other. Ours is a computational model built with biophysically based neurons. We illustrate that many observations of HVC activity are explained by the dynamics of the proposed population of FSUs, and we identify aspects of the model that are currently testable experimentally. In addition, and standing apart from the core features of an FSU, we propose that the transition between modes may be governed by the biophysical mechanism of neuromodulation.

preprint2015arXiv

Basin structure of optimization based state and parameter estimation

Most data based state and parameter estimation methods require suitable initial values or guesses to achieve convergence to the desired solution, which typically is a global minimum of some cost function. Unfortunately, however, other stable solutions (e.g., local minima) may exist and provide suboptimal or even wrong estimates. Here we demonstrate for a 9-dimensional Lorenz-96 model how to characterize the basin size of the global minimum when applying some particular optimization based estimation algorithm. We compare three different strategies for generating suitable initial guesses and we investigate the dependence of the solution on the given trajectory segment (underlying the measured time series). To address the question of how many state variables have to be measured for optimal performance, different types of multivariate time series are considered consisting of 1, 2, or 3 variables. Based on these time series the local observability of state variables and parameters of the Lorenz-96 model is investigated and confirmed using delay coordinates. This result is in good agreement with the observation that correct state and parameter estimation results are obtained if the optimization algorithm is initialized with initial guesses close to the true solution. In contrast, initialization with other exact solutions of the model equations (different from the true solution used to generate the time series) typically fails, i.e. the optimization procedure ends up in local minima different from the true solution. Initialization using random values in a box around the attractor exhibits success rates depending on the number of observables and the available time series (trajectory segment).

preprint2014arXiv

Accurately Estimating the State of a Geophysical System with Sparse Observations: Predicting the Weather

Utilizing the information in observations of a complex system to make accurate predictions through a quantitative model when observations are completed at time $T$, requires an accurate estimate of the full state of the model at time $T$. When the number of measurements $L$ at each observation time within the observation window is larger than a sufficient minimum value $L_s$, the impediments in the estimation procedure are removed. As the number of available observations is typically such that $L \ll L_s$, additional information from the observations must be presented to the model. We show how, using the time delays of the measurements at each observation time, one can augment the information transferred from the data to the model, removing the impediments to accurate estimation and permitting dependable prediction. We do this in a core geophysical fluid dynamics model, the shallow water equations, at the heart of numerical weather prediction. The method is quite general, however, and can be utilized in the analysis of a broad spectrum of complex systems where measurements are sparse. When the model of the complex system has errors, the method still enables accurate estimation of the state of the model and thus evaluation of the model errors in a manner separated from uncertainties in the data assimilation procedure.

preprint2011arXiv

Data Assimilation using a GPU Accelerated Path Integral Monte Carlo Approach

The answers to data assimilation questions can be expressed as path integrals over all possible state and parameter histories. We show how these path integrals can be evaluated numerically using a Markov Chain Monte Carlo method designed to run in parallel on a Graphics Processing Unit (GPU). We demonstrate the application of the method to an example with a transmembrane voltage time series of a simulated neuron as an input, and using a Hodgkin-Huxley neuron model. By taking advantage of GPU computing, we gain a parallel speedup factor of up to about 300, compared to an equivalent serial computation on a CPU, with performance increasing as the length of the observation time used for data assimilation increases.

preprint2010arXiv

Self-Consistent Stochastic Model Errors in Data Assimilation

In using data assimilation to import information from observations to estimate parameters and state variables of a model, one must assume a distribution for the noise in the measurements and in the model errors. Using the path integral formulation of data assimilation~ cite{abar2009}, we introduce the idea of self consistency of the distribution of stochastic model errors: the distribution of model errors from the path integral with observed data should be consistent with the assumption made in formulating the the path integral. The path integral setting for data assimilation is discussed to provide the setting for the consistency test. Using two examples drawn from the 1996 Lorenz model, for $D = 100$ and for $D = 20$ we show how one can test for this inconsistency with essential no additional effort than that expended in extracting answers to interesting questions from data assimilation itself. \end{abstract}

preprint2009arXiv

Effective Actions for Ensemble Data Assimilation

Ensemble data assimilation is a problem in determining the most likely phase space trajectory of a model of an observed dynamical sys- tem as it receives inputs from measurements passing information to the model. Using methods developed in statistical physics, we present effective actions and equations of motion for the mean orbits associ- ated with the temporal development of a dynamical model when it has errors, there is uncertainty in its initial state, and it receives informa- tion from measurements. If there are correlations among errors in the measurements they are naturally included in this approach.

preprint2009arXiv

State and parameter estimation using Monte Carlo evaluation of path integrals

Transferring information from observations of a dynamical system to estimate the fixed parameters and unobserved states of a system model can be formulated as the evaluation of a discrete time path integral in model state space. The observations serve as a guiding potential working with the dynamical rules of the model to direct system orbits in state space. The path integral representation permits direct numerical evaluation of the conditional mean path through the state space as well as conditional moments about this mean. Using a Monte Carlo method for selecting paths through state space we show how these moments can be evaluated and demonstrate in an interesting model system the explicit influence of the role of transfer of information from the observations. We address the question of how many observations are required to estimate the unobserved state variables, and we examine the assumptions of Gaussianity of the underlying conditional probability.

preprint2007arXiv

Measuring spike train synchrony

Estimating the degree of synchrony or reliability between two or more spike trains is a frequent task in both experimental and computational neuroscience. In recent years, many different methods have been proposed that typically compare the timing of spikes on a certain time scale to be fixed beforehand. Here, we propose the ISI-distance, a simple complementary approach that extracts information from the interspike intervals by evaluating the ratio of the instantaneous frequencies. The method is parameter free, time scale independent and easy to visualize as illustrated by an application to real neuronal spike trains obtained in vitro from rat slices. In a comparison with existing approaches on spike trains extracted from a simulated Hindemarsh-Rose network, the ISI-distance performs as well as the best time-scale-optimized measure based on spike timing.

Henry D. I. Abarbanel

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

A Systematic Exploration of Reservoir Computing for Forecasting Complex Spatiotemporal Dynamics

Integrating Recurrent Neural Networks with Data Assimilation for Scalable Data-Driven State Estimation

Robust Forecasting using Predictive Generalized Synchronization in Reservoir Computing

Machine Learning Classification Informed by a Functional Biophysical System

Precision annealing Monte Carlo methods for statistical data assimilation and machine learning

Model of the Songbird Nucleus HVC as a Network of Central Pattern Generators

Basin structure of optimization based state and parameter estimation

Accurately Estimating the State of a Geophysical System with Sparse Observations: Predicting the Weather

Data Assimilation using a GPU Accelerated Path Integral Monte Carlo Approach

Self-Consistent Stochastic Model Errors in Data Assimilation

Effective Actions for Ensemble Data Assimilation

State and parameter estimation using Monte Carlo evaluation of path integrals

Measuring spike train synchrony