Source author record

Marc Bocquet

Marc Bocquet appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Emerging Technologies physics.ao-ph astro-ph.IM Machine Learning Methodology Applications cond-mat.dis-nn cond-mat.mes-hall eess.SP math.OC physics.data-an

Catalog footprint

What is connected

12works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Tutorial on Bayesian Data Assimilation

This tutorial provides a broad introduction to Bayesian data assimilation that will be useful to practitioners, in interpreting algorithms and results, and for theoretical studies developing novel schemes with an understanding of the rich history of geophysical data assimilation and its current directions. The simple case of data assimilation in a 'perfect' model is primarily discussed for pedagogical purposes. Some mathematical results are derived at a high-level in order to illustrate key ideas about different estimators. However, the focus of this work is on the intuition behind these methods, where more formal and detailed treatments of the data assimilation problem can be found in the various references. In surveying a variety of widely used data assimilation schemes, the key message of this tutorial is how the Bayesian analysis provides a consistent framework for the estimation problem and how this allows one to formulate its solution in a variety of ways to exploit the operational challenges in the geosciences.

preprint2020arXiv

A Review of Innovation-Based Methods to Jointly Estimate Model and Observation Error Covariance Matrices in Ensemble Data Assimilation

Data assimilation combines forecasts from a numerical model with observations. Most of the current data assimilation algorithms consider the model and observation error terms as additive Gaussian noise, specified by their covariance matrices Q and R, respectively. These error covariances, and specifically their respective amplitudes, determine the weights given to the background (i.e., the model forecasts) and to the observations in the solution of data assimilation algorithms (i.e., the analysis). Consequently, Q and R matrices significantly impact the accuracy of the analysis. This review aims to present and to discuss, with a unified framework, different methods to jointly estimate the Q and R matrices using ensemble-based data assimilation techniques. Most of the methodologies developed to date use the innovations, defined as differences between the observations and the projection of the forecasts onto the observation space. These methodologies are based on two main statistical criteria: (i) the method of moments, in which the theoretical and empirical moments of the innovations are assumed to be equal, and (ii) methods that use the likelihood of the observations, themselves contained in the innovations. The reviewed methods assume that innovations are Gaussian random variables, although extension to other distributions is possible for likelihood-based methods. The methods also show some differences in terms of levels of complexity and applicability to high-dimensional systems. The conclusion of the review discusses the key challenges to further develop estimation methods for Q and R. These challenges include taking into account time-varying error covariances, using limited observational coverage, estimating additional deterministic error terms, or accounting for correlated noises.

preprint2020arXiv

Bayesian inference of chaotic dynamics by merging data assimilation, machine learning and expectation-maximization

The reconstruction from observations of high-dimensional chaotic dynamics such as geophysical flows is hampered by (i) the partial and noisy observations that can realistically be obtained, (ii) the need to learn from long time series of data, and (iii) the unstable nature of the dynamics. To achieve such inference from the observations over long time series, it has been suggested to combine data assimilation and machine learning in several ways. We show how to unify these approaches from a Bayesian perspective using expectation-maximization and coordinate descents. In doing so, the model, the state trajectory and model error statistics are estimated all together. Implementations and approximations of these methods are discussed. Finally, we numerically and successfully test the approach on two relevant low-order chaotic models with distinct identifiability.

preprint2020arXiv

Combining data assimilation and machine learning to emulate a dynamical model from sparse and noisy observations: a case study with the Lorenz 96 model

A novel method, based on the combination of data assimilation and machine learning is introduced. The new hybrid approach is designed for a two-fold scope: (i) emulating hidden, possibly chaotic, dynamics and (ii) predicting their future states. The method consists in applying iteratively a data assimilation step, here an ensemble Kalman filter, and a neural network. Data assimilation is used to optimally combine a surrogate model with sparse noisy data. The output analysis is spatially complete and is used as a training set by the neural network to update the surrogate model. The two steps are then repeated iteratively. Numerical experiments have been carried out using the chaotic 40-variables Lorenz 96 model, proving both convergence and statistical skill of the proposed hybrid approach. The surrogate model shows short-term forecast skill up to two Lyapunov times, the retrieval of positive Lyapunov exponents as well as the more energetic frequencies of the power density spectrum. The sensitivity of the method to critical setup parameters is also presented: the forecast skill decreases smoothly with increased observational noise but drops abruptly if less than half of the model domain is observed. The successful synergy between data assimilation and machine learning, proven here with a low-dimensional system, encourages further investigation of such hybrids with more sophisticated dynamics.

preprint2020arXiv

Embracing the Unreliability of Memory Devices for Neuromorphic Computing

The emergence of resistive non-volatile memories opens the way to highly energy-efficient computation near- or in-memory. However, this type of computation is not compatible with conventional ECC, and has to deal with device unreliability. Inspired by the architecture of animal brains, we present a manufactured differential hybrid CMOS/RRAM memory architecture suitable for neural network implementation that functions without formal ECC. We also show that using low-energy but error-prone programming conditions only slightly reduces network accuracy.

preprint2020arXiv

In-Memory Resistive RAM Implementation of Binarized Neural Networks for Medical Applications

The advent of deep learning has considerably accelerated machine learning development. The deployment of deep neural networks at the edge is however limited by their high memory and energy consumption requirements. With new memory technology available, emerging Binarized Neural Networks (BNNs) are promising to reduce the energy impact of the forthcoming machine learning hardware generation, enabling machine learning on the edge devices and avoiding data transfer over the network. In this work, after presenting our implementation employing a hybrid CMOS - hafnium oxide resistive memory technology, we suggest strategies to apply BNNs to biomedical signals such as electrocardiography and electroencephalography, keeping accuracy level and reducing memory requirements. We investigate the memory-accuracy trade-off when binarizing whole network and binarizing solely the classifier part. We also discuss how these results translate to the edge-oriented Mobilenet~V1 neural network on the Imagenet task. The final goal of this research is to enable smart autonomous healthcare devices.

preprint2020arXiv

Low Power In-Memory Implementation of Ternary Neural Networks with Resistive RAM-Based Synapse

The design of systems implementing low precision neural networks with emerging memories such as resistive random access memory (RRAM) is a major lead for reducing the energy consumption of artificial intelligence (AI). Multiple works have for example proposed in-memory architectures to implement low power binarized neural networks. These simple neural networks, where synaptic weights and neuronal activations assume binary values, can indeed approach state-of-the-art performance on vision tasks. In this work, we revisit one of these architectures where synapses are implemented in a differential fashion to reduce bit errors, and synaptic weights are read using precharge sense amplifiers. Based on experimental measurements on a hybrid 130 nm CMOS/RRAM chip and on circuit simulation, we show that the same memory array architecture can be used to implement ternary weights instead of binary weights, and that this technique is particularly appropriate if the sense amplifier is operated in near-threshold regime. We also show based on neural network simulation on the CIFAR-10 image recognition task that going from binary to ternary neural networks significantly increases neural network performance. These results highlight that AI circuits function may sometimes be revisited when operated in low power regimes.

preprint2020arXiv

On temporal scale separation in coupled data assimilation with the ensemble Kalman filter

Coupled data assimilation (CDA) distinctively appears as a main concern in numerical weather and climate prediction with major efforts put forward worldwide. The core issue is the scale separation acting as a barrier that hampers the propagation of the information across model components. We provide a brief survey of CDA, and then focus on CDA using the ensemble Kalman filter (EnKF). We consider first coupled equations with temporal scale difference and deduce that: (i) cross components effects are strong from the slow to the fast scale, but, (ii) intra-component effects are much stronger in the fast scale. While observing the slow scale is desirable and benefits the fast, the latter must be observed with high frequency otherwise the error will affect the slow scale. Experiments are performed using the atmosphere-ocean model, MAOOAM. Six configurations are considered, differing for the strength of the atmosphere-ocean coupling and/or the number of model modes. A comprehensive dynamical characterisation of the model configurations is provided by examining the Lyapunov spectrum, Kolmogorov entropy and Kaplan-Yorke attractor dimension. We also compute the covariant Lyapunov vectors and use them to explain how model instabilities act on different model's modes according to the coupling strength. The experiments confirm the importance of observing the fast scale, but show also that, despite its slow temporal scale, frequent observations in the ocean are beneficial. The relation between the ensemble size and the unstable subspace dimension has been studied. Results largely ratify what known for uncoupled system: the condition N>n0 is necessary for the EnKF to converge. But the quasi-degeneracy of the Lyapunov spectrum of MAOOAM, with many near-zero exponents, is potentially the cause of the smooth gradual reduction of the analysis error observed for some model configurations, even when N>n0.

preprint2015arXiv

DADA: Data Assimilation for the Detection and Attribution of Weather- and Climate-related Events

We describe a new approach allowing for systematic causal attribution of weather and climate-related events, in near-real time. The method is purposely designed to facilitate its implementation at meteorological centers by relying on data treatments that are routinely performed when numerically forecasting the weather. Namely, we show that causal attribution can be obtained as a by-product of so-called data assimilation procedures that are run on a daily basis to update the meteorological model with new atmospheric observations; hence, the proposed methodology can take advantage of the powerful computational and observational capacity of weather forecasting centers. We explain the theoretical rationale of this approach and sketch the most prominent features of a "data assimilation-based detection and attribution" (DADA) procedure. The proposal is illustrated in the context of the classical three-variable Lorenz model with additional forcing. Several theoretical and practical research questions that need to be addressed to make the proposal readily operational within weather forecasting centers are finally laid out.

preprint2014arXiv

Local ensemble transform Kalman filter, a fast non-stationary control law for adaptive optics on ELTs: theoretical aspects and first simulation results

We propose a new algorithm for an adaptive optics system control law, based on the Linear Quadratic Gaussian approach and a Kalman Filter adaptation with localizations. It allows to handle non-stationary behaviors, to obtain performance close to the optimality defined with the residual phase variance minimization criterion, and to reduce the computational burden with an intrinsically parallel implementation on the Extremely Large Telescopes (ELTs).

preprint2014arXiv

Local Ensemble Transform Kalman Filter: a non-stationary control law for complex adaptive optics systems on ELTs

We propose a new algorithm for an adaptive optics system control law which allows to reduce the computational burden in the case of an Extremely Large Telescope (ELT) and to deal with non-stationary behaviors of the turbulence. This approach, using Ensemble Transform Kalman Filter and localizations by domain decomposition is called the local ETKF: the pupil of the telescope is split up into various local domains and calculations for the update estimate of the turbulent phase on each domain are performed independently. This data assimilation scheme enables parallel computation of markedly less data during this update step. This adapts the Kalman Filter to large scale systems with a non-stationary turbulence model when the explicit storage and manipulation of extremely large covariance matrices are impossible. First simulation results are given in order to assess the theoretical analysis and to demonstrate the potentiality of this new control law for complex adaptive optics systems on ELTs.

preprint2002arXiv

Network models for localisation problems belonging to the chiral symmetry classes

We consider localisation problems belonging to the chiral symmetry classes, in which sublattice symmetry is responsible for singular behaviour at a band centre. We formulate models which have the relevant symmetries and which are generalisations of the network model introduced previously in the context of the integer quantum Hall plateau transition. We show that the generalisations required can be re-expressed as corresponding to the introduction of absorption and amplification into either the original network model, or the variants of it that represent disordered superconductors. In addition, we demonstrate that by imposing appropriate constraints on disorder, a lattice version of the Dirac equation with a random vector potential can be obtained, as well as new types of critical behaviour. These models represent a convenient starting point for analytic discussions and computational studies, and we investigate in detail a two-dimensional example without time-reversal invariance. It exhibits both localised and critical phases, and band-centre singularities in the critical phase approach more closely in small systems the expected asymptotic form than in other known realisations of the symmetry class.

Marc Bocquet

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

A Tutorial on Bayesian Data Assimilation

A Review of Innovation-Based Methods to Jointly Estimate Model and Observation Error Covariance Matrices in Ensemble Data Assimilation

Bayesian inference of chaotic dynamics by merging data assimilation, machine learning and expectation-maximization

Combining data assimilation and machine learning to emulate a dynamical model from sparse and noisy observations: a case study with the Lorenz 96 model

Embracing the Unreliability of Memory Devices for Neuromorphic Computing

In-Memory Resistive RAM Implementation of Binarized Neural Networks for Medical Applications

Low Power In-Memory Implementation of Ternary Neural Networks with Resistive RAM-Based Synapse

On temporal scale separation in coupled data assimilation with the ensemble Kalman filter

DADA: Data Assimilation for the Detection and Attribution of Weather- and Climate-related Events

Local ensemble transform Kalman filter, a fast non-stationary control law for adaptive optics on ELTs: theoretical aspects and first simulation results

Local Ensemble Transform Kalman Filter: a non-stationary control law for complex adaptive optics systems on ELTs

Network models for localisation problems belonging to the chiral symmetry classes