Source author record

Pedram Hassanzadeh

Pedram Hassanzadeh appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.ao-ph physics.flu-dyn Machine Learning nlin.CD astro-ph.SR physics.comp-ph physics.geo-ph astro-ph.EP astro-ph.GA Computation Computer Vision math-ph math.DS math.MP math.NA math.OC Numerical Analysis physics.data-an

Catalog footprint

What is connected

17works

18topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

An Analytical and AI-discovered Stable, Accurate, and Generalizable Subgrid-scale Closure for Geophysical Turbulence

By combining AI and fluid physics, we discover a closed-form closure for 2D turbulence from small direct numerical simulation (DNS) data. Large-eddy simulation (LES) with this closure is accurate and stable, reproducing DNS statistics including those of extremes. We also show that the new closure could be derived from a 4th-order truncated Taylor expansion. Prior analytical and AI-based work only found the 2nd-order expansion, which led to unstable LES. The additional terms emerge only when inter-scale energy transfer is considered alongside standard reconstruction criterion in the sparse-equation discovery.

preprint2023arXiv

Using Neural Networks to Learn the Jet Stream Forced Response from Natural Variability

Two distinct features of anthropogenic climate change, warming in the tropical upper troposphere and warming at the Arctic surface, have competing effects on the mid-latitude jet stream's latitudinal position, often referred to as a "tug-of-war". Studies that investigate the jet's response to these thermal forcings show that it is sensitive to model type, season, initial atmospheric conditions, and the shape and magnitude of the forcing. Much of this past work focuses on studying a simulation's response to external manipulation. In contrast, we explore the potential to train a convolutional neural network (CNN) on internal variability alone and then use it to examine possible nonlinear responses of the jet to tropospheric thermal forcing that more closely resemble anthropogenic climate change. Our approach leverages the idea behind the fluctuation-dissipation theorem, which relates the internal variability of a system to its forced response but so far has been only used to quantify linear responses. We train a CNN on data from a long control run of the CESM dry dynamical core and show that it is able to skillfully predict the nonlinear response of the jet to sustained external forcing. The trained CNN provides a quick method for exploring the jet stream sensitivity to a wide range of tropospheric temperature tendencies and, considering that this method can likely be applied to any model with a long control run, could lend itself useful for early stage experiment design.

preprint2022arXiv

Discovery of interpretable structural model errors by combining Bayesian sparse regression and data assimilation: A chaotic Kuramoto-Sivashinsky test case

Models of many engineering and natural systems are imperfect. The discrepancy between the mathematical representations of a true physical system and its imperfect model is called the model error. These model errors can lead to substantial differences between the numerical solutions of the model and the state of the system, particularly in those involving nonlinear, multi-scale phenomena. Thus, there is increasing interest in reducing model errors, particularly by leveraging the rapidly growing observational data to understand their physics and sources. Here, we introduce a framework named MEDIDA: Model Error Discovery with Interpretability and Data Assimilation. MEDIDA only requires a working numerical solver of the model and a small number of noise-free or noisy sporadic observations of the system. In MEDIDA, first the model error is estimated from differences between the observed states and model-predicted states (the latter are obtained from a number of one-time-step numerical integrations from the previous observed states). If observations are noisy, a data assimilation (DA) technique such as ensemble Kalman filter (EnKF) is employed to provide the analysis state of the system, which is then used to estimate the model error. Finally, an equation-discovery technique, here the relevance vector machine (RVM), a sparsity-promoting Bayesian method, is used to identify an interpretable, parsimonious, and closed-form representation of the model error. Using the chaotic Kuramoto-Sivashinsky (KS) system as the test case, we demonstrate the excellent performance of MEDIDA in discovering different types of structural/parametric model errors, representing different types of missing physics, using noise-free and noisy observations.

preprint2022arXiv

FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

FourCastNet, short for Fourier Forecasting Neural Network, is a global data-driven weather forecasting model that provides accurate short to medium-range global predictions at $0.25^{\circ}$ resolution. FourCastNet accurately forecasts high-resolution, fast-timescale variables such as the surface wind speed, precipitation, and atmospheric water vapor. It has important implications for planning wind energy resources, predicting extreme weather events such as tropical cyclones, extra-tropical cyclones, and atmospheric rivers. FourCastNet matches the forecasting accuracy of the ECMWF Integrated Forecasting System (IFS), a state-of-the-art Numerical Weather Prediction (NWP) model, at short lead times for large-scale variables, while outperforming IFS for variables with complex fine-scale structure, including precipitation. FourCastNet generates a week-long forecast in less than 2 seconds, orders of magnitude faster than IFS. The speed of FourCastNet enables the creation of rapid and inexpensive large-ensemble forecasts with thousands of ensemble-members for improving probabilistic forecasting. We discuss how data-driven deep learning models such as FourCastNet are a valuable addition to the meteorology toolkit to aid and augment NWP models.

preprint2022arXiv

Long-term stability and generalization of observationally-constrained stochastic data-driven models for geophysical turbulence

Recent years have seen a surge in interest in building deep learning-based fully data-driven models for weather prediction. Such deep learning models if trained on observations can mitigate certain biases in current state-of-the-art weather models, some of which stem from inaccurate representation of subgrid-scale processes. However, these data-driven models, being over-parameterized, require a lot of training data which may not be available from reanalysis (observational data) products. Moreover, an accurate, noise-free, initial condition to start forecasting with a data-driven weather model is not available in realistic scenarios. Finally, deterministic data-driven forecasting models suffer from issues with long-term stability and unphysical climate drift, which makes these data-driven models unsuitable for computing climate statistics. Given these challenges, previous studies have tried to pre-train deep learning-based weather forecasting models on a large amount of imperfect long-term climate model simulations and then re-train them on available observational data. In this paper, we propose a convolutional variational autoencoder-based stochastic data-driven model that is pre-trained on an imperfect climate model simulation from a 2-layer quasi-geostrophic flow and re-trained, using transfer learning, on a small number of noisy observations from a perfect simulation. This re-trained model then performs stochastic forecasting with a noisy initial condition sampled from the perfect simulation. We show that our ensemble-based stochastic data-driven model outperforms a baseline deterministic encoder-decoder-based convolutional model in terms of short-term skills while remaining stable for long-term climate simulations yielding accurate climatology.

preprint2021arXiv

Stable a posteriori LES of 2D turbulence using convolutional neural networks: Backscattering analysis and generalization to higher Re via transfer learning

There is a growing interest in developing data-driven subgrid-scale (SGS) models for large-eddy simulation (LES) using machine learning (ML). In a priori (offline) tests, some recent studies have found ML-based data-driven SGS models that are trained on high-fidelity data (e.g., from direct numerical simulation, DNS) to outperform baseline physics-based models and accurately capture the inter-scale transfers, both forward (diffusion) and backscatter. While promising, instabilities in a posteriori (online) tests and inabilities to generalize to a different flow (e.g., with a higher Reynolds number, Re) remain as major obstacles in broadening the applications of such data-driven SGS models. For example, many of the same aforementioned studies have found instabilities that required often ad-hoc remedies to stabilize the LES at the expense of reducing accuracy. Here, using 2D decaying turbulence as the testbed, we show that deep fully convolutional neural networks (CNNs) can accurately predict the SGS forcing terms and the inter-scale transfers in a priori tests, and if trained with enough samples, lead to stable and accurate a posteriori LES-CNN. Further analysis attributes these instabilities to the disproportionally lower accuracy of the CNNs in capturing backscattering when the training set is small. We also show that transfer learning, which involves re-training the CNN with a small amount of data (e.g., 1%) from the new flow, enables accurate and stable a posteriori LES-CNN for flows with 16x higher Re (as well as higher grid resolution if needed). These results show the promise of CNNs with transfer learning to provide stable, accurate, and generalizable LES for practical use.

preprint2020arXiv

An Eddy-Zonal Flow Feedback Model for Propagating Annular Modes

The variability of the zonal-mean large-scale extratropical circulation is often studied using individual modes obtained from empirical orthogonal function (EOF) analyses. The prevailing reduced-order model of the leading EOF (EOF1) of zonal-mean zonal wind, called the annular mode, consists of an eddy-mean flow interaction mechanism that results in a positive feedback of EOF1 onto itself. However, a few studies have pointed out that under some circumstances in observations and GCMs, strong couplings exist between EOF1 and EOF2 at some lag times, resulting in decaying-oscillatory, or propagating, annular modes. Here, we introduce a reduced-order model for coupled EOF1 and EOF2 that accounts for potential cross-EOF eddy-zonal flow feedbacks. Using the analytical solution of this model, we derive conditions for the existence of the propagating regime based on the feedback strengths. Using this model, and idealized GCMs and stochastic prototypes, we show that cross-EOF feedbacks play an important role in controlling the persistence of the annular modes by setting the frequency of the oscillation. We find that stronger cross-EOF feedbacks lead to less persistent annular modes. Applying the coupled-EOF model to the Southern Hemisphere reanalysis data shows the existence of strong cross-EOF feedbacks. The results highlight the importance of considering the coupling of EOFs and cross-EOF feedbacks to fully understand the natural and forced variability of the zonal-mean large-scale circulation.

preprint2020arXiv

Analog forecasting of extreme-causing weather patterns using deep learning

Numerical weather prediction (NWP) models require ever-growing computing time/resources, but still, have difficulties with predicting weather extremes. Here we introduce a data-driven framework that is based on analog forecasting (prediction using past similar patterns) and employs a novel deep learning pattern-recognition technique (capsule neural networks, CapsNets) and impact-based auto-labeling strategy. CapsNets are trained on mid-tropospheric large-scale circulation patterns (Z500) labeled $0-4$ depending on the existence and geographical region of surface temperature extremes over North America several days ahead. The trained networks predict the occurrence/region of cold or heat waves, only using Z500, with accuracies (recalls) of $69\%-45\%$ $(77\%-48\%)$ or $62\%-41\%$ $(73\%-47\%)$ $1-5$ days ahead. CapsNets outperform simpler techniques such as convolutional neural networks and logistic regression. Using both temperature and Z500, accuracies (recalls) with CapsNets increase to $\sim 80\%$ $(88\%)$, showing the promises of multi-modal data-driven frameworks for accurate/fast extreme weather predictions, which can augment NWP efforts in providing early warnings.

preprint2020arXiv

Data-driven subgrid-scale modeling of forced Burgers turbulence using deep learning with generalization to higher Reynolds numbers via transfer learning

Developing data-driven subgrid-scale (SGS) models for large eddy simulations (LES) has received substantial attention recently. Despite some success, particularly in a priori (offline) tests, challenges have been identified that include numerical instabilities in a posteriori (online) tests and generalization (i.e., extrapolation) of trained data-driven SGS models, for example to higher Reynolds numbers. Here, using the stochastically forced Burgers turbulence as the test-bed, we show that deep neural networks trained using properly pre-conditioned (augmented) data yield stable and accurate a posteriori LES models. Furthermore, we show that transfer learning enables accurate/stable generalization to a flow with 10x higher Reynolds number.

preprint2019arXiv

Data-driven prediction of a multi-scale Lorenz 96 chaotic system using deep learning methods: Reservoir computing, ANN, and RNN-LSTM

In this paper, the performance of three deep learning methods for predicting short-term evolution and for reproducing the long-term statistics of a multi-scale spatio-temporal Lorenz 96 system is examined. The methods are: echo state network (a type of reservoir computing, RC-ESN), deep feed-forward artificial neural network (ANN), and recurrent neural network with long short-term memory (RNN-LSTM). This Lorenz 96 system has three tiers of nonlinearly interacting variables representing slow/large-scale ($X$), intermediate ($Y$), and fast/small-scale ($Z$) processes. For training or testing, only $X$ is available; $Y$ and $Z$ are never known or used. We show that RC-ESN substantially outperforms ANN and RNN-LSTM for short-term prediction, e.g., accurately forecasting the chaotic trajectories for hundreds of numerical solver's time steps, equivalent to several Lyapunov timescales. The RNN-LSTM and ANN show some prediction skills as well; RNN-LSTM bests ANN. Furthermore, even after losing the trajectory, data predicted by RC-ESN and RNN-LSTM have probability density functions (PDFs) that closely match the true PDF, even at the tails. The PDF of the data predicted using ANN, however, deviates from the true PDF. Implications, caveats, and applications to data-driven and data-assisted surrogate modeling of complex nonlinear dynamical systems such as weather/climate are discussed.

preprint2018arXiv

A test case for application of convolutional neural networks to spatio-temporal climate data: Re-identifying clustered weather patterns

Convolutional neural networks (CNNs) can potentially provide powerful tools for classifying and identifying patterns in climate and environmental data. However, because of the inherent complexities of such data, which are often spatio-temporal, chaotic, and non-stationary, the CNN algorithms must be designed/evaluated for each specific dataset and application. Yet to start, CNN, a supervised technique, requires a large labeled dataset. Labeling demands (human) expert time, which combined with the limited number of relevant examples in this area, can discourage using CNNs for new problems. To address these challenges, here we (1) Propose an effective auto-labeling strategy based on using an unsupervised clustering algorithm and evaluating the performance of CNNs in re-identifying these clusters; (2) Use this approach to label thousands of daily large-scale weather patterns over North America in the outputs of a fully-coupled climate model and show the capabilities of CNNs in re-identifying the 4 clustered regimes. The deep CNN trained with $1000$ samples or more per cluster has an accuracy of $90\%$ or better. Accuracy scales monotonically but nonlinearly with the size of the training set, e.g. reaching $94\%$ with $3000$ training samples per cluster. Effects of architecture and hyperparameters on the performance of CNNs are examined and discussed.

preprint2016arXiv

The linear response function of an idealized atmosphere. Part 1: Construction using Green's functions and applications

A linear response function (LRF) determines the mean-response of a nonlinear climate system to weak imposed forcings, and an eddy flux matrix (EFM) determines the eddy momentum and heat flux responses to mean-flow changes. Neither LRF nor EFM can be calculated from first principles due the lack of a complete theory for turbulent eddies. Here the LRF and EFM for an idealized dry atmosphere are computed by applying numerous localized weak forcings, one at a time, to a GCM with Held-Suarez physics and calculating the mean-responses. The LRF and EFM for zonally-averaged responses are then constructed using these forcings and responses through matrix inversion. Tests demonstrate that LRF and EFM are fairly accurate. Spectral analysis of the LRF shows that the most excitable dynamical mode, the neutral vector, strongly resembles the model's Annular Mode. The framework described here can be employed to compute the LRF/EFM for zonally-asymmetric responses and more complex GCMs. The potential applications of the LRF/EFM constructed here are i) forcing a specified mean-flow for hypothesis-testing, ii) isolating/quantifying the eddy-feedbacks in complex eddy-mean flow interaction problems, and iii) evaluating/improving more generally-applicable methods currently used to construct LRFs or diagnose eddy-feedbacks in comprehensive GCMs or observations. As an example for iii, in Part 2, the LRF is also computed using the fluctuation-dissipation theorem (FDT), and the previously-calculated LRF is exploited to investigate why FDT performs poorly in some cases. It is shown that dimension-reduction using leading EOFs, which is commonly used to construct LRFs from the FDT, can significantly degrade the accuracy due to the non-normality of the operator.

preprint2016arXiv

The linear response function of an idealized atmosphere. Part 2: Implications for the practical use of the Fluctuation-Dissipation Theorem and the role of operator's non-normality

A linear response function (LRF) relates the mean-response of a nonlinear system to weak external forcings and vice versa. Even for simple models of the general circulation, such as the dry dynamical core, the LRF cannot be calculated from first principles due to the lack of a complete theory for eddy-mean flow feedbacks. According to the Fluctuation-Dissipation Theorem (FDT), the LRF can be calculated using only the covariance and lag-covariance matrices of the unforced system. However, efforts in calculating the LRFs for GCMs using FDT have produced mixed results, and the reason(s) behind the poor performance of the FDT remains unclear. In Part 1 of this study, the LRF of an idealized GCM, the dry dynamical core with Held-Suarez physics, is accurately calculated using Green's functions. In this paper (Part 2), the LRF of the same model is computed using FDT, which is found to perform poorly for some of the test cases. The accurate LRF of Part 1 is used with a linear stochastic equation to show that dimension-reduction by projecting the data onto leading EOFs, which is commonly used for FDT, can alone be a significant source of error. Simplified equations and examples of 2 x 2 matrices are then used to demonstrate that this error arises because of the non-normality of the operator. These results suggest that errors caused by dimension-reduction are a major, if not the main, contributor to the poor performance of the LRF calculated using FDT, and that further investigations of dimension-reduction strategies with a focus on non-normality are needed.

preprint2015arXiv

Zombie Vortex Instability I: A Purely Hydrodynamic Instability to Resurrect the Dead Zones of Protoplanetary Disks

There is considerable interest in hydrodynamic instabilities in dead zones of protoplanetary disks as a mechanism for driving angular momentum transport and as a source of particle-trapping vortices to mix chondrules and incubate planetesimal formation. We present simulations with a pseudo-spectral anelastic code and with the compressible code Athena, showing that stably stratified flows in a shearing, rotating box are violently unstable and produce space-filling, sustained turbulence dominated by large vortices with Rossby numbers of order 0.2-0.3. This Zombie Vortex Instability (ZVI) is observed in both codes and is triggered by Kolmogorov turbulence with Mach numbers less than 0.01. It is a common view that if a given constant density flow is stable, then stable vertical stratification should make the flow even more stable. Yet, we show that sufficient vertical stratification can be unstable to ZVI. ZVI is robust and requires no special tuning of boundary conditions, or initial radial entropy or vortensity gradients (though we have studied ZVI only in the limit of infinite cooling time). The resolution of this paradox is that stable stratification allows for a new avenue to instability: baroclinic critical layers. ZVI has not been seen in previous studies of flows in rotating, shearing boxes because those calculations frequently lacked vertical density stratification and/or sufficient numerical resolution. Although we do not expect appreciable angular momentum transport from ZVI in the small domains in this study, we hypothesize that ZVI in larger domains with compressible equations may lead to angular transport via spiral density waves.

preprint2014arXiv

Wall to Wall Optimal Transport

The calculus of variations is employed to find steady divergence-free velocity fields that maximize transport of a tracer between two parallel walls held at fixed concentration for one of two constraints on flow strength: a fixed value of the kinetic energy or a fixed value of the enstrophy. The optimizing flows consist of an array of (convection) cells of a particular aspect ratio Gamma. We solve the nonlinear Euler-Lagrange equations analytically for weak flows and numerically (and via matched asymptotic analysis in the fixed energy case) for strong flows. We report the results in terms of the Nusselt number Nu, a dimensionless measure of the tracer transport, as a function of the Peclet number Pe, a dimensionless measure of the energy or enstrophy of the flow. For both constraints the maximum transport Nu_{MAX}(Pe) is realized in cells of decreasing aspect ratio Gamma_{opt}(Pe) as Pe increases. For the fixed energy problem, Nu_{MAX} \sim Pe and Gamma_{opt} \sim Pe^{-1/2}, while for the fixed enstrophy scenario, Nu_{MAX} \sim Pe^{10/17} and Gamma_{opt} \sim Pe^{-0.36}. We also interpret our results in the context of certain buoyancy-driven Rayleigh-Benard convection problems that satisfy one of the two intensity constraints, enabling us to investigate how the transport scalings compare with upper bounds on Nu expressed as a function of the Rayleigh number \Ra. For steady convection in porous media, corresponding to the fixed energy problem, we find Nu_{MAX} \sim \Ra and Gamma_{opt} \sim Ra^{-1/2}$, while for steady convection in a pure fluid layer between free-slip isothermal walls, corresponding to fixed enstrophy transport, Nu_{MAX} \sim Ra^{5/12} and Gamma_{opt} \sim Ra^{-1/4}.

preprint2013arXiv

Self-Replicating Three-Dimensional Vortices in Neutrally-Stable Stratified Rotating Shear Flows

A previously unknown instability creates space-filling lattices of 3D vortices in linearly-stable, rotating, stratified shear flows. The instability starts from an easily-excited critical layer. The layer intensifies by drawing energy from the background shear and rolls-up into vortices that excite new critical layers and vortices. The vortices self-similarly replicate to create lattices of turbulent vortices. The vortices persist for all time. This self-replication occurs in stratified Couette flows and in the dead zones of protoplanetary disks where it can de-stabilize Keplerian flows.

preprint2012arXiv

The Universal Aspect Ratio of Vortices in Rotating Stratified Flows: Theory and Simulation

We derive a relationship for the vortex aspect ratio $α$ (vertical half-thickness over horizontal length scale) for steady and slowly evolving vortices in rotating stratified fluids, as a function of the Brunt-Vaisala frequencies within the vortex $N_c$ and in the background fluid outside the vortex $\bar{N}$, the Coriolis parameter $f$, and the Rossby number $Ro$ of the vortex: $α^2 = Ro(1+Ro) f^2/(N_c^2-\bar{N}^2)$. This relation is valid for cyclones and anticyclones in either the cyclostrophic or geostrophic regimes; it works with vortices in Boussinesq fluids or ideal gases, and the background density gradient need not be uniform. Our relation for $α$ has many consequences for equilibrium vortices in rotating stratified flows. For example, cyclones must have $N_c^2 > \bar{N}^2$; weak anticyclones (with $|Ro| < 1$) must have $N_c^2 < \bar{N}^2; and strong anticyclones must have $N_c^2 > \bar{N}^2$. We verify our relation for $α$ with numerical simulations of the three-dimensional Boussinesq equations for a wide variety of vortices, including: vortices that are initially in (dissipationless) equilibrium and then evolve due to an imposed weak viscous dissipation or density radiation; anticyclones created by the geostrophic adjustment of a patch of locally mixed density; cyclones created by fluid suction from a small localised region; vortices created from the remnants of the violent breakups of columnar vortices; and weakly non-axisymmetric vortices. The values of the aspect ratios of our numerically-computed vortices validate our relationship for $α$, and generally they differ significantly from the values obtained from the much-cited conjecture that $α= f/\bar{N}$ in quasi-geostrophic vortices.

Pedram Hassanzadeh

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

An Analytical and AI-discovered Stable, Accurate, and Generalizable Subgrid-scale Closure for Geophysical Turbulence

Using Neural Networks to Learn the Jet Stream Forced Response from Natural Variability

Discovery of interpretable structural model errors by combining Bayesian sparse regression and data assimilation: A chaotic Kuramoto-Sivashinsky test case

FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators

Long-term stability and generalization of observationally-constrained stochastic data-driven models for geophysical turbulence

Stable a posteriori LES of 2D turbulence using convolutional neural networks: Backscattering analysis and generalization to higher Re via transfer learning

An Eddy-Zonal Flow Feedback Model for Propagating Annular Modes

Analog forecasting of extreme-causing weather patterns using deep learning

Data-driven subgrid-scale modeling of forced Burgers turbulence using deep learning with generalization to higher Reynolds numbers via transfer learning

Data-driven prediction of a multi-scale Lorenz 96 chaotic system using deep learning methods: Reservoir computing, ANN, and RNN-LSTM

A test case for application of convolutional neural networks to spatio-temporal climate data: Re-identifying clustered weather patterns

The linear response function of an idealized atmosphere. Part 1: Construction using Green's functions and applications

The linear response function of an idealized atmosphere. Part 2: Implications for the practical use of the Fluctuation-Dissipation Theorem and the role of operator's non-normality

Zombie Vortex Instability I: A Purely Hydrodynamic Instability to Resurrect the Dead Zones of Protoplanetary Disks

Wall to Wall Optimal Transport

Self-Replicating Three-Dimensional Vortices in Neutrally-Stable Stratified Rotating Shear Flows

The Universal Aspect Ratio of Vortices in Rotating Stratified Flows: Theory and Simulation