Researcher profile

Karthik Duraisamy

Karthik Duraisamy contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2022arXiv

A Non-intrusive Approach for Physics-constrained Learning with Application to Fuel Cell Modeling

A data-driven model augmentation framework, referred to as Weakly-coupled Integrated Inference and Machine Learning (IIML), is presented to improve the predictive accuracy of physical models. In contrast to parameter calibration, this work seeks corrections to the structure of the model by a) inferring augmentation fields that are consistent with the underlying model, and b) transforming these fields into corrective model forms. The proposed approach couples the inference and learning steps in a weak sense via an alternating optimization approach. This coupling ensures that the augmentation fields remain learnable and maintain consistent functional relationships with local modeled quantities across the training dataset. An iterative solution procedure is presented in this paper, removing the need to embed the augmentation function during the inference process. This framework is used to infer an augmentation introduced within a Polymer electrolyte membrane fuel cell (PEMFC) model using a small amount of training data (from only 14 training cases.) These training cases belong to a dataset consisting of high-fidelity simulation data obtained from a high-fidelity model of a first generation Toyota Mirai. All cases in this dataset are characterized by different inflow and outflow conditions on the same geometry. When tested on 1224 different configurations, the inferred augmentation significantly improves the predictive accuracy for a wide range of physical conditions. Predictions and available data for the current density distribution are also compared to demonstrate the predictive capability of the model for quantities of interest which were not involved in the inference process. The results demonstrate that the weakly-coupled IIML framework offers sophisticated and robust model augmentation capabilities without requiring extensive changes to the numerical solver.

preprint2022arXiv

Discretization-independent surrogate modeling over complex geometries using hypernetworks and implicit representations

Numerical solutions of partial differential equations (PDEs) require expensive simulations, limiting their application in design optimization, model-based control, and large-scale inverse problems. Surrogate modeling techniques seek to decrease the computational expense while retaining dominant solution features and behavior. Traditional Convolutional Neural Network-based frameworks for surrogate modeling require lossy pixelization and data-preprocessing, and generally are not effective in realistic engineering applications. We propose alternative deep-learning based surrogate models for discretization-independent, continuous representations of PDE solutions, which can be used for learning and prediction over domains with complex, variable geometry and mesh topology. Three methods are proposed and compared; design-variable-coded multi-layer perceptron (DV-MLP), design-variable hypernetworks (DV-Hnet), and non-linear independent dual system (NIDS). Each method utilizes a main network which consumes pointwise spatial information to provide a continuous representation, allowing predictions at any location in the domain. Input features include a minimum-distance function evaluation to implicitly encode the problem geometry. The geometric design variables, which define and distinguish problem instances, are used differently by each method, appearing as additional main-network input features (DV-MLP), or as hypernetwork inputs (DV-Hnet and NIDS). The methods are applied to predict solutions around complex, parametrically-defined geometries on non-parametrically-defined meshes with model predictions obtained many orders of magnitude faster than the full order models. Test cases include a vehicle-aerodynamics problem with complex geometry and limited training data, with a design-variable hypernetwork performing best, with a competitive time-to-best-model despite a much greater parameter count.

preprint2022arXiv

Entropy-Stable Schemes in the Low-Mach-Number Regime: Flux-Preconditioning, Entropy Breakdowns, and Entropy Transfers

Entropy-Stable (ES) schemes, specifically those built from [Tadmor \textit{Math. Comput.} 49 (1987) 91], have been gaining interest over the past decade, especially in the context of under-resolved simulations of compressible turbulent flows using high-order methods. These schemes are attractive because they can provide stability in a global and nonlinear sense (consistency with thermodynamics). However, fully realizing the potential of ES schemes requires a better grasp of their local behavior. Entropy-stability itself does not imply good local behavior [Gouasmi \textit{et al.} \textit{J. Sci. Comp.} 78 (2019) 971, Gouasmi \textit{et al.} \textit{Comput. Methd. Appl. M.} 363 (2020) 112912]. In this spirit, we studied ES schemes in problems where \textit{global stability is not the core issue}. In the present work, we consider the accuracy degradation issues typically encountered by upwind-type schemes in the low-Mach-number regime [Turkel \textit{Annu. Rev. Fluid Mech.} 31 (1999) 285] and their treatment using \textit{Flux-Preconditioning} [Turkel \textit{J. Comput. Phys.} 72 (1987) 277, Miczek \textit{et al.} \textit{A \& A} 576 (2015) A50]. ES schemes suffer from the same issues and Flux-Preconditioning can improve their behavior without interfering with entropy-stability. This is first demonstrated analytically: using similarity and congruence transforms we were able to establish conditions for a preconditioned flux to be ES, and introduce the ES variants of the Miczek's and Turkel's preconditioned fluxes. This is then demonstrated numerically through first-order simulations of two simple test problems representative of the incompressible and acoustic limits, the Gresho Vortex and a right-moving acoustic wave. The results are overall consistent with previous studies [...]

preprint2021arXiv

Non-intrusive Balancing Transformation of Highly Stiff Systems with Lightly-damped Impulse Response

Balanced truncation (BT) is a model reduction method that utilizes a coordinate transformation to retain eigen-directions that are highly observable and reachable. To address realizability and scalability of BT applied to highly stiff and lightly-damped systems, a non-intrusive data-driven method is developed for balancing discrete-time systems via the eigensystem realization algorithm (ERA). The advantage of ERA for balancing transformation makes full-state outputs tractable. Further, ERA enables balancing despite stiffness, by eliminating computation of balancing modes and adjoint simulations. As a demonstrative example, we create balanced ROMs for a one-dimensional reactive flow with pressure forcing, where the stiffness introduced by the chemical source term is extreme (condition number $10^{13}$), preventing analytical implementation of BT. We investigate the performance of ROMs in prediction of dynamics with unseen forcing inputs and demonstrate stability and accuracy of balanced ROMs in truly predictive scenarios whereas without ERA, POD-Galerkin and Least-squares Petrov-Galerkin projections fail to represent the true dynamics. We show that after the initial transients under unit impulse forcing, the system undergoes lightly-damped oscillations, which magnifies the influence of sampling properties on predictive performance of the balanced ROMs. We propose an output domain decomposition approach and couple it with tangential interpolation to resolve sharp gradients at reduced computational costs.

preprint2021arXiv

Sub-grid scale characterization and asymptotic behavior of multi-dimensional upwind schemes for the vorticity transport equations

We study the sub-grid scale characteristics of a vorticity-transport-based approach for large-eddy simulations. In particular, we consider a multi-dimensional upwind scheme for the vorticity transport equations and establish its properties in the under-resolved regime. The asymptotic behavior of key turbulence statistics of velocity gradients, vorticity, and invariants is studied in detail. Modified equation analysis indicates that dissipation can be controlled locally via non-linear limiting of the gradient employed for the vorticity reconstruction on the cell face such that low numerical diffusion is obtained in well-resolved regimes and high numerical diffusion is realized in under-resolved regions. The enstrophy budget highlights the remarkable ability of the truncation terms to mimic the true sub-grid scale dissipation and diffusion. The modified equation also reveals diffusive terms that are similar to several commonly employed sub-grid scale models including tensor-gradient and hyper-viscosity models. Investigations on several canonical turbulence flow cases show that large-scale features are adequately represented and remain consistent in terms of spectral energy over a range of grid resolutions. Numerical dissipation in under-resolved simulations is consistent and can be characterized by diffusion terms discovered in the modified equation analysis. A minimum state of scale separation necessary to obtain asymptotic behavior is characterized using metrics such as effective Reynolds number and effective grid spacing. Temporally-evolving jet simulations, characterized by large-scale vortical structures, demonstrate that high Reynolds number vortex-dominated flows are captured when criteria is met and necessitate diffusive non-linear limiting of vorticity reconstruction be employed to realize accuracy in under-resolved simulations.

preprint2020arXiv

Formulation of Entropy-Stable schemes for the multicomponent compressible Euler equations

In this work, Entropy-Stable (ES) schemes are formulated for the multicomponent compressible Euler equations. Entropy-conservative (EC) and ES fluxes are derived. Particular attention is paid to the limit case of zero partial densities where the structure required by ES schemes breaks down (the entropy variables are no longer defined). It is shown that while an EC flux is well-defined in this limit, a well-defined upwind ES flux requires appropriately averaged partial densities in the dissipation matrix. A similar result holds for the high-order TecNO reconstruction. However, this does not prevent the numerical solution from developing negative partial densities or internal energy. Numerical experiments were performed on one-dimensional and two-dimensional interface and shock-interface problems. The present scheme exactly preserves stationary interfaces. On moving interfaces, it produces pressure anomalies typically observed with conservative schemes [Karni, \textit{J. Comput. Phys.}, 112 (1994) 1]. We find that these anomalies, which are not present in the single-component case, violate neither entropy stability nor a minimum principle of the specific entropy. Finally, we show that the scheme is able to reproduce the physical mechanisms of the two-dimensional shock-bubble interaction problem [Haas \& Sturtevant, J. Fluid Mech. 181 (1987) 41, Quirk \& Karni, J. Fluid Mech. 318 (1996) 129].

preprint2020arXiv

Multi-level Convolutional Autoencoder Networks for Parametric Prediction of Spatio-temporal Dynamics

A data-driven framework is proposed towards the end of predictive modeling of complex spatio-temporal dynamics, leveraging nested non-linear manifolds. Three levels of neural networks are used, with the goal of predicting the future state of a system of interest in a parametric setting. A convolutional autoencoder is used as the top level to encode the high dimensional input data along spatial dimensions into a sequence of latent variables. A temporal convolutional autoencoder (TCAE) serves as the second level, which further encodes the output sequence from the first level along the temporal dimension, and outputs a set of latent variables that encapsulate the spatio-temporal evolution of the dynamics. The use of dilated temporal convolutions grows the receptive field exponentially with network depth, allowing for efficient processing of long temporal sequences typical of scientific computations. A fully-connected network is used as the third level to learn the mapping between these latent variables and the global parameters from training data, and predict them for new parameters. For future state predictions, the second level uses a temporal convolutional network to predict subsequent steps of the output sequence from the top level. Latent variables at the bottom-most level are decoded to obtain the dynamics in physical space at new global parameters and/or at a future time. Predictive capabilities are evaluated on a range of problems involving discontinuities, wave propagation, strong transients, and coherent structures. The sensitivity of the results to different modeling choices is assessed. The results suggest that given adequate data and careful training, effective data-driven predictive models can be constructed. Perspectives are provided on the present approach and its place in the landscape of model reduction.

preprint2020arXiv

On the Structure of Time-delay Embedding in Linear Models of Non-linear Dynamical Systems

This work addresses fundamental issues related to the structure and conditioning of linear time-delayed models of non-linear dynamics on an attractor. While this approach has been well-studied in the asymptotic sense (e.g. for infinite number of delays), the non-asymptotic setting is not well-understood. First, we show that the minimal time-delays required for perfect signal recovery are solely determined by the sparsity in the Fourier spectrum for scalar systems. For the vector case, we provide a rank test and a geometric interpretation for the necessary and sufficient conditions for the existence of an accurate linear time delayed model. Further, we prove that the output controllability index of a linear system induced by the Fourier spectrum serves as a tight upper bound on the minimal number of time delays required. An explicit expression for the exact linear model in the spectral domain is also provided. From a numerical perspective, the effect of the sampling rate and the number of time delays on numerical conditioning is examined. An upper bound on the condition number is derived, with the implication that conditioning can be improved with additional time delays and/or decreasing sampling rates. Moreover, it is explicitly shown that the underlying dynamics can be accurately recovered using only a partial period of the attractor. Our analysis is first validated in simple periodic and quasi-periodic systems, and sensitivity to noise is also investigated. Finally, issues and practical strategies of choosing time delays in large-scale chaotic systems are discussed and demonstrated on 3D turbulent Rayleigh-Bénard convection.

preprint2020arXiv

Physics-Informed Probabilistic Learning of Linear Embeddings of Non-linear Dynamics With Guaranteed Stability

The Koopman operator has emerged as a powerful tool for the analysis of nonlinear dynamical systems as it provides coordinate transformations to globally linearize the dynamics. While recent deep learning approaches have been useful in extracting the Koopman operator from a data-driven perspective, several challenges remain. In this work, we formalize the problem of learning the continuous-time Koopman operator with deep neural networks in a measure-theoretic framework. Our approach induces two types of models: differential and recurrent form, the choice of which depends on the availability of the governing equations and data. We then enforce a structural parameterization that renders the realization of the Koopman operator provably stable. A new autoencoder architecture is constructed, such that only the residual of the dynamic mode decomposition is learned. Finally, we employ mean-field variational inference (MFVI) on the aforementioned framework in a hierarchical Bayesian setting to quantify uncertainties in the characterization and prediction of the dynamics of observables. The framework is evaluated on a simple polynomial system, the Duffing oscillator, and an unstable cylinder wake flow with noisy measurements.

preprint2020arXiv

Variational Multiscale Closures for Finite Element Discretizations Using the Mori-Zwanzig Approach

Simulation of multiscale problems remains a challenge due to the disparate range of spatial and temporal scales and the complex interaction between the resolved and unresolved scales. This work develops a coarse-grained modeling approach for the Continuous Galerkin discretizations by combining the Variational Multiscale decomposition and the Mori-Zwanzig (M-Z) formalism. An appeal of the M-Z formalism is that - akin to Greens functions for linear problems - the impact of unresolved dynamics on resolved scales can be formally represented as a convolution (or memory) integral in a non-linear setting. To ensure tractable and efficient models, Markovian closures are developed for the M-Z memory integral. The resulting sub-scale model has some similarities to adjoint stabilization and orthogonal subscale models. The model is made parameter free by adaptively determining the memory length during the simulation. To illustrate the generalizablity of this model, it is employed in coarse-grained simulations for the one-dimensional Burgers equation and in incompressible turbulence problems.

preprint2018arXiv

Data-driven Discovery of Closure Models

Derivation of reduced order representations of dynamical systems requires the modeling of the truncated dynamics on the retained dynamics. In its most general form, this so-called closure model has to account for memory effects. In this work, we present a framework of operator inference to extract the governing dynamics of closure from data in a compact, non-Markovian form. We employ sparse polynomial regression and artificial neural networks to extract the underlying operator. For a special class of non-linear systems, observability of the closure in terms of the resolved dynamics is analyzed and theoretical results are presented on the compactness of the memory. The proposed framework is evaluated on examples consisting of linear to nonlinear systems with and without chaotic dynamics, with an emphasis on predictive performance on unseen data.

preprint2018arXiv

Long-time predictive modeling of nonlinear dynamical systems using neural networks

We study the use of feedforward neural networks (FNN) to develop models of nonlinear dynamical systems from data. Emphasis is placed on predictions at long times, with limited data availability. Inspired by global stability analysis, and the observation of the strong correlation between the local error and the maximum singular value of the Jacobian of the ANN, we introduce Jacobian regularization in the loss function. This regularization suppresses the sensitivity of the prediction to the local error and is shown to improve accuracy and robustness. Comparison between the proposed approach and sparse polynomial regression is presented in numerical examples ranging from simple ODE systems to nonlinear PDE systems including vortex shedding behind a cylinder, and instability-driven buoyant mixing flow. Furthermore, limitations of feedforward neural networks are highlighted, especially when the training data does not include a low dimensional attractor. Strategies of data augmentation are presented as remedies to address these issues to a certain extent.