Source author record

Victor M. Zavala

Victor M. Zavala appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC eess.SY Systems and Control Machine Learning Applications Computer Vision Distributed, Parallel, and Cluster Computing econ.TH math.DS q-fin.EC

Catalog footprint

What is connected

22works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Graph-Based Modeling and Decomposition of Hierarchical Optimization Problems

We present a graph-theoretic modeling approach for hierarchical optimization that leverages the OptiGraph abstraction implemented in the Julia package Plasmo.jl. We show that the abstraction is flexible and can effectively capture complex hierarchical connectivity that arises from decision-making over multiple spatial and temporal scales (e.g., integration of planning, scheduling, and operations in manufacturing and infrastructures). We also show that the graph abstraction facilitates the conceptualization and implementation of decomposition and approximation schemes. Specifically, we propose a graph-based Benders decomposition (gBD) framework that enables the exploitation of hierarchical (nested) structures and that uses graph aggregation/partitioning procedures to discover such structures. In addition, we provide a Julia implementation of gBD, which we call PlasmoBenders.jl. We illustrate the capabilities using examples arising in the context of energy and power systems.

preprint2022arXiv

Data Analysis using Riemannian Geometry and Applications to Chemical Engineering

We explore the use of tools from Riemannian geometry for the analysis of symmetric positive definite matrices (SPD). An SPD matrix is a versatile data representation that is commonly used in chemical engineering (e.g., covariance/correlation/Hessian matrices and images) and powerful techniques are available for its analysis (e.g., principal component analysis). A key observation that motivates this work is that SPD matrices live on a Riemannian manifold and that implementing techniques that exploit this basic property can yield significant benefits in data-centric tasks such classification and dimensionality reduction. We demonstrate this via a couple of case studies that conduct anomaly detection in the context of process monitoring and image analysis.

preprint2022arXiv

Pricing and Remunerating Electricity Storage Flexibility Using Virtual Links

Ambitious renewable portfolio standards motivate the incorporation of energy storage resources (ESR) as sources of flexibility. While the United States government aims to promote ESR participation in electricity markets, work on market designs for properly remunerating ESRs is still lacking. In this paper, we propose a new energy market clearing framework that incorporates ESR systems. The new market design is computationally attractive in that it avoids mixed-integer formulations and formulations with complementarity constraints. Moreover, compared to previous market designs, our market decomposes the operations of ESRs using the concept of virtual links, which capture the transfer of energy across time. The virtual link representation reveals economic incentives available for ESR operations and sheds light into how electricity markets should remunerate ESRs. We also explore the role of ESR physical parameters on market behavior; we show that, while energy and power capacity defines the amount of flexibility each ESR can provide, storage charge/discharge efficiencies play a fundamental role in ESR remuneration and in mitigating market price volatility. We use our proposed framework to analyze the interplay between ESRs and independent system operators (ISOs) and to provide insights into optimal deployment strategies of ESRs in power grids.

preprint2022arXiv

Random Field Optimization

We present a new modeling paradigm for optimization that we call random field optimization. Random fields are a powerful modeling abstraction that aims to capture the behavior of random variables that live on infinite-dimensional spaces (e.g., space and time) such as stochastic processes (e.g., time series, Gaussian processes, and Markov processes), random matrices, and random spatial fields. This paradigm involves sophisticated mathematical objects (e.g., stochastic differential equations and space-time kernel functions) and has been widely used in neuroscience, geoscience, physics, civil engineering, and computer graphics. Despite of this, however, random fields have seen limited use in optimization; specifically, existing optimization paradigms that involve uncertainty (e.g., stochastic programming and robust optimization) mostly focus on the use of finite random variables. This trend is rapidly changing with the advent of statistical optimization (e.g., Bayesian optimization) and multi-scale optimization (e.g., integration of molecular sciences and process engineering). Our work extends a recently-proposed abstraction for infinite-dimensional optimization problems by capturing more general uncertainty representations. Moreover, we discuss solution paradigms for this new class of problems based on finite transformations and sampling, and identify open questions and challenges.

preprint2022arXiv

SAFE-OCC: A Novelty Detection Framework for Convolutional Neural Network Sensors and its Application in Process Control

We present a novelty detection framework for Convolutional Neural Network (CNN) sensors that we call Sensor-Activated Feature Extraction One-Class Classification (SAFE-OCC). We show that this framework enables the safe use of computer vision sensors in process control architectures. Emergent control applications use CNN models to map visual data to a state signal that can be interpreted by the controller. Incorporating such sensors introduces a significant system operation vulnerability because CNN sensors can exhibit high prediction errors when exposed to novel (abnormal) visual data. Unfortunately, identifying such novelties in real-time is nontrivial. To address this issue, the SAFE-OCC framework leverages the convolutional blocks of the CNN to create an effective feature space to conduct novelty detection using a desired one-class classification technique. This approach engenders a feature space that directly corresponds to that used by the CNN sensor and avoids the need to derive an independent latent space. We demonstrate the effectiveness of SAFE-OCC via simulated control environments.

preprint2021arXiv

Mitigating Investment Risk Using Modular Technologies

We study logistical investment flexibility provided by modular processing technologies for mitigating risk. Specifically, we propose a multi-stage stochastic programming formulation that determines optimal capacity expansion plans that mitigate demand uncertainty. The formulation accounts for multi-product dependencies between small/large units and for trade-offs between expected profit and risk. The formulation uses a cumulative risk measure to avoid timeconsistency issues of traditional, per-stage risk-minimization formulations and we argue that this approach is more compatible with typical investment metrics such as the net present value. Case studies of different complexity are presented to illustrate the developments. Our studies reveal that the Pareto frontier of a flexible setting (allowing for deployment of small units) dominates the Pareto frontier of an inflexible setting (allowing only for deployment of large units). Notably, this dominance is prevalent despite benefits arising from economies of scale of large processing units.

preprint2020arXiv

A Hierarchical Optimization Architecture for Large-Scale Power Networks

We present a hierarchical optimization architecture for large-scale power networks that overcomes limitations of fully centralized and fully decentralized architectures. The architecture leverages principles of multigrid computing schemes, which are widely used in the solution of partial differential equations on massively parallel computers. The top layer of the architecture uses a coarse representation of the entire network while the bottom layer is composed of a family of decentralized optimization agents each operating on a network subdomain at full resolution. We use an alternating direction method of multipliers (ADMM) framework to drive coordination of the decentralized agents. We show that state and dual information obtained from the top layer can be used to accelerate the coordination of the decentralized optimization agents and to recover optimality for the entire system. We demonstrate that the hierarchical architecture can be used to manage large collections of microgrids.

preprint2020arXiv

A Sigmoidal Approximation for Chance-Constrained Nonlinear Programs

We propose a sigmoidal approximation for the value-at-risk (that we call SigVaR) and we use this approximation to tackle nonlinear programs (NLPs) with chance constraints. We prove that the approximation is conservative and that the level of conservatism can be made arbitrarily small for limiting parameter values. The SigVar approximation brings scalability benefits over exact mixed-integer reformulations because its sample average approximation can be cast as a standard NLP. We also establish explicit connections between SigVaR and other smooth sigmoidal approximations recently reported in the literature. We show that a key benefit of SigVaR over such approximations is that one can establish an explicit connection with the conditional value at risk (CVaR) approximation and exploit this connection to obtain initial guesses for the approximation parameters. We present small- and large-scale numerical studies to illustrate the developments.

preprint2020arXiv

Characterizing the Predictive Accuracy of Dynamic Mode Decomposition for Data-Driven Control

Dynamic mode decomposition (DMD) is a versatile approach that enables the construction of low-order models from data. Controller design tasks based on such models require estimates and guarantees on predictive accuracy. In this work, we provide a theoretical analysis of DMD model errors that reveals impact of model order and data availability. The analysis also establishes conditions under which DMD models can be made asymptotically exact. We verify our results using a 2D diffusion system.

preprint2020arXiv

Computing Economic-Optimal and Stable Equilibria for Droop-Controlled Microgrids

We consider the problem of computing equilibria (steady-states) for droop-controlled, islanded, AC microgrids that are both economic-optimal and dynamically stable. This work is motivated by the observation that classical optimal power flow (OPF) formulations used for economic optimization might provide equilibria that are not reachable by low-level controllers (i.e., closed-loop unstable). This arises because OPF problems only enforce steady-state conditions and do not capture the dynamics. We explain this behavior by using a port-Hamiltonian microgrid representation. To overcome the limitations of OPF, the port-Hamiltonian representation can be exploited to derive a bilevel OPF formulation that seeks to optimize economics while enforcing stability. Unfortunately, bilevel optimization with a nonconvex inner problem is difficult to solve in general. As such, we propose an alternative approach (that we call probing OPF), which identifies an economic-optimal and stable equilibrium by probing a neighborhood of equilibria using random perturbations. The probing OPF is advantageous in that it is formulated as a standard nonlinear program, in that it is compatible with existing OPF frameworks, and in that it is applicable to diverse microgrid models. Experiments with the IEEE 118-bus system reveal that few probing points are required to enforce stability.

preprint2020arXiv

Decentralized Schemes with Overlap for Solving Graph-Structured Optimization Problems

We present a new algorithmic paradigm for the decentralized solution of graph-structured optimization problems that arise in the estimation and control of network systems. A key and novel design concept of the proposed approach is that it uses overlapping subdomains to promote and accelerate convergence. We show that the algorithm converges if the size of the overlap is sufficiently large and that the convergence rate improves exponentially with the size of the overlap. The proposed approach provides a bridge between fully decentralized and centralized architectures and is flexible in that it enables the implementation of asynchronous schemes, handling of constraints, and balancing of computing, communication, and data privacy needs. The proposed scheme is tested in an estimation problem for a 9241-node power network and we show that it outperforms the alternating direction method of multipliers.

preprint2020arXiv

Dual Dynamic Programming for Multi-Scale Mixed-Integer MPC

We propose a dual dynamic integer programming (DDIP) framework for solving multi-scale mixed-integer model predictive control (MPC) problems. Such problems arise in applications that involve long horizons and/or fine temporal discretizations as well as mixed-integer states and controls (e.g., scheduling logic and discrete actuators). The approach uses a nested cutting-plane scheme that performs forward and backward sweeps along the time horizon to adaptively approximate cost-to-go functions. The DDIP scheme proposed can handle general MPC formulations with mixed-integer controls and states and can perform forward-backward sweeps over block time partitions. We demonstrate the performance of the proposed scheme by solving mixed-integer MPC problems that arise in the scheduling of central heating, ventilation, and air-conditioning (HVAC) plants. We show that the proposed scheme is scalable and dramatically outperforms state-of-the-art mixed-integer solvers.

preprint2020arXiv

Economic Properties of Multi-Product Supply Chains

We interpret multi-product supply chains (SCs) as coordinated markets; under this interpretation, a SC optimization problem is a market clearing problem that allocates resources and associated economic values (prices) to different stakeholders that bid into the market (suppliers, consumers, transportation, and processing technologies). The market interpretation allows us to establish fundamental properties that explain how physical resources (primal variables) and associated economic values (dual variables) flow in the SC. We use duality theory to explain why incentivizing markets by forcing stakeholder participation (e.g., by imposing demand satisfaction or service provision constraints) yields artificial price behavior, inefficient allocations, and economic losses. To overcome these issues, we explore market incentive mechanisms that use bids; here, we introduce the concept of a stakeholder graph (a product-based representation of a supply chain) and show that this representation allows us to naturally determine minimum bids that activate the market. These results provide guidelines to design SC formulations that properly remunerate stakeholders and to design policy that foster market transactions. The results are illustrated using an urban waste management problem for a city of 100,000 residents.

preprint2020arXiv

Multi-Grid Schemes for Multi-Scale Coordination of Energy Systems

We discuss how multi-grid computing schemes can be used to design hierarchical coordination architectures for energy systems. These hierarchical architectures can be used to manage multiple temporal and spatial scales and mitigate fundamental limitations of centralized and decentralized architectures. We present the basic elements of a multi-grid scheme, which includes a smoothing operator (a high-resolution decentralized coordination layer that targets phenomena at high frequencies) and a coarsening operator (a low-resolution centralized coordination layer that targets phenomena at low frequencies). For smoothing, we extend existing convergence results for Gauss-Seidel schemes by applying them to systems that cover unstructured domains. This allows us to target problems with multiple timescales and arbitrary networks. The proposed coordination schemes can be used to guide transactions in decentralized electricity markets. We present a storage control example and a power flow diffusion example to illustrate the developments.

preprint2020arXiv

Multiscale Model Predictive Control of Battery Systems for Frequency Regulation Markets using Physics-Based Models

We propose a multiscale model predictive control (MPC) framework for stationary battery systems that exploits high-fidelity models to trade-off short-term economic incentives provided by energy and frequency regulation (FR) markets and long-term degradation effects. We find that the MPC framework can drastically reduce long-term degradation while properly responding to FR and energy market signals (compared to MPC formulations that use low-fidelity models). Our results also provide evidence that sophisticated battery models can be embedded within closedloop MPC simulations by using modern nonlinear programming solvers (we provide an efficient and easy-to-use implementation in Julia). We use insights obtained with our simulations to design a low-complexity MPC formulation that matches the behavior obtained with high-fidelity models. This is done by designing a suitable terminal penalty term that implicitly captures longterm degradation. The results suggest that complex degradation behavior can be accounted for in low-complexity MPC formulations by properly designing the cost function. We believe that our proof-of-concept results can be of industrial relevance, as battery vendors are seeking to participate in fast-changing electricity markets while maintaining asset integrity.

preprint2020arXiv

On the Convergence of the Dynamic Inner PCA Algorithm

Dynamic inner principal component analysis (DiPCA) is a powerful method for the analysis of time-dependent multivariate data. DiPCA extracts dynamic latent variables that capture the most dominant temporal trends by solving a large-scale, dense, and nonconvex nonlinear program (NLP). A scalable decomposition algorithm has been recently proposed in the literature to solve these challenging NLPs. The decomposition algorithm performs well in practice but its convergence properties are not well understood. In this work, we show that this algorithm is a specialized variant of a coordinate maximization algorithm. This observation allows us to explain why the decomposition algorithm might work (or not) in practice and can guide improvements. We compare the performance of the decomposition strategies with that of the off-the-shelf solver Ipopt. The results show that decomposition is more scalable and, surprisingly, delivers higher quality solutions.

preprint2020arXiv

Overlapping Schwarz Decomposition for Constrained Quadratic Programs

We present an overlapping Schwarz decomposition algorithm for constrained quadratic programs (QPs). Schwarz algorithms have been traditionally used to solve linear algebra systems arising from partial differential equations, but we have recently shown that they are also effective at solving structured optimization problems. In the proposed scheme, we consider QPs whose algebraic structure can be represented by graphs. The graph domain is partitioned into overlapping subdomains (yielding a set of coupled subproblems), solutions for the subproblems are computed in parallel, and convergence is enforced by updating primal-dual information in the overlapping regions. We show that convergence is guaranteed if the overlap is sufficiently large and that the convergence rate improves exponentially with the size of the overlap. Convergence results rely on a key property of graph-structured problems that is known as exponential decay of sensitivity. Here, we establish conditions under which this property holds for constrained QPs (as those found in network optimization and optimal control), thus extending existing work that addresses unconstrained QPs. The numerical behavior of the Schwarz scheme is demonstrated by using a DC optimal power flow problem defined over a network with 9,241 nodes.

preprint2020arXiv

Stochastic Model Predictive Control for Central HVAC Plants

We present a stochastic model predictive control (MPC) framework for central heating, ventilation, and air conditioning (HVAC) plants. The framework uses real data to forecast and quantify uncertainty of disturbances affecting the system over multiple timescales (electrical loads, heating/cooling loads, and energy prices). We conduct detailed closed-loop simulations and systematic benchmarks for the central HVAC plant of a typical university campus. Results demonstrate that deterministic MPC fails to properly capture disturbances and that this translates into economic penalties associated with peak demand charges and constraint violations in thermal storage capacity (overflow and/or depletion). Our results also demonstrate that stochastic MPC provides a more systematic approach to mitigate uncertainties and that this ultimately leads to cost savings of up to 7.5% and to mitigation of storage constraint violations. Benchmark results also indicate that these savings are close to ideal savings (9.6%) obtained under MPC with perfect information.

preprint2020arXiv

Unifying Theorems for Subspace Identification and Dynamic Mode Decomposition

This paper presents unifying results for subspace identification (SID) and dynamic mode decomposition (DMD) for autonomous dynamical systems. We observe that SID seeks to solve an optimization problem to estimate an extended observability matrix and a state sequence that minimizes the prediction error for the state-space model. Moreover, we observe that DMD seeks to solve a rank-constrained matrix regression problem that minimizes the prediction error of an extended autoregressive model. We prove that existence conditions for perfect (error-free) state-space and low-rank extended autoregressive models are equivalent and that the SID and DMD optimization problems are equivalent. We exploit these results to propose a SID-DMD algorithm that delivers a provably optimal model and that is easy to implement. We demonstrate our developments using a case study that aims to build dynamical models directly from video data.

preprint2019arXiv

A Parallel Decomposition Scheme for Solving Long-Horizon Optimal Control Problems

We present a temporal decomposition scheme for solving long-horizon optimal control problems. In the proposed scheme, the time domain is decomposed into a set of subdomains with partially overlapping regions. Subproblems associated with the subdomains are solved in parallel to obtain local primal-dual trajectories that are assembled to obtain the global trajectories. We provide a sufficient condition that guarantees convergence of the proposed scheme. This condition states that the effect of perturbations on the boundary conditions (i.e., initial state and terminal dual/adjoint variable) should decay asymptotically as one moves away from the boundaries. This condition also reveals that the scheme converges if the size of the overlap is sufficiently large and that the convergence rate improves with the size of the overlap. We prove that linear quadratic problems satisfy the asymptotic decay condition, and we discuss numerical strategies to determine if the condition holds in more general cases. We draw upon a non-convex optimal control problem to illustrate the performance of the proposed scheme.

preprint2016arXiv

Data Centers as Dispatchable Loads to Harness Stranded Power

We analyze how both traditional data center integration and dispatchable load integration affect power grid efficiency. We use detailed network models, parallel optimization solvers, and thousands of renewable generation scenarios to perform our analysis. Our analysis reveals that significant spillage and stranded power will be observed in power grids as wind power levels are increased. A counter-intuitive finding is that collocating data centers with inflexible loads next to wind farms has limited impacts on renewable portfolio standard (RPS) goals because it provides limited system-level flexibility and can in fact increase stranded power and fossil-fueled generation. In contrast, optimally placing data centers that are dispatchable (with flexible loads) provides system-wide flexibility, reduces stranded power, and improves efficiency. In short, optimally placed dispatchable computing loads can enable better scaling to high RPS. We show that these dispatchable computing loads are powered to 60~80\% of their requested capacity, indicating that there are significant economic incentives provided by stranded power.

preprint2015arXiv

A Stochastic Electricity Market Clearing Formulation with Consistent Pricing Properties

We argue that deterministic market clearing formulations introduce arbitrary distortions between day-ahead and expected real-time prices that bias economic incentives and block diversification. We extend and analyze the stochastic clearing formulation proposed by Pritchard et al. (2010) in which the social surplus function induces penalties between day-ahead and real-time quantities. We prove that the formulation yields price distortions that are bounded by the bid prices, and we show that adding a similar penalty term to transmission flows and phase angles ensures boundedness throughout the network. We prove that when the price distortions are zero, day-ahead quantities converge to the quantile of real-time counterparts. The undesired effects of price distortions suggest that stochastic settings provide significant benefits over deterministic ones that go beyond social surplus improvements. We propose additional metrics to evaluate these benefits.

Victor M. Zavala

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

Graph-Based Modeling and Decomposition of Hierarchical Optimization Problems

Data Analysis using Riemannian Geometry and Applications to Chemical Engineering

Pricing and Remunerating Electricity Storage Flexibility Using Virtual Links

Random Field Optimization

SAFE-OCC: A Novelty Detection Framework for Convolutional Neural Network Sensors and its Application in Process Control

Mitigating Investment Risk Using Modular Technologies

A Hierarchical Optimization Architecture for Large-Scale Power Networks

A Sigmoidal Approximation for Chance-Constrained Nonlinear Programs

Characterizing the Predictive Accuracy of Dynamic Mode Decomposition for Data-Driven Control

Computing Economic-Optimal and Stable Equilibria for Droop-Controlled Microgrids

Decentralized Schemes with Overlap for Solving Graph-Structured Optimization Problems

Dual Dynamic Programming for Multi-Scale Mixed-Integer MPC

Economic Properties of Multi-Product Supply Chains

Multi-Grid Schemes for Multi-Scale Coordination of Energy Systems

Multiscale Model Predictive Control of Battery Systems for Frequency Regulation Markets using Physics-Based Models

On the Convergence of the Dynamic Inner PCA Algorithm

Overlapping Schwarz Decomposition for Constrained Quadratic Programs

Stochastic Model Predictive Control for Central HVAC Plants

Unifying Theorems for Subspace Identification and Dynamic Mode Decomposition

A Parallel Decomposition Scheme for Solving Long-Horizon Optimal Control Problems

Data Centers as Dispatchable Loads to Harness Stranded Power

A Stochastic Electricity Market Clearing Formulation with Consistent Pricing Properties