Researcher profile

Sebastian Jaimungal

Sebastian Jaimungal contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
16topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

Model Combination in Risk Sharing under Ambiguity

We consider the problem of an agent who faces losses in continuous time over a finite time horizon and may choose to share some of these losses with a counterparty. The agent is uncertain about the true loss distribution and has multiple models for the losses. Their goal is to optimize a mean-variance type criterion with model combination under ambiguity through risk sharing. We construct such a criterion using the chi-squared divergence, adapting the monotone mean-variance preferences of Maccheroni et al. (2009) to the model combination setting and exploit a dual representation to expand the state space, yielding a time consistent problem. Assuming a Cramér-Lundberg loss model, we fully characterize the optimal risk sharing contract and the agent's wealth process under the optimal strategy. Furthermore, we prove that the strategy we obtain is admissible and that the value function satisfies the appropriate verification conditions. Finally, we apply the optimal strategy to an insurance setting using data from a Spanish automobile insurance portfolio, where we obtain differing models using cross-validation and provide numerical illustrations of the results.

preprint2024arXiv

Exploratory Control with Tsallis Entropy for Latent Factor Models

We study optimal control in models with latent factors where the agent controls the distribution over actions, rather than actions themselves, in both discrete and continuous time. To encourage exploration of the state space, we reward exploration with Tsallis Entropy and derive the optimal distribution over states - which we prove is $q$-Gaussian distributed with location characterized through the solution of an FBS$Δ$E and FBSDE in discrete and continuous time, respectively. We discuss the relation between the solutions of the optimal exploration problems and the standard dynamic optimal control solution. Finally, we develop the optimal policy in a model-agnostic setting along the lines of soft $Q$-learning. The approach may be applied in, e.g., developing more robust statistical arbitrage trading strategies.

preprint2022arXiv

Arbitrage-Free Implied Volatility Surface Generation with Variational Autoencoders

We propose a hybrid method for generating arbitrage-free implied volatility (IV) surfaces consistent with historical data by combining model-free Variational Autoencoders (VAEs) with continuous time stochastic differential equation (SDE) driven models. We focus on two classes of SDE models: regime switching models and Lévy additive processes. By projecting historical surfaces onto the space of SDE model parameters, we obtain a distribution on the parameter subspace faithful to the data on which we then train a VAE. Arbitrage-free IV surfaces are then generated by sampling from the posterior distribution on the latent space, decoding to obtain SDE model parameters, and finally mapping those parameters to IV surfaces. We further refine the VAE model by including conditional features and demonstrate its superior generative out-of-sample performance.

preprint2022arXiv

Functional Data Analysis for Extracting the Intrinsic Dimensionality of Spectra: Application to Chemical Homogeneity in the Open Cluster M67

High-resolution spectroscopic surveys of the Milky Way have entered the Big Data regime and have opened avenues for solving outstanding questions in Galactic archaeology. However, exploiting their full potential is limited by complex systematics, whose characterization has not received much attention in modern spectroscopic analyses. In this work, we present a novel method to disentangle the component of spectral data space intrinsic to the stars from that due to systematics. Using functional principal component analysis on a sample of $18,933$ giant spectra from APOGEE, we find that the intrinsic structure above the level of observational uncertainties requires ${\approx}$10 functional principal components (FPCs). Our FPCs can reduce the dimensionality of spectra, remove systematics, and impute masked wavelengths, thereby enabling accurate studies of stellar populations. To demonstrate the applicability of our FPCs, we use them to infer stellar parameters and abundances of 28 giants in the open cluster M67. We employ Sequential Neural Likelihood, a simulation-based Bayesian inference method that learns likelihood functions using neural density estimators, to incorporate non-Gaussian effects in spectral likelihoods. By hierarchically combining the inferred abundances, we limit the spread of the following elements in M67: $\mathrm{Fe} \lesssim 0.02$ dex; $\mathrm{C} \lesssim 0.03$ dex; $\mathrm{O}, \mathrm{Mg}, \mathrm{Si}, \mathrm{Ni} \lesssim 0.04$ dex; $\mathrm{Ca} \lesssim 0.05$ dex; $\mathrm{N}, \mathrm{Al} \lesssim 0.07$ dex (at 68% confidence). Our constraints suggest a lack of self-pollution by core-collapse supernovae in M67, which has promising implications for the future of chemical tagging to understand the star formation history and dynamical evolution of the Milky Way.

preprint2022arXiv

Minimal Kullback-Leibler Divergence for Constrained Lévy-Itô Processes

Given an n-dimensional stochastic process X driven by P-Brownian motions and Poisson random measures, we seek the probability measure Q, with minimal relative entropy to P, such that the Q-expectations of some terminal and running costs are constrained. We prove existence and uniqueness of the optimal probability measure, derive the explicit form of the measure change, and characterise the optimal drift and compensator adjustments under the optimal measure. We provide an analytical solution for Value-at-Risk (quantile) constraints, discuss how to perturb a Brownian motion to have arbitrary variance, and show that pinned measures arise as a limiting case of optimal measures. The results are illustrated in a risk management setting -- including an algorithm to simulate under the optimal measure -- where an agent seeks to answer the question: what dynamics are induced by a perturbation of the Value-at-Risk and the average time spent below a barrier on the reference process?

preprint2022arXiv

Portfolio Optimisation within a Wasserstein Ball

We study the problem of active portfolio management where an investor aims to outperform a benchmark strategy's risk profile while not deviating too far from it. Specifically, an investor considers alternative strategies whose terminal wealth lie within a Wasserstein ball surrounding a benchmark's -- being distributionally close -- and that have a specified dependence/copula -- tying state-by-state outcomes -- to it. The investor then chooses the alternative strategy that minimises a distortion risk measure of terminal wealth. In a general (complete) market model, we prove that an optimal dynamic strategy exists and provide its characterisation through the notion of isotonic projections. We further propose a simulation approach to calculate the optimal strategy's terminal wealth, making our approach applicable to a wide range of market models. Finally, we illustrate how investors with different copula and risk preferences invest and improve upon the benchmark using the Tail Value-at-Risk, inverse S-shaped, and lower- and upper-tail distortion risk measures as examples. We find that investors' optimal terminal wealth distribution has larger probability masses in regions that reduce their risk measure relative to the benchmark while preserving the benchmark's structure.

preprint2022arXiv

Principal agent mean field games in REC markets

Principal agent games are a growing area of research which focuses on the optimal behaviour of a principal and an agent, with the former contracting work from the latter, in return for providing a monetary award. While this field canonically considers a single agent, the situation where multiple agents, or even an infinite amount of agents are contracted by a principal are growing in prominence and pose interesting and realistic problems. Here, agents form a Nash equilibrium among themselves, and a Stackelberg equilibrium between themselves as a collective and the principal. We apply this framework to the problem of implementing Renewable Energy Certificate (REC) markets, where the principal requires regulated firms (power generators) to pay a non-compliance penalty which is inversely proportional to the amount of RECs they have. RECs can be obtained by generating electricity from clean sources or purchasing on the market. The agents react to this penalty and optimize their behaviours to navigate the system at minimum cost. In the agents' model we incorporate market clearing as well as agent heterogeneity. For a given market design, we find the Nash equilibrium among agents using techniques from mean field games. We then use techniques from extended McKean-Vlasov control problems to solve the principal (regulators) problem, who aim to choose the penalty function in such a way that balances environmental and revenue impacts optimally. We find through these techniques that the optimal penalty function is linear in the agents' state, suggesting the optimal emissions regulation market is more akin to a tax or rebate, regardless of the principal's utility function.

preprint2021arXiv

Lévy-Ito Models in Finance

We present an overview of the broad class of financial models in which the prices of assets are Lévy-Ito processes driven by an $n$-dimensional Brownian motion and an independent Poisson random measure. The Poisson random measure is associated with an $n$-dimensional Lévy process. Each model consists of a pricing kernel, a money market account, and one or more risky assets. We show how the excess rate of return above the interest rate can be calculated for risky assets in such models, thus showing the relationship between risk and return when asset prices have jumps. The framework is applied to a variety of asset classes, allowing one to construct new models as well as interesting generalizations of familiar models.

preprint2020arXiv

A Variational Analysis Approach to Solving the Merton Problem

We address the Merton problem of maximizing the expected utility of terminal wealth using techniques from variational analysis. Under a general continuous semimartingale market model with stochastic parameters, we obtain a characterization of the optimal portfolio for general utility functions in terms of a forward-backward stochastic differential equation (FBSDE) and derive solutions for a number of well-known utility functions. Our results complement a previous studies conducted on optimal strategies in markets driven by Brownian noise with random drift and volatility parameters.

preprint2020arXiv

Convex Analysis for LQG Systems with Applications to Major Minor LQG Mean-Field Game Systems

We develop a convex analysis approach for solving LQG optimal control problems and apply it to major-minor (MM) LQG mean-field game (MFG) systems. The approach retrieves the best response strategies for the major agent and all minor agents that attain an $ε$-Nash equilibrium. An important and distinctive advantage to this approach is that unlike the classical approach in the literature, we are able to avoid imposing assumptions on the evolution of the mean-field. In particular, this provides a tool for dealing with complex and non-standard systems.

preprint2020arXiv

Double Deep Q-Learning for Optimal Execution

Optimal trade execution is an important problem faced by essentially all traders. Much research into optimal execution uses stringent model assumptions and applies continuous time stochastic control to solve them. Here, we instead take a model free approach and develop a variation of Deep Q-Learning to estimate the optimal actions of a trader. The model is a fully connected Neural Network trained using Experience Replay and Double DQN with input features given by the current state of the limit order book, other trading signals, and available execution actions, while the output is the Q-value function estimating the future rewards under an arbitrary action. We apply our model to nine different stocks and find that it outperforms the standard benchmark approach on most stocks using the measures of (i) mean and median out-performance, (ii) probability of out-performance, and (iii) gain-loss ratios.

preprint2020arXiv

Hedging Non-Tradable Risks with Transaction Costs and Price Impact

A risk-averse agent hedges her exposure to a non-tradable risk factor $U$ using a correlated traded asset $S$ and accounts for the impact of her trades on both factors. The effect of the agent's trades on $U$ is referred to as cross-impact. By solving the agent's stochastic control problem, we obtain a closed-form expression for the optimal strategy when the agent holds a linear position in $U$. When the exposure to the non-tradable risk factor $ψ(U_T)$ is non-linear, we provide an approximation to the optimal strategy in closed-form, and prove that the value function is correctly approximated by this strategy when cross-impact and risk-aversion are small. We further prove that when $ψ(U_T)$ is non-linear, the approximate optimal strategy can be written in terms of the optimal strategy for a linear exposure with the size of the position changing dynamically according to the exposure's "Delta" under a particular probability measure.

preprint2020arXiv

Mixing LSMC and PDE Methods to Price Bermudan Options

We develop a mixed least squares Monte Carlo-partial differential equation (LSMC-PDE) method for pricing Bermudan style options on assets whose volatility is stochastic. The algorithm is formulated for an arbitrary number of assets and volatility processes and we prove the algorithm converges almost surely for a class of models. We also discuss two methods to improve the algorithm's computational complexity. Our numerical examples focus on the single ($2d$) and multi-dimensional ($4d$) Heston models and we compare our hybrid algorithm with classical LSMC approaches. In each case, we find that the hybrid algorithm outperforms standard LSMC in terms of estimating prices and optimal exercise boundaries.

preprint2020arXiv

Optimal Behaviour in Solar Renewable Energy Certificate (SREC) Markets

SREC markets are a relatively novel market-based system to incentivize the production of energy from solar means. A regulator imposes a floor on the amount of energy each regulated firm must generate from solar power in a given period and provides them with certificates for each generated MWh. Firms offset these certificates against the floor and pay a penalty for any lacking certificates. Certificates are tradable assets, allowing firms to purchase/sell them freely. In this work, we formulate a stochastic control problem for generating and trading in SREC markets from a regulated firm's perspective. We account for generation and trading costs, the impact both have on SREC prices, provide a characterization of the optimal strategy, and develop a numerical algorithm to solve this control problem. Through numerical experiments, we explore how a firm who acts optimally behaves under various conditions. We find that an optimal firm's generation and trading behaviour can be separated into various regimes, based on the marginal benefit of obtaining an additional SREC, and validate our theoretical characterization of the optimal strategy. We also conduct parameter sensitivity experiments and conduct comparisons of the optimal strategy to other candidate strategies.

preprint2020arXiv

Trading Foreign Exchange Triplets

We develop the optimal trading strategy for a foreign exchange (FX) broker who must liquidate a large position in an illiquid currency pair. To maximize revenues, the broker considers trading in a currency triplet which consists of the illiquid pair and two other liquid currency pairs. The liquid pairs in the triplet are chosen so that one of the pairs is redundant. The broker is risk-neutral and accounts for model ambiguity in the FX rates to make her strategy robust to model misspecification. When the broker is ambiguity neutral (averse) the trading strategy in each pair is independent (dependent) of the inventory in the other two pairs in the triplet. We employ simulations to illustrate how the robust strategies perform. For a range of ambiguity aversion parameters, we find the mean Profit and Loss (P&L) of the strategy increases and the standard deviation of the P&L decreases as ambiguity aversion increases.