Source author record

Nadia Oudjane

Nadia Oudjane appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.PR math.OC q-fin.PR math.NA Computer Science and Game Theory Cryptography and Security eess.SY Machine Learning Multiagent Systems Systems and Control

Catalog footprint

What is connected

13works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Online Markov Decision Processes with Terminal Law Constraints

Traditional reinforcement learning usually assumes either episodic interactions with resets or continuous operation to minimize average or cumulative loss. While episodic settings have many theoretical results, resets are often unrealistic in practice. The infinite-horizon setting avoids this issue but lacks non-asymptotic guarantees in online scenarios with unknown dynamics. In this work, we move towards closing this gap by introducing a reset-free framework called the periodic framework, where the goal is to find periodic policies: policies that not only minimize cumulative loss but also return the agents to their initial state distribution after a fixed number of steps. We formalize the problem of finding optimal periodic policies and identify sufficient conditions under which it is well-defined for tabular Markov decision processes. To evaluate algorithms in this framework, we introduce the periodic regret, a measure that balances cumulative loss with the terminal law constraint. We then propose the first algorithms for computing periodic policies in two multi-agent settings and show they achieve sublinear periodic regret of order $\tilde O(T^{3/4})$. This provides the first non-asymptotic guarantees for reset-free learning in the setting of $M$ homogeneous agents, for $M > 1$.

preprint2022arXiv

A privacy-preserving distributed computational approach for distributed locational marginal prices

An important issue in today's electricity markets is the management of flexibilities offered by new practices, such as smart home appliances or electric vehicles. By inducing changes in the behavior of residential electric utilities, demand response (DR) seeks to adjust the demand of power to the supply for increased grid stability and better integration of renewable energies. A key role in DR is played by emergent independent entities called load aggregators (LAs). We develop a new decentralized algorithm to solve a convex relaxation of the classical Alternative Current Optimal Power Flow (ACOPF) problem, which relies on local information only. Each computational step can be performed in an entirely privacy-preserving manner, and system-wide coordination is achieved via node-specific distribution locational marginal prices (DLMPs). We demonstrate the efficiency of our approach on a 15-bus radial distribution network.

preprint2020arXiv

A Privacy-preserving Method to Optimize Distributed Resource Allocation

We consider a resource allocation problem involving a large number of agents with individual constraints subject to privacy, and a central operator whose objective is to optimize a global, possibly nonconvex, cost while satisfying the agents' constraints, for instance an energy operator in charge of the management of energy consumption flexibilities of many individual consumers. We provide a privacy-preserving algorithm that does compute the optimal allocation of resources, avoiding each agent to reveal her private information (constraints and individual solution profile) neither to the central operator nor to a third party. Our method relies on an aggregation procedure: we compute iteratively a global allocation of resources, and gradually ensure existence of a disaggregation, that is individual profiles satisfying agents' private constraints, by a protocol involving the generation of polyhedral cuts and secure multiparty computations (SMC). To obtain these cuts, we use an alternate projection method, which is implemented locally by each agent, preserving her privacy needs. We adress especially the case in which the local and global constraints define a transportation polytope. Then, we provide theoretical convergence estimates together with numerical results, showing that the algorithm can be effectively used to solve the allocation problem in high dimension, while addressing privacy issues.

preprint2020arXiv

Efficient Estimation of Equilibria in Large Aggregative Games with Coupling Constraints

Aggregative games have many industrial applications, and computing an equilibrium in those games is challenging when the number of players is large. In the framework of atomic aggregative games with coupling constraints, we show that variational Nash equilibria of a large aggregative game can be approximated by a Wardrop equilibrium of an auxiliary population game of smaller dimension. Each population of this auxiliary game corresponds to a group of atomic players of the initial large game. This approach enables an efficient computation of an approximated equilibrium, as the variational inequality characterizing the Wardrop equilibrium is of smaller dimension than the initial one. This is illustrated on an example in the smart grid context.

preprint2016arXiv

Branching diffusion representation of semilinear PDEs and Monte Carlo approximation

We provide a representation result of parabolic semi-linear PD-Es, with polynomial nonlinearity, by branching diffusion processes. We extend the classical representation for KPP equations, introduced by Skorokhod (1964), Watanabe (1965) and McKean (1975), by allowing for polynomial nonlinearity in the pair $(u, Du)$, where $u$ is the solution of the PDE with space gradient $Du$. Similar to the previous literature, our result requires a non-explosion condition which restrict to "small maturity" or "small nonlinearity" of the PDE. Our main ingredient is the automatic differentiation technique as in Henry Labordere, Tan and Touzi (2015), based on the Malliavin integration by parts, which allows to account for the nonlinearities in the gradient. As a consequence, the particles of our branching diffusion are marked by the nature of the nonlinearity. This new representation has very important numerical implications as it is suitable for Monte Carlo simulation. Indeed, this provides the first numerical method for high dimensional nonlinear PDEs with error estimate induced by the dimension-free Central limit theorem. The complexity is also easily seen to be of the order of the squared dimension. The final section of this paper illustrates the efficiency of the algorithm by some high dimensional numerical experiments.

preprint2016arXiv

Particle system algorithm and chaos propagation related to non-conservative McKean type stochastic differential equations

We discuss numerical aspects related to a new class of nonlinear Stochastic Differential Equations in the sense of McKean, which are supposed to represent non conservative nonlinear Partial Differential equations (PDEs). We propose an original interacting particle system for which we discuss the propagation of chaos. We consider a time-discretized approximation of this particle system to which we associate a random function which is proved to converge to a solution of a regularized version of a nonlinear PDE.

preprint2016arXiv

Unbiased Monte Carlo estimate of stochastic differential equations expectations

We develop a pure Monte Carlo method to compute $E(g(X_T))$ where $g$ is a bounded and Lipschitz function and $X_t$ an Ito process. This approach extends a previously proposed method to the general multidimensional case with a SDE with varying coefficients. A variance reduction method relying on interacting particle systems is also developped.

preprint2015arXiv

Probabilistic representation of a class of non conservative nonlinear Partial Differential Equations

We introduce a new class of nonlinear Stochastic Differential Equations in the sense of McKean, related to non conservative nonlinear Partial Differential equations (PDEs). We discuss existence and uniqueness pathwise and in law under various assumptions. We propose an original interacting particle system for which we discuss the propagation of chaos. To this system, we associate a random function which is proved to converge to a solution of a regularized version of PDE.

preprint2014arXiv

Hedging Expected Losses on Derivatives in Electricity Futures Markets

We investigate the problem of pricing and hedging derivatives of Electricity Futures contract when the underlying asset is not available. We propose to use a cross hedging strategy based on the Futures contract covering the larger delivery period. A quick overview of market data shows a basis risk for this market incompleteness. For that purpose we formulate the pricing problem in a stochastic target form along the lines of Bouchard and al. (2008), with a moment loss function. Following the same techniques as in the latter, we avoid to demonstrate the uniqueness of the value function by comparison arguments and explore convex duality methods to provide a semi-explicit solution to the problem. We then propose numerical results to support the new hedging strategy and compare our method to the Black-Scholes naive approach.

preprint2013arXiv

Variance optimal hedging for continuous time additive processes and applications

For a large class of vanilla contingent claims, we establish an explicit Föllmer-Schweizer decomposition when the underlying is an exponential of an additive process. This allows to provide an efficient algorithm for solving the mean variance hedging problem. Applications to models derived from the electricity market are performed.

preprint2012arXiv

On some expectation and derivative operators related to integral representations of random variables with respect to a PII process

Given a process with independent increments $X$ (not necessarily a martingale) and a large class of square integrable r.v. $H=f(X_T)$, $f$ being the Fourier transform of a finite measure $μ$, we provide explicit Kunita-Watanabe and Föllmer-Schweizer decompositions. The representation is expressed by means of two significant maps: the expectation and derivative operators related to the characteristics of $X$. We also provide an explicit expression for the variance optimal error when hedging the claim $H$ with underlying process $X$. Those questions are motivated by finding the solution of the celebrated problem of global and local quadratic risk minimization in mathematical finance.

preprint2012arXiv

Variance Optimal Hedging for discrete time processes with independent increments. Application to Electricity Markets

We consider the discretized version of a (continuous-time) two-factor model introduced by Benth and coauthors for the electricity markets. For this model, the underlying is the exponent of a sum of independent random variables. We provide and test an algorithm, which is based on the celebrated Foellmer-Schweizer decomposition for solving the mean-variance hedging problem. In particular, we establish that decomposition explicitely, for a large class of vanilla contingent claims. Interest is devoted in the choice of rebalancing dates and its impact on the hedging error, regarding the payoff regularity and the non stationarity of the log-price process.

preprint2010arXiv

Snell envelope with path dependent multiplicative optimality criteria

We analyze the Snell envelope with path dependent multiplicative optimality criteria. Especially for this case, we propose a variation of the Snell envelope backward recursion which allows to extend some classical approxima- tion schemes to the multiplicatively path dependent case. In this framework, we propose an importance sampling particle approximation scheme based on a specific change of measure, designed to concentrate the computational effort in regions pointed out by the criteria. This new algorithm is theoritically studied. We provide non asymptotic convergence estimates and prove that the resulting estimator is high biased.

Nadia Oudjane

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Online Markov Decision Processes with Terminal Law Constraints

A privacy-preserving distributed computational approach for distributed locational marginal prices

A Privacy-preserving Method to Optimize Distributed Resource Allocation

Efficient Estimation of Equilibria in Large Aggregative Games with Coupling Constraints

Branching diffusion representation of semilinear PDEs and Monte Carlo approximation

Particle system algorithm and chaos propagation related to non-conservative McKean type stochastic differential equations

Unbiased Monte Carlo estimate of stochastic differential equations expectations

Probabilistic representation of a class of non conservative nonlinear Partial Differential Equations

Hedging Expected Losses on Derivatives in Electricity Futures Markets

Variance optimal hedging for continuous time additive processes and applications

On some expectation and derivative operators related to integral representations of random variables with respect to a PII process

Variance Optimal Hedging for discrete time processes with independent increments. Application to Electricity Markets

Snell envelope with path dependent multiplicative optimality criteria