Researcher profile

Péter L. Simon

Péter L. Simon contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2022arXiv

Learning the parameters of a differential equation from its trajectory via the adjoint equation

The paper contributes to strengthening the relation between machine learning and the theory of differential equations. In this context, the inverse problem of fitting the parameters, and the initial condition of a differential equation to some measurements constitutes a key issue. The paper explores an abstraction that can be used to construct a family of loss functions with the aim of fitting the solution of an initial value problem to a set of discrete or continuous measurements. It is shown, that an extension of the adjoint equation can be used to derive the gradient of the loss function as a continuous analogue of backpropagation in machine learning. Numerical evidence is presented that under reasonably controlled circumstances the gradients obtained this way can be used in a gradient descent to fit the solution of an initial value problem to a set of continuous noisy measurements, and a set of discrete noisy measurements that are recorded at uncertain times.

preprint2022arXiv

On parameter identifiability in network-based epidemic models

Many models in mathematical epidemiology are developed with the aim to provide a framework for parameter estimation and then prediction. It is well-known that parameters are not always uniquely identifiable. In this paper we consider network-based mean-field models and explore the problem of parameter identifiability when observations about an epidemic are available. Making use of the analytical tractability of most network-based mean-field models, e.g., explicit analytical expressions for leading eigenvalue and final epidemic size, we set up the parameter identifiability problem as finding the solution or solutions of a system of coupled equations. More precisely, subject to observing/measuring growth rate and final epidemic size, we seek to identify parameter values leading to these measurements. We are particularly concerned with disentangling transmission rate from the network density. To do this we define strong and weak identifiability and we find that except for the simplest model, parameters cannot be uniquely determined, that is they are weakly identifiable. This means that there exists multiple solutions (a manifold of infinite measure) which give rise to model output that is close to the data. Identifying, formalising and analytically describing this problem should lead to a better appreciation of the complexity involved in fitting models with many parameters to data.

preprint2011arXiv

Differential equation approximations of stochastic network processes: an operator semigroup approach

The rigorous linking of exact stochastic models to mean-field approximations is studied. Starting from the differential equation point of view the stochastic model is identified by its Kolmogorov equations, which is a system of linear ODEs that depends on the state space size ($N$) and can be written as $\dot u_N=A_N u_N$. Our results rely on the convergence of the transition matrices $A_N$ to an operator $A$. This convergence also implies that the solutions $u_N$ converge to the solution $u$ of $\dot u=Au$. The limiting ODE can be easily used to derive simpler mean-field-type models such that the moments of the stochastic process will converge uniformly to the solution of appropriately chosen mean-field equations. A bi-product of this method is the proof that the rate of convergence is $\mathcal{O}(1/N)$. In addition, it turns out that the proof holds for cases that are slightly more general than the usual density dependent one. Moreover, for Markov chains where the transition rates satisfy some sign conditions, a new approach for proving convergence to the mean-field limit is proposed. The starting point in this case is the derivation of a countable system of ordinary differential equations for all the moments. This is followed by the proof of a perturbation theorem for this infinite system, which in turn leads to an estimate for the difference between the moments and the corresponding quantities derived from the solution of the mean-field ODE.

preprint2011arXiv

Exact and approximate epidemic models on networks: a new, improved closure relation

Recently, research that focuses on the rigorous understanding of the relation between simulation and/or exact models on graphs and approximate counterparts has gained lots of momentum. This includes revisiting the performance of classic pairwise models with closures at the level of pairs and/or triples as well as effective-degree-type models and those based on the probability generating function formalism. In this paper, for a fully connected graph and the simple $SIS$ (susceptible-infected-susceptible) epidemic model, a novel closure is introduced. This is done via using the equations for the moments of the distribution describing the number of infecteds at all times combined with the empirical observations that this is well described/approximated by a binomial distribution with time dependent parameters. This assumption allows us to express higher order moments in terms of lower order ones and this leads to a new closure. The significant feature of the new closure is that the difference of the exact system, given by the Kolmogorov equations, from the solution of the newly defined approximate system is of order $1/N^2$. This is in contrast with the $\mathcal{O}(1/N)$ difference corresponding to the approximate system obtained via the classic triple closure.