Researcher profile

Yevgeny Liokumovich

Yevgeny Liokumovich contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2024arXiv

Quantifying stability of non-power-seeking in artificial agents

We investigate the question: if an AI agent is known to be safe in one setting, is it also safe in a new setting similar to the first? This is a core question of AI alignment--we train and test models in a certain environment, but deploy them in another, and we need to guarantee that models that seem safe in testing remain so in deployment. Our notion of safety is based on power-seeking--an agent which seeks power is not safe. In particular, we focus on a crucial type of power-seeking: resisting shutdown. We model agents as policies for Markov decision processes, and show (in two cases of interest) that not resisting shutdown is "stable": if an MDP has certain policies which don't avoid shutdown, the corresponding policies for a similar MDP also don't avoid shutdown. We also show that there are natural cases where safety is _not_ stable--arbitrarily small perturbations may result in policies which never shut down. In our first case of interest--near-optimal policies--we use a bisimulation metric on MDPs to prove that small perturbations won't make the agent take longer to shut down. Our second case of interest is policies for MDPs satisfying certain constraints which hold for various models (including language models). Here, we demonstrate a quantitative bound on how fast the probability of not shutting down can increase: by defining a metric on MDPs; proving that the probability of not shutting down, as a function on MDPs, is lower semicontinuous; and bounding how quickly this function decreases.

preprint2022arXiv

Geodesic nets on non-compact Riemannian manifolds

A geodesic flower is a finite collection of geodesic loops based at the same point $p$ that satisfy the following balancing condition: The sum of all unit tangent vectors to all geodesic arcs meeting at $p$ is equal to the zero vector. In particular, a geodesic flower is a stationary geodesic net. We prove that in every complete non-compact manifold with locally convex ends there exists a non-trivial geodesic flower.

preprint2022arXiv

Singular behavior and generic regularity of min-max minimal hypersurfaces

We show that for a generic $8$-dimensional Riemannian manifold with positive Ricci curvature, there exists a smooth minimal hypersurface. Without the curvature condition, we show that for a dense set of 8-dimensional Riemannian metrics there exists a minimal hypersurface with at most one singular point. This extends previous work on generic regularity that only dealt with area-minimizing hypersurfaces. These results are a consequence of a more general estimate for a one-parameter min-max minimal hypersurface $Σ\subset (M,g)$ (valid in any dimension): $$\mathcal H^{0} (\mathcal{S}_{nm}(Σ)) +{\rm Index}(Σ) \leq 1$$ where $\mathcal{S}_{nm}(Σ)$ denotes the set of singular points of $Σ$ with a unique tangent cone non-area minimizing on either side.

preprint2021arXiv

Filling metric spaces

We prove a new version of isoperimetric inequality: Given a positive real $m$, a Banach space $B$, a closed subset $Y$ of metric space $X$ and a continuous map $f:Y \rightarrow B$ with $f(Y)$ compact $$\inf_FHC_{m+1}(F(X))\leq c(m)HC_m(f(Y))^{\frac{m+1}{m}},$$ where $HC_m$ denotes the $m$-dimensional Hausdorff content, the infimum is taken over the set of all continuous maps $F:X\longrightarrow B$ such that $F(y)=f(y)$ for all $y\in Y$, and $c(m)$ depends only on $m$. Moreover, one can find $F$ with a nearly minimal $HC_{m+1}$ such that its image lies in the $C(m)HC_m(f(Y))^{1\over m}$-neighbourhood of $f(Y)$ with the exception of a subset with zero $(m+1)$-dimensional Hausdorff measure. The paper also contains a very general coarea inequality for Hausdorff content and its modifications. As an application we demonstrate an inequality conjectured by Larry Guth that relates the $m$-dimensional Hausdorff content of a compact metric space with its $(m-1)$-dimensional Urysohn width. We show that this result implies new systolic inequalities that both strengthen the classical Gromov's systolic inequality for essential Riemannian manifolds and extend this inequality to a wider class of non-simply connected manifolds.