Researcher profile

Florian Schäfer

Florian Schäfer contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

Information geometric regularization of unidimensional pressureless Euler equations yields global strong solutions

Partial differential equations describing compressible fluids are prone to the formation of shock singularities, arising from faster upstream fluid particles catching up to slower, downstream ones. In geometric terms, this causes the deformation map to leave the manifold of diffeomorphisms. Information geometric regularization addresses this issue by changing the manifold geometry to make it geodesically complete. Empirical evidence suggests that this results in smooth solutions without adding artificial viscosity. This work makes a first step towards understanding this phenomenon rigorously, in the setting of the unidimensional pressureless Euler equations. It shows that their information geometric regularization has smooth global solutions. By establishing $Γ$-convergence of its variational description, it proves convergence of these solutions to entropy solutions of the nominal problem, in the limit of vanishing regularization parameter. A consequence of these results is that manifolds of unidimensional diffeomorphisms with information geometric regularization are geodesically complete.

preprint2024arXiv

InRank: Incremental Low-Rank Learning

The theory of greedy low-rank learning (GLRL) aims to explain the impressive generalization capabilities of deep learning. It proves that stochastic gradient-based training implicitly regularizes neural networks towards low-rank solutions through a gradual increase of the rank during training. However, there is a gap between theory and practice since GLRL requires an infinitesimal initialization of the weights, which is not practical due to the fact that it is a saddle point. In this work, we remove the assumption of infinitesimal initialization by focusing on cumulative weight updates. We prove the cumulative weight updates follow an incremental low-rank trajectory for arbitrary orthogonal initialization of weights in a three-layer linear network. Empirically, we demonstrate that our theory holds on a broad range of neural networks (e.g., transformers) and standard training algorithms (e.g., SGD, Adam). However, existing training algorithms do not exploit the low-rank property to improve computational efficiency as the networks are not parameterized in low-rank. To remedy this, we design a new training algorithm Incremental Low-Rank Learning (InRank), which explicitly expresses cumulative weight updates as low-rank matrices while incrementally augmenting their ranks during training. We evaluate InRank on GPT-2, and our results indicate that InRank achieves comparable prediction performance as the full-rank counterpart while requiring at most 33% of the total ranks throughout training. We also propose an efficient version of InRank that achieves a reduction of 37% in total training time and 36% in model size when training GPT-medium on WikiText-103 from scratch.

preprint2023arXiv

Scalable Bayesian transport maps for high-dimensional non-Gaussian spatial fields

A multivariate distribution can be described by a triangular transport map from the target distribution to a simple reference distribution. We propose Bayesian nonparametric inference on the transport map by modeling its components using Gaussian processes. This enables regularization and uncertainty quantification of the map estimation, while still resulting in a closed-form and invertible posterior map. We then focus on inferring the distribution of a nonstationary spatial field from a small number of replicates. We develop specific transport-map priors that are highly flexible and are motivated by the behavior of a large class of stochastic processes. Our approach is scalable to high-dimensional distributions due to data-dependent sparsity and parallel computations. We also discuss extensions, including Dirichlet process mixtures for flexible marginals. We present numerical results to demonstrate the accuracy, scalability, and usefulness of our methods, including statistical emulation of non-Gaussian climate-model output.

preprint2021arXiv

Feshbach resonances of large mass-imbalance Er-Li mixtures

We report on the experimental observation of Feshbach resonances in large mass-imbalance mixtures of Erbium (Er) and Lithium (Li). All combinations between ${}^{168}$Er, ${}^{166}$Er and ${}^7$Li, ${}^6$Li are cooled to temperatures of a few microkelvin, partially by means of sympathetic cooling together with Ytterbium (Yb) as a third mixture component. The Er-Li inelastic interspecies collisional properties are studied for magnetic fields up to 680 G. In all cases resonant interspecies loss features, indicative of Feshbach resonances, have been observed. While most resonances have sub-Gauss widths, a few of them are broad and feature widths of several Gauss. Those broad resonances are a key to the realization of ultracold Er-Li quantum gas mixtures with tunable interactions.

preprint2020arXiv

Comparison of Meta-Heuristics for the Planning of Meshed Power Systems

The power system planning task is a combinatorial optimization problem. The objective function minimizes the economic costs subject to a set of technical and operational constraints. Meta-heuristics are often used as optimization strategies to find solutions to this problem by combining switching, line reinforcement or new line measures. Common heuristics are GA, PSO, HC, ILS or newer methods such as GWO or FWA. In this paper, we compare these algorithms within the same framework. We test each algorithm on 8 different test grids ranging from 73 to 9421 buses. For each grid and algorithm, we start 50 runs with a maximum run time of 1 hour. The results show that the performance of an algorithm depends on the initial grid state, grid size and amount of measures. The ILS method is very robust in most cases. In the larger test grids, more exploratory heuristics, e.g., GA and PSO, find solutions in shorter run times.

preprint2020arXiv

Competitive Gradient Descent

We introduce a new algorithm for the numerical computation of Nash equilibria of competitive two-player games. Our method is a natural generalization of gradient descent to the two-player setting where the update is given by the Nash equilibrium of a regularized bilinear local approximation of the underlying game. It avoids oscillatory and divergent behaviors seen in alternating gradient descent. Using numerical experiments and rigorous analysis, we provide a detailed comparison to methods based on \emph{optimism} and \emph{consensus} and show that our method avoids making any unnecessary changes to the gradient dynamics while achieving exponential (local) convergence for (locally) convex-concave zero sum games. Convergence and stability properties of our method are robust to strong interactions between the players, without adapting the stepsize, which is not the case with previous methods. In our numerical experiments on non-convex-concave problems, existing methods are prone to divergence and instability due to their sensitivity to interactions among the players, whereas we never observe divergence of our algorithm. The ability to choose larger stepsizes furthermore allows our algorithm to achieve faster convergence, as measured by the number of model evaluations.

preprint2020arXiv

Competitive Mirror Descent

Constrained competitive optimization involves multiple agents trying to minimize conflicting objectives, subject to constraints. This is a highly expressive modeling language that subsumes most of modern machine learning. In this work we propose competitive mirror descent (CMD): a general method for solving such problems based on first order information that can be obtained by automatic differentiation. First, by adding Lagrange multipliers, we obtain a simplified constraint set with an associated Bregman potential. At each iteration, we then solve for the Nash equilibrium of a regularized bilinear approximation of the full problem to obtain a direction of movement of the agents. Finally, we obtain the next iterate by following this direction according to the dual geometry induced by the Bregman potential. By using the dual geometry we obtain feasible iterates despite only solving a linear system at each iteration, eliminating the need for projection steps while still accounting for the global nonlinear structure of the constraint set. As a special case we obtain a novel competitive multiplicative weights algorithm for problems on the positive cone.

preprint2020arXiv

Tools for quantum simulation with ultracold atoms in optical lattices

After many years of development of the basic tools, quantum simulation with ultracold atoms has now reached the level of maturity where it can be used to investigate complex quantum processes. Planning of new experiments and upgrading existing set-ups depends crucially on a broad overview of the available techniques, their specific advantages and limitations. This Technical Review aims to provide a comprehensive compendium of the state of the art. We discuss the basic principles, the available techniques and their current range of applications. Focusing on the simulation of varied phenomena in solid-state physics using optical lattice experiments, we review their basics, the necessary techniques and the accessible physical parameters. We outline how to control and use interactions with external potentials and between the atoms, and how to design new synthetic gauge fields and spin-orbit coupling. We discuss the latest progress in site-resolved techniques using quantum gas microscopes, and describe the unique features of quantum simulation experiments with two-electron atomic species.

preprint2019arXiv

Ultracold collisions in the Yb-Li mixture system

We report our experimental results on the collisional physics between non-S-state atoms (ytterbium (Yb), effectively a two-electron system, in the metastable ${}^3\mathrm{P}_2$ state) and S-state atoms (lithium (Li), an alkali metal, in the ground state). At low magnetic fields, by measuring inelastic interspecies collisional losses in the double quantum degenerate mixture we reveal the strong dependence of the inelastic losses on the internal spin states of both species and suppressed losses in stretched state configurations. Increasing the magnetic field up to 800 G we further investigate the magnetic field dependence of the collisional interactions. There, smoothly increasing inelastic losses are observed towards higher fields. The combined knowledge of both the magnetic field and the spin state dependence of the collisional losses of this prototypical mixture system of non-S-state and S-state atoms provides a significant step forward towards controllable impurity physics realized in the Yb-Li ultracold system.