Researcher profile

David Müller

David Müller contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

CoCo-InEKF: State Estimation with Learned Contact Covariances in Dynamic, Contact-Rich Scenarios

Robust state estimation for highly dynamic motion of legged robots remains challenging, especially in dynamic, contact-rich scenarios. Traditional approaches often rely on binary contact states that fail to capture the nuances of partial contact or directional slippage. This paper presents CoCo-InEKF, a differentiable invariant extended Kalman filter that utilizes continuous contact velocity covariances instead of binary contact states. These learned covariances allow the method to dynamically modulate contact confidence, accounting for more nuanced conditions ranging from firm contact to directional slippage or no contact. To predict these covariances for a set of predefined contact candidate points, we employ a lightweight neural network trained end-to-end using a state-error loss. This approach eliminates the need for heuristic ground-truth contact labels. In addition, we propose an automated contact candidate selection procedure and demonstrate that our method is insensitive to their exact placement. Experiments on a bipedal robot demonstrate a superior accuracy-efficiency tradeoff for linear velocity estimation, as well as improved filter consistency compared to baseline methods. This enables the robust execution of challenging motions, including dancing and complex ground interactions -- both in simulation and in the real world.

preprint2026arXiv

ReActor: Reinforcement Learning for Physics-Aware Motion Retargeting

Retargeting human kinematic reference motion onto a robot's morphology remains a formidable challenge. Existing methods often produce physical inconsistencies, such as foot sliding, self-collisions, or dynamically infeasible motions, which hinder downstream imitation learning. We propose a bilevel optimization framework that jointly adapts reference motions to a robot's morphology while training a tracking policy using reinforcement learning. To make the optimization tractable, we derive an approximate gradient for the upper-level loss. Our framework requires only a sparse set of semantic rigid-body correspondences and eliminates the need for manual tuning by identifying optimal values for a parameterization expressive enough to preserve characteristic motion across different embodiments. Moreover, by integrating retargeting directly with physics simulation, we produce physically plausible motions that facilitate robust imitation learning. We validate our method in simulation and on hardware, demonstrating challenging motions for morphologies that differ significantly from a human, including retargeting onto a quadruped.

preprint2021arXiv

Dynamic pricing under nested logit demand

Recently, there is growing interest and need for dynamic pricing algorithms, especially, in the field of online marketplaces by offering smart pricing options for big online stores. We present an approach to adjust prices based on the observed online market data. The key idea is to characterize optimal prices as minimizers of a total expected revenue function, which turns out to be convex. We assume that consumers face information processing costs, hence, follow a discrete choice demand model, and suppliers are equipped with quantity adjustment costs. We prove the strong smoothness of the total expected revenue function by deriving the strong convexity modulus of its dual. Our gradient-based pricing schemes outbalance supply and demand at the convergence rates of $\mathcal{O}(\frac{1}{t})$ and $\mathcal{O}(\frac{1}{t^2})$, respectively. This suggests that the imperfect behavior of consumers and suppliers helps to stabilize the market.

preprint2020arXiv

Chaining of hard disks in nematic needles: particle-based simulation of colloidal interactions in liquid crystals

Colloidal particles suspended in liquid crystals can exhibit various effective anisotropic interactions that can be tuned and utilized in self-assembly processes. We simulate a two-dimensional system of hard disks suspended in a solution of dense hard needles as a model system for colloids suspended in a nematic lyotropic liquid crystal. The novel event-chain Monte Carlo technique enables us to directly measure colloidal interactions in a microscopic simulation with explicit liquid crystal particles in the dense nematic phase. We find a directional short-range attraction for disks along the director, which triggers chaining parallel to the director and seemingly contradicts the standard liquid crystal field theory result of a quadrupolar attraction with a preferred ${45^{\circ}}$ angle. Our results can be explained by a short-range density-dependent depletion interaction, which has been neglected so far. Directionality and strength of the depletion interaction are caused by the weak planar anchoring of hard rods. The depletion attraction robustly dominates over the quadrupolar elastic attraction if disks come close. Self-assembly of many disks proceeds via intermediate chaining, which demonstrates that in lyotropic liquid crystal colloids depletion interactions play an important role in structure formation processes.

preprint2020arXiv

Simulations of the Glasma in 3+1D

The Glasma is a gluonic state of matter which can be created in collisions of relativistic heavy ions and is a precursor to the quark-gluon plasma. The existence of this state is a prediction of the color glass condensate (CGC) effective theory. In many applications of the CGC framework, the boost invariant approximation is employed. It assumes that the longitudinal extent of the nuclei can be approximated as infinitesimally thin. Consequently, the Glasma produced from such a collision is boost invariant and can be effectively described in 2+1D. Therefore, observables of the boost invariant Glasma are by construction independent of rapidity. The main goal of this thesis is to develop a numerical method for the non-boost-invariant setting where nuclei are assumed to be thin, but of finite longitudinal extent. This is in conflict with a number of simplifications that are used in the boost invariant case. In particular, one has to describe the collisions in 3+1D in the laboratory or center-of-mass frame. The change of frame forces the explicit inclusion of the color charges of nuclei. The new method is tested using an extension of the McLerran-Venugopalan model which includes a parameter for longitudinal thickness. It reproduces the boost invariant setting as a limiting case. Studying the pressure components of the Glasma, one finds the pressure anisotropy remains large. The energy density of the Glasma depends on rapidity due to the explicit breaking of boost invariance. The width of the observed rapidity profiles is controlled by the collision energy and can be shown to roughly agree with experimental data. Finally, a new numerical scheme for real-time lattice gauge theory is developed which provides higher numerical stability than the previous method. This new scheme is shown to be gauge-covariant and conserves the Gauss constraint even for large time steps.