Source author record

Klaus Obermayer

Klaus Obermayer appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Neurons and Cognition Computer Vision Artificial Intelligence math.OC nlin.AO Computational Engineering, Finance, and Science Data Structures and Algorithms eess.AS math.DS math.PR Multimedia nlin.PS Sound

Catalog footprint

What is connected

25works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Salience-SGG: Enhancing Unbiased Scene Graph Generation with Iterative Salience Estimation

Scene Graph Generation (SGG) suffers from a long-tailed distribution, where a few predicate classes dominate while many others are underrepresented, leading to biased models that underperform on rare relations. Unbiased-SGG methods address this issue by implementing debiasing strategies, but often at the cost of spatial understanding, resulting in an over-reliance on semantic priors. We introduce Salience-SGG, a novel framework featuring an Iterative Salience Decoder (ISD) that emphasizes triplets with salient spatial structures. To support this, we propose semantic-agnostic salience labels guiding ISD. Evaluations on Visual Genome, Open Images V6, and GQA-200 show that Salience-SGG achieves state-of-the-art performance and improves existing Unbiased-SGG methods in their spatial understanding as demonstrated by the Pairwise Localization Average Precision

preprint2022arXiv

Risk-Sensitive Partially Observable Markov Decision Processes as Fully Observable Multivariate Utility Optimization problems

We provide a new algorithm for solving Risk Sensitive Partially Observable Markov Decisions Processes, when the risk is modeled by a utility function, and both the state space and the space of observations is finite. This algorithm is based on an observation that the change of measure and the subsequent introduction of the information space that is used for exponential utility functions, can be actually extended for sums of exponentials if one introduces an extra vector parameter that tracks the "expected accumulated cost" that corresponds to each exponential. Since every increasing function can be approximated by sums of exponentials in finite intervals, the method can be essentially applied for any utility function, with its complexity depending on the number of exponentials.

preprint2022arXiv

Similarity of Pre-trained and Fine-tuned Representations

In transfer learning, only the last part of the networks - the so-called head - is often fine-tuned. Representation similarity analysis shows that the most significant change still occurs in the head even if all weights are updatable. However, recent results from few-shot learning have shown that representation change in the early layers, which are mostly convolutional, is beneficial, especially in the case of cross-domain adaption. In our paper, we find out whether that also holds true for transfer learning. In addition, we analyze the change of representation in transfer learning, both during pre-training and fine-tuning, and find out that pre-trained structure is unlearned if not usable.

preprint2022arXiv

Transfer Learning for Segmentation Problems: Choose the Right Encoder and Skip the Decoder

It is common practice to reuse models initially trained on different data to increase downstream task performance. Especially in the computer vision domain, ImageNet-pretrained weights have been successfully used for various tasks. In this work, we investigate the impact of transfer learning for segmentation problems, being pixel-wise classification problems that can be tackled with encoder-decoder architectures. We find that transfer learning the decoder does not help downstream segmentation tasks, while transfer learning the encoder is truly beneficial. We demonstrate that pretrained weights for a decoder may yield faster convergence, but they do not improve the overall model performance as one can obtain equivalent results with randomly initialized decoders. However, we show that it is more effective to reuse encoder weights trained on a segmentation or reconstruction task than reusing encoder weights trained on classification tasks. This finding implicates that using ImageNet-pretrained encoders for downstream segmentation problems is suboptimal. We also propose a contrastive self-supervised approach with multiple self-reconstruction tasks, which provides encoders that are suitable for transfer learning in segmentation problems in the absence of segmentation labels.

preprint2020arXiv

The NIGENS General Sound Events Database

Computational auditory scene analysis is gaining interest in the last years. Trailing behind the more mature field of speech recognition, it is particularly general sound event detection that is attracting increasing attention. Crucial for training and testing reasonable models is having available enough suitable data -- until recently, general sound event databases were hardly found. We release and present a database with 714 wav files containing isolated high quality sound events of 14 different types, plus 303 `general' wav files of anything else but these 14 types. All sound events are strongly labeled with perceptual on- and offset times, paying attention to omitting in-between silences. The amount of isolated sound events, the quality of annotations, and the particular general sound class distinguish NIGENS from other databases.

preprint2020arXiv

Training Generative Networks with general Optimal Transport distances

We propose a new algorithm that uses an auxiliary neural network to express the potential of the optimal transport map between two data distributions. In the sequel, we use the aforementioned map to train generative networks. Unlike WGANs, where the Euclidean distance is ${\it implicitly}$ used, this new method allows to ${\it explicitly}$ use ${\it any}$ transportation cost function that can be chosen to match the problem at hand. For example, it allows to use the squared distance as a transportation cost function, giving rise to the Wasserstein-2 metric for probability distributions, which results in fast and stable gradient descends. It also allows to use image centered distances, like the structure similarity index, with notable differences in the results.

preprint2019arXiv

A Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with Applications in Partially Observable Markov Decision Processes

By using the fact that the space of all probability measures with finite support can be somehow completed in two different fashions, one generating the Arens-Eells space and another generating the Kantorovich-Wasserstein (Wasserstein-1) space, and by exploiting the duality relationship between the Arens-Eells space with the space of Lipschitz functions, we provide a dual representation of Fenchel-Moreau-Rockafellar type for proper convex functionals on Wasserstein-1. We retrieve dual transportation inequalities as a Corollary and we provide examples where the theorem can be used to easily prove dual expressions like the celebrated Donsker-Varadhan variational formula. Finally our result allows to write convex functions as the supremum over all linear functions that are generated by roots of its conjugate dual, something that we apply to the field of Partially observable Markov decision processes (POMDPs) to approximate the value function of a given POMDP by iterating level sets. This extends the method used in Smallwood 1973 for finite state spaces to the case were the state space is a Polish metric space.

preprint2016arXiv

Controlling Statistical Moments of Stochastic Dynamical Networks

We consider a general class of stochastic networks and ask which network nodes need to be controlled, and how, to stabilize and switch between desired metastable (target) states in terms of the first and second statistical moments of the system. We first show that it is sufficient to directly interfere with a subset of nodes which can be identified using information about the graph of the network only. Then, we develop a suitable method for feedback control which acts on that subset of nodes and preserves the covariance structure of the desired target state. Finally, we demonstrate our theoretical results using a stochastic Hopfield network and a global brain model. Our results are applicable to a variety of (model) networks, and further our understanding of the relationship between network structure and collective dynamics for the benefit of effective control.

preprint2016arXiv

Extending integrate-and-fire model neurons to account for the effects of weak electric fields and input filtering mediated by the dendrite

How extracellular electric fields, as generated endogenously or through transcranial brain stimulation, affect the dynamics of large neuronal populations is of great interest but not well understood. To study the collective dynamics of large populations single-compartment (point) model neurons have been proven very successful. These models, however, lack the dendritic morphology to biophysically account for the effects of electric fields, and for changes in synaptic integration due to morphology alone. Here we (i) characterize the response of a canonical spatial (ball-and-stick) model neuron to fluctuating synaptic input as well as an oscillatory, weak electric field, and (ii) analytically derive an extension for popular integrate-and-fire point neuron models to accurately reproduce these responses. We obtain distinct filters mediated by the dendrite for inputs at the soma (high-pass filter) or at the distal dendritic site (low-pass filter), and find that the electric field induces spike rate resonance in the beta and gamma frequency bands or even higher frequencies, depending on the location of synaptic background input. Due to their computational efficiency the extended point models are well suited for application in large populations of coupled neurons with different morphology, exposed to extracellular electric fields.

preprint2016arXiv

Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning

This paper investigates a type of instability that is linked to the greedy policy improvement in approximated reinforcement learning. We show empirically that non-deterministic policy improvement can stabilize methods like LSPI by controlling the improvements' stochasticity. Additionally we show that a suitable representation of the value function also stabilizes the solution to some degree. The presented approach is simple and should also be easily transferable to more sophisticated algorithms like deep reinforcement learning.

preprint2015arXiv

On Average Risk-sensitive Markov Control Processes

We introduce the Lyapunov approach to optimal control problems of average risk-sensitive Markov control processes with general risk maps. Motivated by applications in particular to behavioral economics, we consider possibly non-convex risk maps, modeling behavior with mixed risk preference. We introduce classical objective functions to the risk-sensitive setting and we are in particular interested in optimizing the average risk in the infinite-time horizon for Markov Control Processes on general, possibly non-compact, state spaces allowing also unbounded cost. Existence and uniqueness of an optimal control is obtained with a fixed point theorem applied to the nonlinear map modeling the risk-sensitive expected total cost. The necessary contraction is obtained in a suitable chosen seminorm under a new set of conditions: 1) Lyapunov-type conditions on both risk maps and cost functions that control the growth of iterations, and 2) Doeblin-type conditions, known for Markov chains, generalized to nonlinear mappings. In the particular case of the entropic risk map, the above conditions can be replaced by the existence of a Lyapunov function, a local Doeblin-type condition for the underlying Markov chain, and a growth condition on the cost functions.

preprint2015arXiv

Regression with Linear Factored Functions

Many applications that use empirically estimated functions face a curse of dimensionality, because the integrals over most function classes must be approximated by sampling. This paper introduces a novel regression-algorithm that learns linear factored functions (LFF). This class of functions has structural properties that allow to analytically solve certain integrals and to calculate point-wise products. Applications like belief propagation and reinforcement learning can exploit these properties to break the curse and speed up computation. We derive a regularized greedy optimization scheme, that learns factored basis functions during training. The novel regression algorithm performs competitively to Gaussian processes on benchmark tasks, and the learned LFF functions are with 4-9 factored basis functions on average very compact.

preprint2014arXiv

Analyzing critical propagation in a reaction-diffusion-advection model using unstable slow waves

The effect of advection on the critical minimal speed of traveling waves is studied. Previous theoretical studies estimated the effect on the velocity of stable fast waves and predicted the existence of a critical advection strength below which propagating waves are not supported anymore. In this paper, the critical advection strength is calculated taking into account the unstable slow wave solution. Thereby, theoretical results predict, that advection can induce stable wave propagation in the non-excitable parameter regime, if the advection strength exceeds a critical value. In addition, an analytical expression for the advection-velocity relation of the unstable slow wave is derived. Predictions are confirmed numerically in a two-variable reaction-diffusion model.

preprint2014arXiv

Risk-sensitive Markov control processes

We introduce a general framework for measuring risk in the context of Markov control processes with risk maps on general Borel spaces that generalize known concepts of risk measures in mathematical finance, operations research and behavioral economics. Within the framework, applying weighted norm spaces to incorporate also unbounded costs, we study two types of infinite-horizon risk-sensitive criteria, discounted total risk and average risk, and solve the associated optimization problems by dynamic programming. For the discounted case, we propose a new discount scheme, which is different from the conventional form but consistent with the existing literature, while for the average risk criterion, we state Lyapunov-like stability conditions that generalize known conditions for Markov chains to ensure the existence of solutions to the optimality equation.

preprint2014arXiv

Risk-sensitive Reinforcement Learning

We derive a family of risk-sensitive reinforcement learning methods for agents, who face sequential decision-making tasks in uncertain environments. By applying a utility function to the temporal difference (TD) error, nonlinear transformations are effectively applied not only to the received rewards but also to the true transition probabilities of the underlying Markov decision process. When appropriate utility functions are chosen, the agents' behaviors express key features of human behavior as predicted by prospect theory (Kahneman and Tversky, 1979), for example different risk-preferences for gains and losses as well as the shape of subjective probability curves. We derive a risk-sensitive Q-learning algorithm, which is necessary for modeling human behavior when transition probabilities are unknown, and prove its convergence. As a proof of principle for the applicability of the new framework we apply it to quantify human behavior in a sequential investment task. We find, that the risk-sensitive variant provides a significantly better fit to the behavioral data and that it leads to an interpretation of the subject's responses which is indeed consistent with prospect theory. The analysis of simultaneously measured fMRI signals show a significant correlation of the risk-sensitive TD error with BOLD signal change in the ventral striatum. In addition we find a significant correlation of the risk-sensitive Q-values with neural activity in the striatum, cingulate cortex and insula, which is not present if standard Q-values are used.

preprint2013arXiv

Adaptation controls synchrony and cluster states of coupled threshold-model neurons

We analyze zero-lag and cluster synchrony of delay-coupled non-smooth dynamical systems by extending the master stability approach, and apply this to networks of adaptive threshold-model neurons. For a homogeneous population of excitatory and inhibitory neurons we find (i) that subthreshold adaptation stabilizes or destabilizes synchrony depending on whether the recurrent synaptic excitatory or inhibitory couplings dominate, and (ii) that synchrony is always unstable for networks with balanced recurrent synaptic inputs. If couplings are not too strong, synchronization properties are similar for very different coupling topologies, i.e., random connections or spatial networks with localized connectivity. We generalize our approach for two subpopulations of neurons with non-identical local dynamics, including bursting, for which activity-based adaptation controls the stability of cluster states, independent of a specific coupling topology.

preprint2013arXiv

Adaptation controls synchrony and cluster states of coupled threshold-model neurons: Supplemental Material

Derivation of the transition conditions for the variational equations for zero-lag and cluster synchrony.

preprint2013arXiv

Afferent specificity, feature specific connectivity influence orientation selectivity: A computational study in mouse primary visual cortex

Primary visual cortex (V1) provides crucial insights into the selectivity and emergence of specific output features such as orientation tuning. Tuning and selectivity of cortical neurons in mouse visual cortex is not equivocally resolved so far. While many in-vivo experimental studies found inhibitory neurons of all subtypes to be broadly tuned for orientation other studies report inhibitory neurons that are as sharply tuned as excitatory neurons. These diverging findings about the selectivity of excitatory and inhibitory cortical neurons prompted us to ask the following questions: (1) How different or similar is the cortical computation with that in previously described species that relies on map? (2) What is the network mechanism underlying the sharpening of orientation selectivity in the mouse primary visual cortex? Here, we investigate the above questions in a computational framework with a recurrent network composed of Hodgkin-Huxley (HH) point neurons. Our cortical network with random connectivity alone could not account for all the experimental observations, which led us to hypothesize, (a) Orientation dependent connectivity (b) Feedforward afferent specificity to understand orientation selectivity of V1 neurons in mouse. Using population (orientation selectivity index) OSI as a measure of neuronal selectivity to stimulus orientation we test each hypothesis separately and in combination against experimental data. Based on our analysis of orientation selectivity (OS) data we find a good fit of network parameters in a model based on afferent specificity and connectivity that scales with feature similarity. We conclude that this particular model class best supports data sets of orientation selectivity of excitatory and inhibitory neurons in layer 2/3 of primary visual cortex of mouse.

preprint2013arXiv

How adaptation currents change threshold, gain and variability of neuronal spiking

Many types of neurons exhibit spike rate adaptation, mediated by intrinsic slow $\mathrm{K}^+$-currents, which effectively inhibit neuronal responses. How these adaptation currents change the relationship between in-vivo like fluctuating synaptic input, spike rate output and the spike train statistics, however, is not well understood. In this computational study we show that an adaptation current which primarily depends on the subthreshold membrane voltage changes the neuronal input-output relationship (I-O curve) subtractively, thereby increasing the response threshold. A spike-dependent adaptation current alters the I-O curve divisively, thus reducing the response gain. Both types of adaptation currents naturally increase the mean inter-spike interval (ISI), but they can affect ISI variability in opposite ways. A subthreshold current always causes an increase of variability while a spike-triggered current decreases high variability caused by fluctuation-dominated inputs and increases low variability when the average input is large. The effects on I-O curves match those caused by synaptic inhibition in networks with asynchronous irregular activity, for which we find subtractive and divisive changes caused by external and recurrent inhibition, respectively. Synaptic inhibition, however, always increases the ISI variability. We analytically derive expressions for the I-O curve and ISI variability, which demonstrate the robustness of our results. Furthermore, we show how the biophysical parameters of slow $\mathrm{K}^+$-conductances contribute to the two different types of adaptation currents and find that $\mathrm{Ca}^{2+}$-activated $\mathrm{K}^+$-currents are effectively captured by a simple spike-dependent description, while muscarine-sensitive or $\mathrm{Na}^+$-activated $\mathrm{K}^+$-currents show a dominant subthreshold component.

preprint2013arXiv

Impact of adaptation currents on synchronization of coupled exponential integrate-and-fire neurons

Author summary: Synchronization of neuronal spiking in the brain is related to cognitive functions, such as perception, attention, and memory. It is therefore important to determine which properties of neurons influence their collective behavior in a network and to understand how. A prominent feature of many cortical neurons is spike frequency adaptation, which is caused by slow transmembrane currents. We investigated how these adaptation currents affect the synchronization tendency of coupled model neurons. Using the efficient adaptive exponential integrate-and-fire (aEIF) model and a biophysically detailed neuron model for validation, we found that increased adaptation currents promote synchronization of coupled excitatory neurons at lower spike frequencies, as long as the conduction delays between the neurons are negligible. Inhibitory neurons on the other hand synchronize in presence of conduction delays, with or without adaptation currents. Our results emphasize the utility of the aEIF model for computational studies of neuronal network dynamics. We conclude that adaptation currents provide a mechanism to generate low frequency oscillations in local populations of excitatory neurons, while faster rhythms seem to be caused by inhibition rather than excitation.

preprint2012arXiv

Learning in Riemannian Orbifolds

Learning in Riemannian orbifolds is motivated by existing machine learning algorithms that directly operate on finite combinatorial structures such as point patterns, trees, and graphs. These methods, however, lack statistical justification. This contribution derives consistency results for learning problems in structured domains and thereby generalizes learning in vector spaces and manifolds.

preprint2011arXiv

Extending Bron Kerbosch for Solving the Maximum Weight Clique Problem

This contribution extends the Bron Kerbosch algorithm for solving the maximum weight clique problem, where continuous-valued weights are assigned to both, vertices and edges. We applied the proposed algorithm to graph matching problems.

preprint2011arXiv

Probabilistic prototype models for attributed graphs

This contribution proposes a new approach towards developing a class of probabilistic methods for classifying attributed graphs. The key concept is random attributed graph, which is defined as an attributed graph whose nodes and edges are annotated by random variables. Every node/edge has two random processes associated with it- occurence probability and the probability distribution over the attribute values. These are estimated within the maximum likelihood framework. The likelihood of a random attributed graph to generate an outcome graph is used as a feature for classification. The proposed approach is fast and robust to noise.

preprint2010arXiv

Accelerating Competitive Learning Graph Quantization

Vector quantization(VQ) is a lossy data compression technique from signal processing for which simple competitive learning is one standard method to quantize patterns from the input space. Extending competitive learning VQ to the domain of graphs results in competitive learning for quantizing input graphs. In this contribution, we propose an accelerated version of competitive learning graph quantization (GQ) without trading computational time against solution quality. For this, we lift graphs locally to vectors in order to avoid unnecessary calculations of intractable graph distances. In doing so, the accelerated version of competitive learning GQ gradually turns locally into a competitive learning VQ with increasing number of iterations. Empirical results show a significant speedup by maintaining a comparable solution quality.

preprint2010arXiv

Graph Quantization

Vector quantization(VQ) is a lossy data compression technique from signal processing, which is restricted to feature vectors and therefore inapplicable for combinatorial structures. This contribution presents a theoretical foundation of graph quantization (GQ) that extends VQ to the domain of attributed graphs. We present the necessary Lloyd-Max conditions for optimality of a graph quantizer and consistency results for optimal GQ design based on empirical distortion measures and stochastic optimization. These results statistically justify existing clustering algorithms in the domain of graphs. The proposed approach provides a template of how to link structural pattern recognition methods other than GQ to statistical pattern recognition.

Klaus Obermayer

What is connected

Connect this record

See the researcher in context

Building this map preview

25 published item(s)

Salience-SGG: Enhancing Unbiased Scene Graph Generation with Iterative Salience Estimation

Risk-Sensitive Partially Observable Markov Decision Processes as Fully Observable Multivariate Utility Optimization problems

Similarity of Pre-trained and Fine-tuned Representations

Transfer Learning for Segmentation Problems: Choose the Right Encoder and Skip the Decoder

The NIGENS General Sound Events Database

Training Generative Networks with general Optimal Transport distances

A Fenchel-Moreau-Rockafellar type theorem on the Kantorovich-Wasserstein space with Applications in Partially Observable Markov Decision Processes

Controlling Statistical Moments of Stochastic Dynamical Networks

Extending integrate-and-fire model neurons to account for the effects of weak electric fields and input filtering mediated by the dendrite

Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning

On Average Risk-sensitive Markov Control Processes

Regression with Linear Factored Functions

Analyzing critical propagation in a reaction-diffusion-advection model using unstable slow waves

Risk-sensitive Markov control processes

Risk-sensitive Reinforcement Learning

Adaptation controls synchrony and cluster states of coupled threshold-model neurons

Adaptation controls synchrony and cluster states of coupled threshold-model neurons: Supplemental Material

Afferent specificity, feature specific connectivity influence orientation selectivity: A computational study in mouse primary visual cortex

How adaptation currents change threshold, gain and variability of neuronal spiking

Impact of adaptation currents on synchronization of coupled exponential integrate-and-fire neurons

Learning in Riemannian Orbifolds

Extending Bron Kerbosch for Solving the Maximum Weight Clique Problem

Probabilistic prototype models for attributed graphs

Accelerating Competitive Learning Graph Quantization

Graph Quantization