Source author record

Patrick Krauss

Patrick Krauss appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Neurons and Cognition Quantitative Methods Artificial Intelligence Machine Learning Computer Vision Neural and Evolutionary Computing cond-mat.soft Information Theory math.IT physics.med-ph q-fin.CP

Catalog footprint

What is connected

15works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Are cortical microcircuits optimized for information flux? -- A simulation-based reverse engineering study

A sufficiently large information flux in recurrent neural networks, quantified by the mutual information between successive network states, is considered a prerequisite for rich information processing capabilities. This raises the question of whether biological neural networks, such as cortical microcolumns, may be structurally organized to enhance information flux. To investigate this possibility, we study a simplified model of the cortical layer 5 architecture, in which a densely and strongly interconnected core population is embedded within a larger supporting network. Surprisingly, we find that the embedding network exerts a pronounced flux-enhancing effect on the core dynamics. Systematic reverse-engineering analyses reveal that the embedding network provides two key contributions: first, it generates effective biases that shift core neurons into a higher-entropy operating regime; second, it supplies stochastic fluctuations that prevent the network from becoming trapped in simple fixed-point or oscillatory attractors through the mechanism of Recurrence Resonance. We further show that the information flux can be increased even beyond the biologically embedded case by applying individually optimized biases to the core neurons, and that these biases can emerge from a simple self-organization principle. Our findings are relevant both for the functional interpretation of biological neural circuits and for the design of artificial recurrent systems such as reservoir computers.

preprint2026arXiv

Deep Reinforcement Learning for Optimum Order Execution: Mitigating Risk and Maximizing Returns

Optimal Order Execution is a well-established problem in finance that pertains to the flawless execution of a trade (buy or sell) for a given volume within a specified time frame. This problem revolves around optimizing returns while minimizing risk, yet recent research predominantly focuses on addressing one aspect of this challenge. In this paper, we introduce an innovative approach to Optimal Order Execution within the US market, leveraging Deep Reinforcement Learning (DRL) to effectively address this optimization problem holistically. Our study assesses the performance of our model in comparison to two widely employed execution strategies: Volume Weighted Average Price (VWAP) and Time Weighted Average Price (TWAP). Our experimental findings clearly demonstrate that our DRL-based approach outperforms both VWAP and TWAP in terms of return on investment and risk management. The model's ability to adapt dynamically to market conditions, even during periods of market stress, underscores its promise as a robust solution.

preprint2026arXiv

Neuroscience-Inspired Analyses of Visual Interestingness in Multimodal Transformers

Human attention is the gateway to conscious perception, memory and decision-making. However, its role in modern transformer models remains largely unexplored. As these systems increasingly influence what people see, prefer and buy, the question arises as to whether they encode principles of human interest or merely exploit large-scale correlations. Addressing this issue is crucial for understanding cognition and ensuring the responsible use of AI in communication and marketing. In order to address this issue, the concept of visual interest was examined within the multimodal vision-language-model Qwen3-VL-8B, using a pre-defined Common Interestingness (CI) score derived from large-scale human engagement data on the photo-sharing platform Flickr. Here, we analyzed internal representations across vision and language components using methods from the neurosciences. Our analyses revealed that CI information is linearly decodable from final-layer embeddings, indicating that it is aligned with human-derived measures of visual interestingness. Dimensionality reduction and Generalized Discrimination Value (GDV) analyses demonstrate that CI-related hidden representations emerge in intermediate vision transformer layers and becomes progressively more distinguishable across language model layers. Concept vectors derived using geometric, probe, and Sparse Auto-Encoder based methods converge in higher layers, as confirmed by representational similarity analysis. This indicates a robust and structured encoding of visual interestingness without explicit supervision. Future work will seek to identify shared computational principles linking human brain dynamics and transformer architectures, with the ultimate goal of uncovering the organizing mechanisms that give rise to attention and interest in both biological and artificial systems.

preprint2023arXiv

Extracting continuous sleep depth from EEG data without machine learning

The human sleep-cycle has been divided into discrete sleep stages that can be recognized in electroencephalographic (EEG) and other bio-signals by trained specialists or machine learning systems. It is however unclear whether these human-defined stages can be re-discovered with unsupervised methods of data analysis, using only a minimal amount of generic pre-processing. Based on EEG data, recorded overnight from sleeping human subjects, we investigate the degree of clustering of the sleep stages using the General Discrimination Value as a quantitative measure of class separability. Virtually no clustering is found in the raw data, even after transforming the EEG signals of each thirty-second epoch from the time domain into the more informative frequency domain. However, a Principal Component Analysis (PCA) of these epoch-wise frequency spectra reveals that the sleep stages separate significantly better in the low-dimensional sub-space of certain PCA components. In particular the component $C_1(t)$ can serve as a robust, continuous 'master variable' that encodes the depth of sleep and therefore correlates strongly with the 'hypnogram', a common plot of the discrete sleep stages over time. Moreover, $C_1(t)$ shows persistent trends during extended time periods where the sleep stage is constant, suggesting that sleep may be better understood as a continuum. These intriguing properties of $C_1(t)$ are not only relevant for understanding brain dynamics during sleep, but might also be exploited in low-cost single-channel sleep tracking devices for private and clinical use.

preprint2023arXiv

Multi-Modal Cognitive Maps based on Neural Networks trained on Successor Representations

Cognitive maps are a proposed concept on how the brain efficiently organizes memories and retrieves context out of them. The entorhinal-hippocampal complex is heavily involved in episodic and relational memory processing, as well as spatial navigation and is thought to built cognitive maps via place and grid cells. To make use of the promising properties of cognitive maps, we set up a multi-modal neural network using successor representations which is able to model place cell dynamics and cognitive map representations. Here, we use multi-modal inputs consisting of images and word embeddings. The network learns the similarities between novel inputs and the training database and therefore the representation of the cognitive map successfully. Subsequently, the prediction of the network can be used to infer from one modality to another with over $90\%$ accuracy. The proposed method could therefore be a building block to improve current AI systems for better understanding of the environment and the different modalities in which objects appear. The association of specific modalities with certain encounters can therefore lead to context awareness in novel situations when similar encounters with less information occur and additional information can be inferred from the learned cognitive map. Cognitive maps, as represented by the entorhinal-hippocampal complex in the brain, organize and retrieve context from memories, suggesting that large language models (LLMs) like ChatGPT could harness similar architectures to function as a high-level processing center, akin to how the hippocampus operates within the cortex hierarchy. Finally, by utilizing multi-modal inputs, LLMs can potentially bridge the gap between different forms of data (like images and words), paving the way for context-awareness and grounding of abstract concepts through learned associations, addressing the grounding problem in AI.

preprint2022arXiv

Classification at the Accuracy Limit -- Facing the Problem of Data Ambiguity

Data classification, the process of analyzing data and organizing it into categories, is a fundamental computing problem of natural and artificial information processing systems. Ideally, the performance of classifier models would be evaluated using unambiguous data sets, where the 'correct' assignment of category labels to the input data vectors is unequivocal. In real-world problems, however, a significant fraction of actually occurring data vectors will be located in a boundary zone between or outside of all categories, so that perfect classification cannot even in principle be achieved. We derive the theoretical limit for classification accuracy that arises from the overlap of data categories. By using a surrogate data generation model with adjustable statistical properties, we show that sufficiently powerful classifiers based on completely different principles, such as perceptrons and Bayesian models, all perform at this universal accuracy limit. Remarkably, the accuracy limit is not affected by applying non-linear transformations to the data, even if these transformations are non-reversible and drastically reduce the information content of the input data. We compare emerging data embeddings produced by supervised and unsupervised training, using MNIST and human EEG recordings during sleep. We find that categories are not only well separated in the final layers of classifiers trained with back-propagation, but to a smaller degree also after unsupervised dimensionality reduction. This suggests that human-defined categories, such as hand-written digits or sleep stages, can indeed be considered as 'natural kinds'.

preprint2022arXiv

Neural Network based Successor Representations of Space and Language

How does the mind organize thoughts? The hippocampal-entorhinal complex is thought to support domain-general representation and processing of structural knowledge of arbitrary state, feature and concept spaces. In particular, it enables the formation of cognitive maps, and navigation on these maps, thereby broadly contributing to cognition. It has been proposed that the concept of multi-scale successor representations provides an explanation of the underlying computations performed by place and grid cells. Here, we present a neural network based approach to learn such representations, and its application to different scenarios: a spatial exploration task based on supervised learning, a spatial navigation task based on reinforcement learning, and a non-spatial task where linguistic constructions have to be inferred by observing sample sentences. In all scenarios, the neural network correctly learns and approximates the underlying structure by building successor representations. Furthermore, the resulting neural firing patterns are strikingly similar to experimentally observed place and grid cell firing patterns. We conclude that cognitive maps and neural network-based successor representations of structured knowledge provide a promising way to overcome some of the short comings of deep learning towards artificial general intelligence.

preprint2022arXiv

Predictive coding and stochastic resonance as fundamental principles of auditory perception

How is information processed in the brain during perception? Mechanistic insight is achieved only when experiments are employed to test formal or computational models. In analogy to lesion studies, phantom perception may serve as a vehicle to understand the fundamental processing principles underlying auditory perception. With a special focus on tinnitus -- as the prime example of auditory phantom perception -- we review recent work at the intersection of artificial intelligence, psychology, and neuroscience. In particular, we discuss why everyone with tinnitus suffers from hearing loss, but not everyone with hearing loss suffers from tinnitus. We argue that the increase of sensory precision due to Bayesian inference could be caused by intrinsic neural noise and lead to a prediction error in the cerebral cortex. Hence, two fundamental processing principles - being ubiquitous in the brain - provide the most explanatory power for the emergence of tinnitus: predictive coding as a top-down, and stochastic resonance as a complementary bottom-up mechanism. We conclude that both principles play a crucial role in healthy auditory perception.

preprint2021arXiv

Known Operator Learning and Hybrid Machine Learning in Medical Imaging -- A Review of the Past, the Present, and the Future

In this article, we perform a review of the state-of-the-art of hybrid machine learning in medical imaging. We start with a short summary of the general developments of the past in machine learning and how general and specialized approaches have been in competition in the past decades. A particular focus will be the theoretical and experimental evidence pro and contra hybrid modelling. Next, we inspect several new developments regarding hybrid machine learning with a particular focus on so-called known operator learning and how hybrid approaches gain more and more momentum across essentially all applications in medical imaging and medical image analysis. As we will point out by numerous examples, hybrid models are taking over in image reconstruction and analysis. Even domains such as physical simulation and scanner and acquisition design are being addressed using machine learning grey box modelling approaches. Towards the end of the article, we will investigate a few future directions and point out relevant areas in which hybrid modelling, meta learning, and other domains will likely be able to drive the state-of-the-art ahead.

preprint2020arXiv

Sparsity through evolutionary pruning prevents neuronal networks from overfitting

Modern Machine learning techniques take advantage of the exponentially rising calculation power in new generation processor units. Thus, the number of parameters which are trained to resolve complex tasks was highly increased over the last decades. However, still the networks fail - in contrast to our brain - to develop general intelligence in the sense of being able to solve several complex tasks with only one network architecture. This could be the case because the brain is not a randomly initialized neural network, which has to be trained by simply investing a lot of calculation power, but has from birth some fixed hierarchical structure. To make progress in decoding the structural basis of biological neural networks we here chose a bottom-up approach, where we evolutionarily trained small neural networks in performing a maze task. This simple maze task requires dynamical decision making with delayed rewards. We were able to show that during the evolutionary optimization random severance of connections lead to better generalization performance of the networks compared to fully connected networks. We conclude that sparsity is a central property of neural networks and should be considered for modern Machine learning approaches.

preprint2016arXiv

A statistical method for analyzing and comparing spatiotemporal cortical activation patterns

We present a new statistical method to analyze multichannel steady-state local field potentials (LFP) recorded within different sensory cortices of different rodent species. Our spatiotemporal multi-dimensional cluster statistics (MCS) method enables statistical analyzing and comparing clusters of data points in n-dimensional space. We demonstrate that using this approach stimulus-specific attractor-like spatiotemporal activity patterns can be detected and be significantly different from each other during stimulation with long-lasting stimuli. Our method may be applied to other types of multichannel neuronal data, like EEG, MEG or spiking responses and used for the development of new read-out algorithms of brain activity and by that opens new perspectives for the development of brain-computer interfaces.

preprint2016arXiv

Stochastic resonance controlled upregulation of internal noise after hearing loss as a putative correlate of tinnitus-related neuronal hyperactivity

Subjective tinnitus (ST) is generally assumed to be a consequence of hearing loss (HL). In animal studies acoustic trauma can lead to behavioral signs of ST, in human studies ST patients without increased hearing thresholds were found to suffer from so called hidden HL. Additionally, ST is correlated with pathologically increased spontaneous firing rates and neuronal hyperactivity (NH) along the auditory pathway. Homeostatic plasticity (HP) has been proposed as a compensation mechanism leading to the development of NH, arguing that after HL initially decreased mean firing rates of neurons are subsequently restored by increased spontaneous rates. However all HP models fundamentally lack explanatory power since the function of keeping mean firing rate constant remains elusive as does the benefit this might have in terms of information processing. Furthermore the neural circuitry being able to perform the comparison of preferred with actual mean firing rate remains unclear. Here we propose an entirely new interpretation of ST related development of NH in terms of information theory. We suggest that stochastic resonance (SR) plays a key role in short- and long-term plasticity within the auditory system and is the ultimate cause of NH and ST. SR has been found ubiquitous in neuroscience and refers to the phenomenon that sub-threshold, unperceivable signals can be transmitted by adding noise to sensor input. We argue that after HL, SR serves to lift signals above the increased hearing threshold, hence subsequently decreasing thresholds again. The increased amount of internal noise is the correlate of the NH, which finally leads to the development of ST, due to neuronal plasticity along the auditory pathway. We demonstrate the plausibility of our hypothesis by using a computational model and provide exemplarily findings of human and animal studies that are consistent with our model.

preprint2015arXiv

Adaptive stochastic resonance based on output autocorrelations

Successful detection of weak signals is a universal challenge for numerous technical and biological systems and crucially limits signal transduction and transmission. Stochastic resonance (SR) has been identified to have the potential to tackle this problem, namely to enable non-linear systems to detect small, otherwise sub-threshold signals by means of added non-zero noise. This has been demonstrated within a wide range of systems in physical, technological and biological contexts. Based on its ubiquitous importance, numerous theoretical and technical approaches aim at an optimization of signal transduction based on SR. Several quantities like mutual information, signal-to-noise-ratio, or the cross-correlation between input stimulus and resulting detector response have been used to determine optimal noise intensities for SR. The fundamental shortcoming with all these measures is that knowledge of the signal to be detected is required to compute them. This dilemma prevents the use of adaptive SR procedures in any application where the signal to be detected is unknown. We here show that the autocorrelation function (AC) of the detector response fundamentally overcomes this drawback. For a simplified model system, the equivalence of the output AC with the measures mentioned above is proven analytically. In addition, we test our approach numerically for a variety of systems comprising different input signals and different types of detectors. The results indicate a strong similarity between mutual information and output AC in terms of the optimal noise intensity for SR. Hence, using the output AC to adaptively vary the amount of added noise in order to maximize information transmission via SR might be a fundamental processing principle in nature, in particular within neural systems which could be implemented in future technical applications.

preprint2011arXiv

Poresizes in random line networks

Many natural fibrous networks with fiber diameters much smaller than the average poresize can be described as three-dimensional (3D) random line networks. We consider here a `Mikado' model for such systems, consisting of straight line segments of equal length, distributed homogeneously and isotropically in space. First, we derive analytically the probability density distribution $p(r_{no})$ for the `nearest obstacle distance' $r_{no}$ between a randomly chosen test point within the network pores and its closest neighboring point on a line segment. Second, we show that in the limit where the line segments are much longer than the typical pore size, $p(r_{no})$ becomes a Rayleigh distribution. The single parameter $σ$ of this Rayleigh distribution represents the most probable nearest obstacle distance and can be expressed in terms of the total line length per unit volume. Finally, we show by numerical simulations that $σ$ differs only by a constant factor from the intuitive notion of average `pore size', defined by finding the maximum sphere that fits into each pore and then averaging over the radii of these spheres.

preprint2011arXiv

Reconstructing fiber networks from confocal image stacks

We present a numerically efficient method to reconstruct a disordered network of thin biopolymers, such as collagen gels, from three-dimensional (3D) image stacks recorded with a confocal microscope. Our method is based on a template matching algorithm that simultaneously performs a binarization and skeletonization of the network. The size and intensity pattern of the template is automatically adapted to the input data so that the method is scale invariant and generic. Furthermore, the template matching threshold is iteratively optimized to ensure that the final skeletonized network obeys a universal property of voxelized random line networks, namely, solid-phase voxels have most likely three solid-phase neighbors in a $3\times3$ neighborhood. This optimization criterion makes our method free of user-defined parameters and the output exceptionally robust against imaging noise.

Patrick Krauss

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Are cortical microcircuits optimized for information flux? -- A simulation-based reverse engineering study

Deep Reinforcement Learning for Optimum Order Execution: Mitigating Risk and Maximizing Returns

Neuroscience-Inspired Analyses of Visual Interestingness in Multimodal Transformers

Extracting continuous sleep depth from EEG data without machine learning

Multi-Modal Cognitive Maps based on Neural Networks trained on Successor Representations

Classification at the Accuracy Limit -- Facing the Problem of Data Ambiguity

Neural Network based Successor Representations of Space and Language

Predictive coding and stochastic resonance as fundamental principles of auditory perception

Known Operator Learning and Hybrid Machine Learning in Medical Imaging -- A Review of the Past, the Present, and the Future

Sparsity through evolutionary pruning prevents neuronal networks from overfitting

A statistical method for analyzing and comparing spatiotemporal cortical activation patterns

Stochastic resonance controlled upregulation of internal noise after hearing loss as a putative correlate of tinnitus-related neuronal hyperactivity

Adaptive stochastic resonance based on output autocorrelations

Poresizes in random line networks

Reconstructing fiber networks from confocal image stacks