Researcher profile

Benjamin Schäfer

Benjamin Schäfer contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

Explainable Load Forecasting with Covariate-Informed Time Series Foundation Models

Time Series Foundation Models (TSFMs) have recently emerged as general-purpose forecasting models and show considerable potential for applications in energy systems. However, applications in critical infrastructure like power grids require transparency to ensure trust and reliability and cannot rely on pure black-box models. To enhance the transparency of TSFMs, we propose an efficient algorithm for computing Shapley Additive Explanations (SHAP) tailored to these models. The proposed approach leverages the flexibility of TSFMs with respect to input context length and provided covariates. This property enables efficient temporal and covariate masking (selectively withholding inputs), allowing for a scalable explanation of model predictions using SHAP. We evaluate two TSFMs - Chronos-2 and TabPFN-TS - on a day-ahead load forecasting task for a transmission system operator (TSO). In a zero-shot setting, both models achieve predictive performance competitive with a Transformer model trained specifically on multiple years of TSO data. The explanations obtained through our proposed approach align with established domain knowledge, particularly as the TSFMs appropriately use weather and calendar information for load prediction. Overall, we demonstrate that TSFMs can serve as transparent and reliable tools for operational energy forecasting.

preprint2022arXiv

Lockdown effects on air quality in megacities during the first and second waves of COVID-19 pandemic

Air pollution is among the highest contributors to mortality worldwide, especially in urban areas. During spring 2020, many countries enacted social distancing measures in order to slow down the ongoing COVID-19 pandemic. A particularly drastic measure, the 'lockdown', urged people to stay at home and thereby prevent new COVID-19 infections during the first (2020) and second wave (2021) of the pandemic. In turn, it also reduced traffic and industrial activities. But how much did these lockdown measures improve air quality in large cities, and are there differences in how air quality was affected? Here, we analyse data from two megacities: London as an example for Europe and Delhi as an example for Asia. We consider data during first and second wave lockdowns and compare them to 2019 values. Overall, we find a reduction in almost all air pollutants with intriguing differences between the two cities except Delhi in 2021. In London, despite smaller average concentrations, we still observe high-pollutant states and an increased tendency towards extreme events (a higher kurtosis of the probability density during lockdown) during 2020 and low pollution levels during 2021. For Delhi, we observe a much stronger decrease of pollution concentrations, including high pollution states during 2020 and higher pollution levels in 2021. These results could help to design policies to improve long-term air quality in megacities.

preprint2022arXiv

Microscopic fluctuations in power-grid frequency recordings at the sub-second scale

Complex systems, such as the power grid, are essential for our daily lives. Many complex systems display (multi-)fractal behavior, correlated fluctuations and power laws. Whether the power-grid frequency, an indicator about the balance on supply and demand in the electricity grid, also displays such complexity remains a mostly open question. Within the present article, we utilize highly resolved measurements to quantify the properties of the power-grid frequency. We show that below 1 second, the dynamics may fundamentally change, including a suddenly increasing power spectral density, emergence of multifractality and a change of correlation behavior. We provide a simplified stochastic model involving positively correlated noise to reproduce the observed dynamics, possibly linked to frequency dependent loads. Finally, we stress the need for high-quality measurements and discuss how we obtained the data analyzed here.

preprint2022arXiv

Revealing drivers and risks for power grid frequency stability with explainable AI

Stable operation of the electrical power system requires the power grid frequency to stay within strict operational limits. With millions of consumers and thousands of generators connected to a power grid, detailed human-build models can no longer capture the full dynamics of this complex system. Modern machine learning algorithms provide a powerful alternative for system modelling and prediction, but the intrinsic black-box character of many models impedes scientific insights and poses severe security risks. Here, we show how eXplainable AI (XAI) alleviates these problems by revealing critical dependencies and influences on the power grid frequency. We accurately predict frequency stability indicators (such as RoCoF and Nadir) for three major European synchronous areas and identify key features that determine the power grid stability. Load ramps, specific generation ramps but also prices and forecast errors are central to understand and stabilize the power grid.

preprint2022arXiv

Spatial heterogeneity of air pollution statistics

Air pollution is one of the leading causes of death globally, and continues to have a detrimental effect on our health. In light of these impacts, an extensive range of statistical modelling approaches has been devised in order to better understand air pollution statistics. However, the time-varying statistics of different types of air pollutants are far from being fully understood. The observed probability density functions (PDFs) of concentrations depend very much on the spatial location and on the pollutant substance. In this paper, we analyse a large variety of data from 3544 different European monitoring sites and show that the PDFs of nitric oxide ($NO$), nitrogen dioxide ($NO2$) and particulate matter ($PM10$ and $PM2.5$) concentrations generically exhibit heavy tails and are asymptotically well approximated by $q$-exponential distributions with a given width parameter $λ$. We observe that the power-law parameter $q$ and the width parameter $λ$ vary widely for the different spatial locations. For each substance, we find different patterns of parameter clouds in the $(q, λ)$ plane. These depend on the type of pollutants and on the environmental characteristics (urban/suburban/rural/traffic/industrial/background). This means the effective statistical physics description of air pollution exhibits a strong degree of spatial heterogeneity.

preprint2021arXiv

Exploring deterministic frequency deviations with explainable AI

Deterministic frequency deviations (DFDs) critically affect power grid frequency quality and power system stability. A better understanding of these events is urgently needed as frequency deviations have been growing in the European grid in recent years. DFDs are partially explained by the rapid adjustment of power generation following the intervals of electricity trading, but this intuitive picture fails especially before and around noonday. In this article, we provide a detailed analysis of DFDs and their relation to external features using methods from explainable Artificial Intelligence. We establish a machine learning model that well describes the daily cycle of DFDs and elucidate key interdependencies using SHapley Additive exPlanations (SHAP). Thereby, we identify solar ramps as critical to explain patterns in the Rate of Change of Frequency (RoCoF).

preprint2021arXiv

Secondary control activation analysed and predicted with explainable AI

The transition to a renewable energy system poses challenges for power grid operation and stability. Secondary control is key in restoring the power system to its reference following a disturbance. Underestimating the necessary control capacity may require emergency measures, such as load shedding. Hence, a solid understanding of the emerging risks and the driving factors of control is needed. In this contribution, we establish an explainable machine learning model for the activation of secondary control power in Germany. Training gradient boosted trees, we obtain an accurate description of control activation. Using SHapely Additive exPlanation (SHAP) values, we investigate the dependency between control activation and external features such as the generation mix, forecasting errors, and electricity market data. Thereby, our analysis reveals drivers that lead to high reserve requirements in the German power system. Our transparent approach, utilizing open data and making machine learning models interpretable, opens new scientific discovery avenues.

preprint2020arXiv

Dynamically induced cascading failures in power grids

Reliable functioning of infrastructure networks is essential for our modern society. Cascading failures are the cause of most large-scale network outages. Although cascading failures often exhibit dynamical transients, the modeling of cascades has so far mainly focused on the analysis of sequences of steady states. In this article, we focus on electrical transmission networks and introduce a framework that takes into account both the event-based nature of cascades and the essentials of the network dynamics. We find that transients of the order of seconds in the flows of a power grid play a crucial role in the emergence of collective behaviors. We finally propose a forecasting method to identify critical lines and components in advance or during operation. Overall, our work highlights the relevance of dynamically induced failures on the synchronization dynamics of national power grids of different European countries and provides methods to predict and model cascading failures.

preprint2020arXiv

Multilayer modeling of adoption dynamics in energy demand management

Due to the emergence of new technologies, the whole electricity system is undergoing transformations on a scale and pace never observed before. The decentralization of energy resources and the smart grid have forced utility services to rethink their relationships with customers. Demand response (DR) seeks to adjust the demand for power instead of adjusting the supply. However, DR business models rely on customer participation and can only be effective when large numbers of customers in close geographic vicinity, e.g., connected to the same transformer, opt in. Here, we introduce a model for the dynamics of service adoption on a two-layer multiplex network: the layer of social interactions among customers and the power-grid layer connecting the households. While the adoption process - based on peer-to-peer communication - runs on the social layer, the time-dependent recovery rate of the nodes depends on the states of their neighbors on the power-grid layer, making an infected node surrounded by infectious ones less keen to recover. Numerical simulations of the model on synthetic and real-world networks show that a strong local influence of the customers' actions leads to a discontinuous transition where either none or all the nodes in the network are infected, depending on the infection rate and social pressure to adopt. We find that clusters of early adopters act as points of high local pressure, helping maintaining adopters, and facilitating the eventual adoption of all nodes. This suggests direct marketing strategies on how to efficiently establish and maintain new technologies such as DR schemes.

preprint2020arXiv

Open data base analysis of scaling and spatio-temporal properties of power grid frequencies

The electrical energy system has attracted much attention from an increasingly diverse research community. Many theoretical predictions have been made, from scaling laws of fluctuations to propagation velocities of disturbances. However, to validate any theory, empirical data from large-scale power systems are necessary but are rarely shared openly. Here, we analyse an open data base of measurements of electric power grid frequencies across 17 locations in 12 synchronous areas on three continents. The power grid frequency is of particular interest, as it indicates the balance of supply and demand and carries information on deterministic, stochastic, and control influences. We perform a broad analysis of the recorded data, compare different synchronous areas and validate a previously conjectured scaling law. Furthermore, we show how fluctuations change from local independent oscillations to a homogeneous bulk behaviour. Overall, the presented open data base and analyses constitute a step towards more shared, collaborative

preprint2020arXiv

Predictability of Power Grid Frequency

The power grid frequency is the central observable in power system control, as it measures the balance of electrical supply and demand. A reliable frequency forecast can facilitate rapid control actions and may thus greatly improve power system stability. Here, we develop a weighted-nearest-neighbor (WNN) predictor to investigate how predictable the frequency trajectories are. Our forecasts for up to one hour are more precise than averaged daily profiles and could increase the efficiency of frequency control actions. Furthermore, we gain an increased understanding of the specific properties of different synchronous areas by interpreting the optimal prediction parameters (number of nearest neighbors, the prediction horizon, etc.) in terms of the physical system. Finally, prediction errors indicate the occurrence of exceptional external perturbations. Overall, we provide a diagnostics tool and an accurate predictor of the power grid frequency time series, allowing better understanding of the underlying dynamics.

preprint2020arXiv

Superstatistical approach to air pollution statistics

Air pollution by Nitrogen Oxides (NOx) is a major concern in large cities as it has severe adverse health effects. However, the statistical properties of air pollutants are not fully understood. Here, we use methods borrowed from nonequilibrium statistical mechanics to construct suitable superstatistical models for air pollution statistics. In particular, we analyze time series of Nitritic Oxide ($NO$) and Nitrogen Dioxide ($NO_2$) concentrations recorded at several locations throughout Greater London. We find that the probability distributions of concentrations have heavy tails and that the dynamics is well-described by $χ^2$ superstatistics for $NO$ and inverse $χ^2$ superstatistics for $NO_2$. Our results can be used to give precise risk estimates of high-pollution situations and pave the way to mitigation strategies.

preprint2020arXiv

Universal properties of primary and secondary cosmic ray energy spectra

Atomic nuclei appearing in cosmic rays are typically classified as primary or secondary. However, a better understanding of their origin and propagation properties is still necessary. We analyse the flux of primary (He, C, O) and secondary nuclei (Li, Be, B) detected with rigidity (momentum/charge) between 2 GV and 3 TV by the Alpha Magnetic Spectrometer (AMS) on the International Space Station. We show that $q$-exponential distribution functions, as motivated by generalized versions of statistical mechanics with temperature fluctuations, provide excellent fits for the measured flux of all nuclei considered. Primary and secondary fluxes reveal a universal dependence on kinetic energy per nucleon for which the underlying energy distribution functions are solely distinguished by their effective degrees of freedom. All given spectra are characterized by a universal mean temperature parameter $\sim$ 200 MeV which agrees with the Hagedorn temperature. Our analysis suggests that QCD scattering processes together with nonequilibrium temperature fluctuations provide a plausible explanation for the observed universality in cosmic ray energy spectra. Our analysis suggests that QCD scattering processes together with nonequilibrium temperature fluctuations imprint universally onto the measured cosmic ray spectra, and produce a similar shape of energy spectra as high energy collider experiments on the Earth.

preprint2020arXiv

Wind Power Persistence Characterized by Superstatistics

Mitigating climate change demands a transition towards renewable electricity generation, with wind power being a particularly promising technology. Long periods either of high or of low wind therefore essentially define the necessary amount of storage to balance the power system. While the general statistics of wind velocities have been studied extensively, persistence (waiting) time statistics of wind is far from well understood. Here, we investigate the statistics of both high- and low-wind persistence. We find heavy tails and explain them as a superposition of different wind conditions, requiring $q$-exponential distributions instead of exponential distributions. Persistent wind conditions are not necessarily caused by stationary atmospheric circulation patterns nor by recurring individual weather types but may emerge as a combination of multiple weather types and circulation patterns. Understanding wind persistence statistically and synoptically, may help to ensure a reliable and economically feasible future energy system, which uses a high share of wind generation.

preprint2019arXiv

Data-driven model of the power-grid frequency dynamics

The energy system is rapidly changing to accommodate the increasing number of renewable generators and the general transition towards a more sustainable future. Simultaneously, business models and market designs evolve, affecting power-grid operation and power-grid frequency. Problems raised by this ongoing transition are increasingly addressed by transdisciplinary research approaches, ranging from purely mathematical modelling to applied case studies. These approaches require a stochastic description of consumer behaviour, fluctuations by renewables, market rules, and how they influence the stability of the power-grid frequency. Here, we introduce an easy-to-use, data-driven, stochastic model for the power-grid frequency and demonstrate how it reproduces key characteristics of the observed statistics of the Continental European and British power grids. We offer executable code and guidelines on how to use the model on any power grid for various mathematical or engineering applications.

preprint2019arXiv

Stochastic properties of the frequency dynamics in real and synthetic power grids

The frequency constitutes a key state variable of electrical power grids. However, as the frequency is subject to several sources of fluctuations, ranging from renewable volatility to demand fluctuations and dispatch, it is strongly dynamic. Yet, the statistical and stochastic properties of the frequency fluctuation dynamics are far from fully understood. Here, we analyse properties of power grid frequency trajectories recorded from different synchronous regions. We highlight the non-Gaussian and still approximately Markovian nature of the frequency statistics. Further, we find that the frequency displays significant fluctuations exactly at the time intervals of regulation and trading, confirming the need of having a regulatory and market design that respects the technical and dynamical constraints in future highly renewable power grids. Finally, employing a recently proposed synthetic model for the frequency dynamics, we combine our statistical and stochastic analysis and analyse in how far dynamically modelled frequency properties match the ones of real trajectories.