Researcher profile

Zoltan Nagy

Zoltan Nagy contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

Counter-Dyna: Data-Efficient RL-Based HVAC Control using Counterfactual Building Models

Model-based reinforcement learning (MBRL) offers a promising approach for data-efficient energy management in buildings, combining the strengths of predictive modeling and reinforcement learning. While previous MBRL methods applied to HVAC control have reduced training data requirements, they still require several months of interaction with the building to learn a satisfactory control policy. A key reason is that existing surrogate models attempt to predict the entire state-space, including weather and electricity prices that are unaffected by control actions, or completely ignore these variables. Addressing these issues, we propose Counter-Dyna, a method that enhances the data-efficiency of Dyna, an MBRL method. We create data-efficient counterfactual surrogate models (CSM) by leveraging invariances in the state-space. Using a CSM in Dyna speeds up RL training measured in environment interaction data compared to previous results. In comparison with previous state-of-the-art that used 6-12 months of environment interactions, our method needs only 5 weeks. We evaluate our method in a large simulation study using the literature standard BOPTEST framework and proximal policy algorithm (PPO) as the RL algorithm. Our results show cost-saving potentials of 5.3% to 17.0% in a hypothetical deployment scenario. Our work is a significant step towards making real-world deployment of RL algorithms in HVAC control practically viable.

preprint2026arXiv

Instance camera focus prediction for crystal agglomeration classification

Agglomeration refers to the process of crystal clustering due to interparticle forces. Crystal agglomeration analysis from microscopic images is challenging due to the inherent limitations of two-dimensional imaging. Overlapping crystals may appear connected even when located at different depth layers. Because optical microscopes have a shallow depth of field, crystals that are in-focus and out-of-focus in the same image typically reside on different depth layers and do not constitute true agglomeration. To address this, we first quantified camera focus with an instance camera focus prediction network to predict 2 class focus level that aligns better with visual observations than traditional image processing focus measures. Then an instance segmentation model is combined with the predicted focus level for agglomeration classification. Our proposed method has a higher agglomeration classification and segmentation accuracy than the baseline models on ammonium perchlorate crystal and sugar crystal dataset.

preprint2026arXiv

Toward a foundational thermal model for residential buildings

The building energy community lacks a foundational thermal model, i.e., a single pretrained model capable of generalizing across diverse buildings, climates, and control strategies without building-specific calibration. Achieving this vision requires architectural principles that capture universal thermal dynamics rather than memorizing building-specific patterns. We take a step toward this goal by presenting a physics-informed transformer architecture that embeds domain knowledge, e.g., derivative enrichment and Euler-based numerical integration, into a decoder-only framework. We incorporate static building features extracted from simulation models and employ Rotary Position Embedding attention to capture temporal dependencies. Evaluated on the CityLearn dataset spanning 247 residential buildings across three climate zones, our model achieves one-step prediction accuracy (RMSE of 0.30°C in Texas, 0.29°C in Vermont) while outperforming both traditional baselines and fine-tuned Time-Series Foundation Models. We also demonstrate zero-shot transferability: models trained on as few as two buildings generalize to unseen buildings and climate zones without fine-tuning. Despite the limitation of simulated residential buildings, our results establish physics-informed architectural principles as a promising foundation for universal building thermal models.

preprint2024arXiv

Gauge choice for organizing infrared singularities in QCD

We explore the features of interpolating gauge for QCD. This gauge, defined by Doust and by Baulieu and Zwanziger, interpolates between Feynman gauge or Lorenz gauge and Coulomb gauge. We argue that it could be useful for defining the splitting functions for a parton shower beyond order $\as$ or for defining the infrared subtraction terms for higher order perturbative calculations.

preprint2022arXiv

Multivariable evolution in final state parton shower algorithms

One can use more than one scale variable to specify the family of surfaces in the space of parton splitting parameters that define the evolution of a parton shower. Considering $e^+e^-$ annihilation, we use two variables, with shower evolution following a special path in this two dimensional space. In addition, we treat in a special way the part of the splitting function that has a soft emission singularity but no collinear singularity. This leads to certain advantages compared to the usual shower formulation with only one scale variable.

preprint2022arXiv

Multivariable evolution in parton showers with initial state partons

One can use more than one scale variable to define the family of surfaces in the space of parton splitting parameters that define the evolution of a parton shower. In an earlier paper, we developed this idea for electron-positron annihilation. Here, we use multiple scale variables for a parton shower with initial state partons. Then we need a more sophisticated analysis because the evolution of parton distribution functions must be coordinated with the parton shower evolution. We make the needed connections more precise than in our earlier work, even for the case of just one scale variable. Then we develop an example with three scale variables, which leads to advantages compared to the usual shower formulation with only one scale variable. We provide results for Drell-Yan muon pair production.

preprint2022arXiv

Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings

Building upon prior research that highlighted the need for standardizing environments for building control research, and inspired by recently introduced challenges for real life reinforcement learning control, here we propose a non-exhaustive set of nine real world challenges for reinforcement learning control in grid-interactive buildings. We argue that research in this area should be expressed in this framework in addition to providing a standardized environment for repeatability. Advanced controllers such as model predictive control and reinforcement learning (RL) control have both advantages and disadvantages that prevent them from being implemented in real world problems. Comparisons between the two are rare, and often biased. By focusing on the challenges, we can investigate the performance of the controllers under a variety of situations and generate a fair comparison. As a demonstration, we implement the offline learning challenge in CityLearn and study the impact of different levels of domain knowledge and complexity of RL algorithms. We show that the sequence of operations utilized in a rule based controller (RBC) used for offline training affects the performance of the RL agents when evaluated on a set of four energy flexibility metrics. Longer offline learning from an optimized RBC leads to improved performance in the long run. RL agents that learn from a simplified RBC risk poorer performance as the offline learning period increases. We also observe no impact on performance from information sharing amongst agents. We call for a more interdisciplinary effort of the research community to address the real world challenges, and unlock the potential of grid-interactive building

preprint2021arXiv

MARTINI: Smart Meter Driven Estimation of HVAC Schedules and Energy Savings Based on WiFi Sensing and Clustering

HVAC systems account for a significant portion of building energy use. Nighttime setback scheduling is an energy conservation measure where cooling and heating setpoints are increased and decreased respectively during unoccupied periods with the goal of obtaining energy savings. However, knowledge of a building's real occupancy is required to maximize the success of this measure. In addition, there is the need for a scalable way to estimate energy savings potential from energy conservation measures that is not limited by building specific parameters and experimental or simulation modeling investments. Here, we propose MARTINI, a sMARt meTer drIveN estImation of occupant-derived HVAC schedules and energy savings that leverages the ubiquity of energy smart meters and WiFi infrastructure in commercial buildings. We estimate the schedules by clustering WiFi-derived occupancy profiles and, energy savings by shifting ramp-up and setback times observed in typical/measured load profiles obtained by clustering smart meter energy profiles. Our case-study results with five buildings over seven months show an average of 8.1%-10.8% (summer) and 0.2%-5.9% (fall) chilled water energy savings when HVAC system operation is aligned with occupancy. We validate our method with results from building energy performance simulation (BEPS) and find that estimated average savings of MARTINI are within 0.9%-2.4% of the BEPS predictions. In the absence of occupancy information, we can still estimate potential savings from increasing ramp-up time and decreasing setback start time. In 51 academic buildings, we find savings potentials between 1%-5%.

preprint2009arXiv

On the transverse momentum in Z-boson production in a virtuality ordered parton shower

Cross sections for physical processes that involve very different momentum scales in the same process will involve large logarithms of the ratio of the momentum scales when calculated in perturbation theory. One goal of calculations using parton showers is to sum these large logarithms. We ask whether this goal is achieved for the transverse momentum distribution of a Z-boson produced in hadron-hadron collisions when the shower is organized with higher virtuality parton splittings coming first, followed successively by lower virtuality parton splittings. We find that the virtuality ordered shower works well in reproducing the known QCD result.

preprint2008arXiv

Direct numerical integration of one-loop Feynman diagrams for N-photon amplitudes

One approach to the calculation of cross sections for infrared-safe observables in high energy collisions at next-to-leading order is to perform all of the integrations, including the virtual loop integration, by Monte Carlo numerical integration. In a previous paper, two of us have shown how one can perform such a virtual loop integration numerically after first introducing a Feynman parameter representation. In this paper, we perform the integration directly, without introducing Feynman parameters, after suitably deforming the integration contour. Our example is the N-photon scattering amplitude with a massless electron loop. We report results for N = 6 and N = 8.

preprint2001arXiv

Three-jet cross sections in hadron-hadron collisions at next-to-leading order

We present a new QCD event generator for hadron collider which can calculate one-, two- and three-jet cross sections at next-to-leading order accuracy. In this letter we study the transverse energy spectrum of three-jet hadronic events using the kT algorithm. We show that the next-to-leading order correction significantly reduces the renormalization and factorization scale dependence of the three-jet cross section.