Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2026arXiv

A Comprehensive Dataset for Human vs. AI Generated Image Detection

Multimodal generative AI systems like Stable Diffusion, DALL-E, and MidJourney have fundamentally changed how synthetic images are created. These tools drive innovation but also enable the spread of misleading content, false information, and manipulated media. As generated images become harder to distinguish from photographs, detecting them has become an urgent priority. To combat this challenge, We release MS COCOAI, a novel dataset for AI generated image detection consisting of 96000 real and synthetic datapoints, built using the MS COCO dataset. To generate synthetic images, we use five generators: Stable Diffusion 3, Stable Diffusion 2.1, SDXL, DALL-E 3, and MidJourney v6. Based on the dataset, we propose two tasks: (1) classifying images as real or generated, and (2) identifying which model produced a given synthetic image. The dataset is available at https://huggingface.co/datasets/Rajarshi-Roy-research/Defactify_Image_Dataset.

preprint2026arXiv

PrefixGuard: From LLM-Agent Traces to Online Failure-Warning Monitors

Large language model (LLM) agents now execute long, tool-using tasks where final outcome checks can arrive too late for intervention. Online warning requires lightweight prefix monitors over heterogeneous traces, but hand-authored event schemas are brittle and deployment-time LLM judging is costly. We introduce PrefixGuard, a trace-to-monitor framework with an offline StepView induction step followed by supervised monitor training. StepView induces deterministic typed-step adapters from raw trace samples, and the monitor learns an event abstraction and prefix-risk scorer from terminal outcomes. Across WebArena, $τ^2$-Bench, SkillsBench, and TerminalBench, the strongest PrefixGuard monitors reach 0.900/0.710/0.533/0.557 AUPRC. Using the strongest backend within each representation, they improve over raw-text controls by an average of +0.137 AUPRC. LLM judges remain substantially weaker under the same prefix-warning protocol. We also derive an observability ceiling on score-based area under the precision-recall curve (AUPRC) that separates monitor error from failures lacking evidence in the observed prefix. For finite-state audit, post-hoc deterministic finite automaton (DFA) extraction remains compact on WebArena and $τ^2$-Bench (29 and 20 states) but expands to 151 and 187 states on SkillsBench and TerminalBench. Finally, first-alert diagnostics show that strong ranking does not imply deployment utility: WebArena ranks well yet fails to support low-false-alarm alerts, whereas $τ^2$-Bench and TerminalBench retain more actionable early alerts. Together, these results position PrefixGuard as a practical monitor-synthesis recipe with explicit diagnostics for when prefix warnings translate into actionable interventions.

preprint2022arXiv

Dark Current and Single Photon Detection by 1550nm Avalanche Photodiodes: Dead Time Corrected Probability Distributions and Entropy Rates

Single photon detectors have dark count rates that depend strongly on the bias level for detector operation. In the case of weak light sources such as novel lasers or single-photon emitters, the rate of counts due to the light source can be comparable to that of the detector dark counts. In such cases, a characterization of the statistical properties of the dark counts is necessary. The dark counts are often assumed to follow a Poisson process that is statistically independent of the incident photon counts. This assumption must be validated for specific types of photodetectors. In this work, we focus on single-photon avalanche photodiodes (SPADs) made for 1550nm. For the InGaAs detectors used, we find the measured distributions often differ significantly from Poisson due to the presence of dead time and afterpulsing with the difference increasing with the bias level. When the dead time is increased to remove the effects of afterpulsing, it is necessary to correct the measured distributions for the effects of the dead time. To this end, we apply an iterative algorithm to remove dead time effects from the probability distribution for dark counts as well as for the case where light from an external weak laser source (known to be Poisson) is detected together with the dark counts. We believe this to be the first instance of the comprehensive application of this algorithm to real data and find that the corrected probability distributions are Poisson distributions in both cases. We additionally use the Grassberger-Procaccia algorithm to estimate the entropy production rates of the dark count processes, which provides a single metric that characterizes the temporal correlations between dark counts as well as the shape of the distribution. We have thus developed a systematic procedure for taking data with 1550nm SPADs and obtaining accurate photocount statistics to examine novel light sources.

preprint2022arXiv

PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning

In this work, we present a reinforcement learning (RL) based approach to designing parallel prefix circuits such as adders or priority encoders that are fundamental to high-performance digital design. Unlike prior methods, our approach designs solutions tabula rasa purely through learning with synthesis in the loop. We design a grid-based state-action representation and an RL environment for constructing legal prefix circuits. Deep Convolutional RL agents trained on this environment produce prefix adder circuits that Pareto-dominate existing baselines with up to 16.0% and 30.2% lower area for the same delay in the 32b and 64b settings respectively. We observe that agents trained with open-source synthesis tools and cell library can design adder circuits that achieve lower area and delay than commercial tool adders in an industrial cell library.

preprint2020arXiv

Critical Switching in Globally Attractive Chimeras

We report on a new type of chimera state that attracts almost all initial conditions and exhibits power-law switching behavior in networks of coupled oscillators. Such switching chimeras consist of two symmetric configurations, which we refer to as subchimeras, in which one cluster is synchronized and the other is incoherent. Despite each subchimera being linearly stable, switching chimeras are extremely sensitive to noise: arbitrarily small noise triggers and sustains persistent switching between the two symmetric subchimeras. The average switching frequency scales as a power law with the noise intensity, which is in contrast with the exponential scaling observed in typical stochastic transitions. Rigorous numerical analysis reveals that the power-law switching behavior originates from intermingled basins of attraction associated with the two subchimeras, which in turn are induced by chaos and symmetry in the system. The theoretical results are supported by experiments on coupled optoelectronic oscillators, which demonstrate the generality and robustness of switching chimeras.

preprint2020arXiv

Laminar Chaos in experiments and nonlinear delayed Langevin equations: A time series analysis toolbox for the detection of Laminar Chaos

Recently, it was shown that certain systems with large time-varying delay exhibit different types of chaos, which are related to two types of time-varying delay: conservative and dissipative delays. The known high-dimensional Turbulent Chaos is characterized by strong fluctuations. In contrast, the recently discovered low-dimensional Laminar Chaos is characterized by nearly constant laminar phases with periodic durations and a chaotic variation of the intensity from phase to phase. In this paper we extend our results from our preceding publication [J. D. Hart, R. Roy, D. Müller-Bender, A. Otto, and G. Radons, PRL 123 154101 (2019)], where it is demonstrated that Laminar Chaos is a robust phenomenon, which can be observed in experimental systems. We provide a time-series analysis toolbox for the detection of robust features of Laminar Chaos. We benchmark our toolbox by experimental time series and time series of a model system which is described by a nonlinear Langevin equation with time-varying delay. The benchmark is done for different noise strengths for both the experimental system and the model system, where Laminar Chaos can be detected, even if it is hard to distinguish from Turbulent Chaos by a visual analysis of the trajectory.

preprint2020arXiv

Learning Interpretable Models in the Property Specification Language

We address the problem of learning human-interpretable descriptions of a complex system from a finite set of positive and negative examples of its behavior. In contrast to most of the recent work in this area, which focuses on descriptions expressed in Linear Temporal Logic (LTL), we develop a learning algorithm for formulas in the IEEE standard temporal logic PSL (Property Specification Language). Our work is motivated by the fact that many natural properties, such as an event happening at every n-th point in time, cannot be expressed in LTL, whereas it is easy to express such properties in PSL. Moreover, formulas in PSL can be more succinct and easier to interpret (due to the use of regular expressions in PSL formulas) than formulas in LTL. Our learning algorithm builds on top of an existing algorithm for learning LTL formulas. Roughly speaking, our algorithm reduces the learning task to a constraint satisfaction problem in propositional logic and then uses a SAT solver to search for a solution in an incremental fashion. We have implemented our algorithm and performed a comparative study between the proposed method and the existing LTL learning algorithm. Our results illustrate the effectiveness of the proposed approach to provide succinct human-interpretable descriptions from examples.

preprint2020arXiv

Optimal Scheduling for Maximizing Information Freshness & System Performance in Industrial Cyber-Physical Systems

Age of Information is a newly introduced metric, getting vivid attention for measuring the freshness of information in real-time networks. This parameter has evolved to guarantee the reception of timely information from the latest status update, received by a user from any real-time application. In this paper, we study a centralized, closed-loop, networked controlled industrial wireless sensor-actuator network for cyber-physical production systems. Here, we jointly address the problem of transmission scheduling of sensor updates and the restoration of an information flow-line after any real-time update having hard-deadline drops from it, resulting a break in the loop. Unlike existing real-time scheduling policies that only ensure timely updates, this work aims to accomplish both the time-sensitivity and data freshness in new and regenerative real-time updates in terms of the age of information. Here, the coexistence of both cyber and physical units and their individual requirements for providing the quality of service to the system, as a whole, seems to be one of the major challenges to handle. In this work, minimization of staleness of the time-critical updates to extract maximum utilization out of its information content and its effects on other network performances are thoroughly investigated. A greedy scheduling policy called Deadline-aware highest latency first has been used to solve this problem; its performance optimality is proved analytically. Finally, our claim is validated by comparing the results obtained by our algorithm with those of other popular scheduling policies through extensive simulations.

preprint2020arXiv

Property-Directed Verification of Recurrent Neural Networks

This paper presents a property-directed approach to verifying recurrent neural networks (RNNs). To this end, we learn a deterministic finite automaton as a surrogate model from a given RNN using active automata learning. This model may then be analyzed using model checking as verification technique. The term property-directed reflects the idea that our procedure is guided and controlled by the given property rather than performing the two steps separately. We show that this not only allows us to discover small counterexamples fast, but also to generalize them by pumping towards faulty flows hinting at the underlying error in the RNN.