Researcher profile

Paul Smith

Paul Smith contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2025arXiv

Causify DataFlow: A Framework For High-performance Machine Learning Stream Computing

We present DataFlow, a computational framework for building, testing, and deploying high-performance machine learning systems on unbounded time-series data. Traditional data science workflows assume finite datasets and require substantial reimplementation when moving from batch prototypes to streaming production systems. This gap introduces causality violations, batch boundary artifacts, and poor reproducibility of real-time failures. DataFlow resolves these issues through a unified execution model based on directed acyclic graphs (DAGs) with point-in-time idempotency: outputs at any time t depend only on a fixed-length context window preceding t. This guarantee ensures that models developed in batch mode execute identically in streaming production without code changes. The framework enforces strict causality by automatically tracking knowledge time across all transformations, eliminating future-peeking bugs. DataFlow supports flexible tiling across temporal and feature dimensions, allowing the same model to operate at different frequencies and memory profiles via configuration alone. It integrates natively with the Python data science stack and provides fit/predict semantics for online learning, caching and incremental computation, and automatic parallelization through DAG-based scheduling. We demonstrate its effectiveness across domains including financial trading, IoT, fraud detection, and real-time analytics.

preprint2022arXiv

Universality for two-dimensional critical cellular automata

We study the class of monotone, two-state, deterministic cellular automata, in which sites are activated (or 'infected') by certain configurations of nearby infected sites. These models have close connections to statistical physics, and several specific examples have been extensively studied in recent years by both mathematicians and physicists. This general setting was first studied only recently, however, by Bollobás, Smith and Uzzell, who showed that the family of all such 'bootstrap percolation' models on $\mathbb{Z}^2$ can be naturally partitioned into three classes, which they termed subcritical, critical and supercritical. In this paper we determine the order of the threshold for percolation (complete occupation) for every critical bootstrap percolation model in two dimensions. This 'universality' theorem includes as special cases results of Aizenman and Lebowitz, Gravner and Griffeath, Mountford, and van Enter and Hulshof, significantly strengthens bounds of Bollobás, Smith and Uzzell, and complements recent work of Balister, Bollobás, Przykucki and Smith on subcritical models.

preprint2020arXiv

On the Estimation of Entropy in the FastICA Algorithm

The fastICA method is a popular dimension reduction technique used to reveal patterns in data. Here we show both theoretically and in practice that the approximations used in fastICA can result in patterns not being successfully recognised. We demonstrate this problem using a two-dimensional example where a clear structure is immediately visible to the naked eye, but where the projection chosen by fastICA fails to reveal this structure. This implies that care is needed when applying fastICA. We discuss how the problem arises and how it is intrinsically connected to the approximations that form the basis of the computational efficiency of fastICA.

preprint2020arXiv

SN 2014ab: An Aspherical Type IIn Supernova with Low Polarization

We present photometry, spectra, and spectropolarimetry of supernova (SN) 2014ab, obtained through $\sim 200$ days after peak brightness. SN 2014ab was a luminous Type IIn SN ($M_V < -19.14$ mag) discovered after peak brightness near the nucleus of its host galaxy, VV 306c. Prediscovery upper limits constrain the time of explosion to within 200 days prior to discovery. While SN 2014ab declined by $\sim 1$ mag over the course of our observations, the observed spectrum remained remarkably unchanged. Spectra exhibit an asymmetric emission-line profile with a consistently stronger blueshifted component, suggesting the presence of dust or a lack of symmetry between the far side and near side of the SN. The Pa$β$ emission line shows a profile very similar to that of H$α$, implying that this stronger blueshifted component is caused either through obscuration by large dust grains, occultation by optically thick material, or a lack of symmetry between the far side and near side of the interaction region. Despite these asymmetric line profiles, our spectropolarimetric data show that SN 2014ab has little detected polarization after accounting for the interstellar polarization. This suggests that we are seeing emission from a photosphere that has only small deviation from circular symmetry face-on. We are likely seeing a SN IIn with nearly circular symmetry in the plane normal to our line of sight, but with either large-grain dust or significant asymmetry in the density of circumstellar material or SN ejecta along our line of sight. We suggest that SN 2014ab and SN 2010jl (as well as other SNe IIn) may be similar events viewed from different directions.

preprint2020arXiv

What Makes Ly$α$ Nebulae Glow? Mapping the Polarization of LABd05

&#34;Ly$α$ nebulae&#34; are giant ($\sim$100 kpc), glowing gas clouds in the distant universe. The origin of their extended Ly$α$ emission remains a mystery. Some models posit that Ly$α$ emission is produced when the cloud is photoionized by UV emission from embedded or nearby sources, while others suggest that the Ly$α$ photons originate from an embedded galaxy or AGN and are then resonantly scattered by the cloud. At least in the latter scenario, the observed Ly$α$ emission will be polarized. To test these possibilities, we are conducting imaging polarimetric observations of seven Ly$α$ nebulae. Here we present our results for LABd05, a cloud at $z$ = 2.656 with an obscured, embedded AGN to the northeast of the peak of Ly$α$ emission. We detect significant polarization. The highest polarization fractions $P$ are $\sim$10-20% at $\sim$20-40 kpc southeast of the Ly$α$ peak, away from the AGN. The lowest $P$, including upper-limits, are $\sim$5% and lie between the Ly$α$ peak and AGN. In other words, the polarization map is lopsided, with $P$ increasing from the Ly$α$ peak to the southeast. The measured polarization angles $θ$ are oriented northeast, roughly perpendicular to the $P$ gradient. This unique polarization pattern suggests that 1) the spatially-offset AGN is photoionizing nearby gas and 2) escaping Ly$α$ photons are scattered by the nebula at larger radii and into our sightline, producing tangentially-oriented, radially-increasing polarization away from the photoionized region. Finally we conclude that the interplay between the gas density and ionization profiles produces the observed central peak in the Ly$α$ emission. This also implies that the structure of LABd05 is more complex than assumed by current theoretical spherical or cylindrical models.

preprint2010arXiv

Unobscured Type 2 AGNs

Type 2 AGNs with intrinsically weak broad emission lines (BELs) would be exceptions to the unified model. After examining a number of proposed candidates critically, we find that the sample is contaminated significantly by objects with BELs of strengths indicating that they actually contain intermediate-type AGNs, plus a few Compton-thick sources as revealed by extremely low ratios of X-ray to nuclear IR luminosities. We develop quantitative metrics that show two (NGC 3147 and NGC 4594) of the remaining candidates to have BELs 2-3 orders of magnitude weaker than those of typical type-1 AGNs. Several more galaxies remain as candidates to have anomalously weak BELs, but this status cannot be confirmed with the existing information. Although the parent sample is poorly defined, the two confirmed objects are well under 1% of its total number of members, showing that the absence of a BEL is possible, but very uncommon in AGN. We evaluate these two objects in detail using multi-wavelength measurements. They have little X-ray extinction with N_H < 10^21 cm^{-2}. Their IR spectra show strong silicate emission (NGC 4594) or weak aromatic features on a generally power law continuum with a suggestion of silicates in emission (NGC 3147). No polarized BEL is detected in NGC 3147. These results indicate that the two unobscured type-2 objects have circumnuclear tori that are approximately face-on. Combined with their X-ray and optical/UV properties, this behavior implies that we have an unobscured view of the nuclei and thus that they have intrinsically weak BELs. We compare their properties with those of the other less-extreme candidates. We then compare the distributions of bolometric luminosities and accretion rates of these objects with theoretical models that predict weak BELs.