Source author record

Sean Moran

Sean Moran appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.CO Artificial Intelligence Computer Vision eess.IV Software Engineering astro-ph.GA astro-ph.IM Machine Learning

Catalog footprint

What is connected

14works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2024arXiv

Using AI/ML to Find and Remediate Enterprise Secrets in Code & Document Sharing Platforms

We introduce a new challenge to the software development community: 1) leveraging AI to accurately detect and flag up secrets in code and on popular document sharing platforms that frequently used by developers, such as Confluence and 2) automatically remediating the detections (e.g. by suggesting password vault functionality). This is a challenging, and mostly unaddressed task. Existing methods leverage heuristics and regular expressions, that can be very noisy, and therefore increase toil on developers. The next step - modifying code itself - to automatically remediate a detection, is a complex task. We introduce two baseline AI models that have good detection performance and propose an automatic mechanism for remediating secrets found in code, opening up the study of this task to the wider community.

preprint2022arXiv

Senatus -- A Fast and Accurate Code-to-Code Recommendation Engine

Machine learning on source code (MLOnCode) is a popular research field that has been driven by the availability of large-scale code repositories and the development of powerful probabilistic and deep learning models for mining source code. Code-to-code recommendation is a task in MLOnCode that aims to recommend relevant, diverse and concise code snippets that usefully extend the code currently being written by a developer in their development environment (IDE). Code-to-code recommendation engines hold the promise of increasing developer productivity by reducing context switching from the IDE and increasing code-reuse. Existing code-to-code recommendation engines do not scale gracefully to large codebases, exhibiting a linear growth in query time as the code repository increases in size. In addition, existing code-to-code recommendation engines fail to account for the global statistics of code repositories in the ranking function, such as the distribution of code snippet lengths, leading to sub-optimal retrieval results. We address both of these weaknesses with \emph{Senatus}, a new code-to-code recommendation engine. At the core of Senatus is \emph{De-Skew} LSH a new locality sensitive hashing (LSH) algorithm that indexes the data for fast (sub-linear time) retrieval while also counteracting the skewness in the snippet length distribution using novel abstract syntax tree-based feature scoring and selection algorithms. We evaluate Senatus and find the recommendations to be of higher quality than competing baselines, while achieving faster search. For example on the CodeSearchNet dataset Senatus improves performance by 31.21\% F1 and 147.9\emph{x} faster query time compared to Facebook Aroma. Senatus also outperforms standard MinHash LSH by 29.2\% F1 and 51.02\emph{x} faster query time.

preprint2022arXiv

ST-FL: Style Transfer Preprocessing in Federated Learning for COVID-19 Segmentation

Chest Computational Tomography (CT) scans present low cost, speed and objectivity for COVID-19 diagnosis and deep learning methods have shown great promise in assisting the analysis and interpretation of these images. Most hospitals or countries can train their own models using in-house data, however empirical evidence shows that those models perform poorly when tested on new unseen cases, surfacing the need for coordinated global collaboration. Due to privacy regulations, medical data sharing between hospitals and nations is extremely difficult. We propose a GAN-augmented federated learning model, dubbed ST-FL (Style Transfer Federated Learning), for COVID-19 image segmentation. Federated learning (FL) permits a centralised model to be learned in a secure manner from heterogeneous datasets located in disparate private data silos. We demonstrate that the widely varying data quality on FL client nodes leads to a sub-optimal centralised FL model for COVID-19 chest CT image segmentation. ST-FL is a novel FL framework that is robust in the face of highly variable data quality at client nodes. The robustness is achieved by a denoising CycleGAN model at each client of the federation that maps arbitrary quality images into the same target quality, counteracting the severe data variability evident in real-world FL use-cases. Each client is provided with the target style, which is the same for all clients, and trains their own denoiser. Our qualitative and quantitative results suggest that this FL model performs comparably to, and in some cases better than, a model that has centralised access to all the training data.

preprint2020arXiv

DeepLPF: Deep Local Parametric Filters for Image Enhancement

Digital artists often improve the aesthetic quality of digital photographs through manual retouching. Beyond global adjustments, professional image editing programs provide local adjustment tools operating on specific parts of an image. Options include parametric (graduated, radial filters) and unconstrained brush tools. These highly expressive tools enable a diverse set of local image enhancements. However, their use can be time consuming, and requires artistic capability. State-of-the-art automated image enhancement approaches typically focus on learning pixel-level or global enhancements. The former can be noisy and lack interpretability, while the latter can fail to capture fine-grained adjustments. In this paper, we introduce a novel approach to automatically enhance images using learned spatially local filters of three different types (Elliptical Filter, Graduated Filter, Polynomial Filter). We introduce a deep neural network, dubbed Deep Local Parametric Filters (DeepLPF), which regresses the parameters of these spatially localized filters that are then automatically applied to enhance the image. DeepLPF provides a natural form of model regularization and enables interpretable, intuitive adjustments that lead to visually pleasing results. We report on multiple benchmarks and show that DeepLPF produces state-of-the-art performance on two variants of the MIT-Adobe-5K dataset, often using a fraction of the parameters required for competing methods.

preprint2020arXiv

Low Light Video Enhancement using Synthetic Data Produced with an Intermediate Domain Mapping

Advances in low-light video RAW-to-RGB translation are opening up the possibility of fast low-light imaging on commodity devices (e.g. smartphone cameras) without the need for a tripod. However, it is challenging to collect the required paired short-long exposure frames to learn a supervised mapping. Current approaches require a specialised rig or the use of static videos with no subject or object motion, resulting in datasets that are limited in size, diversity, and motion. We address the data collection bottleneck for low-light video RAW-to-RGB by proposing a data synthesis mechanism, dubbed SIDGAN, that can generate abundant dynamic video training pairs. SIDGAN maps videos found 'in the wild' (e.g. internet videos) into a low-light (short, long exposure) domain. By generating dynamic video data synthetically, we enable a recently proposed state-of-the-art RAW-to-RGB model to attain higher image quality (improved colour, reduced artifacts) and improved temporal consistency, compared to the same model trained with only static real video data.

preprint2016arXiv

HectoMAP and Horizon Run 4: Dense Structures and Voids in the Real and Simulated Universe

HectoMAP is a dense redshift survey of red galaxies covering a 53 $deg^{2}$ strip of the northern sky. HectoMAP is 97\% complete for galaxies with $r<20.5$, $(g-r)>1.0$, and $(r-i)>0.5$. The survey enables tests of the physical properties of large-scale structure at intermediate redshift against cosmological models. We use the Horizon Run 4, one of the densest and largest cosmological simulations based on the standard $Λ$ Cold Dark Matter ($Λ$CDM) model, to compare the physical properties of observed large-scale structures with simulated ones in a volume-limited sample covering 8$\times10^6$ $h^{-3}$ Mpc$^3$ in the redshift range $0.22<z<0.44$. We apply the same criteria to the observations and simulations to identify over- and under-dense large-scale features of the galaxy distribution. The richness and size distributions of observed over-dense structures agree well with the simulated ones. Observations and simulations also agree for the volume and size distributions of under-dense structures, voids. The properties of the largest over-dense structure and the largest void in HectoMAP are well within the distributions for the largest structures drawn from 300 Horizon Run 4 mock surveys. Overall the size, richness and volume distributions of observed large-scale structures in the redshift range $0.22<z<0.44$ are remarkably consistent with predictions of the standard $Λ$CDM model.

preprint2015arXiv

A systematic study of the inner rotation curves of galaxies observed as part of the GASS and COLD GASS surveys

We present a systematic analysis of the rotation curves of 187 galaxies with masses greater than 10^10 M_sol, with atomic gas masses from the GALEX Arecibo Sloan Survey (GASS), and with follow-up long-slit spectroscopy from the MMT. Our analysis focuses on stellar rotation curves derived by fitting stellar template spectra to the galaxy spectra binned along the slit. In this way, we are able to obtain accurate rotation velocity measurements for a factor of 2 more galaxies than possible with the Halpha line. Galaxies with high atomic gas mass fractions are the most dark-matter dominated galaxies in our sample and have dark matter halo density profiles that are well fit by Navarro, Frenk & White profiles with an average concentration parameter of 10. The inner slopes and of the rotation curves correlate more strongly with stellar population age than with galaxy mass or structural parameters. At fixed stellar mass, the rotation curves of more actively star-forming galaxies have steeper inner slopes than less actively star-forming galaxies. The ratio between the galaxy specific angular momentum and the total specific angular momentum of its dark matter halo, R_j, correlates strongly with galaxy mass, structure and gas content. Low mass, disk-dominated galaxies with atomic gas mass fractions greater than 20% have median values of R_j of around 1, but massive, bulge-dominated galaxies have R_j=0.2-0.3. We argue that these trends can be understood in a picture where gas inflows triggered by disk instabilities lead to the formation of passive, bulge-dominated galaxies with low specific angular momentum.

preprint2015arXiv

Data Reduction Pipeline for the MMT and Magellan Infrared Spectrograph

We describe the new spectroscopic data reduction pipeline for the multi-object MMT/Magellan Infrared Spectrograph. The pipeline is implemented in idl as a stand-alone package and is publicly available in both stable and development versions. We describe novel algorithms for sky subtraction and correction for telluric absorption. We demonstrate that our sky subtraction technique reaches the Poisson limit set by the photon statistics. Our telluric correction uses a hybrid approach by first computing a correction function from an observed stellar spectrum, and then differentially correcting it using a grid of atmosphere transmission models for the target airmass value. The pipeline provides a sufficient level of performance for real time reduction and thus enables data quality control during observations. We reduce an example dataset to demonstrate the high data reduction quality.

preprint2013arXiv

A Multi-Wavelength Analysis of NGC 4178: A Bulgeless Galaxy with an AGN

We present {\it Gemini} longslit optical spectroscopy and VLA radio observations of the nuclear region of NGC 4178, a late-type bulgeless disk galaxy recently confirmed to host an AGN through infrared and X-ray observations. Our observations reveal that the dynamical center of the galaxy is coincident with the location of the {\it Chandra} X-ray point source discovered in a previous work, providing further support for the presence of an AGN. While the X-ray and IR observations provide robust evidence for an AGN, the optical spectrum shows no evidence for the AGN, underscoring the need for the penetrative power of mid-IR and X-ray observations in finding buried or weak AGNs in this class of galaxy. Finally, the upper limit to the radio flux, together with our previous X-ray and IR results, is consistent with the scenario in which NGC 4178 harbors a deeply buried AGN accreting at a high rate.

preprint2012arXiv

COLD GASS, an IRAM Legacy Survey of Molecular Gas in Massive Galaxies: III. Comparison with semi-analytic models of galaxy formation

We compare the semi-analytic models of galaxy formation of Fu et al. (2010), which track the evolution of the radial profiles of atomic and molecular gas in galaxies, with gas fraction scaling relations derived from the COLD GASS survey (Saintonge et al 2011). The models provide a good description of how condensed baryons in galaxies with gas are partitioned into stars, atomic and molecular gas as a function of galaxy stellar mass and surface density. The models do not reproduce the tight observed relation between stellar surface density and bulge-to-disk ratio for this population. We then turn to an analysis of the"quenched" population of galaxies without detectable cold gas. The current implementation of radio-mode feedback in the models disagrees strongly with the data. In the models, gas cooling shuts down in nearly all galaxies in dark matter halos above a mass of 10**12 M_sun. As a result, stellar mass is the observable that best predicts whether a galaxy has little or no neutral gas. In contrast, our data show that quenching is largely independent of stellar mass. Instead, there are clear thresholds in bulge-to-disk ratio and in stellar surface density that demarcate the location of quenched galaxies. We propose that processes associated with bulge formation play a key role in depleting the neutral gas in galaxies and that further gas accretion is suppressed following the formation of the bulge, even in dark matter halos of low mass.

preprint2011arXiv

COLD GASS, an IRAM legacy survey of molecular gas in massive galaxies: I. Relations between H2, HI, stellar content and structural properties

We are conducting COLD GASS, a legacy survey for molecular gas in nearby galaxies. Using the IRAM 30m telescope, we measure the CO(1-0) line in a sample of ~350 nearby (D=100-200 Mpc), massive galaxies (log(M*/Msun)>10.0). The sample is selected purely according to stellar mass, and therefore provides an unbiased view of molecular gas in these systems. By combining the IRAM data with SDSS photometry and spectroscopy, GALEX imaging and high-quality Arecibo HI data, we investigate the partition of condensed baryons between stars, atomic gas and molecular gas in 0.1-10L* galaxies. In this paper, we present CO luminosities and molecular hydrogen masses for the first 222 galaxies. The overall CO detection rate is 54%, but our survey also uncovers the existence of sharp thresholds in galaxy structural parameters such as stellar mass surface density and concentration index, below which all galaxies have a measurable cold gas component but above which the detection rate of the CO line drops suddenly. The mean molecular gas fraction MH2/M* of the CO detections is 0.066+/-0.039, and this fraction does not depend on stellar mass, but is a strong function of NUV-r colour. Through stacking, we set a firm upper limit of MH2/M*=0.0016+/-0.0005 for red galaxies with NUV-r>5.0. The average molecular-to-atomic hydrogen ratio in present-day galaxies is 0.3, with significant scatter from one galaxy to the next. The existence of strong detection thresholds in both the HI and CO lines suggests that "quenching" processes have occurred in these systems. Intriguingly, atomic gas strongly dominates in the minority of galaxies with significant cold gas that lie above these thresholds. This suggests that some re-accretion of gas may still be possible following the quenching event.

preprint2011arXiv

COLD GASS, an IRAM Legacy Survey of Molecular Gas in Massive Galaxies: II. The non-universality of the Molecular Gas Depletion Timescale

We study the relation between molecular gas and star formation in a volume-limited sample of 222 galaxies from the COLD GASS survey, with measurements of the CO(1-0) line from the IRAM 30m telescope. The galaxies are at redshifts 0.025<z<0.05 and have stellar masses in the range 10.0<log(M*/Msun)<11.5. The IRAM measurements are complemented by deep Arecibo HI observations and homogeneous SDSS and GALEX photometry. A reference sample that includes both UV and far-IR data is used to calibrate our estimates of star formation rates from the seven optical/UV bands. The mean molecular gas depletion timescale, tdep(H2), for all the galaxies in our sample is 1 Gyr, however tdep(H2) increases by a factor of 6 from a value of ~0.5 Gyr for galaxies with stellar masses of 10^10 Msun to ~3 Gyr for galaxies with masses of a few times 10^11 Msun. In contrast, the atomic gas depletion timescale remains contant at a value of around 3 Gyr. This implies that in high mass galaxies, molecular and atomic gas depletion timescales are comparable, but in low mass galaxies, molecular gas is being consumed much more quickly than atomic gas. The strongest dependences of tdep(H2) are on the stellar mass of the galaxy (parameterized as log tdep(H2)= (0.36+/-0.07)(log M* - 10.70)+(9.03+/-0.99)), and on the specific star formation rate. A single tdep(H2) versus sSFR relation is able to fit both "normal" star-forming galaxies in our COLD GASS sample, as well as more extreme starburst galaxies (LIRGs and ULIRGs), which have tdep(H2) < 10^8 yr. Normal galaxies at z=1-2 are displaced with respect to the local galaxy population in the tdep(H2) versus sSFR plane and have molecular gas depletion times that are a factor of 3-5 times longer at a given value of sSFR due to their significantly larger gas fractions.

preprint2010arXiv

LoCuSS: First Results from Strong-lensing Analysis of 20 Massive Galaxy Clusters at z~0.2

We present a statistical analysis of a sample of 20 strong lensing clusters drawn from the Local Cluster Substructure Survey (LoCuSS), based on high resolution Hubble Space Telescope imaging of the cluster cores and follow-up spectroscopic observations using the Keck-I telescope. We use detailed parameterized models of the mass distribution in the cluster cores, to measure the total cluster mass and fraction of that mass associated with substructures within R<250kpc.These measurements are compared with the distribution of baryons in the cores, as traced by the old stellar populations and the X-ray emitting intracluster medium. Our main results include: (i) the distribution of Einstein radii is log-normal, with a peak and 1sigma width of <log(RE(z=2))>=1.16+/-0.28; (ii) we detect an X-ray/lensing mass discrepancy of <M_SL/M_X>=1.3 at 3 sigma significance -- clusters with larger substructure fractions displaying greater mass discrepancies, and thus greater departures from hydrostatic equilibrium; (iii) cluster substructure fraction is also correlated with the slope of the gas density profile on small scales, implying a connection between cluster-cluster mergers and gas cooling. Overall our results are consistent with the view that cluster-cluster mergers play a prominent role in shaping the properties of cluster cores, in particular causing departures from hydrostatic equilibrium, and possibly disturbing cool cores. Our results do not support recent claims that large Einstein radius clusters present a challenge to the CDM paradigm.

preprint2009arXiv

Environmental Effects in the Evolution of Galactic Bulges

We investigate possible environmental trends in the evolution of galactic bulges over the redshift range 0<z<0.6. For this purpose, we construct the Fundamental Plane (FP) for cluster and field samples at redshifts <z>=0.4 and <z>=0.54 using surface photometry based on HST imaging and velocity dispersions based on Keck spectroscopy. As a reference point for our study we include data for pure ellipticals, which we model as single-component Sersic profiles; whereas for multi-component galaxies we undertake decompositions using Sersic and exponential models for the bulge and disk respectively. Although the FP for both distant cluster and field samples are offset from the local relation, consistent with evolutionary trends found in earlier studies, we detect significant differences in the zero point of ~=0.2 dex between the field and cluster samples at a given redshift. For both clusters, the environmentally-dependent offset is in the sense expected for an accelerated evolution of bulges in dense environments. By matching the mass range of our samples, we confirm that this difference does not arise as a result of the mass-dependent downsizing effects seen in larger field samples. Our result is also consistent with the hypothesis that - at fixed mass and environment - the star formation histories of galactic bulges and pure spheroids are indistinguishable, and difficult to reconcile with the picture whereby the majority of large bulges form primarily via secular processes within spiral galaxies.

Sean Moran

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Using AI/ML to Find and Remediate Enterprise Secrets in Code & Document Sharing Platforms

Senatus -- A Fast and Accurate Code-to-Code Recommendation Engine

ST-FL: Style Transfer Preprocessing in Federated Learning for COVID-19 Segmentation

DeepLPF: Deep Local Parametric Filters for Image Enhancement

Low Light Video Enhancement using Synthetic Data Produced with an Intermediate Domain Mapping

HectoMAP and Horizon Run 4: Dense Structures and Voids in the Real and Simulated Universe

A systematic study of the inner rotation curves of galaxies observed as part of the GASS and COLD GASS surveys

Data Reduction Pipeline for the MMT and Magellan Infrared Spectrograph

A Multi-Wavelength Analysis of NGC 4178: A Bulgeless Galaxy with an AGN

COLD GASS, an IRAM Legacy Survey of Molecular Gas in Massive Galaxies: III. Comparison with semi-analytic models of galaxy formation

COLD GASS, an IRAM legacy survey of molecular gas in massive galaxies: I. Relations between H2, HI, stellar content and structural properties

COLD GASS, an IRAM Legacy Survey of Molecular Gas in Massive Galaxies: II. The non-universality of the Molecular Gas Depletion Timescale

LoCuSS: First Results from Strong-lensing Analysis of 20 Massive Galaxy Clusters at z~0.2

Environmental Effects in the Evolution of Galactic Bulges