Researcher profile

J. Line

J. Line contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
13works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2020arXiv

Is rotation forest the best classifier for problems with continuous features?

In short, our experiments suggest that yes, on average, rotation forest is better than the most common alternatives when all the attributes are real-valued. Rotation forest is a tree based ensemble that performs transforms on subsets of attributes prior to constructing each tree. We present an empirical comparison of classifiers for problems with only real-valued features. We evaluate classifiers from three families of algorithms: support vector machines; tree-based ensembles; and neural networks tuned with a large grid search. We compare classifiers on unseen data based on the quality of the decision rule (using classification error) the ability to rank cases (area under the receiver operating characteristic) and the probability estimates (using negative log likelihood). We conclude that, in answer to the question posed in the title, yes, rotation forest is significantly more accurate on average than competing techniques when compared on three distinct sets of datasets. Further, we assess the impact of the design features of rotation forest through an ablative study that transforms random forest into rotation forest. We identify the major limitation of rotation forest as its scalability, particularly in number of attributes. To overcome this problem we develop a model to predict the train time of the algorithm and hence propose a contract version of rotation forest where a run time cap is imposed {\em a priori}. We demonstrate that on large problems rotation forest can be made an order of magnitude faster without significant loss of accuracy. We also show that there is no real benefit (on average) from tuning rotation forest. We maintain that without any domain knowledge to indicate an algorithm preference, rotation forest should be the default algorithm of choice for problems with continuous attributes.

preprint2016arXiv

A High Reliability Survey of Discrete Epoch of Reionization Foreground Sources in the MWA EoR0 Field

Detection of the Epoch of Reionization HI signal requires a precise understanding of the intervening galaxies and AGN, both for instrumental calibration and foreground removal. We present a catalogue of 7394 extragalactic sources at 182 MHz detected in the RA=0 field of the Murchison Widefield Array Epoch of Reionization observation programme. Motivated by unprecedented requirements for precision and reliability we develop new methods for source finding and selection. We apply machine learning methods to self-consistently classify the relative reliability of 9490 source candidates. A subset of 7466 are selected based on reliability class and signal-to-noise ratio criteria. These are statistically cross-matched to four other radio surveys using both position and flux density information. We find 7369 sources to have confident matches, including 90 partially resolved sources that split into a total of 192 sub-components. An additional 25 unmatched sources are included as new radio detections. The catalogue sources have a median spectral index of -0.85. Spectral flattening is seen toward lower frequencies with a median of -0.71 predicted at 182 MHz. The astrometric error is 7 arcsec. compared to a 2.3 arcmin. beam FWHM. The resulting catalogue covers approximately 1400 sq. deg. and is complete to approximately 80 mJy within half beam power. This provides the most reliable discrete source sky model available to date in the MWA EoR0 field for precision foreground subtraction.

preprint2016arXiv

CHIPS: The Cosmological HI Power Spectrum Estimator

Detection of the cosmological neutral hydrogen signal from the Epoch of Reionization, and estimation of its basic physical parameters, is the principal scientific aim of many current low-frequency radio telescopes. Here we describe the Cosmological HI Power Spectrum Estimator (CHIPS), an algorithm developed and implemented with data from the Murchison Widefield Array (MWA), to compute the two-dimensional and spherically-averaged power spectrum of brightness temperature fluctuations. The principal motivations for CHIPS are the application of realistic instrumental and foreground models to form the optimal estimator, thereby maximising the likelihood of unbiased signal estimation, and allowing a full covariant understanding of the outputs. CHIPS employs an inverse-covariance weighting of the data through the maximum likelihood estimator, thereby allowing use of the full parameter space for signal estimation (&#34;foreground suppression&#34;). We describe the motivation for the algorithm, implementation, application to real and simulated data, and early outputs. Upon application to a set of 3 hours of data, we set a 2$σ$ upper limit on the EoR dimensionless power at $k=0.05$~h.Mpc$^{-1}$ of $Δ_k^2<7.6\times{10^4}$~mK$^2$ in the redshift range $z=[6.2-6.6]$, consistent with previous estimates.

preprint2016arXiv

Delay Spectrum with Phase-Tracking Arrays: Extracting the HI power spectrum from the Epoch of Reionization

The Detection of redshifted 21 cm emission from the epoch of reionization (EoR) is a challenging task owing to strong foregrounds that dominate the signal. In this paper, we propose a general method, based on the delay spectrum approach, to extract HI power spectra that is applicable to tracking observations using an imaging radio interferometer (Delay Spectrum with Imaging Arrays (DSIA)). Our method is based on modelling the HI signal taking into account the impact of wide field effects such as the $w$-term which are then used as appropriate weights in cross-correlating the measured visibilities. Our method is applicable to any radio interferometer that tracks a phase center and could be utilized for arrays such as MWA, LOFAR, GMRT, PAPER and HERA. In the literature the delay spectrum approach has been implemented for near-redundant baselines using drift scan observations. In this paper we explore the scheme for non-redundant tracking arrays, and this is the first application of delay spectrum methodology to such data to extract the HI signal. We analyze 3 hours of MWA tracking data on the EoR1 field. We present both 2-dimensional ($k_\parallel,k_\perp$) and 1-dimensional (k) power spectra from the analysis. Our results are in agreement with the findings of other pipelines developed to analyse the MWA EoR data.

preprint2016arXiv

First Season MWA EoR Power Spectrum Results at Redshift 7

The Murchison Widefield Array (MWA) has collected hundreds of hours of Epoch of Reionization (EoR) data and now faces the challenge of overcoming foreground and systematic contamination to reduce the data to a cosmological measurement. We introduce several novel analysis techniques such as cable reflection calibration, hyper-resolution gridding kernels, diffuse foreground model subtraction, and quality control methods. Each change to the analysis pipeline is tested against a two dimensional power spectrum figure of merit to demonstrate improvement. We incorporate the new techniques into a deep integration of 32 hours of MWA data. This data set is used to place a systematic-limited upper limit on the cosmological power spectrum of $Δ^2 \leq 2.7 \times 10^4$ mK$^2$ at $k=0.27$ h~Mpc$^{-1}$ and $z=7.1$, consistent with other published limits, and a modest improvement (factor of 1.4) over previous MWA results. From this deep analysis we have identified a list of improvements to be made to our EoR data analysis strategies. These improvements will be implemented in the future and detailed in upcoming publications.

preprint2016arXiv

Low frequency observations of linearly polarized structures in the interstellar medium near the south Galactic pole

We present deep polarimetric observations at 154 MHz with the Murchison Widefield Array (MWA), covering 625 deg^2 centered on RA=0 h, Dec=-27 deg. The sensitivity available in our deep observations allows an in-band, frequency-dependent analysis of polarized structure for the first time at long wavelengths. Our analysis suggests that the polarized structures are dominated by intrinsic emission but may also have a foreground Faraday screen component. At these wavelengths, the compactness of the MWA baseline distribution provides excellent snapshot sensitivity to large-scale structure. The observations are sensitive to diffuse polarized emission at ~54&#39; resolution with a sensitivity of 5.9 mJy beam^-1 and compact polarized sources at ~2.4&#39; resolution with a sensitivity of 2.3 mJy beam^-1 for a subset (400 deg^2) of this field. The sensitivity allows the effect of ionospheric Faraday rotation to be spatially and temporally measured directly from the diffuse polarized background. Our observations reveal large-scale structures (~1 deg - 8 deg in extent) in linear polarization clearly detectable in ~2 minute snapshots, which would remain undetectable by interferometers with minimum baseline lengths >110 m at 154 MHz. The brightness temperature of these structures is on average 4 K in polarized intensity, peaking at 11 K. Rotation measure synthesis reveals that the structures have Faraday depths ranging from -2 rad m^-2 to 10 rad m^-2 with a large fraction peaking at ~+1 rad m^-2. We estimate a distance of 51+/-20 pc to the polarized emission based on measurements of the in-field pulsar J2330-2005. We detect four extragalactic linearly polarized point sources within the field in our compact source survey. Based on the known polarized source population at 1.4 GHz and non-detections at 154 MHz, we estimate an upper limit on the depolarization ratio of 0.08 from 1.4 GHz to 154 MHz.

preprint2016arXiv

Parametrising Epoch of Reionization foregrounds: A deep survey of low-frequency point-source spectra with the MWA

Experiments that pursue detection of signals from the Epoch of Reionization (EoR) are relying on spectral smoothness of source spectra at low frequencies. This article empirically explores the effect of foreground spectra on EoR experiments by measuring high-resolution full-polarization spectra for the 586 brightest unresolved sources in one of the MWA EoR fields using 45 h of observation. A novel peeling scheme is used to subtract 2500 sources from the visibilities with ionospheric and beam corrections, resulting in the deepest, confusion-limited MWA image so far. The resulting spectra are found to be affected by instrumental effects, which limit the constraints that can be set on source-intrinsic spectral structure. The sensitivity and power-spectrum of the spectra are analysed, and it is found that the spectra of residuals are dominated by PSF sidelobes from nearby undeconvolved sources. We release a catalogue describing the spectral parameters for each measured source.

preprint2016arXiv

The Importance of Wide-field Foreground Removal for 21 cm Cosmology: A Demonstration With Early MWA Epoch of Reionization Observations

In this paper we present observations, simulations, and analysis demonstrating the direct connection between the location of foreground emission on the sky and its location in cosmological power spectra from interferometric redshifted 21 cm experiments. We begin with a heuristic formalism for understanding the mapping of sky coordinates into the cylindrically averaged power spectra measurements used by 21 cm experiments, with a focus on the effects of the instrument beam response and the associated sidelobes. We then demonstrate this mapping by analyzing power spectra with both simulated and observed data from the Murchison Widefield Array. We find that removing a foreground model which includes sources in both the main field-of-view and the first sidelobes reduces the contamination in high k_parallel modes by several percent relative to a model which only includes sources in the main field-of-view, with the completeness of the foreground model setting the principal limitation on the amount of power removed. While small, a percent-level amount of foreground power is in itself more than enough to prevent recovery of any EoR signal from these modes. This result demonstrates that foreground subtraction for redshifted 21 cm experiments is truly a wide-field problem, and algorithms and simulations must extend beyond the main instrument field-of-view to potentially recover the full 21 cm power spectrum.

preprint2016arXiv

The Murchison Widefield Array 21 cm Power Spectrum Analysis Methodology

We present the 21 cm power spectrum analysis approach of the Murchison Widefield Array Epoch of Reionization project. In this paper, we compare the outputs of multiple pipelines for the purpose of validating statistical limits cosmological hydrogen at redshifts between 6 and 12. Multiple, independent, data calibration and reduction pipelines are used to make power spectrum limits on a fiducial night of data. Comparing the outputs of imaging and power spectrum stages highlights differences in calibration, foreground subtraction and power spectrum calculation. The power spectra found using these different methods span a space defined by the various tradeoffs between speed, accuracy, and systematic control. Lessons learned from comparing the pipelines range from the algorithmic to the prosaically mundane; all demonstrate the many pitfalls of neglecting reproducibility. We briefly discuss the way these different methods attempt to handle the question of evaluating a significant detection in the presence of foregrounds.

preprint2015arXiv

Confirmation of Wide-Field Signatures in Redshifted 21 cm Power Spectra

We confirm our recent prediction of the &#34;pitchfork&#34; foreground signature in power spectra of high-redshift 21 cm measurements where the interferometer is sensitive to large-scale structure on all baselines. This is due to the inherent response of a wide-field instrument and is characterized by enhanced power from foreground emission in Fourier modes adjacent to those considered to be the most sensitive to the cosmological H I signal. In our recent paper, many signatures from the simulation that predicted this feature were validated against Murchison Widefield Array (MWA) data, but this key pitchfork signature was close to the noise level. In this paper, we improve the data sensitivity through the coherent averaging of 12 independent snapshots with identical instrument settings and provide the first confirmation of the prediction with a signal-to-noise ratio > 10. This wide-field effect can be mitigated by careful antenna designs that suppress sensitivity near the horizon. Simple models for antenna apertures that have been proposed for future instruments such as the Hydrogen Epoch of Reionization Array and the Square Kilometre Array indicate they should suppress foreground leakage from the pitchfork by ~40 dB relative to the MWA and significantly increase the likelihood of cosmological signal detection in these critical Fourier modes in the three-dimensional power spectrum.

preprint2015arXiv

Empirical Covariance Modeling for 21 cm Power Spectrum Estimation: A Method Demonstration and New Limits from Early Murchison Widefield Array 128-Tile Data

The separation of the faint cosmological background signal from bright astrophysical foregrounds remains one of the most daunting challenges of mapping the high-redshift intergalactic medium with the redshifted 21 cm line of neutral hydrogen. Advances in mapping and modeling of diffuse and point source foregrounds have improved subtraction accuracy, but no subtraction scheme is perfect. Precisely quantifying the errors and error correlations due to missubtracted foregrounds allows for both the rigorous analysis of the 21 cm power spectrum and for the maximal isolation of the &#34;EoR window&#34; from foreground contamination. We present a method to infer the covariance of foreground residuals from the data itself in contrast to previous attempts at a priori modeling. We demonstrate our method by setting limits on the power spectrum using a 3 h integration from the 128-tile Murchison Widefield Array. Observing between 167 and 198 MHz, we find at 95% confidence a best limit of Delta^2(k) < 3.7 x 10^4 mK^2 at comoving scale k = 0.18 hMpc^-1 and at z = 6.8, consistent with existing limits.

preprint2015arXiv

Foregrounds in Wide-Field Redshifted 21 cm Power Spectra

Detection of 21~cm emission of HI from the epoch of reionization, at redshifts z>6, is limited primarily by foreground emission. We investigate the signatures of wide-field measurements and an all-sky foreground model using the delay spectrum technique that maps the measurements to foreground object locations through signal delays between antenna pairs. We demonstrate interferometric measurements are inherently sensitive to all scales, including the largest angular scales, owing to the nature of wide-field measurements. These wide-field effects are generic to all observations but antenna shapes impact their amplitudes substantially. A dish-shaped antenna yields the most desirable features from a foreground contamination viewpoint, relative to a dipole or a phased array. Comparing data from recent Murchison Widefield Array observations, we demonstrate that the foreground signatures that have the largest impact on the HI signal arise from power received far away from the primary field of view. We identify diffuse emission near the horizon as a significant contributing factor, even on wide antenna spacings that usually represent structures on small scales. For signals entering through the primary field of view, compact emission dominates the foreground contamination. These two mechanisms imprint a characteristic &#34;pitchfork&#34; signature on the &#34;foreground wedge&#34; in Fourier delay space. Based on these results, we propose that selective down-weighting of data based on antenna spacing and time can mitigate foreground contamination substantially by a factor ~100 with negligible loss of sensitivity.

preprint2015arXiv

The low-frequency environment of the Murchison Widefield Array: radio-frequency interference analysis and mitigation

The Murchison Widefield Array (MWA) is a new low-frequency interferometric radio telescope built in Western Australia at one of the locations of the future Square Kilometre Array (SKA). We describe the automated radio-frequency interference (RFI) detection strategy implemented for the MWA, which is based on the AOFlagger platform, and present 72-231-MHz RFI statistics from 10 observing nights. RFI detection removes 1.1% of the data. RFI from digital TV (DTV) is observed 3% of the time due to occasional ionospheric or atmospheric propagation. After RFI detection and excision, almost all data can be calibrated and imaged without further RFI mitigation efforts, including observations within the FM and DTV bands. The results are compared to a previously published Low-Frequency Array (LOFAR) RFI survey. The remote location of the MWA results in a substantially cleaner RFI environment compared to LOFAR&#39;s radio environment, but adequate detection of RFI is still required before data can be analysed. We include specific recommendations designed to make the SKA more robust to RFI, including: the availability of sufficient computing power for RFI detection; accounting for RFI in the receiver design; a smooth band-pass response; and the capability of RFI detection at high time and frequency resolution (second and kHz-scale respectively).