Researcher profile

Sean E. Lake

Sean E. Lake contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

An Exploration of How Training Set Composition Bias in Machine Learning Affects Identifying Rare Objects

When training a machine learning classifier on data where one of the classes is intrinsically rare, the classifier will often assign too few sources to the rare class. To address this, it is common to up-weight the examples of the rare class to ensure it isn't ignored. It is also a frequent practice to train on restricted data where the balance of source types is closer to equal for the same reason. Here we show that these practices can bias the model toward over-assigning sources to the rare class. We also explore how to detect when training data bias has had a statistically significant impact on the trained model's predictions, and how to reduce the bias's impact. While the magnitude of the impact of the techniques developed here will vary with the details of the application, for most cases it should be modest. They are, however, universally applicable to every time a machine learning classification model is used, making them analogous to Bessel's correction to the sample variance.

preprint2019arXiv

The Contribution of Galaxies to the $3.4\,\mathrm{μm}$ Cosmic Infrared Background as Measured Using WISE

The study of the extragalactic background light (EBL) in the optical and near infrared has received a lot of attention in the last decade, especially near a wavelength of $λ\approx 3.4\operatorname{μm}$, with remaining tension among different techniques for estimating the background. In this paper we present a measurement of the contribution of galaxies to the EBL at $3.4\operatorname{μm}$ that is based on the measurement of the luminosity function (LF) in Lake et al. (2018) and the mean spectral energy distribution of galaxies in Lake & Wright (2016). The mean and standard deviation of our most reliable Bayesian posterior chain gives a $3.4\operatorname{μm}$ background of $I_ν= 9.0\pm0.5 \operatorname{kJy} \operatorname{sr}^{-1}$ ($νI_ν= 8.0\pm0.4 \operatorname{nW} \operatorname{m}^{-2} \operatorname{sr}^{-1} e\operatorname{-fold}^{-1}$), with systematic uncertainties unlikely to be greater than $2\operatorname{kJy} \operatorname{sr}^{-1}$. This result is higher than most previous efforts to measure the contribution of galaxies to the $3.4\operatorname{μm}$ EBL, but is consistent with the upper limits placed by blazars and the most recent direct measurements of the total $3.4\operatorname{μm}$ EBL.