Source author record

Li-Yen Hsu

Li-Yen Hsu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.GA astro-ph.CO cs.CY Machine Learning

Catalog footprint

What is connected

7works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Health insurance companies cover half of the United States population through commercial employer-sponsored health plans and pay 1.2 trillion US dollars every year to cover medical expenses for their members. The actuary and underwriter roles at a health insurance company serve to assess which risks to take on and how to price those risks to ensure profitability of the organization. While Bayesian hierarchical models are the current standard in the industry to estimate risk, interest in machine learning as a way to improve upon these existing methods is increasing. Lumiata, a healthcare analytics company, ran a study with a large health insurance company in the United States. We evaluated the ability of machine learning models to predict the per member per month cost of employer groups in their next renewal period, especially those groups who will cost less than 95\% of what an actuarial model predicts (groups with "concession opportunities"). We developed a sequence of two models, an individual patient-level and an employer-group-level model, to predict the annual per member per month allowed amount for employer groups, based on a population of 14 million patients. Our models performed 20\% better than the insurance carrier's existing pricing model, and identified 84\% of the concession opportunities. This study demonstrates the application of a machine learning system to compute an accurate and fair price for health insurance products and analyzes how explainable machine learning models can exceed actuarial models' predictive accuracy while maintaining interpretability.

preprint2018arXiv

A Submillimeter Perspective on the GOODS Fields (SUPER GOODS). III. A Large Sample of ALMA Sources in the GOODS-S

We analyze the >4-sigma sources in the most sensitive 100 arcmin^2 area (rms <0.56 mJy) of a SCUBA-2 850 micron survey of the GOODS-S and present the 75 band 7 ALMA sources (>4.5-sigma) obtained from high-resolution interferometric follow-up observations. The SCUBA-2---and hence ALMA---samples should be complete to 2.25 mJy. Of the 53 SCUBA-2 sources in this complete sample, only five have no ALMA detections, while 13% (68% confidence range 7-19%) have multiple ALMA counterparts. Color-based high-redshift dusty galaxy selection techniques find at most 55% of the total ALMA sample. In addition to using literature spectroscopic and optical/NIR photometric redshifts, we estimate FIR photometric redshifts based on an Arp 220 template. We identify seven z>4 candidates. We see the expected decline with redshift of the 4.5 micron and 24 micron to 850 micron flux ratios, confirming these as good diagnostics of z>4 candidates. We visually classify 52 ALMA sources, finding 44% (68% confidence range 35-53%) to be apparent mergers. We calculate rest-frame 2-8 keV and 8-28 keV luminosities using the 7 Ms Chandra X-ray image. Nearly all of the ALMA sources detected at 0.5-2 keV are consistent with a known X-ray luminosity to 850 micron flux relation for star-forming galaxies, while most of those detected at 2-7 keV are moderate luminosity AGNs that lie just above the 2-7 keV detection threshold. The latter largely have substantial obscurations of log N_H = 23-24 cm^-2, but two of the high-redshift candidates may even be Compton thick.

preprint2017arXiv

A Submillimeter Perspective on the GOODS Fields (SUPER GOODS) - I. An Ultradeep SCUBA-2 Survey of the GOODS-N

In this first paper in the SUPER GOODS series on powerfully star-forming galaxies in the two GOODS fields, we present a deep SCUBA-2 survey of the GOODS-N at both 850 and 450 micron (central rms noise of 0.28 mJy and 2.6 mJy, respectively). In the central region the 850 micron observations cover the GOODS-N to near the confusion limit of ~1.65 mJy, while over a wider 450 arcmin^2 region---well complemented by Herschel far-infrared imaging---they have a median 4-sigma limit of 3.5 mJy. We present >4-sigma catalogs of 186 850 micron and 31 450 micron selected sources. We use interferometric observations from the SMA and the VLA to obtain precise positions for 114 SCUBA-2 sources (28 from the SMA, all of which are also VLA sources). We present new spectroscopic redshifts and include all existing spectroscopic or photometric redshifts. We also compare redshifts estimated using the 20 cm to 850 micron and the 250 micron to 850 micron flux ratios. We show that the redshift distribution increases with increasing flux, and we parameterize the dependence. We compute the star formation history and the star formation rate (SFR) density distribution functions in various redshift intervals, finding that they reach a peak at z=2-3 before dropping to higher redshifts. We show that the number density per unit volume of SFR>500 solar mass per year galaxies measured from the SCUBA-2 sample does not change much relative to that of lower SFR galaxies from UV selected samples over z=2-5, suggesting that, apart from changes in the normalization, the shape in the number density as a function of SFR is invariant over this redshift interval.

preprint2016arXiv

The Hawaii SCUBA-2 Lensing Cluster Survey: Number Counts and Submillimeter Flux Ratios

We present deep number counts at 450 and 850 $μ$m using the SCUBA-2 camera on the James Clerk Maxwell Telescope. We combine data for six lensing cluster fields and three blank fields to measure the counts over a wide flux range at each wavelength. Thanks to the lensing magnification, our measurements extend to fluxes fainter than 1 mJy and 0.2 mJy at 450 $μ$m and 850 $μ$m, respectively. Our combined data highly constrain the faint end of the number counts. Integrating our counts shows that the majority of the extragalactic background light (EBL) at each wavelength is contributed by faint sources with $L_{\rm IR} < 10^{12} L_{\odot }$, corresponding to luminous infrared galaxies (LIRGs) or normal galaxies. By comparing our result with the 500 $μ$m stacking of $K$-selected sources from the literature, we conclude that the $K$-selected LIRGs and normal galaxies still cannot fully account for the EBL that originates from sources with $L_{\rm IR} < 10^{12} L_{\odot }$. This suggests that many faint submillimeter galaxies may not be included in the UV star formation history. We also explore the submillimeter flux ratio between the two bands for our 450 $μ$m and 850 $μ$m selected sources. At 850 $μ$m, we find a clear relation between the flux ratio and the observed flux. This relation can be explained by a redshift evolution, where galaxies at higher redshifts have higher luminosities and star formation rates. In contrast, at 450 $μ$m, we do not see a clear relation between the flux ratio and the observed flux.

preprint2014arXiv

Compact Quiescent Galaxies at Intermediate Redshifts

From several searches of the area common to the Sloan Digital Sky Survey and the United Kingdom Infrared Telescope Infrared Deep Sky Survey, we have selected 22 luminous galaxies between $z \sim$ 0.4 and $z \sim$ 0.9 that have colors and sizes similar to those of the compact quiescent galaxies at $z>2$. By exploring structural parameters and stellar populations, we found that most of these galaxies actually formed most of their stars at $z<2$ and are generally less compact than those found at $z > 2$. Several of these young objects are disk-like or possibly prolate. This lines up with several previous studies which found that massive quiescent galaxies at high redshifts often have disk-like morphologies. If these galaxies were to be confirmed to be disk-like, their formation mechanism must be able to account for both compactness and disks. On the other hand, if these galaxies were to be confirmed to be prolate, the fact that prolate galaxies do not exist in the local universe would indicate that galaxy formation mechanisms have evolved over cosmic time. We also found five galaxies forming over 80% of their stellar masses at $z>2$. Three of these galaxies appear to have been modified to have spheroid-like morphologies, in agreement with the scenario of "inside-out" buildup of massive galaxies. The remaining galaxies, SDSS\,J014355.21+133451.4 and SDSS\,J115836.93+021535.1, have truly old stellar populations and disk-like morphologies. These two objects would be good candidates for nearly unmodified compact quiescent galaxies from high redshifts that are worth future study.

preprint2012arXiv

Cluster Mass Profiles from a Bayesian Analysis of Weak Lensing Distortion and Magnification Measurements: Applications to Subaru Data

We directly construct model-independent mass profiles of galaxy clusters from combined weak-lensing distortion and magnification measurements within a Bayesian statistical framework,which allows for a full parameter-space extraction of the underlying signal. This method applies to the full range of radius outside the Einstein radius, and recovers the absolute mass normalization. We apply our method to deep Subaru imaging of five high-mass (>10^{15}M_{sun}) clusters, A1689, A1703, A370, Cl0024+17, and RXJ1347-11, to obtain accurate profiles to beyond the virial radius (r_{vir}). For each cluster the lens distortion and magnification data are shown to be consistent with each other, and the total signal-to-noise ratio of the combined measurements ranges from 13 to 24 per cluster. We form a model-independent mass profile from stacking the clusters, which is detected at 37σ out to R ~ 1.7r_{vir}. The projected logarithmic slope steepens from -1.01 \pm 0.09 at R ~ 0.1r_{vir} to -1.92 \pm 0.51 at R ~ 0.9r_{vir}. We also derive for each cluster inner strong-lensing based mass profiles from deep HST/ACS observations, which we show overlap well with the outer Subaru-based profiles and together are well described by a generalized form of the Navarro-Frenk-White profile, except for the ongoing merger RXJ1347-11, with modest variations in the central cusp slope (-dlnρ/dlnr < 0.9). The improvement here from adding the magnification measurements is significant, ~30% in terms of cluster mass profile measurements, compared with the lensing distortion signal.

preprint2012arXiv

The three-dimensional geometry and merger history of the massive galaxy cluster MACS J0358.8-2955

We present results of a combined X-ray/optical analysis of the dynamics of the massive cluster MACS J0358.8-2955 (z=0.428) based on observations with the Chandra X-ray Observatory, the Hubble Space Telescope, and the Keck-I telescope on Mauna Kea. MACS J0358.8-2955 is found to be one of the most X-ray luminous clusters known at z>0.3, featuring L_X(<r_500) = 4.24*10^45 erg/s, kT = (9.55 +0.58/-0.37) keV, M^{3D}_{gas}(<r_500) = (9.18+/-1.45)*10^13 M_sun, and M^{3D}_{tot}(<r_500) = (1.12+/-0.18)*10^15 M_sun. The system's high velocity dispersion of (1440 +130/-110) km/s (890 km/s when the correct relativistic equation is used), however, is inflated by infall along the line of sight, as the result of a complex merger of at least three sub-clusters. One collision proceeds close to head-on, while the second features a significant impact parameter. The temperature variations in the intra-cluster gas, two tentative cold fronts, the radial velocities measured for cluster galaxies, and the small offsets between collisional and non-collisional cluster components all suggest that both merger events are observed close to core passage and along axes that are greatly inclined with respect to the plane of the sky. A strong-lensing analysis of the system anchored upon three triple-image systems (two of which have spectroscopic redshifts) yields independent constraints on the mass distribution. For a gas fraction of 8.2%, the resulting strong-lensing mass profile is in good agreement with our X-ray estimates, and the details of the mass distribution are fully consistent with our interpretation of the three-dimensional merger history of this complex system.

Li-Yen Hsu

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

A Submillimeter Perspective on the GOODS Fields (SUPER GOODS). III. A Large Sample of ALMA Sources in the GOODS-S

A Submillimeter Perspective on the GOODS Fields (SUPER GOODS) - I. An Ultradeep SCUBA-2 Survey of the GOODS-N

The Hawaii SCUBA-2 Lensing Cluster Survey: Number Counts and Submillimeter Flux Ratios

Compact Quiescent Galaxies at Intermediate Redshifts

Cluster Mass Profiles from a Bayesian Analysis of Weak Lensing Distortion and Magnification Measurements: Applications to Subaru Data

The three-dimensional geometry and merger history of the massive galaxy cluster MACS J0358.8-2955