Researcher profile

Li-Yen Hsu

Li-Yen Hsu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2021arXiv

Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Health insurance companies cover half of the United States population through commercial employer-sponsored health plans and pay 1.2 trillion US dollars every year to cover medical expenses for their members. The actuary and underwriter roles at a health insurance company serve to assess which risks to take on and how to price those risks to ensure profitability of the organization. While Bayesian hierarchical models are the current standard in the industry to estimate risk, interest in machine learning as a way to improve upon these existing methods is increasing. Lumiata, a healthcare analytics company, ran a study with a large health insurance company in the United States. We evaluated the ability of machine learning models to predict the per member per month cost of employer groups in their next renewal period, especially those groups who will cost less than 95\% of what an actuarial model predicts (groups with "concession opportunities"). We developed a sequence of two models, an individual patient-level and an employer-group-level model, to predict the annual per member per month allowed amount for employer groups, based on a population of 14 million patients. Our models performed 20\% better than the insurance carrier's existing pricing model, and identified 84\% of the concession opportunities. This study demonstrates the application of a machine learning system to compute an accurate and fair price for health insurance products and analyzes how explainable machine learning models can exceed actuarial models' predictive accuracy while maintaining interpretability.

preprint2018arXiv

A Submillimeter Perspective on the GOODS Fields (SUPER GOODS). III. A Large Sample of ALMA Sources in the GOODS-S

We analyze the >4-sigma sources in the most sensitive 100 arcmin^2 area (rms <0.56 mJy) of a SCUBA-2 850 micron survey of the GOODS-S and present the 75 band 7 ALMA sources (>4.5-sigma) obtained from high-resolution interferometric follow-up observations. The SCUBA-2---and hence ALMA---samples should be complete to 2.25 mJy. Of the 53 SCUBA-2 sources in this complete sample, only five have no ALMA detections, while 13% (68% confidence range 7-19%) have multiple ALMA counterparts. Color-based high-redshift dusty galaxy selection techniques find at most 55% of the total ALMA sample. In addition to using literature spectroscopic and optical/NIR photometric redshifts, we estimate FIR photometric redshifts based on an Arp 220 template. We identify seven z>4 candidates. We see the expected decline with redshift of the 4.5 micron and 24 micron to 850 micron flux ratios, confirming these as good diagnostics of z>4 candidates. We visually classify 52 ALMA sources, finding 44% (68% confidence range 35-53%) to be apparent mergers. We calculate rest-frame 2-8 keV and 8-28 keV luminosities using the 7 Ms Chandra X-ray image. Nearly all of the ALMA sources detected at 0.5-2 keV are consistent with a known X-ray luminosity to 850 micron flux relation for star-forming galaxies, while most of those detected at 2-7 keV are moderate luminosity AGNs that lie just above the 2-7 keV detection threshold. The latter largely have substantial obscurations of log N_H = 23-24 cm^-2, but two of the high-redshift candidates may even be Compton thick.

preprint2017arXiv

A Submillimeter Perspective on the GOODS Fields (SUPER GOODS) - I. An Ultradeep SCUBA-2 Survey of the GOODS-N

In this first paper in the SUPER GOODS series on powerfully star-forming galaxies in the two GOODS fields, we present a deep SCUBA-2 survey of the GOODS-N at both 850 and 450 micron (central rms noise of 0.28 mJy and 2.6 mJy, respectively). In the central region the 850 micron observations cover the GOODS-N to near the confusion limit of ~1.65 mJy, while over a wider 450 arcmin^2 region---well complemented by Herschel far-infrared imaging---they have a median 4-sigma limit of 3.5 mJy. We present >4-sigma catalogs of 186 850 micron and 31 450 micron selected sources. We use interferometric observations from the SMA and the VLA to obtain precise positions for 114 SCUBA-2 sources (28 from the SMA, all of which are also VLA sources). We present new spectroscopic redshifts and include all existing spectroscopic or photometric redshifts. We also compare redshifts estimated using the 20 cm to 850 micron and the 250 micron to 850 micron flux ratios. We show that the redshift distribution increases with increasing flux, and we parameterize the dependence. We compute the star formation history and the star formation rate (SFR) density distribution functions in various redshift intervals, finding that they reach a peak at z=2-3 before dropping to higher redshifts. We show that the number density per unit volume of SFR>500 solar mass per year galaxies measured from the SCUBA-2 sample does not change much relative to that of lower SFR galaxies from UV selected samples over z=2-5, suggesting that, apart from changes in the normalization, the shape in the number density as a function of SFR is invariant over this redshift interval.