Researcher profile

Daniel Fryer

Daniel Fryer contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2021arXiv

Shapley values for feature selection: The good, the bad, and the axioms

The Shapley value has become popular in the Explainable AI (XAI) literature, thanks, to a large extent, to a solid theoretical foundation, including four "favourable and fair" axioms for attribution in transferable utility games. The Shapley value is provably the only solution concept satisfying these axioms. In this paper, we introduce the Shapley value and draw attention to its recent uses as a feature selection tool. We call into question this use of the Shapley value, using simple, abstract "toy" counterexamples to illustrate that the axioms may work against the goals of feature selection. From this, we develop a number of insights that are then investigated in concrete simulation settings, with a variety of Shapley value formulations, including SHapley Additive exPlanations (SHAP) and Shapley Additive Global importancE (SAGE).

preprint2020arXiv

SARGDV: Efficient identification of groundwater-dependent vegetation using synthetic aperture radar

Groundwater depletion impacts the sustainability of numerous groundwater-dependent vegetation (GDV) globally, placing significant stress on their capacity to provide environmental and ecological support for flora, fauna, and anthropic benefits. Industries such as mining, agriculture, and plantations are heavily reliant on groundwater, the over-exploitation of which risks impacting groundwater regimes, quality, and accessibility for nearby GDVs. Cost effective methods of GDV identification will enable strategic protection of these critical ecological systems, through improved and sustainable groundwater management by communities and industry. Recent application of synthetic aperture radar (SAR) earth observation data in Australia has demonstrated the utility of radar for identifying terrestrial groundwater-dependent ecosystems at scale. We propose a robust classification method to advance identification of GDVs at scale using processed SAR data products adapted from a recent previous method. The method includes the development of SARGDV, a binary classification model, which uses the extreme gradient boosting (XGBoost) algorithm in conjunction with three data cubes composed of Sentinel-1 SAR interferometric wide images. The images were collected as a one-year time series over Mount Gambier, a region in South Australia, known to support GDVs. The SARGDV model demonstrated high performance for classifying GDVs with 77% precision, 76% true positive rate and 96% accuracy. This method may be used to support the protection of GDV communities globally by providing a long term, cost-effective solution to identify GDVs over variable regions and climates, via the use of freely available, high-resolution, globally available Sentinel-1 SAR data sets.

preprint2020arXiv

Shapley value confidence intervals for attributing variance explained

The coefficient of determination, the $R^2$, is often used to measure the variance explained by an affine combination of multiple explanatory covariates. An attribution of this explanatory contribution to each of the individual covariates is often sought in order to draw inference regarding the importance of each covariate with respect to the response phenomenon. A recent method for ascertaining such an attribution is via the game theoretic Shapley value decomposition of the coefficient of determination. Such a decomposition has the desirable efficiency, monotonicity, and equal treatment properties. Under a weak assumption that the joint distribution is pseudo-elliptical, we obtain the asymptotic normality of the Shapley values. We then utilize this result in order to construct confidence intervals and hypothesis tests regarding such quantities. Monte Carlo studies regarding our results are provided. We found that our asymptotic confidence intervals are computationally superior to competing bootstrap methods and are able to improve upon the performance of such intervals. In an expository application to Australian real estate price modelling, we employ Shapley value confidence intervals to identify significant differences between the explanatory contributions of covariates, between models, which otherwise share approximately the same $R^2$ value. These different models are based on real estate data from the same periods in 2019 and 2020, the latter covering the early stages of the arrival of the novel coronavirus, COVID-19.