Researcher profile

Matthew Malloy

Matthew Malloy contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2022arXiv

Geometry of the Minimum Volume Confidence Sets

Computation of confidence sets is central to data science and machine learning, serving as the workhorse of A/B testing and underpinning the operation and analysis of reinforcement learning algorithms. This paper studies the geometry of the minimum-volume confidence sets for the multinomial parameter. When used in place of more standard confidence sets and intervals based on bounds and asymptotic approximation, learning algorithms can exhibit improved sample complexity. Prior work showed the minimum-volume confidence sets are the level-sets of a discontinuous function defined by an exact p-value. While the confidence sets are optimal in that they have minimum average volume, computation of membership of a single point in the set is challenging for problems of modest size. Since the confidence sets are level-sets of discontinuous functions, little is apparent about their geometry. This paper studies the geometry of the minimum volume confidence sets by enumerating and covering the continuous regions of the exact p-value function. This addresses a fundamental question in A/B testing: given two multinomial outcomes, how can one determine if their corresponding minimum volume confidence sets are disjoint? We answer this question in a restricted setting.

preprint2020arXiv

Digital Contact Tracing Using IP Colocation

The spread of an infectious disease through a population can be modeled using a network or a graph. In digital advertising, internet device graphs are graph data sets that organize identifiers produced by mobile phones, PCs, TVs, and tablets as they access media on the internet. Characterized by immense scale, they have become ubiquitous as they enable targeted advertising, content customization and tracking. This paper posits that internet device graphs, in particular those based on IP colocation, can provide significant utility in predicting and modeling the spread of infectious disease. Starting the week of March 16th, 2020, in the United States, many individuals began to `shelter-in-place' as schools and workplaces across the nation closed because of the COVID-19 pandemic. This paper quantifies the effect of the shelter-in-place orders on a large scale internet device graph with more than a billion nodes by studying the graph before and after orders went into effect. The effects are clearly visible. The structure of the graph suggests behavior least conducive to transmission of infection occurred in the US between April 12th and 19th, 2020. This paper also discusses the utility of device graphs for i) contact tracing, ii) prediction of `hot spots', iii) simulation of infectious disease spread, and iv) delivery of advertisement-based warnings to potentially exposed individuals. The paper also posits an overarching question: can systems and datasets amassed by entities in the digital ad ecosystem aid in the fight against COVID-19?