Source author record

Stephen Hardy

Stephen Hardy appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Distributed, Parallel, and Cluster Computing Machine Learning math.FA math.LO math.OA

Catalog footprint

What is connected

2works

5topics

3close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2016arXiv

Fast Learning from Distributed Datasets without Entity Matching

Consider the following data fusion scenario: two datasets/peers contain the same real-world entities described using partially shared features, e.g. banking and insurance company records of the same customer base. Our goal is to learn a classifier in the cross product space of the two domains, in the hard case in which no shared ID is available -- e.g. due to anonymization. Traditionally, the problem is approached by first addressing entity matching and subsequently learning the classifier in a standard manner. We present an end-to-end solution which bypasses matching entities, based on the recently introduced concept of Rademacher observations (rados). Informally, we replace the minimisation of a loss over examples, which requires to solve entity resolution, by the equivalent minimisation of a (different) loss over rados. Among others, key properties we show are (i) a potentially huge subset of these rados does not require to perform entity matching, and (ii) the algorithm that provably minimizes the rado loss over these rados has time and space complexities smaller than the algorithm minimizing the equivalent example loss. Last, we relax a key assumption of the model, that the data is vertically partitioned among peers --- in this case, we would not even know the existence of a solution to entity resolution. In this more general setting, experiments validate the possibility of significantly beating even the optimal peer in hindsight.

preprint2016arXiv

Pseudocompact C$^*$-algebras

We study the class of pseudocompact C*-algebras, which are the logical limits of finite-dimensional C*-algebras. The pseudocompact C*-algebras are unital, stably finite, real rank zero, stable rank one, and tracial. We show that the pseudocompact C*-algebras have trivial K_1 groups and the Dixmier property. The class is stable under direct sums, tensoring by finite-dimensional C*-algebras, taking corners, and taking centers. We give an explicit axiomatization of the commutative pseudocompact C*-algebras. We also study the subclass of pseudomatricial C*-algebras, which have unique tracial states, strict comparison of projections, and trivial centers. We give some information about the K_0 groups of the pseudomatricial C*-algebras.

Stephen Hardy

What is connected

Connect this record

See the researcher in context

Building this map preview

2 published item(s)

Fast Learning from Distributed Datasets without Entity Matching

Pseudocompact C$^*$-algebras