Source author record

Dominic Dotterrer

Dominic Dotterrer appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.CO math.GT math.MG Computation and Language Cryptography and Security Discrete Mathematics Machine Learning math.AT math.DG math.PR

Catalog footprint

What is connected

7works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Empirical Differential Privacy

We show how to achieve differential privacy with no or reduced added noise, based on the empirical noise in the data itself. Unlike previous works on noiseless privacy, the empirical viewpoint avoids making any explicit assumptions about the random process generating the data.

preprint2021arXiv

Long Document Summarization in a Low Resource Setting using Pretrained Language Models

Abstractive summarization is the task of compressing a long document into a coherent short document while retaining salient information. Modern abstractive summarization methods are based on deep neural networks which often require large training datasets. Since collecting summarization datasets is an expensive and time-consuming task, practical industrial settings are usually low-resource. In this paper, we study a challenging low-resource setting of summarizing long legal briefs with an average source document length of 4268 words and only 120 available (document, summary) pairs. To account for data scarcity, we used a modern pretrained abstractive summarizer BART (Lewis et al., 2020), which only achieves 17.9 ROUGE-L as it struggles with long documents. We thus attempt to compress these long documents by identifying salient sentences in the source which best ground the summary, using a novel algorithm based on GPT-2 (Radford et al., 2019) language model perplexity scores, that operates within the low resource regime. On feeding the compressed documents to BART, we observe a 6.0 ROUGE-L improvement. Our method also beats several competitive salience detection baselines. Furthermore, the identified salient sentences tend to agree with an independent human labeling by domain experts.

preprint2017arXiv

Quantitative null-cobordism

For a given null-cobordant Riemannian $n$-manifold, how does the minimal geometric complexity of a null-cobordism depend on the geometric complexity of the manifold? In [Gro99], Gromov conjectured that this dependence should be linear. We show that it is at most a polynomial whose degree depends on $n$. This construction relies on another of independent interest. Take $X$ and $Y$ to be sufficiently nice compact metric spaces, such as Riemannian manifolds or simplicial complexes. Suppose $Y$ is simply connected and rationally homotopy equivalent to a product of Eilenberg-MacLane spaces: for example, any simply connected Lie group. Then two homotopic L-Lipschitz maps $f, g : X \rightarrow Y$ are homotopic via a $CL$-Lipschitz homotopy. We present a counterexample to show that this is not true for larger classes of spaces $Y$.

preprint2016arXiv

On Expansion and Topological Overlap

We give a detailed and easily accessible proof of Gromov's Topological Overlap Theorem. Let $X$ be a finite simplicial complex or, more generally, a finite polyhedral cell complex of dimension $d$. Informally, the theorem states that if $X$ has sufficiently strong higher-dimensional expansion properties (which generalize edge expansion of graphs and are defined in terms of cellular cochains of $X$) then $X$ has the following topological overlap property: for every continuous map $X\rightarrow \mathbf{R}^d$ there exists a point $p\in \mathbf{R}^d$ that is contained in the images of a positive fraction $μ>0$ of the $d$-cells of $X$. More generally, the conclusion holds if $\mathbf{R}^d$ is replaced by any $d$-dimensional piecewise-linear (PL) manifold $M$, with a constant $μ$ that depends only on $d$ and on the expansion properties of $X$, but not on $M$.

preprint2014arXiv

Higher dimensional distortion of random complexes

Using the random complexes of Linial and Meshulam, we exhibit a large family of simplicial complexes for which, whenever affinely embedded into Euclidean space, the filling areas of simplicial cycles is greatly distorted. This phenomenon can be regarded as a higher order analogue of the metric distortion of embeddings of random graphs.

preprint2012arXiv

The filling problem in the cube

We prove an isoperimetric inequality for filling cellular cycles in a high dimensional cube with cellular chains. In addition, we provide a family of cubical cellular cycles for which the exponent in the inequality is optimal.

preprint2011arXiv

Coboundary expanders

We describe a natural topological generalization of edge expansion for graphs to regular CW complexes and prove that this property holds with high probability for certain random complexes.