Source author record

Benjamin Hsu

Benjamin Hsu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language cond-mat.stat-mech quant-ph Artificial Intelligence cond-mat.mes-hall cond-mat.str-el hep-th

Catalog footprint

What is connected

7works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality

The machine translation (MT) task is typically formulated as that of returning a single translation for an input segment. However, in many cases, multiple different translations are valid and the appropriate translation may depend on the intended target audience, characteristics of the speaker, or even the relationship between speakers. Specific problems arise when dealing with honorifics, particularly translating from English into languages with formality markers. For example, the sentence "Are you sure?" can be translated in German as "Sind Sie sich sicher?" (formal register) or "Bist du dir sicher?" (informal). Using wrong or inconsistent tone may be perceived as inappropriate or jarring for users of certain cultures and demographics. This work addresses the problem of learning to control target language attributes, in this case formality, from a small amount of labeled contrastive data. We introduce an annotated dataset (CoCoA-MT) and an associated evaluation metric for training and evaluating formality-controlled MT models for six diverse target languages. We show that we can train formality-controlled models by fine-tuning on labeled contrastive data, achieving high accuracy (82% in-domain and 73% out-of-domain) while maintaining overall quality.

preprint2022arXiv

Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities

Identifying related entities and events within and across documents is fundamental to natural language understanding. We present an approach to entity and event coreference resolution utilizing contrastive representation learning. Earlier state-of-the-art methods have formulated this problem as a binary classification problem and leveraged large transformers in a cross-encoder architecture to achieve their results. For large collections of documents and corresponding set of $n$ mentions, the necessity of performing $n^{2}$ transformer computations in these earlier approaches can be computationally intensive. We show that it is possible to reduce this burden by applying contrastive learning techniques that only require $n$ transformer computations at inference time. Our method achieves state-of-the-art results on a number of key metrics on the ECB+ corpus and is competitive on others.

preprint2022arXiv

Sockeye 3: Fast Neural Machine Translation with PyTorch

Sockeye 3 is the latest version of the Sockeye toolkit for Neural Machine Translation (NMT). Now based on PyTorch, Sockeye 3 provides faster model implementations and more advanced features with a further streamlined codebase. This enables broader experimentation with faster iteration, efficient training of stronger and faster models, and the flexibility to move new ideas quickly from research to production. When running comparable models, Sockeye 3 is up to 126% faster than other PyTorch implementations on GPUs and up to 292% faster on CPUs. Sockeye 3 is open source software released under the Apache 2.0 license.

preprint2013arXiv

Dynamical stability of the quantum Lifshitz theory in 2+1 Dimensions

The role of magnetic and electric perturbations to the quantum Lifshitz model in 2+1 dimensions are examined in this paper. The quantum Lifshitz model is an effective field theory for quantum multicritical systems, that include generalized 2D quantum dimer models in bipartite lattices and their generalizations. It describes a class of quantum phase transitions between ordered and topological phases in 2+1 dimensions. Magnetic perturbations break the dimer conservation law. Electric excitations, whose condensation lead to ordered phases, have been studied extensively both in the classical 3D model and in the quantum 2D model. However, the role of magnetic vortex excitations whose condensation drive these systems into a $\mathbb{Z}_2$ topological phase has been largely ignored. To study the interplay of both excitations, we perform a perturbative renormalization group study to one loop order and study the stability of the theory away from quantum multi criticality. This is done by generalizing the operator product expansion to anisotropic models. The relation with recent classical Monte Carlo simulations in Rokhsar-Kivelson wave functions will be discussed.

preprint2013arXiv

Kramers-Wannier Duality of Statistical Mechanics Applied to the Boolean Satisfiability Problem of Computer Science

We present a novel application of the Kramers-Wannier duality on one of the most important problems of computer science, the Boolean satisfiability problem (SAT). More specifically, we focus on sharp-SAT or equivalently #SAT - the problem of counting the number of solutions to a Boolean satisfaction formula. #SAT can be cast into a statistical-mechanical language, where it reduces to calculating the partition function of an Ising spin Hamiltonian with multi-spin interactions. We show that Kramers-Wannier duality can be generalized to apply to such multi-connected spin networks. We present an exact dual partner to #SAT and explicitly verify their equivalence with a few simple examples. It is shown that the NP-completeness of the original problem maps on the complexity of the dual problem of enumerating the number of non-negative solutions to a Diophantine system of equations. We discuss the implications of this duality and the prospects of similar dualities applied to computer science problems.

preprint2013arXiv

The Renyi Entropy and the Multifractal Spectrum of Systems Near the Localization Transition

We show that the Rényi entropies of single particle, extended wave functions for disordered systems contain information about the multifractal spectrum. It is shown for moments of the Rényi entropy, $S_{n}$, where $|n|<1$, it is possible to extract universal information about the multifractility of such systems. This is shown through a generic calculation and then illustrated through two example models. We find good agreement between our analytic formula and numerical simulations of the two test models. Our formalism is easily extendable to generic non-interacting fermion models. It is also suggested that recent experimental advances in measuring the multifractal spectrum might allow some moments of the Rényi entropy to be measured.

preprint2010arXiv

Universal Behavior of Entanglement in 2D Quantum Critical Dimer Models

We examine the scaling behavior of the entanglement entropy for the 2D quantum dimer model (QDM) at criticality and derive the universal finite sub-leading correction $γ_{QCP}$. We compute the value of $γ_{QCP}$ without approximation working directly with the wave function of a generalized 2D QDM at the Rokhsar-Kivelson QCP in the continuum limit. Using the replica approach, we construct the conformal boundary state corresponding to the cyclic identification of $n$-copies along the boundary of the observed region. We find that the universal finite term is $γ_{QCP}=\ln R-1/2$ where $R$ is the compactification radius of the bose field theory quantum Lifshitz model, the effective field theory of the 2D QDM at quantum criticality. We also demonstrated that the entanglement spectrum of the critical wave function on a large but finite region is described by the characters of the underlying conformal field theory. It is shown that this is formally related to the problems of quantum Brownian motion on $n$-dimensional lattices or equivalently a system of strings interacting with a brane containing a background electromagnetic field and can be written as an expectation value of a vertex operator.

Benjamin Hsu

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality

Contrastive Representation Learning for Cross-Document Coreference Resolution of Events and Entities

Sockeye 3: Fast Neural Machine Translation with PyTorch

Dynamical stability of the quantum Lifshitz theory in 2+1 Dimensions

Kramers-Wannier Duality of Statistical Mechanics Applied to the Boolean Satisfiability Problem of Computer Science

The Renyi Entropy and the Multifractal Spectrum of Systems Near the Localization Transition

Universal Behavior of Entanglement in 2D Quantum Critical Dimer Models