Researcher profile

Sebastian Schuster

Sebastian Schuster contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2022arXiv

ADM mass in warp drive spacetimes

What happens when a warp bubble has mass? This seemingly innocent question forces one to carefully formalize exactly what one means by a warp bubble, exactly what one means by having the warp bubble "move" with respect to the fixed stars, and forces one to more carefully examine the notion of mass in warp-drive spacetimes. This is the goal of the present article. In this process, we will see that often-made throw-away comments regarding "payloads" are even simpler than commonly assumed, while there are two further, distinct yet subtle ways in which a mass can appear in connection with a warp drive space-time: One, that the warp bubble (not its payload) has the mass; two, that the mass is a background feature in front of which the warp drive moves. For simplicity, we consider generic Natário warp drives with zero-vorticity flow field. The resulting spacetimes are sufficiently simple to allow an exact and fully explicit computation of all of the stress-energy components, and verify that (as expected) the null energy condition (NEC) is violated. Likewise the weak, strong, and dominant energy conditions (WEC, SEC, DEC) are violated. Indeed, this confirms the community's folk wisdom, and recent (fully general, but implicit) results of the present authors which closed previous gaps in the argument. However, folk wisdom should be carefully and critically examined before being believed, and the present examples for general results will greatly aid physical intuition.

preprint2022arXiv

Coloring the Blank Slate: Pre-training Imparts a Hierarchical Inductive Bias to Sequence-to-sequence Models

Relations between words are governed by hierarchical structure rather than linear ordering. Sequence-to-sequence (seq2seq) models, despite their success in downstream NLP applications, often fail to generalize in a hierarchy-sensitive manner when performing syntactic transformations - for example, transforming declarative sentences into questions. However, syntactic evaluations of seq2seq models have only observed models that were not pre-trained on natural language data before being trained to perform syntactic transformations, in spite of the fact that pre-training has been found to induce hierarchical linguistic generalizations in language models; in other words, the syntactic capabilities of seq2seq models may have been greatly understated. We address this gap using the pre-trained seq2seq models T5 and BART, as well as their multilingual variants mT5 and mBART. We evaluate whether they generalize hierarchically on two transformations in two languages: question formation and passivization in English and German. We find that pre-trained seq2seq models generalize hierarchically when performing syntactic transformations, whereas models trained from scratch on syntactic transformations do not. This result presents evidence for the learnability of hierarchical syntactic information from non-annotated natural language text while also demonstrating that seq2seq models are capable of syntactic generalization, though only after exposure to much more language data than human learners receive.

preprint2022arXiv

Generic warp drives violate the null energy condition

Three very recent articles have claimed that it is possible to, at least in theory, either set up positive energy warp drives satisfying the weak energy condition (WEC), or at the very least, to minimize the WEC violations. These claims are at best incomplete, since the arguments presented only demonstrate the existence of one set of timelike observers, the co-moving Eulerian observers, who see "nice" physics. While these observers might see a positive energy density, the WEC requires all timelike observers to see positive energy density. Therefore, one should revisit this issue. A more careful analysis shows that the situation is actually much grimmer than advertised -- all physically reasonable warp drives will violate the null energy condition, and so also automatically violate the WEC, and both the strong and dominant energy conditions. While warp drives are certainly interesting examples of speculative physics, the violation of the energy conditions, at least within the framework of standard general relativity, is unavoidable. Even in modified gravity, physically reasonable warp drives will still violate the purely geometrical null convergence condition and the timelike convergence condition which, in turn, will place very strong constraints on any modified-gravity warp drive.

preprint2022arXiv

When a sentence does not introduce a discourse entity, Transformer-based models still sometimes refer to it

Understanding longer narratives or participating in conversations requires tracking of discourse entities that have been mentioned. Indefinite noun phrases (NPs), such as 'a dog', frequently introduce discourse entities but this behavior is modulated by sentential operators such as negation. For example, 'a dog' in 'Arthur doesn't own a dog' does not introduce a discourse entity due to the presence of negation. In this work, we adapt the psycholinguistic assessment of language models paradigm to higher-level linguistic phenomena and introduce an English evaluation suite that targets the knowledge of the interactions between sentential operators and indefinite NPs. We use this evaluation suite for a fine-grained investigation of the entity tracking abilities of the Transformer-based models GPT-2 and GPT-3. We find that while the models are to a certain extent sensitive to the interactions we investigate, they are all challenged by the presence of multiple NPs and their behavior is not systematic, which suggests that even models at the scale of GPT-3 do not fully acquire basic entity tracking abilities.

preprint2021arXiv

Tractor beams, pressor beams, and stressor beams in general relativity

The metrics of general relativity generally fall into two categories: Those which are solutions of the Einstein equations for a given source energy-momentum tensor, and the "reverse engineered" metrics -- metrics bespoke for a certain purpose. Their energy-momentum tensors are then calculated by inserting these into the Einstein equations. This latter approach has found frequent use when confronted with creative input from fiction, wormholes and warp drives being the most famous examples. In this paper, we shall again take inspiration from fiction, and see what general relativity can tell us about the possibility of a gravitationally induced tractor beam. We will base our construction on warp drives and show how versatile this ansatz alone proves to be. Not only can we easily find tractor beams (attracting objects); repulsor/pressor beams are just as attainable, and a generalization to "stressor" beams is seen to present itself quite naturally. We show that all of these metrics would violate various energy conditions. This will provide an opportunity to ruminate on the meaning of energy conditions as such, and what we can learn about whether an arbitrarily advanced civilization might have access to such beams.

preprint2020arXiv

Harnessing the linguistic signal to predict scalar inferences

Pragmatic inferences often subtly depend on the presence or absence of linguistic features. For example, the presence of a partitive construction (of the) increases the strength of a so-called scalar inference: listeners perceive the inference that Chris did not eat all of the cookies to be stronger after hearing "Chris ate some of the cookies" than after hearing the same utterance without a partitive, "Chris ate some cookies." In this work, we explore to what extent neural network sentence encoders can learn to predict the strength of scalar inferences. We first show that an LSTM-based sentence encoder trained on an English dataset of human inference strength ratings is able to predict ratings with high accuracy (r=0.78). We then probe the model's behavior using manually constructed minimal sentence pairs and corpus data. We find that the model inferred previously established associations between linguistic features and inference strength, suggesting that the model learns to use linguistic features to predict pragmatic inferences.

preprint2020arXiv

Orientation Averaging of Optical Chirality Near Nanoparticles and Aggregates

Artificial nanostructures enable fine control of electromagnetic fields at the nanoscale, a possibility that has recently been extended to the interaction between polarised light and chiral matter. The theoretical description of such interactions, and its application to the design of optimised structures for chiroptical spectroscopies, brings new challenges to the common set of tools used in nano-optics. In particular, chiroptical effects often depend crucially on the relative orientation of the scatterer and the incident light, but many experiments are performed with randomly-oriented scatterers, dispersed in a solution. We derive new expressions for the orientation-averaged local degree of optical chirality of the electromagnetic field in the presence of a nanoparticle aggregate. This is achieved using the superposition T -matrix framework, ideally suited for the derivation of efficient orientation-averaging formulas in light scattering problems. Our results are applied to a few model examples, and illustrate several non-intuitive aspects in the distribution of orientation-averaged degree of chirality around nanostructures. The results will be of significant interest for the study of nanoparticle assemblies designed to enhance chiroptical spectroscopies, and where the numerically-efficient computation of the averaged degree of optical chirality enables a more comprehensive exploration of the many possible nanostructures.

preprint2020arXiv

Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection

Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word segmentation; a morphological layer comprising lemmas, universal part-of-speech tags, and standardized morphological features; and a syntactic layer focusing on syntactic relations between predicates, arguments and modifiers. In this paper, we describe version 2 of the guidelines (UD v2), discuss the major changes from UD v1 to UD v2, and give an overview of the currently available treebanks for 90 languages.

preprint2019arXiv

Sparsity of Hawking Radiation in $D+1$ Space-Time Dimensions Including Particle Masses

Hawking radiation from an evaporating black hole has often been compared to black body radiation. However, this comparison misses an important feature of Hawking radiation: Its low density of states. This can be captured in an easy to calculate, heuristic, and semi-analytic measure called "sparsity". In this letter we shall present both the concept of sparsities and its application to $D+1$-dimensional Tangherlini black holes and their evaporation. In particular, we shall also publish for the first time sparsity expressions taking into account in closed form effects of non-zero particle mass. We will also see how this comparatively simple method reproduces results of (massless) Hawking radiation in higher dimensions and how different spins contribute to the total radiation in this context.