Source author record

Joachim Wagner

Joachim Wagner appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language cond-mat.soft physics.chem-ph

Catalog footprint

What is connected

4works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

gaBERT -- an Irish Language Model

The BERT family of neural language models have become highly popular due to their ability to provide sequences of text with rich context-sensitive token encodings which are able to generalise well to many NLP tasks. We introduce gaBERT, a monolingual BERT model for the Irish language. We compare our gaBERT model to multilingual BERT and the monolingual Irish WikiBERT, and we show that gaBERT provides better representations for a downstream parsing task. We also show how different filtering criteria, vocabulary size and the choice of subword tokenisation model affect downstream performance. We compare the results of fine-tuning a gaBERT model with an mBERT model for the task of identifying verbal multiword expressions, and show that the fine-tuned gaBERT model also performs better at this task. We release gaBERT and related code to the community.

preprint2020arXiv

The ADAPT Enhanced Dependency Parser at the IWPT 2020 Shared Task

We describe the ADAPT system for the 2020 IWPT Shared Task on parsing enhanced Universal Dependencies in 17 languages. We implement a pipeline approach using UDPipe and UDPipe-future to provide initial levels of annotation. The enhanced dependency graph is either produced by a graph-based semantic dependency parser or is built from the basic tree using a small set of heuristics. Our results show that, for the majority of languages, a semantic dependency parser can be successfully applied to the task of parsing enhanced dependencies. Unfortunately, we did not ensure a connected graph as part of our pipeline approach and our competition submission relied on a last-minute fix to pass the validation script which harmed our official evaluation scores significantly. Our submission ranked eighth in the official evaluation with a macro-averaged coarse ELAS F1 of 67.23 and a treebank average of 67.49. We later implemented our own graph-connecting fix which resulted in a score of 79.53 (language average) or 79.76 (treebank average), which would have placed fourth in the competition evaluation.

preprint2020arXiv

Treebank Embedding Vectors for Out-of-domain Dependency Parsing

A recent advance in monolingual dependency parsing is the idea of a treebank embedding vector, which allows all treebanks for a particular language to be used as training data while at the same time allowing the model to prefer training data from one treebank over others and to select the preferred treebank at test time. We build on this idea by 1) introducing a method to predict a treebank vector for sentences that do not come from a treebank used in training, and 2) exploring what happens when we move away from predefined treebank embedding vectors during test time and instead devise tailored interpolations. We show that 1) there are interpolated vectors that are superior to the predefined ones, and 2) treebank vectors can be predicted with sufficient accuracy, for nine out of ten test languages, to match the performance of an oracle approach that knows the most suitable predefined treebank embedding for the test set.

preprint2015arXiv

Depolarized light scattering from prolate anisotropic particles: the influence of the particle shape on the field autocorrelation function

We provide a theoretical analysis for the intermediate scattering function typically measured in depolarized dynamic light scattering experiments. We calculate the field autocorrelation function $g_1^{\rm VH}(Q,t)$ in dependence on the wave vector $Q$ and the time $t$ explicitly in a vertical-horizontal scattering geometry for differently shaped solids of revolution. The shape of prolate cylinders, spherocylinders, spindles, and double cones with variable aspect ratio is expanded in rotational invariants $f_{lm}(r)$. By Fourier transform of these expansion coefficients, a formal multipole expansion of the scattering function is obtained, which is used to calculate the weighting coefficients appearing in the depolarized scattering function. In addition to translational and rotational diffusion, especially the translational-rotational coupling of shape-anisotropic objects is considered. From the short-time behavior of the intermediate scattering function, the first cumulants $Γ(Q)$ are calculated. In a depolarized scattering experiment, they deviate from the simple proportionality to $Q^2$. The coefficients $f_{lm}(Q)$ strongly depend on the geometry and aspect ratio of the particles. The time dependence, in addition, is governed by the translational and rotational diffusion tensors, which are calculated by means of bead models for differently shaped particles in dependence on their aspect ratio. Therefore, our analysis shows how details of the particle shape---beyond their aspect ratio---can be determined by a precise scattering experiment. This is of high relevance in understanding smart materials which involve suspensions of anisotropic colloidal particles.