Source author record

Ikuya Yamada

Ikuya Yamada appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language cond-mat.str-el cond-mat.supr-con Machine Learning

Catalog footprint

What is connected

5works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

EASE: Entity-Aware Contrastive Learning of Sentence Embedding

We present EASE, a novel method for learning sentence embeddings via contrastive learning between sentences and their related entities. The advantage of using entity supervision is twofold: (1) entities have been shown to be a strong indicator of text semantics and thus should provide rich training signals for sentence embeddings; (2) entities are defined independently of languages and thus offer useful cross-lingual alignment supervision. We evaluate EASE against other unsupervised models both in monolingual and multilingual settings. We show that EASE exhibits competitive or better performance in English semantic textual similarity (STS) and short text clustering (STC) tasks and it significantly outperforms baseline methods in multilingual settings on a variety of tasks. Our source code, pre-trained models, and newly constructed multilingual STC dataset are available at https://github.com/studio-ousia/ease.

preprint2022arXiv

Global Entity Disambiguation with BERT

We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia. We achieve new state-of-the-art results on five standard ED datasets: AIDA-CoNLL, MSNBC, AQUAINT, ACE2004, and WNED-WIKI. The source code and model checkpoint are available at https://github.com/studio-ousia/luke.

preprint2022arXiv

MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages

We present the results of the Workshop on Multilingual Information Access (MIA) 2022 Shared Task, evaluating cross-lingual open-retrieval question answering (QA) systems in 16 typologically diverse languages. In this task, we adapted two large-scale cross-lingual open-retrieval QA datasets in 14 typologically diverse languages, and newly annotated open-retrieval QA data in 2 underrepresented languages: Tagalog and Tamil. Four teams submitted their systems. The best system leveraging iteratively mined diverse negative examples and larger pretrained models achieves 32.2 F1, outperforming our baseline by 4.5 points. The second best system uses entity-aware contextualized representations for document retrieval, and achieves significant improvements in Tamil (20.8 F1), whereas most of the other systems yield nearly zero scores.

preprint2022arXiv

mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models

Recent studies have shown that multilingual pretrained language models can be effectively improved with cross-lingual alignment information from Wikipedia entities. However, existing methods only exploit entity information in pretraining and do not explicitly use entities in downstream tasks. In this study, we explore the effectiveness of leveraging entity representations for downstream cross-lingual tasks. We train a multilingual language model with 24 languages with entity representations and show the model consistently outperforms word-based pretrained models in various cross-lingual transfer tasks. We also analyze the model and the key insight is that incorporating entity representations into the input allows us to extract more language-agnostic features. We also evaluate the model with a multilingual cloze prompt task with the mLAMA dataset. We show that entity-based prompt elicits correct factual knowledge more likely than using only word representations. Our source code and pretrained models are available at https://github.com/studio-ousia/luke.

preprint2013arXiv

Phonon anomalies and lattice dynamics in superconducting oxychlorides Ca$_{2-x}$CuO$_2$Cl

We present a comprehensive study of the phonon dispersion in an underdoped, superconducting Ca$_{2-x}$CuO$_2$Cl$_2$ crystal. We interpret the results using lattice dynamical calculations based on a shell model, and we compare the results, to other hole-doped cuprates, in particular to the ones isomorphic to La$_{2-x}$Sr$_x$CuO$_4$ (LSCO). We found that an anomalous dip in the Cu-O bond stretching dispersion develops in oxychlorides with a simultaneous marked broadening of the mode. The broadening is maximum at $\approx (π/ (2a) ~ 0 ~ 0)$ that corresponds to the charge-modulations propagation vector. Our analysis also suggests that screening effects in calculations may cause an apparent cosine-shaped bending of the Cu-O bond-stretching dispersion along both the ($q$ 0 0) and ($q$ $q$ 0) directions, that is not observed on the data close to optimal doping. This observation suggests that the discrepancy between experimental data and \textit{ab-initio} calculations on this mode originates from an overestimation of the doping effects on the mode.

Ikuya Yamada

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

EASE: Entity-Aware Contrastive Learning of Sentence Embedding

Global Entity Disambiguation with BERT

MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages

mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models

Phonon anomalies and lattice dynamics in superconducting oxychlorides Ca$_{2-x}$CuO$_2$Cl