Researcher profile

Yuji Matsumoto

Yuji Matsumoto contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2022arXiv

A Super-Earth Orbiting Near the Inner Edge of the Habitable Zone around the M4.5-dwarf Ross 508

We report the near-infrared radial-velocity (RV) discovery of a super-Earth planet on a 10.77-day orbit around the M4.5 dwarf Ross 508 ($J_\mathrm{mag}=9.1$). Using precision RVs from the Subaru Telescope IRD (InfraRed Doppler) instrument, we derive a semi-amplitude of $3.92^{+0.60}_{-0.58}$ ${\rm m\,s}^{-1}$, corresponding to a planet with a minimum mass $m \sin i = 4.00^{+0.53}_{-0.55}\ M_{\oplus}$. We find no evidence of significant signals at the detected period in spectroscopic stellar activity indicators or MEarth photometry. The planet, Ross 508 b, has a semimajor-axis of $0.05366^{+0.00056}_{-0.00049}$ au. This gives an orbit-averaged insolation of $\approx$1.4 times the Earth's value, placing Ross 508 b near the inner edge of its star's habitable zone. We have explored the possibility that the planet has a high eccentricity and its host is accompanied by an additional unconfirmed companion on a wide orbit. Our discovery demonstrates that the near-infrared RV search can play a crucial role to find a low-mass planet around cool M dwarfs like Ross 508.

preprint2022arXiv

Global Entity Disambiguation with BERT

We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia. We achieve new state-of-the-art results on five standard ED datasets: AIDA-CoNLL, MSNBC, AQUAINT, ACE2004, and WNED-WIKI. The source code and model checkpoint are available at https://github.com/studio-ousia/luke.

preprint2022arXiv

Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning

Discriminativeness is a desirable feature of image captions: captions should describe the characteristic details of input images. However, recent high-performing captioning models, which are trained with reinforcement learning (RL), tend to generate overly generic captions despite their high performance in various other criteria. First, we investigate the cause of the unexpectedly low discriminativeness and show that RL has a deeply rooted side effect of limiting the output words to high-frequency words. The limited vocabulary is a severe bottleneck for discriminativeness as it is difficult for a model to describe the details beyond its vocabulary. Then, based on this identification of the bottleneck, we drastically recast discriminative image captioning as a much simpler task of encouraging low-frequency word generation. Hinted by long-tail classification and debiasing methods, we propose methods that easily switch off-the-shelf RL models to discriminativeness-aware models with only a single-epoch fine-tuning on the part of the parameters. Extensive experiments demonstrate that our methods significantly enhance the discriminativeness of off-the-shelf RL models and even outperform previous discriminativeness-aware methods with much smaller computational costs. Detailed analysis and human evaluation also verify that our methods boost the discriminativeness without sacrificing the overall quality of captions.

preprint2020arXiv

A Fast and Easy Regression Technique for k-NN Classification Without Using Negative Pairs

This paper proposes an inexpensive way to learn an effective dissimilarity function to be used for $k$-nearest neighbor ($k$-NN) classification. Unlike Mahalanobis metric learning methods that map both query (unlabeled) objects and labeled objects to new coordinates by a single transformation, our method learns a transformation of labeled objects to new points in the feature space whereas query objects are kept in their original coordinates. This method has several advantages over existing distance metric learning methods: (i) In experiments with large document and image datasets, it achieves $k$-NN classification accuracy better than or at least comparable to the state-of-the-art metric learning methods. (ii) The transformation can be learned efficiently by solving a standard ridge regression problem. For document and image datasets, training is often more than two orders of magnitude faster than the fastest metric learning methods tested. This speed-up is also due to the fact that the proposed method eliminates the optimization over "negative" object pairs, i.e., objects whose class labels are different. (iii) The formulation has a theoretical justification in terms of reducing hubness in data.

preprint2020arXiv

Breaking Resonant Chains: Destabilization of Resonant Planets due to Long-term Mass Evolution

Recent exoplanet observations reported a large number of multiple-planet systems, in which some of the planets are in a chain of resonances. The fraction of resonant systems to non-resonant systems provides clues about their formation history. We investigated the orbital stability of planets in resonant chains by considering the long-term evolution of planetary mass and stellar mass and using orbital calculations. We found that while resonant chains were stable, they can be destabilized by a change of $\sim$10% in planetary mass. Such a mass evolution can occur by atmospheric escape due to photoevaporation. We also found that resonant chains can be broken by a stellar mass loss of $\lesssim1$%, which would be explained by stellar winds or coronal mass ejections. The long-term mass change of planets and stars plays an important role in the orbital evolutions of planetary systems including super-Earths.

preprint2020arXiv

Formation of multiple-planet systems in resonant chains around M dwarfs

Recent observations have revealed the existence of multiple-planet systems composed of Earth-mass planets around late M dwarfs. Most of their orbits are close to commensurabilities, which suggests that planets were commonly trapped in resonant chains in their formation around low-mass stars. We investigate the formation of multiple-planet systems in resonant chains around low-mass stars. A time-evolution model of the multiple-planet formation via pebble accretion in the early phase of the disk evolution is constructed based on the formation model for the TRAPPIST-1 system by Ormel et al. (2017). Our simulations show that knowing the protoplanet appearance timescale is important for determining the number of planets and their trapped resonances: as the protoplanet appearance timescale increases, fewer planets are formed, which are trapped in more widely separated resonances. We find that there is a range of the protoplanet appearance timescale for forming the stable multiple-planet systems in resonant chains. This range depends on the stellar mass and disk size. We suggest that the protoplanet appearance timescale is a key parameter for studying the formation of multiple-planet systems with planets in resonant chains around low-mass stars. The composition of the planets in our model is also discussed.

preprint2020arXiv

Length-controllable Abstractive Summarization by Guiding with Summary Prototype

We propose a new length-controllable abstractive summarization model. Recent state-of-the-art abstractive summarization models based on encoder-decoder models generate only one summary per source text. However, controllable summarization, especially of the length, is an important aspect for practical applications. Previous studies on length-controllable abstractive summarization incorporate length embeddings in the decoder module for controlling the summary length. Although the length embeddings can control where to stop decoding, they do not decide which information should be included in the summary within the length constraint. Unlike the previous models, our length-controllable abstractive summarization model incorporates a word-level extractive module in the encoder-decoder model instead of length embeddings. Our model generates a summary in two steps. First, our word-level extractor extracts a sequence of important words (we call it the "prototype text") from the source text according to the word-level importance scores and the length constraint. Second, the prototype text is used as additional input to the encoder-decoder model, which generates a summary by jointly encoding and copying words from both the prototype text and source text. Since the prototype text is a guide to both the content and length of the summary, our model can generate an informative and length-controlled summary. Experiments with the CNN/Daily Mail dataset and the NEWSROOM dataset show that our model outperformed previous models in length-controlled settings.

preprint2019arXiv

Aggregate Growth and Internal structures of Chondrite Parent Bodies Forming from Dense Clumps

Major components of chondrites are chondrules and matrix. Measurements of the volatile abundance in Semarkona chondrules suggest that chondrules formed in a dense clump that had a higher solid density than the gas density in the solar nebula. We investigate collisions between chondrules and matrix in the surface region of dense clumps using fluffy aggregate growth models. Our simulations show that the collisional growth of aggregates composed of chondrules and matrix takes place in the clumps well before they experience gravitational collapse. The internal structure of chondrite parent bodies (CPBs) can be thereby determined by aggregate growth. We find that the aggregate growth generates two scales within CPBs. The first scale is involved with the small scale distribution of chondrules and determined by the early growth stage, where chondrules accrete aggregates composed of matrix grains. This accretion can reproduce the thickness of the matrix layer around chondrules found in chondrites. The other scale is related to the large scale distribution of chondrules. Its properties (e.g., the abundance of chondrules and the overall size) depend on the gas motion within the clump, which is parameterized in this work. Our work thus suggests that the internal structure of CPBs may provide important clues about their formation conditions and mechanisms.