Researcher profile

Matthew Richardson

Matthew Richardson contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

Structure-Grounded Pretraining for Text-to-SQL

Learning to capture text-table alignment is essential for tasks like text-to-SQL. A model needs to correctly recognize natural language references to columns and values and to ground them in the given database schema. In this paper, we present a novel weakly supervised Structure-Grounded pretraining framework (StruG) for text-to-SQL that can effectively learn to capture text-table alignment based on a parallel text-table corpus. We identify a set of novel prediction tasks: column grounding, value grounding and column-value mapping, and leverage them to pretrain a text-table encoder. Additionally, to evaluate different methods under more realistic text-table alignment settings, we create a new evaluation set Spider-Realistic based on Spider dev set with explicit mentions of column names removed, and adopt eight existing text-to-SQL datasets for cross-database evaluation. STRUG brings significant improvement over BERT-LARGE in all settings. Compared with existing pretraining methods such as GRAPPA, STRUG achieves similar performance on Spider, and outperforms all baselines on more realistic sets. The Spider-Realistic dataset is available at https://doi.org/10.5281/zenodo.5205322.

preprint2022arXiv

The CosmoQuest Moon Mappers Community Science Project: The Effect of Incidence Angle on the Lunar Surface Crater Distribution

The CosmoQuest virtual community science platform facilitates the creation and implementation of astronomical research projects performed by citizen scientists. One such project, called Moon Mappers, aids in determining the feasibility of producing crowd-sourced cratering statistics of the surface of the Moon. Lunar crater population statistics are an important metric used to understand the formation and evolutionary history of lunar surface features, to estimate relative and absolute model ages of regions on the Moon's surface, and to establish chronologies for other planetary surfaces via extrapolation from the lunar record. It has been suggested and shown that solar incidence angle has an effect on the identification of craters, particularly at meter scales. We have used high-resolution image data taken by the Lunar Reconnaissance Orbiter's Narrow-Angle Camera of the Apollo 15 landing site over a range of solar incidence angles and have compiled catalogs of crater identifications obtained by minimally trained members of the general public participating in CosmoQuest's Moon Mappers project. We have studied the effects of solar incidence angle spanning from approximately 27.5 deg to approximately 83 deg (extending the incidence angle range examined in previous works), down to a minimum crater size of 10 m, and find that the solar incidence angle has a significant effect on the crater identification process, as has been determined by subject matter experts in other studies. The results of this analysis not only highlight the ability to use crowd-sourced data in reproducing and validating scientific analyses but also indicate the potential to perform original research.

preprint2022arXiv

The Nature of Low-Albedo Small Bodies from 3-$μ$m Spectroscopy: One Group that Formed Within the Ammonia Snow Line and One that Formed Beyond It

We present evidence, via a large survey of 191 new spectra along with previously-published spectra, of a divide in the 3-$μ$m spectral properties of the low-albedo asteroid population. One group (&#34;Sharp-types&#34; or ST, with band centers $<$ 3 $μ$m) has a spectral shape consistent with carbonaceous chondrite meteorites, while the other group (&#34;not-Sharp-types&#34; or NST, with bands centered $>$ 3 $μ$m) is not represented in the meteorite literature but is as abundant as the STs among large objects. Both groups are present in most low-albedo asteroid taxonomic classes, and except in limited cases taxonomic classifications based on 0.5-2.5-$μ$m data alone cannot predict whether an asteroid is ST or NST. Statistical tests show the STs and NSTs differ in average band depth, semi-major axis, and perihelion at confidence levels $\ge$98\%, while not showing significant differences in albedo. We also show that many NSTs have a 3-$μ$m absorption band shape like Comet 67P, and likely represent an important small-body composition throughout the solar system. A simple explanation for the origin of these groups is formation on opposite sides of the ammonia snow line, with the NST group accreting H2O and NH3 and the ST group only accreting H2O, with subsequent thermal and chemical evolution resulting in the minerals seen today. Such an explanation is consistent with recent dynamical modeling of planetesimal formation and delivery, and suggests that much more outer solar system material was delivered to the main asteroid belt than would be thought based on the number of D-class asteroids found today.