Source author record

Sweta Agrawal

Sweta Agrawal appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language Artificial Intelligence Neurons and Cognition

Catalog footprint

What is connected

4works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

An Imitation Learning Curriculum for Text Editing with Non-Autoregressive Models

We propose a framework for training non-autoregressive sequence-to-sequence models for editing tasks, where the original input sequence is iteratively edited to produce the output. We show that the imitation learning algorithms designed to train such models for machine translation introduces mismatches between training and inference that lead to undertraining and poor generalization in editing scenarios. We address this issue with two complementary strategies: 1) a roll-in policy that exposes the model to intermediate training sequences that it is more likely to encounter during inference, 2) a curriculum that presents easy-to-learn edit operations first, gradually increasing the difficulty of training samples as the model becomes competent. We show the efficacy of these strategies on two challenging English editing tasks: controllable text simplification and abstractive summarization. Our approach significantly improves output quality on both tasks and controls output complexity better on the simplification task.

preprint2022arXiv

Controlling Translation Formality Using Pre-trained Multilingual Language Models

This paper describes the University of Maryland's submission to the Special Task on Formality Control for Spoken Language Translation at \iwslt, which evaluates translation from English into 6 languages with diverse grammatical formality markers. We investigate to what extent this problem can be addressed with a \textit{single multilingual model}, simultaneously controlling its output for target language and formality. Results show that this strategy can approach the translation quality and formality control achieved by dedicated translation models. However, the nature of the underlying pre-trained language model and of the finetuning samples greatly impact results.

preprint2022arXiv

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large, web-mined text datasets covering hundreds of languages. We manually audit the quality of 205 language-specific corpora released with five major public datasets (CCAligned, ParaCrawl, WikiMatrix, OSCAR, mC4). Lower-resource corpora have systematic issues: At least 15 corpora have no usable text, and a significant fraction contains less than 50% sentences of acceptable quality. In addition, many are mislabeled or use nonstandard/ambiguous language codes. We demonstrate that these issues are easy to detect even for non-proficient speakers, and supplement the human audit with automatic analyses. Finally, we recommend techniques to evaluate and improve multilingual corpora and discuss potential risks that come with low-quality data releases.

preprint2022arXiv

The two body problem: proprioception and motor control across the metamorphic divide

Like a rocket being propelled into space, evolution has engineered flies to launch into adulthood via multiple stages. Flies develop and deploy two distinct bodies, linked by the transformative process of metamorphosis. The fly larva is a soft hydraulic tube that can crawl to find food and avoid predators. The adult fly has a stiff exoskeleton with articulated limbs capable of long-distance navigation and rich social interactions. Because the larval and adult forms are so distinct in structure, they require distinct strategies for sensing and moving the body. The metamorphic divide thus presents an opportunity for comparative analysis of neural circuits. Here, we review recent progress toward understanding the neural mechanisms of proprioception and motor control in larval and adult Drosophila. We highlight commonalities that point toward general principles of sensorimotor control and differences that may reflect unique constraints imposed by biomechanics. Finally, we discuss emerging opportunities for comparative analysis of neural circuit architecture in the fly and other animal species.

Sweta Agrawal

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

An Imitation Learning Curriculum for Text Editing with Non-Autoregressive Models

Controlling Translation Formality Using Pre-trained Multilingual Language Models

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

The two body problem: proprioception and motor control across the metamorphic divide