Source author record

Jonathan Chang

Jonathan Chang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Applications Populations and Evolution Computation and Language Genomics physics.soc-ph Robotics Social and Information Networks

Catalog footprint

What is connected

7works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

MobILE: Model-Based Imitation Learning From Observation Alone

This paper studies Imitation Learning from Observations alone (ILFO) where the learner is presented with expert demonstrations that consist only of states visited by an expert (without access to actions taken by the expert). We present a provably efficient model-based framework MobILE to solve the ILFO problem. MobILE involves carefully trading off strategic exploration against imitation - this is achieved by integrating the idea of optimism in the face of uncertainty into the distribution matching imitation learning (IL) framework. We provide a unified analysis for MobILE, and demonstrate that MobILE enjoys strong performance guarantees for classes of MDP dynamics that satisfy certain well studied notions of structural complexity. We also show that the ILFO problem is strictly harder than the standard IL problem by presenting an exponential sample complexity separation between IL and ILFO. We complement these theoretical results with experimental simulations on benchmark OpenAI Gym tasks that indicate the efficacy of MobILE. Code for implementing the MobILE framework is available at https://github.com/rahulkidambi/MobILE-NeurIPS2021.

preprint2022arXiv

Multitask Prompted Training Enables Zero-Shot Task Generalization

Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale, we develop a system for easily mapping any natural language tasks into a human-readable prompted form. We convert a large set of supervised datasets, each with multiple prompts with diverse wording. These prompted datasets allow for benchmarking the ability of a model to perform completely held-out tasks. We fine-tune a pretrained encoder-decoder model (Raffel et al., 2020; Lester et al., 2021) on this multitask mixture covering a wide variety of tasks. The model attains strong zero-shot performance on several standard datasets, often outperforming models up to 16x its size. Further, our approach attains strong performance on a subset of tasks from the BIG-bench benchmark, outperforming models up to 6x its size. All trained models are available at https://github.com/bigscience-workshop/t-zero and all prompts are available at https://github.com/bigscience-workshop/promptsource.

preprint2021arXiv

Learning Deep Parameterized Skills from Demonstration for Re-targetable Visuomotor Control

Robots need to learn skills that can not only generalize across similar problems but also be directed to a specific goal. Previous methods either train a new skill for every different goal or do not infer the specific target in the presence of multiple goals from visual data. We introduce an end-to-end method that represents targetable visuomotor skills as a goal-parameterized neural network policy. By training on an informative subset of available goals with the associated target parameters, we are able to learn a policy that can zero-shot generalize to previously unseen goals. We evaluate our method in a representative 2D simulation of a button-grid and on both button-pressing and peg-insertion tasks on two different physical arms. We demonstrate that our model trained on 33% of the possible goals is able to generalize to more than 90% of the targets in the scene for both simulation and robot experiments. We also successfully learn a mapping from target pixel coordinates to a robot policy to complete a specified goal.

preprint2016arXiv

Competition and extinction explain the evolution of diversity in American automobiles

One of the most remarkable aspects of our species is that while we show surprisingly little genetic diversity, we demonstrate astonishing amounts of cultural diversity. Perhaps most impressive is the diversity of our technologies, broadly defined as all the physical objects we produce and the skills we use to produce them. Despite considerable focus on the evolution of technology by social scientists and philosophers, there have been few attempts to systematically quantify technological diversity and therefore the dynamics of technological change remain poorly understood. Here we show a novel Bayesian model for examining technological diversification adopted from paleontological analysis of occurrence data. We use this framework to estimate the tempo of diversification in American car and truck models produced between 1896 and 2014 and to test the relative importance of competition and extrinsic factors in shaping changes in macroevolutionary rates. Our results identify a four-fold decrease in the origination and extinction rates of car models and a negative net diversification rate over the last thirty years. We also demonstrate that competition played a more significant role in car model diversification than either changes in oil prices or gross domestic product. Together our analyses provide a set of tools that can enhance current research on technological and cultural evolution by providing a flexible and quantitative framework for exploring the dynamics of diversification.

preprint2014arXiv

Joint Inference of Multiple Label Types in Large Networks

We tackle the problem of inferring node labels in a partially labeled graph where each node in the graph has multiple label types and each label type has a large number of possible labels. Our primary example, and the focus of this paper, is the joint inference of label types such as hometown, current city, and employers, for users connected by a social network. Standard label propagation fails to consider the properties of the label types and the interactions between them. Our proposed method, called EdgeExplain, explicitly models these, while still enabling scalable inference under a distributed message-passing architecture. On a billion-node subset of the Facebook social network, EdgeExplain significantly outperforms label propagation for several label types, with lifts of up to 120% for recall@1 and 60% for recall@3.

preprint2012arXiv

TAPIR enables high-throughput estimation and comparison of phylogenetic informativeness using locus-specific substitution models

Massively parallel DNA sequencing techniques are rapidly changing the dynamics of phylogenetic study design by exponentially increasing the discovery of phylogenetically useful loci. This increase in the number of phylogenetic markers potentially provides researchers the opportunity to select subsets of loci best-addressing particular phylogenetic hypotheses based on objective measures of performance over different time scales. Investigators may also want to determine the power of particular phylogenetic markers relative to each other. However, currently available tools are designed to evaluate a small number of markers and are not well-suited to screening hundreds or thousands of candidate loci across the genome. TAPIR is an alternative implementation of Townsend's estimate of phylogenetic informativeness (PI) that enables rapid estimation and summary of PI when applied to data sets containing hundreds to thousands of candidate, phylogenetically informative loci.

preprint2010arXiv

Hierarchical relational models for document networks

We develop the relational topic model (RTM), a hierarchical model of both network structure and node attributes. We focus on document networks, where the attributes of each document are its words, that is, discrete observations taken from a fixed vocabulary. For each pair of documents, the RTM models their link as a binary random variable that is conditioned on their contents. The model can be used to summarize a network of documents, predict links between them, and predict words within them. We derive efficient inference and estimation algorithms based on variational methods that take advantage of sparsity and scale with the number of links. We evaluate the predictive performance of the RTM for large networks of scientific abstracts, web documents, and geographically tagged news.