Researcher profile

Matthew Johnson

Matthew Johnson contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2022arXiv

3D face reconstruction with dense landmarks

Landmarks often play a key role in face analysis, but many aspects of identity or expression cannot be represented by sparse landmarks alone. Thus, in order to reconstruct faces more accurately, landmarks are often combined with additional signals like depth images or techniques like differentiable rendering. Can we keep things simple by just using more landmarks? In answer, we present the first method that accurately predicts 10x as many landmarks as usual, covering the whole head, including the eyes and teeth. This is accomplished using synthetic training data, which guarantees perfect landmark annotations. By fitting a morphable model to these dense landmarks, we achieve state-of-the-art results for monocular 3D face reconstruction in the wild. We show that dense landmarks are an ideal signal for integrating face shape information across frames by demonstrating accurate and expressive facial performance capture in both monocular and multi-view scenarios. This approach is also highly efficient: we can predict dense landmarks and fit our 3D face model at over 150FPS on a single CPU thread. Please see our website: https://microsoft.github.io/DenseLandmarks/.

preprint2022arXiv

AutoMat: Accelerated Computational Electrochemical systems Discovery

Large-scale electrification is vital to addressing the climate crisis, but several scientific and technological challenges remain to fully electrify both the chemical industry and transportation. In both of these areas, new electrochemical materials will be critical, but their development currently relies heavily on human-time-intensive experimental trial and error and computationally expensive first-principles, meso-scale and continuum simulations. We present an automated workflow, AutoMat, that accelerates these computational steps by introducing both automated input generation and management of simulations across scales from first principles to continuum device modeling. Furthermore, we show how to seamlessly integrate multi-fidelity predictions such as machine learning surrogates or automated robotic experiments "in-the-loop". The automated framework is implemented with design space search techniques to dramatically accelerate the overall materials discovery pipeline by implicitly learning design features that optimize device performance across several metrics. We discuss the benefits of AutoMat using examples in electrocatalysis and energy storage and highlight lessons learned.

preprint2022arXiv

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gopher. These models are evaluated on 152 diverse tasks, achieving state-of-the-art performance across the majority. Gains from scale are largest in areas such as reading comprehension, fact-checking, and the identification of toxic language, but logical and mathematical reasoning see less benefit. We provide a holistic analysis of the training dataset and model's behaviour, covering the intersection of model scale with bias and toxicity. Finally we discuss the application of language models to AI safety and the mitigation of downstream harms.

preprint2022arXiv

Snowmass2021 CMB-HD White Paper

CMB-HD is a proposed millimeter-wave survey over half the sky that would be ultra-deep (0.5 uK-arcmin) and have unprecedented resolution (15 arcseconds at 150 GHz). Such a survey would answer many outstanding questions about the fundamental physics of the Universe. Major advances would be 1.) the use of gravitational lensing of the primordial microwave background to map the distribution of matter on small scales (k~10 h Mpc^(-1)), which probes dark matter particle properties. It will also allow 2.) measurements of the thermal and kinetic Sunyaev-Zel&#39;dovich effects on small scales to map the gas density and velocity, another probe of cosmic structure. In addition, CMB-HD would allow us to cross critical thresholds: 3.) ruling out or detecting any new, light (< 0.1 eV) particles that were in thermal equilibrium with known particles in the early Universe, 4.) testing a wide class of multi-field models that could explain an epoch of inflation in the early Universe, and 5.) ruling out or detecting inflationary magnetic fields. CMB-HD would also provide world-leading constraints on 6.) axion-like particles, 7.) cosmic birefringence, 8.) the sum of the neutrino masses, and 9.) the dark energy equation of state. The CMB-HD survey would be delivered in 7.5 years of observing 20,000 square degrees of sky, using two new 30-meter-class off-axis crossed Dragone telescopes to be located at Cerro Toco in the Atacama Desert. Each telescope would field 800,000 detectors (200,000 pixels), for a total of 1.6 million detectors.

preprint2022arXiv

Snowmass2021 Cosmic Frontier: Cosmic Microwave Background Measurements White Paper

This is a solicited whitepaper for the Snowmass 2021 community planning exercise. The paper focuses on measurements and science with the Cosmic Microwave Background (CMB). The CMB is foundational to our understanding of modern physics and continues to be a powerful tool driving our understanding of cosmology and particle physics. In this paper, we outline the broad and unique impact of CMB science for the High Energy Cosmic Frontier in the upcoming decade. We also describe the progression of ground-based CMB experiments, which shows that the community is prepared to develop the key capabilities and facilities needed to achieve these transformative CMB measurements.

preprint2022arXiv

Towards Mapping and Assessing Sidewalk Accessibility Across Sociocultural and Geographic Contexts

Despite the important role of sidewalks in supporting mobility, accessibility, and public health, there is a lack of high-quality datasets and corresponding analyses on sidewalk existence and condition. Our work explores a twofold vision: first, to develop scalable mechanisms to locate and assess sidewalks in cities across the world, and second, to use this data to support new urban analyses and mobility tools. We report on two preliminary urban science explorations enabled by our approach: exploring geo-spatial patterns and key correlates of sidewalk accessibility and examining differences in sidewalk infrastructure across regions.

preprint2022arXiv

Unified Scaling Laws for Routed Language Models

The performance of a language model has been shown to be effectively modeled as a power-law in its parameter count. Here we study the scaling behaviors of Routing Networks: architectures that conditionally use only a subset of their parameters while processing an input. For these models, parameter count and computational requirement form two independent axes along which an increase leads to better performance. In this work we derive and justify scaling laws defined on these two variables which generalize those known for standard language models and describe the performance of a wide range of routing architectures trained via three different techniques. Afterwards we provide two applications of these laws: first deriving an Effective Parameter Count along which all models scale at the same rate, and then using the scaling coefficients to give a quantitative comparison of the three routing techniques considered. Our analysis derives from an extensive evaluation of Routing Networks across five orders of magnitude of size, including models with hundreds of experts and hundreds of billions of parameters.

preprint2022arXiv

VolTeMorph: Realtime, Controllable and Generalisable Animation of Volumetric Representations

The recent increase in popularity of volumetric representations for scene reconstruction and novel view synthesis has put renewed focus on animating volumetric content at high visual quality and in real-time. While implicit deformation methods based on learned functions can produce impressive results, they are `black boxes&#39; to artists and content creators, they require large amounts of training data to generalise meaningfully, and they do not produce realistic extrapolations outside the training data. In this work we solve these issues by introducing a volume deformation method which is real-time, easy to edit with off-the-shelf software and can extrapolate convincingly. To demonstrate the versatility of our method, we apply it in two scenarios: physics-based object deformation and telepresence where avatars are controlled using blendshapes. We also perform thorough experiments showing that our method compares favourably to both volumetric approaches combined with implicit deformation and methods based on mesh deformation.

preprint2020arXiv

A high fidelity synthetic face framework for computer vision

Analysis of faces is one of the core applications of computer vision, with tasks ranging from landmark alignment, head pose estimation, expression recognition, and face recognition among others. However, building reliable methods requires time-consuming data collection and often even more time-consuming manual annotation, which can be unreliable. In our work we propose synthesizing such facial data, including ground truth annotations that would be almost impossible to acquire through manual annotation at the consistency and scale possible through use of synthetic data. We use a parametric face model together with hand crafted assets which enable us to generate training data with unprecedented quality and diversity (varying shape, texture, expression, pose, lighting, and hair).

preprint2020arXiv

CMB-HD: Astro2020 RFI Response

CMB-HD is a proposed ultra-deep (0.5 uk-arcmin), high-resolution (15 arcseconds) millimeter-wave survey over half the sky that would answer many outstanding questions in both fundamental physics of the Universe and astrophysics. This survey would be delivered in 7.5 years of observing 20,000 square degrees, using two new 30-meter-class off-axis cross-Dragone telescopes to be located at Cerro Toco in the Atacama Desert. Each telescope would field 800,000 detectors (200,000 pixels), for a total of 1.6 million detectors.

preprint2020arXiv

High Resolution Zero-Shot Domain Adaptation of Synthetically Rendered Face Images

Generating photorealistic images of human faces at scale remains a prohibitively difficult task using computer graphics approaches. This is because these require the simulation of light to be photorealistic, which in turn requires physically accurate modelling of geometry, materials, and light sources, for both the head and the surrounding scene. Non-photorealistic renders however are increasingly easy to produce. In contrast to computer graphics approaches, generative models learned from more readily available 2D image data have been shown to produce samples of human faces that are hard to distinguish from real data. The process of learning usually corresponds to a loss of control over the shape and appearance of the generated images. For instance, even simple disentangling tasks such as modifying the hair independently of the face, which is trivial to accomplish in a computer graphics approach, remains an open research question. In this work, we propose an algorithm that matches a non-photorealistic, synthetically generated image to a latent vector of a pretrained StyleGAN2 model which, in turn, maps the vector to a photorealistic image of a person of the same pose, expression, hair, and lighting. In contrast to most previous work, we require no synthetic training data. To the best of our knowledge, this is the first algorithm of its kind to work at a resolution of 1K and represents a significant leap forward in visual realism.

preprint2019arXiv

CMB-HD: An Ultra-Deep, High-Resolution Millimeter-Wave Survey Over Half the Sky

A millimeter-wave survey over half the sky, that spans frequencies in the range of 30 to 350 GHz, and that is both an order of magnitude deeper and of higher-resolution than currently funded surveys would yield an enormous gain in understanding of both fundamental physics and astrophysics. By providing such a deep, high-resolution millimeter-wave survey (about 0.5 uK-arcmin noise and 15 arcsecond resolution at 150 GHz), CMB-HD will enable major advances. It will allow 1) the use of gravitational lensing of the primordial microwave background to map the distribution of matter on small scales (k~10/hMpc), which probes dark matter particle properties. It will also allow 2) measurements of the thermal and kinetic Sunyaev-Zel&#39;dovich effects on small scales to map the gas density and gas pressure profiles of halos over a wide field, which probes galaxy evolution and cluster astrophysics. In addition, CMB-HD would allow us to cross critical thresholds in fundamental physics: 3) ruling out or detecting any new, light (< 0.1eV), thermal particles, which could potentially be the dark matter, and 4) testing a wide class of multi-field models that could explain an epoch of inflation in the early Universe. Such a survey would also 5) monitor the transient sky by mapping the full observing region every few days, which opens a new window on gamma-ray bursts, novae, fast radio bursts, and variable active galactic nuclei. Moreover, CMB-HD would 6) provide a census of planets, dwarf planets, and asteroids in the outer Solar System, and 7) enable the detection of exo-Oort clouds around other solar systems, shedding light on planet formation. CMB-HD will deliver this survey in 5 years of observing half the sky, using two new 30-meter-class off-axis cross-Dragone telescopes to be located at Cerro Toco in the Atacama Desert. The telescopes will field about 2.4 million detectors (600,000 pixels) in total.