Source author record

Shuang Ma

Shuang Ma appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci physics.optics Robotics Artificial Intelligence eess.SY Machine Learning Systems and Control

Catalog footprint

What is connected

5works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems

Learning representations that generalize across tasks and domains is challenging yet necessary for autonomous systems. Although task-driven approaches are appealing, designing models specific to each application can be difficult in the face of limited data, especially when dealing with highly variable multimodal input spaces arising from different tasks in different environments.We introduce the first general-purpose pretraining pipeline, COntrastive Multimodal Pretraining for AutonomouS Systems (COMPASS), to overcome the limitations of task-specific models and existing pretraining approaches. COMPASS constructs a multimodal graph by considering the essential information for autonomous systems and the properties of different modalities. Through this graph, multimodal signals are connected and mapped into two factorized spatio-temporal latent spaces: a "motion pattern space" and a "current state space." By learning from multimodal correspondences in each latent space, COMPASS creates state representations that models necessary information such as temporal dynamics, geometry, and semantics. We pretrain COMPASS on a large-scale multimodal simulation dataset TartanAir \cite{tartanair2020iros} and evaluate it on drone navigation, vehicle racing, and visual odometry tasks. The experiments indicate that COMPASS can tackle all three scenarios and can also generalize to unseen environments and real-world data.

preprint2022arXiv

Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers

Natural language is the most intuitive medium for us to interact with other people when expressing commands and instructions. However, using language is seldom an easy task when humans need to express their intent towards robots, since most of the current language interfaces require rigid templates with a static set of action targets and commands. In this work, we provide a flexible language-based interface for human-robot collaboration, which allows a user to reshape existing trajectories for an autonomous agent. We take advantage of recent advancements in the field of large language models (BERT and CLIP) to encode the user command, and then combine these features with trajectory information using multi-modal attention transformers. We train the model using imitation learning over a dataset containing robot trajectories modified by language commands, and treat the trajectory generation process as a sequence prediction problem, analogously to how language generation architectures operate. We evaluate the system in multiple simulated trajectory scenarios, and show a significant performance increase of our model over baseline approaches. In addition, our real-world experiments with a robot arm show that users significantly prefer our natural language interface over traditional methods such as kinesthetic teaching or cost-function programming. Our study shows how the field of robotics can take advantage of large pre-trained language models towards creating more intuitive interfaces between robots and machines. Project webpage: https://arthurfenderbucker.github.io/NL_trajectory_reshaper/

preprint2015arXiv

The thermal stability and separation characteristic of anti-sticking layers of Pt/Cr films for hot slumping technology

The thermal stability and separation characteristic of anti-sticking layers of Pt/Cr films were studied in this paper. Several types of adhesion layers were investigated: 10.0 nm Pt, 1.5 nm Cr+50.0 nm Pt, 2.5 nm Cr+50.0 nm Pt and 3.5 nm Cr+50.0 nm Pt fabricated using direct current magnetron sputtering. The variation of layer thicknesses, roughness, crystallization and surface topography of Pt/Cr films have been analyzed by grazing incidence X-ray reflectometry, large angle X-ray diffraction and the optical profiler before and after heating. 2.5 nm Cr+50.0 nm Pt films exhibit the best thermal stability and separation characteristic according to the heating and hot slumping experiments. The films were also applied as anti-sticking layers to optimize the maximum temperature of hot slumping technology.

preprint2012arXiv

The transition from amorphous to crystalline in Al/Zr multilayers

The amorphous-to-crystalline transition in Al(1.0%wtSi)/Zr and Al(Pure)/Zr multilayers grown by direct-current magnetron sputtering system has been characterized over a range of Al layer thicknesses (1.0-5.0 nm) by using a series of complementary measurements including grazing incidence X-ray reflectometry, atomic force microscopy, X-ray diffraction and high-resolution transmission electron microscopy. The Al layer thickness transition exhibits the Si doped in Al could not only disfavor the crystallization of Al, but also influence the changing trends of surface roughness and diffraction peak position of phase Al<111>. An interesting feature of the presence of Si in Al layer is that Si could influence the transition process in Al(1%wtSi) layer, in which the critical thickness (1.6 nm) of Al(Pure) layer in Al(Pure)/Zr shifts to 1.8 nm of Al(1.0%wtSi) layer in Al(1.0%wtSi)/Zr multilayer. We also found that the Zr-on-Al interlayer is wider than the Al-on-Zr interlayer in both systems, and the Al layers do not have specific crystal orientation in the directions vertical to the layer from SAED patterns below the thickness (3.0 nm) of Al layers. Above the thickness (3.0 nm) of Al layers, the Al layers are highly oriented in Al<111>, so that the transformation from asymmetrical to symmetrical interlayers can be observed. Based on the analysis of all measurements, we build up a model with four steps, which could explain the Al layer thickness transition process in terms of a critical thickness for the nucleation of Al(Pure) and Al(1%wtSi) crystallites.

preprint2012arXiv

Thermally induced structural modification in the Al/Zr multilayers

The effect of increasing temperature on the structural stability and interactions of two kinds of Al/Zr (Al(1%wtSi)/Zr and Al(Pure)/Zr) multilayer mirrors are investigated. All Al/Zr multilayers annealed from 200^{\circ}C to 500^{\circ}C, were deposited on Si wafers by using direct-current magnetron sputtering technology. A detailed and consistent picture of the thermally induced changes in the microstructure is obtained using an array of complementary measurements including grazing incidence X-ray reflectance, atomic force microscope, X-ray diffraction and high-resolution transmission electron microscopy. The first significant structural changes of two systems are observed at 250^{\circ}C, characterized by asymmetric interlayers appears at interface. At 290^{\circ}C, the interface consisted of amorphous Al-Zr alloy is transformed to amorphous Al-Zr alloy and cubic ZrAl3 in both systems. By 298^{\circ}C of Al(1%wtSi)/Zr and 295^{\circ}C of Al(Pure)/Zr multilayers, the interfacial phases of Al-Zr alloy transform completely into polycrystalline mixtures of hcp-ZrAl2 and cubic-ZrAl3, which smooth the interface boundary and lower the surface roughness in the multilayers. Up to 500^{\circ}C, the multilayer structure still exists in both systems, and the differences between the asymmetric interlayers are much larger in the multilayers. Finally, we discuss the transformation from symmetric to asymmetric in the annealing process for other systems.

Shuang Ma

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems

Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers

The thermal stability and separation characteristic of anti-sticking layers of Pt/Cr films for hot slumping technology

The transition from amorphous to crystalline in Al/Zr multilayers

Thermally induced structural modification in the Al/Zr multilayers