Source author record

Zhao Chen

Zhao Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Machine Learning Computer Vision cond-mat.mes-hall physics.optics Applications Computation and Language Databases eess.SY math.NA Numerical Analysis physics.app-ph physics.data-an physics.ins-det Systems and Control

Catalog footprint

What is connected

11works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

We propose GradTail, an algorithm that uses gradients to improve model performance on the fly in the face of long-tailed training data distributions. Unlike conventional long-tail classifiers which operate on converged - and possibly overfit - models, we demonstrate that an approach based on gradient dot product agreement can isolate long-tailed data early on during model training and improve performance by dynamically picking higher sample weights for that data. We show that such upweighting leads to model improvements for both classification and regression models, the latter of which are relatively unexplored in the long-tail literature, and that the long-tail examples found by gradient alignment are consistent with our semantic expectations.

preprint2022arXiv

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Prompt-Tuning is a new paradigm for finetuning pre-trained language models in a parameter-efficient way. Here, we explore the use of HyperNetworks to generate hyper-prompts: we propose HyperPrompt, a novel architecture for prompt-based task-conditioning of self-attention in Transformers. The hyper-prompts are end-to-end learnable via generation by a HyperNetwork. HyperPrompt allows the network to learn task-specific feature maps where the hyper-prompts serve as task global memories for the queries to attend to, at the same time enabling flexible information sharing among tasks. We show that HyperPrompt is competitive against strong multi-task learning baselines with as few as $0.14\%$ of additional task-conditioning parameters, achieving great parameter and computational efficiency. Through extensive empirical experiments, we demonstrate that HyperPrompt can achieve superior performances over strong T5 multi-task learning baselines and parameter-efficient adapter variants including Prompt-Tuning and HyperFormer++ on Natural Language Understanding benchmarks of GLUE and SuperGLUE across many model sizes.

preprint2021arXiv

State-resolved ultrafast charge and spin dynamics in [Co/Pd] multilayers

We use transient absorption spectroscopy with circularly polarized x-rays to detect laser-excited hole states below the Fermi level and compare their dynamics with that of unoccupied states above the Fermi level in ferromagnetic [Co/Pd] multilayers. While below the Fermi level an instantaneous and significantly stronger demagnetization is observed, above the Fermi level the demagnetization is delayed by 35+/-10 fs. This provides a direct visualization of how ultrafast demagnetization proceeds via initial spin-flip scattering of laser-excited holes to the subsequent formation of spin waves.

preprint2020arXiv

48 channels 100-GHz tunable laser by integrating 16 DFB lasers with high wavelength-spacing uniformity

We report a 48-channel 100-GHz tunable laser near 1550 nm by integrating 16 DFB lasers. High wavelength-spacing uniformity is guaranteed by the reconstruction-equivalent-chirp technique, which enables a temperature tuning range below 20 Celsius degree.

preprint2020arXiv

Sparse representation for damage identification of structural systems

Identifying damage of structural systems is typically characterized as an inverse problem which might be ill-conditioned due to aleatory and epistemic uncertainties induced by measurement noise and modeling error. Sparse representation can be used to perform inverse analysis for the case of sparse damage. In this paper, we propose a novel two-stage sensitivity analysis-based framework for both model updating and sparse damage identification. Specifically, an $\ell_2$ Bayesian learning method is firstly developed for updating the intact model and uncertainty quantification so as to set forward a baseline for damage detection. A sparse representation pipeline built on a quasi-$\ell_0$ method, e.g., Sequential Threshold Least Squares (STLS) regression, is then presented for damage localization and quantification. Additionally, Bayesian optimization together with cross validation is developed to heuristically learn hyperparameters from data, which saves the computational cost of hyperparameter tuning and produces more reliable identification result. The proposed framework is verified by three examples, including a 10-story shear-type building, a complex truss structure, and a shake table test of an eight-story steel frame. Results show that the proposed approach is capable of both localizing and quantifying structural damage with high accuracy.

preprint2019arXiv

Ultrafast X-Ray Induced Changes of the Electronic and Magnetic Response of Solids Due to Valence Electron Redistribution

We report a novel mechanism, consisting of redistribution of valence electrons near the Fermi level, during interactions of intense femtosecond X-ray pulses with a Co/Pd multilayer. The changes in Co 3d valence shell occupation were directly revealed by fluence-dependent changes of the Co L$_3$ X-ray absorption and magnetic circular dichroism spectra near the excitation threshold. The valence shell redistribution arises from inelastic scattering of high energy Auger electrons and photoelectrons that lead to transient holes below and electrons above the Fermi level on the femtosecond time scale. The valence electron reshuffling effect scales with the energy deposited by X-rays and within 17 fs extends to valence states within 2 eV of the Fermi level. As a consequence the sample demagnetizes by more than twenty percent due to magnon generation.

preprint2016arXiv

3-D Convolutional Neural Networks for Glioblastoma Segmentation

Convolutional Neural Networks (CNN) have emerged as powerful tools for learning discriminative image features. In this paper, we propose a framework of 3-D fully CNN models for Glioblastoma segmentation from multi-modality MRI data. By generalizing CNN models to true 3-D convolutions in learning 3-D tumor MRI data, the proposed approach utilizes a unique network architecture to decouple image pixels. Specifically, we design a convolutional layer with pre-defined Difference- of-Gaussian (DoG) filters to perform true 3-D convolution incorporating local neighborhood information at each pixel. We then use three trained convolutional layers that act to decouple voxels from the initial 3-D convolution. The proposed framework allows identification of high-level tumor structures on MRI. We evaluate segmentation performance on the BRATS segmentation dataset with 274 tumor samples. Extensive experimental results demonstrate encouraging performance of the proposed approach comparing to the state-of-the-art methods. Our data-driven approach achieves a median Dice score accuracy of 89% in whole tumor glioblastoma segmentation, revealing a generalized low-bias possibility to learn from medium-size MRI datasets.

preprint2015arXiv

Femtosecond X-ray magnetic circular dichroism absorption spectroscopy at an X-ray free electron laser

X-ray magnetic circular dichroism spectroscopy using an X-ray free electron laser is demonstrated with spectra over the Fe L$_{3,2}$-edges. This new ultrafast time-resolved capability is then applied to a fluence-dependent study of all-optical magnetic switching dynamics of Fe and Gd magnetic sublattices in a GdFeCo thin film above its magnetization compensation temperature. At the magnetic switching fuence, we corroborate the existence of a transient ferromagnetic-like state. The timescales of the dynamics, however, are longer than previously observed below the magnetization compensation temperature. Above and below the switching fluence range, we observe secondary demagnetization with about 5 ps timescales. This indicates that the spin thermalization takes longer than 5 ps.

preprint2015arXiv

Microwave soft x-ray microscopy for nanoscale magnetization dynamics in the 5-10 GHz frequency range

We present a scanning transmission x-ray microscopy setup combined with a novel microwave synchronization scheme in order to study high frequency magnetization dynamics at synchrotron light sources. The sensitivity necessary to detect small changes of the magnetization on short time scales and nanometer spatial dimensions is achieved by combination of the developed excitation mechanism with a single photon counting electronics that is locked to the synchrotron operation frequency. The required mechanical stability is achieved by a compact design of the microscope. Our instrument is capable of creating direct images of dynamical phenomena in the 5-10 GHz range, with 35 nm resolution. When used together with circularly polarized x-rays, the above capabilities can be combined to study magnetic phenomena at microwave frequencies, such as ferromagnetic resonance (FMR) and spin waves. We demonstrate the capabilities of our technique by presenting phase resolved images of a 6 GHz nanoscale spin wave generated by a spin torque oscillator, as well as the uniform ferromagnetic precession with ~0.1 deg amplitude at 9 GHz in a micrometer-sized cobalt strip.

preprint2015arXiv

Reliable Diversity-Based Spatial Crowdsourcing by Moving Workers

With the rapid development of mobile devices and the crowdsourcig platforms, the spatial crowdsourcing has attracted much attention from the database community, specifically, spatial crowdsourcing refers to sending a location-based request to workers according to their positions. In this paper, we consider an important spatial crowdsourcing problem, namely reliable diversity-based spatial crowdsourcing (RDB-SC), in which spatial tasks (such as taking videos/photos of a landmark or firework shows, and checking whether or not parking spaces are available) are time-constrained, and workers are moving towards some directions. Our RDB-SC problem is to assign workers to spatial tasks such that the completion reliability and the spatial/temporal diversities of spatial tasks are maximized. We prove that the RDB-SC problem is NP-hard and intractable. Thus, we propose three effective approximation approaches, including greedy, sampling, and divide-and-conquer algorithms. In order to improve the efficiency, we also design an effective cost-model-based index, which can dynamically maintain moving workers and spatial tasks with low cost, and efficiently facilitate the retrieval of RDB-SC answers. Through extensive experiments, we demonstrate the efficiency and effectiveness of our proposed approaches over both real and synthetic data sets.

preprint2014arXiv

Nanoscale confinement of all-optical switching in TbFeCo using plasmonic antennas

All-optical switching (AOS) of magnetic domains by femtosecond laser pulses was first observed in the transition metal-rare earth (TM-RE) alloy GdFeCo1-5; this phenomenon demonstrated the potential for optical control of magnetism for the development of ever faster future magnetic recording technologies. The technological potential of AOS has recently increased due to the discovery of the same effect in other materials, including RE-free magnetic multilayers6,7. However, to be technologically meaningful, AOS must compete with the bit densities of conventional storage devices, restricting optically-switched magnetic areas to sizes well below the diffraction limit. Here, we demonstrate reproducible and robust all-optical switching of magnetic domains of 53 nm size in a ferrimagnetic TbFeCo alloy using gold plasmonic antenna structures. The confined nanoscale magnetic reversal is imaged around and beneath plasmonic antennas using x-ray resonant holographic imaging. Our results demonstrate the potential of future AOS-based magnetic recording technologies.

Zhao Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

HyperPrompt: Prompt-based Task-Conditioning of Transformers

State-resolved ultrafast charge and spin dynamics in [Co/Pd] multilayers

48 channels 100-GHz tunable laser by integrating 16 DFB lasers with high wavelength-spacing uniformity

Sparse representation for damage identification of structural systems

Ultrafast X-Ray Induced Changes of the Electronic and Magnetic Response of Solids Due to Valence Electron Redistribution

3-D Convolutional Neural Networks for Glioblastoma Segmentation

Femtosecond X-ray magnetic circular dichroism absorption spectroscopy at an X-ray free electron laser

Microwave soft x-ray microscopy for nanoscale magnetization dynamics in the 5-10 GHz frequency range

Reliable Diversity-Based Spatial Crowdsourcing by Moving Workers

Nanoscale confinement of all-optical switching in TbFeCo using plasmonic antennas