Researcher profile

Zhao Chen

Zhao Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
12topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2022arXiv

GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

We propose GradTail, an algorithm that uses gradients to improve model performance on the fly in the face of long-tailed training data distributions. Unlike conventional long-tail classifiers which operate on converged - and possibly overfit - models, we demonstrate that an approach based on gradient dot product agreement can isolate long-tailed data early on during model training and improve performance by dynamically picking higher sample weights for that data. We show that such upweighting leads to model improvements for both classification and regression models, the latter of which are relatively unexplored in the long-tail literature, and that the long-tail examples found by gradient alignment are consistent with our semantic expectations.

preprint2022arXiv

HyperPrompt: Prompt-based Task-Conditioning of Transformers

Prompt-Tuning is a new paradigm for finetuning pre-trained language models in a parameter-efficient way. Here, we explore the use of HyperNetworks to generate hyper-prompts: we propose HyperPrompt, a novel architecture for prompt-based task-conditioning of self-attention in Transformers. The hyper-prompts are end-to-end learnable via generation by a HyperNetwork. HyperPrompt allows the network to learn task-specific feature maps where the hyper-prompts serve as task global memories for the queries to attend to, at the same time enabling flexible information sharing among tasks. We show that HyperPrompt is competitive against strong multi-task learning baselines with as few as $0.14\%$ of additional task-conditioning parameters, achieving great parameter and computational efficiency. Through extensive empirical experiments, we demonstrate that HyperPrompt can achieve superior performances over strong T5 multi-task learning baselines and parameter-efficient adapter variants including Prompt-Tuning and HyperFormer++ on Natural Language Understanding benchmarks of GLUE and SuperGLUE across many model sizes.

preprint2021arXiv

State-resolved ultrafast charge and spin dynamics in [Co/Pd] multilayers

We use transient absorption spectroscopy with circularly polarized x-rays to detect laser-excited hole states below the Fermi level and compare their dynamics with that of unoccupied states above the Fermi level in ferromagnetic [Co/Pd] multilayers. While below the Fermi level an instantaneous and significantly stronger demagnetization is observed, above the Fermi level the demagnetization is delayed by 35+/-10 fs. This provides a direct visualization of how ultrafast demagnetization proceeds via initial spin-flip scattering of laser-excited holes to the subsequent formation of spin waves.

preprint2020arXiv

Sparse representation for damage identification of structural systems

Identifying damage of structural systems is typically characterized as an inverse problem which might be ill-conditioned due to aleatory and epistemic uncertainties induced by measurement noise and modeling error. Sparse representation can be used to perform inverse analysis for the case of sparse damage. In this paper, we propose a novel two-stage sensitivity analysis-based framework for both model updating and sparse damage identification. Specifically, an $\ell_2$ Bayesian learning method is firstly developed for updating the intact model and uncertainty quantification so as to set forward a baseline for damage detection. A sparse representation pipeline built on a quasi-$\ell_0$ method, e.g., Sequential Threshold Least Squares (STLS) regression, is then presented for damage localization and quantification. Additionally, Bayesian optimization together with cross validation is developed to heuristically learn hyperparameters from data, which saves the computational cost of hyperparameter tuning and produces more reliable identification result. The proposed framework is verified by three examples, including a 10-story shear-type building, a complex truss structure, and a shake table test of an eight-story steel frame. Results show that the proposed approach is capable of both localizing and quantifying structural damage with high accuracy.

preprint2019arXiv

Ultrafast X-Ray Induced Changes of the Electronic and Magnetic Response of Solids Due to Valence Electron Redistribution

We report a novel mechanism, consisting of redistribution of valence electrons near the Fermi level, during interactions of intense femtosecond X-ray pulses with a Co/Pd multilayer. The changes in Co 3d valence shell occupation were directly revealed by fluence-dependent changes of the Co L$_3$ X-ray absorption and magnetic circular dichroism spectra near the excitation threshold. The valence shell redistribution arises from inelastic scattering of high energy Auger electrons and photoelectrons that lead to transient holes below and electrons above the Fermi level on the femtosecond time scale. The valence electron reshuffling effect scales with the energy deposited by X-rays and within 17 fs extends to valence states within 2 eV of the Fermi level. As a consequence the sample demagnetizes by more than twenty percent due to magnon generation.