Researcher profile

Yong Zhao

Yong Zhao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
27works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

27 published item(s)

preprint2024arXiv

Autonomous Crowdsensing: Operating and Organizing Crowdsensing for Sensing Automation

The precise characterization and modeling of Cyber-Physical-Social Systems (CPSS) requires more comprehensive and accurate data, which imposes heightened demands on intelligent sensing capabilities. To address this issue, Crowdsensing Intelligence (CSI) has been proposed to collect data from CPSS by harnessing the collective intelligence of a diverse workforce. Our first and second Distributed/Decentralized Hybrid Workshop on Crowdsensing Intelligence (DHW-CSI) have focused on principles and high-level processes of organizing and operating CSI, as well as the participants, methods, and stages involved in CSI. This letter reports the outcomes of the latest DHW-CSI, focusing on Autonomous Crowdsensing (ACS) enabled by a range of technologies such as decentralized autonomous organizations and operations, large language models, and human-oriented operating systems. Specifically, we explain what ACS is and explore its distinctive features in comparison to traditional crowdsensing. Moreover, we present the ``6A-goal" of ACS and propose potential avenues for future research.

preprint2023arXiv

Generalized Parton Distributions from Lattice QCD with Asymmetric Momentum Transfer: Unpolarized Quarks

Traditionally, lattice QCD computations of generalized parton distributions (GPDs) have been carried out in a symmetric frame, where the transferred momentum is symmetrically distributed between the incoming and outgoing hadrons. However, such frames are inconvenient since they require a separate calculation for each value of the momentum transfer, increasing significantly the computational cost. In this work, by focusing on the quasi-distribution approach, we lay the foundation for faster and more effective lattice QCD calculations of GPDs exploiting asymmetric frames, with freedom in the transferred momentum distribution. An important ingredient of our approach is the Lorentz covariant parameterization of the matrix elements in terms of Lorentz-invariant amplitudes, which allows one to relate matrix elements in different frames. We also use this amplitude approach to propose a new definition of quasi-GPDs that is frame-independent and, more importantly, may lead to smaller power corrections in the matching relations to the light-cone GPDs. We demonstrate the efficacy of the formalism through numerical calculations using one ensemble of $N_f$=2+1+1 twisted mass fermions with a clover improvement. The value of the light-quark masses lead to a pion mass of about 260 MeV. Concentrating on the proton, and limiting ourselves to a vanishing longitudinal momentum transfer to the target, we extract the invariant amplitudes from matrix element calculations in both the symmetric and asymmetric frame, and obtain results for the twist-2 light-cone GPDs for unpolarized quarks, that is, $H$ and $E$.

preprint2023arXiv

GPDs in asymmetric frames

It is often taken for granted that Generalized Parton Distributions (GPDs) are defined in the "symmetric" frame, where the transferred momentum is symmetrically distributed between the incoming/outgoing hadrons. However, such frames pose computational challenges for the lattice QCD practitioners. In these proceedings, we lay the foundation for lattice QCD calculations of GPDs in "asymmetric" frames, where the transferred momentum is not symmetrically distributed between the incoming/outgoing hadrons. The novelty of our work relies on the parameterization of the matrix elements in terms of Lorentz-invariant amplitudes, which not only helps in establishing relations between the said frames but also helps in isolating higher-twist contaminations. As an example, we focus on the unpolarized GPDs for spin-1/2 particles.

preprint2022arXiv

Conditional gradient method for vector optimization

In this paper, we propose a conditional gradient method for solving constrained vector optimization problems with respect to a partial order induced by a closed, convex and pointed cone with nonempty interior. When the partial order under consideration is the one induced by the non-negative orthant, we regain the method for multiobjective optimization recently proposed by Assunção et al. (Comput Optim Appl 78(3):741--768, 2021). In our method, the construction of auxiliary subproblem is based on the well-known oriented distance function. Three different types of step size strategies (Armijio, adaptative and nonmonotone) are considered. Without any assumptions, we prove that stationarity of accumulation points of the sequences produced by the proposed method equipped with the Armijio or the nonmonotone step size rule. To obtain the convergence result of the method with the adaptative step size strategy, we introduce an useful cone convexity condition which allows to circumvent the intricate question of the Lipschitz continuity of Jocabian for the objective function. This condition helps us generalize the classical descent lemma to the vector optimization case. Under suitable convexity assumptions for the objective function, it is proved that all accumulation points of any generated sequences obtained by our method are weakly efficient solutions.

preprint2022arXiv

DeepXRD, a Deep Learning Model for Predicting of XRD spectrum from Materials Composition

One of the long-standing problems in materials science is how to predict a material's structure and then its properties given only its composition. Experimental characterization of crystal structures has been widely used for structure determination, which is however too expensive for high-throughput screening. At the same time, directly predicting crystal structures from compositions remains a challenging unsolved problem. Herein we propose a deep learning algorithm for predicting the XRD spectrum given only the composition of a material, which can then be used to infer key structural features for downstream structural analysis such as crystal system or space group classification or crystal lattice parameter determination or materials property predictions. Benchmark studies on two datasets show that our DeepXRD algorithm can achieve good performance for XRD prediction as evaluated over our test sets. It can thus be used in high-throughput screening in the huge materials composition space for new materials discovery.

preprint2022arXiv

Factorization connecting continuum and lattice TMDs

Transverse-momentum-dependent parton distribution functions (TMDs) can be studied from first principles by a perturbative matching onto lattice-calculable quantities: so-called lattice TMDs, which are a class of equal-time correlators that includes quasi-TMDs and TMDs in the Lorentz-invariant approach. We introduce a general correlator that includes as special cases these two Lattice TMDs and continuum TMDs, like the Collins scheme. Then, to facilitate the derivation of a factorization relation between lattice and continuum TMDs, we construct a new scheme, the Large Rapidity (LR) scheme, intermediate between the Collins and quasi-TMDs. The LR and Collins schemes differ only by an order of limits, and can be matched onto one another by a multiplicative kernel. We show that this same matching also holds between quasi and Collins TMDs, which enables us to prove a factorization relation between these quantities to all orders in $α_s$. Our results imply that there is no mixing between various quark flavors or gluons when matching Collins and quasi TMDs, making the lattice calculation of individual flavors and gluon TMDs easier than anticipated. We cross-check these results explicitly at one loop and discuss implications for other physical-to-lattice scheme factorizations.

preprint2022arXiv

Lattice QCD Calculations of Parton Physics

In this document, we summarize the status and challenges of calculating parton physics in lattice QCD for the US Particle Physics Community Planning Exercise (a.k.a. "Snowmass"). While PDF-moments calculations have been very successful and been continuously improved, new methods have been developed to calculate distributions directly in $x$-space. Many recent lattice studies have been focused on calculating isovector PDFs of the pion and nucleon, learning to control systematics associated with excited-state contamination, renormalization and continuum extrapolations, pion-mass and finite-volume effects, etc. Although in some cases, the lattice results are already competitive with experimental data, to reach the level of precision in a wide range of $x$ for unpolarized nucleon PDFs impactful for future collider physics remains a challenge, and may require exascale supercomputing power. The new theoretical methods open the door for calculating other partonic observables which will be the focus of the experimental program in nuclear physics, including generalized parton distributions and transverse-momentum dependent PDFs. A fruitful interplay between experimental data and lattice-QCD calculations will usher in a new era for parton physics and hadron structure.

preprint2022arXiv

Lattice QCD Determination of the Bjorken-$x$ Dependence of Parton Distribution Functions at Next-to-next-to-leading Order

We report the first lattice QCD calculation of pion valence quark distribution with next-to-next-to-leading order perturbative matching correction, which is done using two fine lattices with spacings $a=0.04$ fm and $0.06$ fm and valence pion mass $m_π=300$ MeV, at boost momentum as large as $2.42$ GeV. As a crucial step to control the systematics, we renormalize the pion valence quasi distribution in the recently proposed hybrid scheme, which features a Wilson-line mass subtraction at large distances in coordinate space, and develop a procedure to match it to the $\overline{\rm MS}$ scheme. We demonstrate that the renormalization and the perturbative matching in Bjorken-$x$ space yield a reliable determination of the valence quark distribution for $0.03\lesssim x \lesssim 0.80$ with 5-20\% uncertainties.

preprint2022arXiv

Memory gradient method for multiobjective optimization

In this paper, we propose a new descent method, termed as multiobjective memory gradient method, for finding Pareto critical points of a multiobjective optimization problem. The main thought in this method is to select a combination of the current descent direction and past multi-step iterative information as a new search direction and to obtain a stepsize by virtue of two types of strategies. It is proved that the developed direction with suitable parameters always satisfies the sufficient descent condition at each iteration. Based on mild assumptions, we obtain the global convergence and the rates of convergence for our method. Computational experiments are given to demonstrate the effectiveness of the proposed method.

preprint2022arXiv

Pion form factor and charge radius from Lattice QCD at physical point

We present our results on the electromagnetic form factor of pion over a wide range of $Q^2$ using lattice QCD simulations with Wilson-clover valence quarks and HISQ sea quarks. We study the form factor at the physical point with a lattice spacing $a=0.076$ fm. To study the lattice spacing and quark mass effects, we also present results for 300 MeV pion at two different lattice spacings $a=0.04$ and 0.06 fm. The lattice calculations at the physical quark mass appear to agree with the experimental results. Through fits to the form factor, we estimate the charge radius of pion for physical pion mass to be $\langle r_π^2 \rangle=0.42(2)~{\rm fm}^2$.

preprint2021arXiv

A Hybrid Renormalization Scheme for Quasi Light-Front Correlations in Large-Momentum Effective Theory

In large-momentum effective theory (LaMET), calculating parton physics starts from calculating coordinate-space-$z$ correlation functions $\tilde h(z, a,P^z)$ in a hadron of momentum $P^z$ in lattice QCD. Such correlation functions involve both linear and logarithmic divergences in lattice spacing $a$, and thus need to be properly renormalized. We introduce a hybrid renormalization procedure to match these lattice correlations to those in the continuum $\overline{\rm MS}$ scheme, without introducing extra non-perturbative effects at large $z$. We analyze the effect of ${\cal O}(Λ_{\rm QCD})$ ambiguity in the Wilson line self-energy subtraction involved in this hybrid scheme. To obtain the momentum-space distributions, we recommend to extrapolate the lattice data to the asymptotic $z$-region using the generic properties of the coordinate space correlations at moderate and large $P^z$, respectively.

preprint2021arXiv

Active learning based generative design for the discovery of wide bandgap materials

Active learning has been increasingly applied to screening functional materials from existing materials databases with desired properties. However, the number of known materials deposited in the popular materials databases such as ICSD and Materials Project is extremely limited and consists of just a tiny portion of the vast chemical design space. Herein we present an active generative inverse design method that combines active learning with a deep variational autoencoder neural network and a generative adversarial deep neural network model to discover new materials with a target property in the whole chemical design space. The application of this method has allowed us to discover new thermodynamically stable materials with high band gap (SrYF$_5$) and semiconductors with specified band gap ranges (SrClF$_3$, CaClF$_5$, YCl$_3$, SrC$_2$F$_3$, AlSCl, As$_2$O$_3$), all of which are verified by the first principle DFT calculations. Our experiments show that while active learning itself may sample chemically infeasible candidates, these samples help to train effective screening models for filtering out materials with desired properties from the hypothetical materials created by the generative model. The experiments show the effectiveness of our active generative inverse design approach.

preprint2020arXiv

Collins-Soper Kernel for TMD Evolution from Lattice QCD

The Collins-Soper kernel relates transverse momentum-dependent parton distribution functions (TMDPDFs) at different energy scales. For small parton transverse momentum $q_T\sim Λ_\text{QCD}$, this kernel is non-perturbative and can only be determined with controlled uncertainties through experiment or first-principles calculations. This work presents the first exploratory determination of the Collins-Soper kernel using the lattice formulation of Quantum Chromodynamics. In a quenched calculation, the $N_f=0$ kernel is determined at scales in the range 250 MeV $< q_T < 2$ GeV, and an analysis of the remaining systematic uncertainties is undertaken.

preprint2020arXiv

Cross-Channel Intragroup Sparsity Neural Network

Modern deep neural networks rely on overparameterization to achieve state-of-the-art generalization. But overparameterized models are computationally expensive. Network pruning is often employed to obtain less demanding models for deployment. Fine-grained pruning removes individual weights in parameter tensors and can achieve a high model compression ratio with little accuracy degradation. However, it introduces irregularity into the computing dataflow and often does not yield improved model inference efficiency in practice. Coarse-grained model pruning, while realizing satisfactory inference speedup through removal of network weights in groups, e.g. an entire filter, often lead to significant accuracy degradation. This work introduces the cross-channel intragroup (CCI) sparsity structure, which can prevent the inference inefficiency of fine-grained pruning while maintaining outstanding model performance. We then present a novel training algorithm designed to perform well under the constraint imposed by the CCI-Sparsity. Through a series of comparative experiments we show that our proposed CCI-Sparsity structure and the corresponding pruning algorithm outperform prior art in inference efficiency by a substantial margin given suited hardware acceleration in the future.

preprint2020arXiv

Dedge-AGMNet:an effective stereo matching network optimized by depth edge auxiliary task

To improve the performance in ill-posed regions, this paper proposes an atrous granular multi-scale network based on depth edge subnetwork(Dedge-AGMNet). According to a general fact, the depth edge is the binary semantic edge of instance-sensitive. This paper innovatively generates the depth edge ground-truth by mining the semantic and instance dataset simultaneously. To incorporate the depth edge cues efficiently, our network employs the hard parameter sharing mechanism for the stereo matching branch and depth edge branch. The network modifies SPP to Dedge-SPP, which fuses the depth edge features to the disparity estimation network. The granular convolution is extracted and extends to 3D architecture. Then we design the AGM module to build a more suitable structure. This module could capture the multi-scale receptive field with fewer parameters. Integrating the ranks of different stereo datasets, our network outperforms other stereo matching networks and advances state-of-the-art performances on the Sceneflow, KITTI 2012 and KITTI 2015 benchmark datasets.

preprint2020arXiv

Global Attention based Graph Convolutional Neural Networks for Improved Materials Property Prediction

Machine learning (ML) methods have gained increasing popularity in exploring and developing new materials. More specifically, graph neural network (GNN) has been applied in predicting material properties. In this work, we develop a novel model, GATGNN, for predicting inorganic material properties based on graph neural networks composed of multiple graph-attention layers (GAT) and a global attention layer. Through the application of the GAT layers, our model can efficiently learn the complex bonds shared among the atoms within each atom&#39;s local neighborhood. Subsequently, the global attention layer provides the weight coefficients of each atom in the inorganic crystal material which are used to considerably improve our model&#39;s performance. Notably, with the development of our GATGNN model, we show that our method is able to both outperform the previous models&#39; predictions and provide insight into the crystallization of the material.

preprint2020arXiv

Impact of JD Bernal Thoughts in the Science of Science upon China: Implications for Quantitative Studies of Science Today

John Desmond Bernal (1901-1970) was one of the most eminent scientists in molecular biology, and also regarded as the founding father of the Science of Science. His book The Social Function of Science laid the theoretical foundations for the discipline. In this article, we summarize four chief characteristics of his ideas in the Science of Science: the socio-historical perspective, theoretical models, qualitative and quantitative approaches, and studies of science planning and policy. China has constantly reformed its scientific and technological system based on research evidence of the Science of Science. Therefore, we analyze the impact of Bernal Science-of-Science thoughts on the development of Science of Science in China, and discuss how they might be usefully taken still further in quantitative studies of science.

preprint2020arXiv

Machine Learning based prediction of noncentrosymmetric crystal materials

Noncentrosymmetric materials play a critical role in many important applications such as laser technology, communication systems,quantum computing, cybersecurity, and etc. However, the experimental discovery of new noncentrosymmetric materials is extremely difficult. Here we present a machine learning model that could predict whether the composition of a potential crystalline structure would be centrosymmetric or not. By evaluating a diverse set of composition features calculated using matminer featurizer package coupled with different machine learning algorithms, we find that Random Forest Classifiers give the best performance for noncentrosymmetric material prediction, reaching an accuracy of 84.8% when evaluated with 10 fold cross-validation on the dataset with 82,506 samples extracted from Materials Project. A random forest model trained with materials with only 3 elements gives even higher accuracy of 86.9%. We apply our ML model to screen potential noncentrosymmetric materials from 2,000,000 hypothetical materials generated by our inverse design engine and report the top 20 candidate noncentrosymmetric materials with 2 to 4 elements and top 20 borate candidates

preprint2020arXiv

Nonperturbative renormalization of staple-shaped Wilson line operators in lattice QCD

Quark bilinear operators with staple-shaped Wilson lines are used to study transverse-momentum-dependent parton distribution functions (TMDPDFs) from lattice quantum chromodynamics (QCD). Here, the renormalization factors for the isovector operators, including all mixings between operators with different Dirac structures, are computed nonperturbatively in the regularization-independent momentum subtraction scheme for the first time. This study is undertaken in quenched QCD with three different lattice spacings. With Wilson flow applied to the gauge fields in the calculations, the operator mixing pattern due to chiral symmetry breaking with the lattice regularization is found to be significantly different from that predicted by one-loop lattice perturbation theory calculations. These results constitute a critical step towards the systematic extraction of TMDPDFs from lattice QCD.

preprint2020arXiv

Parton distribution function for the gluon condensate

Motivated by the desire to understand the nucleon mass structure in terms of light-cone distributions, we introduce the twist-four parton distribution function $F(x)$ whose first moment is the gluon condensate in the nucleon. We present the equation of motion relations for $F(x)$ and discuss the possible existence of the delta function (`zero mode&#39;) contribution at $x=0$. We also perform one-loop calculations for quark and gluon targets.

preprint2020arXiv

Pion valence quark PDF from lattice QCD

We present lattice results on the valence-quark structure of the pion using a coordinate space method within the framework of Large Momentum Effective Theory (LaMET). In this method one relies on the matrix elements of a Euclidean correlator in boosted hadronic states, which have an operator product expansion at short distance that allows us to extract the moments of PDFs. We renormalize the Euclidean correlator by forming the reduced Ioffe-time distribution (rITD), and reconstruct the second and fourth moments of the pion PDF by taking into account of QCD evolution effects.

preprint2020arXiv

Proton spin after 30 years: what we know and what we don&#39;t?

More than three decades has passed since the European Muon Collaboration published the first surprising result on the spin structure of the proton. Much theoretical and experimental progress has been made in understanding the origins of the proton spin. In this review, we will discuss what we have learned so far, what are still missing, and what we shall expect to learn from the upcoming experiments including JLab 12 GeV and Electron-Ion Collider. In particular, we focus on first principles calculations and experimental measurements of the total gluon helicity $ΔG$, and quark and gluon orbital angular momenta.

preprint2020arXiv

Renormalization and Matching for the Collins-Soper Kernel from Lattice QCD

The Collins-Soper kernel, which governs the energy evolution of transverse-momentum dependent parton distribution functions (TMDPDFs), is required to accurately predict Drell-Yan like processes at small transverse momentum, and is a key ingredient for extracting TMDPDFs from experiment. Earlier we proposed a method to calculate this kernel from ratios of the so-called quasi-TMDPDFs determined with lattice QCD, which are defined as hadronic matrix elements of staple-shaped Euclidean Wilson line operators. Here we provide the one-loop renormalization of these operators in a regularization-independent momentum subtraction (RI$^\prime$/MOM) scheme, as well as the conversion factor from the RI$^\prime$/MOM-renormalized quasi-TMDPDF to the $\overline{\rm MS}$ scheme. We also propose a procedure for calculating the Collins-Soper kernel directly from position space correlators, which simplifies the lattice determination.

preprint2020arXiv

Reverse-engineering Bar Charts Using Neural Networks

Reverse-engineering bar charts extracts textual and numeric information from the visual representations of bar charts to support application scenarios that require the underlying information. In this paper, we propose a neural network-based method for reverse-engineering bar charts. We adopt a neural network-based object detection model to simultaneously localize and classify textual information. This approach improves the efficiency of textual information extraction. We design an encoder-decoder framework that integrates convolutional and recurrent neural networks to extract numeric information. We further introduce an attention mechanism into the framework to achieve high accuracy and robustness. Synthetic and real-world datasets are used to evaluate the effectiveness of the method. To the best of our knowledge, this work takes the lead in constructing a complete neural network-based method of reverse-engineering bar charts.

preprint2019arXiv

Generative adversarial networks (GAN) based efficient sampling of chemical space for inverse design of inorganic materials

A major challenge in materials design is how to efficiently search the vast chemical design space to find the materials with desired properties. One effective strategy is to develop sampling algorithms that can exploit both explicit chemical knowledge and implicit composition rules embodied in the large materials database. Here, we propose a generative machine learning model (MatGAN) based on a generative adversarial network (GAN) for efficient generation of new hypothetical inorganic materials. Trained with materials from the ICSD database, our GAN model can generate hypothetical materials not existing in the training dataset, reaching a novelty of 92.53% when generating 2 million samples. The percentage of chemically valid (charge neutral and electronegativity balanced) samples out of all generated ones reaches 84.5% by our GAN when trained with materials from ICSD even though no such chemical rules are explicitly enforced in our GAN model, indicating its capability to learn implicit chemical composition rules. Our algorithm could be used to speed up inverse design or computational screening of inorganic materials.

preprint2019arXiv

Unpolarized isovector quark distribution function from Lattice QCD: A systematic analysis of renormalization and matching

We present a detailed Lattice QCD study of the unpolarized isovector quark Parton Distribution Function (PDF) using large-momentum effective theory framework. We choose a quasi-PDF defined by a spatial correlator which is free from mixing with other operators of the same dimension. In the lattice simulation, we use a Gaussian-momentum-smeared source at $M_π=356$ MeV and $P_z \in \{1.8,2.3\}$ GeV. To control the systematics associated with the excited states, we explore {five different source-sink separations}. The nonperturbative renormalization is conducted in a regularization-independent momentum subtraction scheme, and the matching between the renormalized quasi-PDF and $\bar{\rm MS}$ PDF is calculated based on perturbative QCD up to one-loop order. Systematic errors due to renormalization and perturbative matching are also analyzed in detail. Our results for lightcone PDF are in reasonable agreement with the latest phenomenological analysis.

preprint2016arXiv

Long distance co-propagation of quantum key distribution and terabit classical optical data channels

Quantum key distribution (QKD) generates symmetric keys between two remote parties, and guarantees the keys not accessible to any third party. Wavelength division multiplexing (WDM) between QKD and classical optical communications by sharing the existing fibre optics infrastructure is highly desired in order to reduce the cost of QKD applications. However, quantum signals are extremely weak and thus easily affected by the spontaneous Raman scattering effect from intensive classical light. Here, by means of wavelength selecting and spectral and temporal filtering, we realize the multiplexing and long distance co-propagation of QKD and Terabit classical coherent optical communication system up to 80km. The data capacity is two orders of magnitude larger than the previous results. Our demonstration verifies the feasibility of QKD and classical communication to share the resources of backbone fibre links, and thus taking the utility of QKD a great step forward.