Source author record

Yu Zhu

Yu Zhu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

37works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

AI of Brain and Cognitive Sciences: From the Perspective of First Principles

Nowadays, we have witnessed the great success of AI in various applications, including image classification, game playing, protein structure analysis, language translation, and content generation. Despite these powerful applications, there are still many tasks in our daily life that are rather simple to humans but pose great challenges to AI. These include image and language understanding, few-shot learning, abstract concepts, and low-energy cost computing. Thus, learning from the brain is still a promising way that can shed light on the development of next-generation AI. The brain is arguably the only known intelligent machine in the universe, which is the product of evolution for animals surviving in the natural environment. At the behavior level, psychology and cognitive sciences have demonstrated that human and animal brains can execute very intelligent high-level cognitive functions. At the structure level, cognitive and computational neurosciences have unveiled that the brain has extremely complicated but elegant network forms to support its functions. Over years, people are gathering knowledge about the structure and functions of the brain, and this process is accelerating recently along with the initiation of giant brain projects worldwide. Here, we argue that the general principles of brain functions are the most valuable things to inspire the development of AI. These general principles are the standard rules of the brain extracting, representing, manipulating, and retrieving information, and here we call them the first principles of the brain. This paper collects six such first principles. They are attractor network, criticality, random network, sparse coding, relational memory, and perceptual learning. On each topic, we review its biological background, fundamental property, potential application to AI, and future development.

preprint2022arXiv

AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

This paper reviews the Challenge on Super-Resolution of Compressed Image and Video at AIM 2022. This challenge includes two tracks. Track 1 aims at the super-resolution of compressed image, and Track~2 targets the super-resolution of compressed video. In Track 1, we use the popular dataset DIV2K as the training, validation and test sets. In Track 2, we propose the LDV 3.0 dataset, which contains 365 videos, including the LDV 2.0 dataset (335 videos) and 30 additional videos. In this challenge, there are 12 teams and 2 teams that submitted the final results to Track 1 and Track 2, respectively. The proposed methods and solutions gauge the state-of-the-art of super-resolution on compressed image and video. The proposed LDV 3.0 dataset is available at https://github.com/RenYang-home/LDV_dataset. The homepage of this challenge is at https://github.com/RenYang-home/AIM22_CompressSR.

preprint2022arXiv

Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression Recognition

Facial expression recognition plays an important role in human-computer interaction. In this paper, we propose the Coarse-to-Fine Cascaded network with Smooth Predicting (CFC-SP) to improve the performance of facial expression recognition. CFC-SP contains two core components, namely Coarse-to-Fine Cascaded networks (CFC) and Smooth Predicting (SP). For CFC, it first groups several similar emotions to form a rough category, and then employs a network to conduct a coarse but accurate classification. Later, an additional network for these grouped emotions is further used to obtain fine-grained predictions. For SP, it improves the recognition capability of the model by capturing both universal and unique expression features. To be specific, the universal features denote the general characteristic of facial emotions within a period and the unique features denote the specific characteristic at this moment. Experiments on Aff-Wild2 show the effectiveness of the proposed CFSP. We achieved 3rd place in the Expression Classification Challenge of the 3rd Competition on Affective Behavior Analysis in-the-wild. The code will be released at https://github.com/BR-IDL/PaddleViT.

preprint2022arXiv

Exploring and Evaluating Image Restoration Potential in Dynamic Scenes

In dynamic scenes, images often suffer from dynamic blur due to superposition of motions or low signal-noise ratio resulted from quick shutter speed when avoiding motions. Recovering sharp and clean results from the captured images heavily depends on the ability of restoration methods and the quality of the input. Although existing research on image restoration focuses on developing models for obtaining better restored results, fewer have studied to evaluate how and which input image leads to superior restored quality. In this paper, to better study an image's potential value that can be explored for restoration, we propose a novel concept, referring to image restoration potential (IRP). Specifically, We first establish a dynamic scene imaging dataset containing composite distortions and applied image restoration processes to validate the rationality of the existence to IRP. Based on this dataset, we investigate several properties of IRP and propose a novel deep model to accurately predict IRP values. By gradually distilling and selective fusing the degradation features, the proposed model shows its superiority in IRP prediction. Thanks to the proposed model, we are then able to validate how various image restoration related applications are benefited from IRP prediction. We show the potential usages of IRP as a filtering principle to select valuable frames, an auxiliary guidance to improve restoration models, and even an indicator to optimize camera settings for capturing better images under dynamic scenarios.

preprint2022arXiv

Free Energy Node Embedding via Generalized Skip-gram with Negative Sampling

A widely established set of unsupervised node embedding methods can be interpreted as consisting of two distinctive steps: i) the definition of a similarity matrix based on the graph of interest followed by ii) an explicit or implicit factorization of such matrix. Inspired by this viewpoint, we propose improvements in both steps of the framework. On the one hand, we propose to encode node similarities based on the free energy distance, which interpolates between the shortest path and the commute time distances, thus, providing an additional degree of flexibility. On the other hand, we propose a matrix factorization method based on a loss function that generalizes that of the skip-gram model with negative sampling to arbitrary similarity matrices. Compared with factorizations based on the widely used $\ell_2$ loss, the proposed method can better preserve node pairs associated with higher similarity scores. Moreover, it can be easily implemented using advanced automatic differentiation toolkits and computed efficiently by leveraging GPU resources. Node clustering, node classification, and link prediction experiments on real-world datasets demonstrate the effectiveness of incorporating free-energy-based similarities as well as the proposed matrix factorization compared with state-of-the-art alternatives.

preprint2022arXiv

Hypergraph Cuts with Edge-Dependent Vertex Weights

We develop a framework for incorporating edge-dependent vertex weights (EDVWs) into the hypergraph minimum s-t cut problem. These weights are able to reflect different importance of vertices within a hyperedge, thus leading to better characterized cut properties. More precisely, we introduce a new class of hyperedge splitting functions that we call EDVWs-based, where the penalty of splitting a hyperedge depends only on the sum of EDVWs associated with the vertices on each side of the split. Moreover, we provide a way to construct submodular EDVWs-based splitting functions and prove that a hypergraph equipped with such splitting functions can be reduced to a graph sharing the same cut properties. In this case, the hypergraph minimum s-t cut problem can be solved using well-developed solutions to the graph minimum s-t cut problem. In addition, we show that an existing sparsification technique can be easily extended to our case and makes the reduced graph smaller and sparser, thus further accelerating the algorithms applied to the reduced graph. Numerical experiments using real-world data demonstrate the effectiveness of our proposed EDVWs-based splitting functions in comparison with the all-or-nothing splitting function and cardinality-based splitting functions commonly adopted in existing work.

preprint2022arXiv

Signal Processing on Higher-Order Networks: Livin' on the Edge ... and Beyond

In this tutorial, we provide a didactic treatment of the emerging topic of signal processing on higher-order networks. Drawing analogies from discrete and graph signal processing, we introduce the building blocks for processing data on simplicial complexes and hypergraphs, two common higher-order network abstractions that can incorporate polyadic relationships. We provide brief introductions to simplicial complexes and hypergraphs, with a special emphasis on the concepts needed for the processing of signals supported on these structures. Specifically, we discuss Fourier analysis, signal denoising, signal interpolation, node embeddings, and nonlinear processing through neural networks, using these two higher-order network models. In the context of simplicial complexes, we specifically focus on signal processing using the Hodge Laplacian matrix, a multi-relational operator that leverages the special structure of simplicial complexes and generalizes desirable properties of the Laplacian matrix in graph signal processing. For hypergraphs, we present both matrix and tensor representations, and discuss the trade-offs in adopting one or the other. We also highlight limitations and potential research avenues, both to inform practitioners and to motivate the contribution of new researchers to the area.

preprint2022arXiv

Signal processing on simplicial complexes

Higher-order networks have so far been considered primarily in the context of studying the structure of complex systems, i.e., the higher-order or multi-way relations connecting the constituent entities. More recently, a number of studies have considered dynamical processes that explicitly account for such higher-order dependencies, e.g., in the context of epidemic spreading processes or opinion formation. In this chapter, we focus on a closely related, but distinct third perspective: how can we use higher-order relationships to process signals and data supported on higher-order network structures. In particular, we survey how ideas from signal processing of data supported on regular domains, such as time series or images, can be extended to graphs and simplicial complexes. We discuss Fourier analysis, signal denoising, signal interpolation, and nonlinear processing through neural networks based on simplicial complexes. Key to our developments is the Hodge Laplacian matrix, a multi-relational operator that leverages the special structure of simplicial complexes and generalizes desirable properties of the Laplacian matrix in graph signal processing.

preprint2022arXiv

The generality of uncooperative and cooperative effects in elementary hydrogen-bonded systems

The cooperative effect plays a significant role in understanding the intermolecular donor-acceptor interactions of hydrogen bonds (H-bonds, D-H...A). Herein, using the benchmark method of high-precision ab initio, the well-known cooperative effect is reproduced in elementary H-bonded systems with different D and A atoms. That is, with the decreasing of intermolecular distance, the D-H bond length first increases and then decreases, while the H...A bond length decreases. On the contrary, when D and A are the same, as the intermolecular distance decreases, the D-H bond length decreases without increasing, which is referred to as the uncooperative effect. Further analyses conclude that compared to cooperative H-bonded systems, uncooperative systems at their respective equilibrium position have a larger core-valence bifurcation (CVB) index (>0.022) and lower binding energies (<0.25 eV), showing a clear linear inverse relationship related to H-bond strength. Therefore, the intermolecular non-H-bonding interactions are predicted to reflect the uncooperative characteristics, which is confirmed by high-precision ab initio calculations. These findings provide a direction for the comprehensive understanding of H-bonds.

preprint2021arXiv

Co-clustering Vertices and Hyperedges via Spectral Hypergraph Partitioning

We propose a novel method to co-cluster the vertices and hyperedges of hypergraphs with edge-dependent vertex weights (EDVWs). In this hypergraph model, the contribution of every vertex to each of its incident hyperedges is represented through an edge-dependent weight, conferring the model higher expressivity than the classical hypergraph. In our method, we leverage random walks with EDVWs to construct a hypergraph Laplacian and use its spectral properties to embed vertices and hyperedges in a common space. We then cluster these embeddings to obtain our proposed co-clustering method, of particular relevance in applications requiring the simultaneous clustering of data entities and features. Numerical experiments using real-world data demonstrate the effectiveness of our proposed approach in comparison with state-of-the-art alternatives.

preprint2021arXiv

Joint Active and Passive Beamforming for Intelligent Reflecting Surface Aided Multiuser MIMO Communications

This letter investigates the joint active and passive beamforming optimization for intelligent reflecting surface (IRS) aided multiuser multiple-input multiple-output systems with the objective of maximizing the weighted sum-rate. We show that this problem can be solved via a matrix weighted mean square error minimization equivalence. In particular, for the optimization of the passive IRS beamforming, we first propose an iterative algorithm with excellent performance based on the manifold optimization. By using the matrix fractional programming technique to obtain a more tractable object function, we then propose a low complexity algorithm based on the majorization-minimization method. Numerical results verify the convergence of our proposed algorithms and the significant performance improvement over the communication scenario without IRS assistance.

preprint2021arXiv

Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance

Self-supervised depth estimation has made a great success in learning depth from unlabeled image sequences. While the mappings between image and pixel-wise depth are well-studied in current methods, the correlation between image, depth and scene semantics, however, is less considered. This hinders the network to better understand the real geometry of the scene, since the contextual clues, contribute not only the latent representations of scene depth, but also the straight constraints for depth map. In this paper, we leverage the two benefits by proposing the implicit and explicit semantic guidance for accurate self-supervised depth estimation. We propose a Semantic-aware Spatial Feature Alignment (SSFA) scheme to effectively align implicit semantic features with depth features for scene-aware depth estimation. We also propose a semantic-guided ranking loss to explicitly constrain the estimated depth maps to be consistent with real scene contextual properties. Both semantic label noise and prediction uncertainty is considered to yield reliable depth supervisions. Extensive experimental results show that our method produces high quality depth maps which are consistently superior either on complex scenes or diverse semantic categories, and outperforms the state-of-the-art methods by a significant margin.

preprint2020arXiv

A short-range metastable defect in the double layer ice

Although the phase of water has extensively investigated whether there exists a defect distorting only locally the structure still under debate. Here we report a localized 5775 defect phase presented in the double layer ice on the Au (111) surface, which is a metastable structure with 5- and 7-membered rings compared with a perfect hexagonal one. Without altering the total number of the hydrogen bonds of the ice, the defect only introduces 0.08 Å molecular displacement and 3.27% interaction energy change outside the defected area. Such defect also exists without Au support but causes a larger lattice relaxation or smaller interaction energy change. The excessively high barrier as well as the low quantum tunneling and thermodynamic probabilities hinder the formation of the defect by post-grown isomerization from the perfect to the defected structure. This finding indicates that the defected ice is stable, and the defect can be formed during the ice growth stage.

preprint2020arXiv

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter count, FLOPs, activations, and memory consumption while at least maintaining PSNR of MSRResNet. The track had 150 registered participants, and 25 teams submitted the final results. They gauge the state-of-the-art in efficient single image super-resolution.

preprint2020arXiv

Attention-based network for low-light image enhancement

The captured images under low light conditions often suffer insufficient brightness and notorious noise. Hence, low-light image enhancement is a key challenging task in computer vision. A variety of methods have been proposed for this task, but these methods often failed in an extreme low-light environment and amplified the underlying noise in the input image. To address such a difficult problem, this paper presents a novel attention-based neural network to generate high-quality enhanced low-light images from the raw sensor data. Specifically, we first employ attention strategy (i.e. channel attention and spatial attention modules) to suppress undesired chromatic aberration and noise. The channel attention module guides the network to refine redundant colour features. The spatial attention module focuses on denoising by taking advantage of the non-local correlation in the image. Furthermore, we propose a new pooling layer, called inverted shuffle layer, which adaptively selects useful information from previous features. Extensive experiments demonstrate the superiority of the proposed network in terms of suppressing the chromatic aberration and noise artifacts in enhancement, especially when the low-light image has severe noise.

preprint2020arXiv

Channel Estimation for Intelligent Reflecting Surface-Assisted Millimeter Wave MIMO Systems

Intelligent reflecting surfaces (IRSs) are regarded as promising enablers for future millimeter wave (mmWave) wireless communication, due to their ability to create favorable line-of-sight (LoS) propagation environments. In this paper, we investigate channel estimation in downlink IRS-assisted mmWave multiple-input multiple-output (MIMO) systems. By leveraging the sparsity of mmWave channels, we formulate the channel estimation problem as a fixed-rank constrained non-convex optimization problem. To tackle the non-convexity, an efficient algorithm is proposed by capitalizing on alternating minimization and manifold optimization (MO), which yields a locally optimal solution. Simulation results show that the proposed MO-based estimation (MO-EST) algorithm significantly outperforms two benchmark schemes and demonstrate the robustness of the MO-EST algorithm with respect to imperfect knowledge of the sparsity level of the channels in practical implementations.

preprint2020arXiv

GINet: Graph Interaction Network for Scene Parsing

Recently, context reasoning using image regions beyond local convolution has shown great potential for scene parsing. In this work, we explore how to incorporate the linguistic knowledge to promote context reasoning over image regions by proposing a Graph Interaction unit (GI unit) and a Semantic Context Loss (SC-loss). The GI unit is capable of enhancing feature representations of convolution networks over high-level semantics and learning the semantic coherency adaptively to each sample. Specifically, the dataset-based linguistic knowledge is first incorporated in the GI unit to promote context reasoning over the visual graph, then the evolved representations of the visual graph are mapped to each local representation to enhance the discriminated capability for scene parsing. GI unit is further improved by the SC-loss to enhance the semantic representations over the exemplar-based semantic graph. We perform full ablation studies to demonstrate the effectiveness of each component in our approach. Particularly, the proposed GINet outperforms the state-of-the-art approaches on the popular benchmarks, including Pascal-Context and COCO Stuff.

preprint2020arXiv

Network Inference from Consensus Dynamics with Unknown Parameters

We explore the problem of inferring the graph Laplacian of a weighted, undirected network from snapshots of a single or multiple discrete-time consensus dynamics, subject to parameter uncertainty, taking place on the network. Specifically, we consider three problems in which we assume different levels of knowledge about the diffusion rates, observation times, and the input signal power of the dynamics. To solve these underdetermined problems, we propose a set of algorithms that leverage the spectral properties of the observed data and tools from convex optimization. Furthermore, we provide theoretical performance guarantees associated with these algorithms. We complement our theoretical work with numerical experiments, that demonstrate how our proposed methods outperform current state-of-the-art algorithms and showcase their effectiveness in recovering both synthetic and real-world networks.

preprint2020arXiv

Unidirectional Oriented Water Wire in Short Nanotube

The orientation of water molecules is the key factor for the fast transport of water in small nanotubes. It has been accepted that the bidirectional water burst in short nanotubes can be transformed into unidirectional transport when the orientation of water molecules is maintained in long nanotubes under the external field. In this work, based on molecular dynamics simulations and first-principles calculations, we showed without external field, it only needs 21 water molecules to maintain the unidirectional single file water intrinsically in carbon nanotube at seconds. Detailed analysis indicates that the surprising result comes from the step by step process for the flip of water chain, which is different with the perceived concerted mechanism. Considering the thickness of cell membrane (normally 5-10 nm) is larger than the length threshold of the unidirectional water wire, this study suggests it may not need the external field to maintain the unidirectional flow in the water channel at the macroscopic timescale.

preprint2019arXiv

Beamforming Design for Large-Scale Antenna Arrays Using Deep Learning

Beamforming (BF) design for large-scale antenna arrays with limited radio frequency chains and the phase-shifter-based analog BF architecture, has been recognized as a key issue in millimeter wave communication systems. It becomes more challenging with imperfect channel state information (CSI). In this letter, we propose a deep learning based BF design approach and develop a BF neural network (BFNN) which can be trained to learn how to optimize the beamformer for maximizing the spectral efficiency with hardware limitation and imperfect CSI. Simulation results show that the proposed BFNN achieves significant performance improvement and strong robustness to imperfect CSI over the conventional BF algorithms.

preprint2016arXiv

Strong interfacial exchange field in the graphene/EuS heterostructure

Exploiting 2D materials for spintronic applications can potentially realize next-generation devices featuring low-power consumption and quantum operation capability. The magnetic exchange field (MEF) induced by an adjacent magnetic insulator enables efficient control of local spin generation and spin modulation in 2D devices without compromising the delicate material structures. Using graphene as a prototypical 2D system, we demonstrate that its coupling to the model magnetic insulator (EuS) produces a substantial MEF (> 14 T) with potential to reach hundreds of Tesla, which leads to orders-of-magnitude enhancement in the spin signal originated from Zeeman spin-Hall effect. Furthermore, the new ferromagnetic ground state of Dirac electrons resulting from the strong MEF may give rise to quantized spin-polarized edge transport. The MEF effect shown in our graphene/EuS devices therefore provides a key functionality for future spin logic and memory devices based on emerging 2D materials in classical and quantum information processing.

preprint2015arXiv

Fidelity-Naturalness Evaluation of Single Image Super Resolution

We study the problem of evaluating super resolution methods. Traditional evaluation methods usually judge the quality of super resolved images based on a single measure of their difference with the original high resolution images. In this paper, we proposed to use both fidelity (the difference with original images) and naturalness (human visual perception of super resolved images) for evaluation. For fidelity evaluation, a new metric is proposed to solve the bias problem of traditional evaluation. For naturalness evaluation, we let humans label preference of super resolution results using pair-wise comparison, and test the correlation between human labeling results and image quality assessment metrics' outputs. Experimental results show that our fidelity-naturalness method is better than the traditional evaluation method for super resolution methods, which could help future research on single-image super resolution.

preprint2015arXiv

Model-Free Sure Screening via Maximum Correlation

We consider the problem of screening features in an ultrahigh-dimensional setting. Using maximum correlation, we develop a novel procedure called MC-SIS for feature screening, and show that MC-SIS possesses the sure screen property without imposing model or distributional assumptions on the response and predictor variables. Therefore, MC-SIS is a model-free sure independence screening method as in contrast with some other existing model-based sure independence screening methods in the literature. Simulation examples and a real data application are used to demonstrate the performance of MC-SIS as well as to compare MC-SIS with other existing sure screening methods. The results show that MC-SIS outperforms those methods when their model assumptions are violated, and it remains competitive when the model assumptions hold.

preprint2015arXiv

Spin-Polarized Tunneling Study on Spin-Momentum Locking in the Topological Insulators

We demonstrate that the charge-spin conversion efficiency of topological insulators (TI) can be experimentally determined by injecting spin-polarized tunneling electrons into a TI. Through a comparative study between bismuth selenide and bismuth antimony telluride, we verified the topological-surface-state origin of the observed giant spin signals. By injecting energetic electrons into bismuth selenide, we further studied the energy dependence of the effective spin polarization at the TI surface. The experimentally verified large spin polarization, as well as our calculations, provides new insights into optimizing TI materials for near room-temperature spintronic applications.

preprint2014arXiv

Direct tunneling through high-$κ$ amorphous HfO$_2$: effects of chemical modification

We report first principles modeling of quantum tunneling through amorphous HfO$_2$ dielectric layer of metal-oxide-semiconductor (MOS) nanostructures in the form of n-Si/HfO$_2$/Al. In particular we predict that chemically modifying the amorphous HfO$_2$ barrier by doping N and Al atoms in the middle region - far from the two interfaces of the MOS structure, can reduce the gate-to-channel tunnel leakage by more than one order of magnitude. Several other types of modification are found to enhance tunneling or induce substantial band bending in the Si, both are not desired from leakage point of view. By analyzing transmission coefficients and projected density of states, the microscopic physics of electron traversing the tunnel barrier with or without impurity atoms in the high-$κ$ dielectric is revealed.

preprint2014arXiv

Statistical calibration of qRT-PCR, microarray and RNA-Seq gene expression data with measurement error models

The accurate quantification of gene expression levels is crucial for transcriptome study. Microarray platforms are commonly used for simultaneously interrogating thousands of genes in the past decade, and recently RNA-Seq has emerged as a promising alternative. The gene expression measurements obtained by microarray and RNA-Seq are, however, subject to various measurement errors. A third platform called qRT-PCR is acknowledged to provide more accurate quantification of gene expression levels than microarray and RNA-Seq, but it has limited throughput capacity. In this article, we propose to use a system of functional measurement error models to model gene expression measurements and calibrate the microarray and RNA-Seq platforms with qRT-PCR. Based on the system, a two-step approach was developed to estimate the biases and error variance components of the three platforms and calculate calibrated estimates of gene expression levels. The estimated biases and variance components shed light on the relative strengths and weaknesses of the three platforms and the calibrated estimates provide a more accurate and consistent quantification of gene expression levels. Theoretical and simulation studies were conducted to establish the properties of those estimates. The system was applied to analyze two gene expression data sets from the Microarray Quality Control (MAQC) and Sequencing Quality Control (SEQC) projects.

preprint2013arXiv

A generic tight-binding model for monolayer, bilayer and bulk MoS2

Molybdenum disulfide (MoS2) is a layered semiconductor which has become very important recently as an emerging electronic device material. Being an intrinsic semiconductor the two-dimensional MoS2 has major advantages as the channel material in field-effect transistors. In this work we determine the electronic structure of MoS2 with the highly accurate screened hybrid functional within the density functional theory (DFT) including the spin-orbit coupling. Using the DFT electronic structures as target, we have developed a single generic tight-binding (TB) model that accurately produces the electronic structures for three different forms of MoS2 - bulk, bilayer and monolayer. Our TB model is based on the Slater-Koster method with non-orthogonal sp3d5 orbitals, nearest-neighbor interactions and spin-orbit coupling. The TB model is useful for atomistic modeling of quantum transport in MoS2 based electronic devices.

preprint2013arXiv

Band offset of GaAs/AlxGa1-xAs heterojunctions from atomistic first principles

Using an atomistic first principles approach, we investigate the band offset of the GaAs/AlxGa1-xAs heterojunctions for the entire range of the Al doping concentration 0<x<=1. We apply the coherent potential approach to handle the configuration average of Al doping and a recently proposed semi-local exchange potential to accurately determine the band gaps of the materials. The calculated band structures of the GaAs, AlAs crystals and band gaps of the GaAs/AlxGa1-xAs alloys, are in very good agreement with the experimental results. We predict that valence band offset of the GaAs/AlxGa1-xAs heterojunction scales with the Al concentration x in a linear fashion as VBO(x)~0.587 x, and the conduction band offset scales with x in a nonlinear fashion. Quantitative comparisons to the corresponding experimental data are made.

preprint2013arXiv

Electric control of spin in monolayer WSe2 field effect transistors

We report a first principles theoretical investigation of quantum transport in monolayer WSe2 field effect transistor (FET). Due to a strong spin-orbit interaction (SOI) and the atomic structure of the two-dimensional (2D) lattice, monolayer WSe2 has an interesting electronic structure that exhibits Zeeman-like up-down spin texture near the K and K' points of the Brillouin zone. In a FET, the gate electric field induces an extra, externally tunable SOI that re-orients the spins into a Rashba-like texture thereby realizing electric control of the spin. Quantum transport is modulated by the spin texture, namely by if the spin orientation of the carrier after the gated channel region, matches or miss-matches that of the FET drain electrode. The carrier current in the FET is labelled both the spin index and the valley index, realizing spintronics and valleytronics in the same device.

preprint2013arXiv

Electronic structures of III-V zinc-blende semiconductors from atomistic first principles

For analyzing quantum transport in semiconductor devices, accurate electronic structures are critical for quantitative predictions. Here we report theoretical analysis of electronic structures of all III-V zinc-blende semiconductor compounds. Our calculations are from density functional theory with the semi-local exchange proposed recently [F. Tran and P. Blaha, Phys. Rev. Lett. 102, 226401 (2009)], within the linear muffin tin orbital scheme. The calculated band gaps and effective masses are compared to experimental data and good quantitative agreement is obtained. Using the theoretical scheme presented here, quantum transport in nanostructures of III-V compounds can be confidently predicted.

preprint2013arXiv

Nonequilibrium Green's function theory for predicting device-to-device variability

Due to random dopant fluctuations, the device-to-device variability is a serious challenge to emerging nanoelectronics. In this work we present theoretical formalisms and numerical simulations of quantum transport variability, based on the nonequilibrium Green's functions and the multiple scattering theory. We have developed a general formalism using the diagrammatic technique within the coherent potential approximation (CPA) that can be applied to a wide range of disorder concentrations. In addition, we have developed a method by using a perturbative expansion within the low concentration approximation (LCA) that is extremely useful for typical nanoelectronic devices having low dopant concentration. Applying both formalisms, transport fluctuations due to random impurities can be predicted without lengthy brute force computation of ensemble of devices structures. Numerical implementations of the formalisms are demonstrated using both tight-binding models and first principles models.

preprint2013arXiv

Quantum Transport Theory with the Nonequilibrium Coherent Potentials

Since any realistic electronic device has some degree of disorder, predicting disorder effects in quantum transport is a critical problem. Here we report the theory of nonequilibrium coherent potential approximation (NECPA) for analyzing disorder effects in nonequilibrium quantum transport of nanoelectronic devices. The NECPA is formulated by contour ordered nonequilibrium Green's function (NEGF) where the disorder average is carried out within the coherent potential approximation on the complex-time contour. We have derived a set of new rules that supplement the celebrated Langreth theorem and, as a whole, the generalized Langreth rules allow us to derive NECPA equations for real time Green's functions. The solution of NECPA equations provide the disorder averaged nonequilibrium density matrix as well as other relevant quantities for quantum transport calculations. We establish the excellent accuracy of NECPA by comparing its results to brute force numerical calculations of disordered tight-binding models. Moreover, the connection of NECPA equations which are derived on the complex-time contour, to the nonequilibrium vertex correction theory which is derived on the real-time axis, is made. As an application, we demonstrate that NECPA can be combined with density functional theory to enable analysis of nanoelectronic device physics from atomistic first principles.

preprint2012arXiv

Structure and electronic transport in graphene wrinkles

Wrinkling is a ubiquitous phenomenon in two-dimensional membranes. In particular, in the large-scale growth of graphene on metallic substrates, high densities of wrinkles are commonly observed. Despite their prevalence and potential impact on large-scale graphene electronics, relatively little is known about their structural morphology and electronic properties. Surveying the graphene landscape using atomic force microscopy, we found that wrinkles reach a certain maximum height before folding over. Calculations of the energetics explain the morphological transition, and indicate that the tall ripples are collapsed into narrow standing wrinkles by van der Waals forces, analogous to large-diameter nanotubes. Quantum transport calculations show that conductance through these collapsed wrinkle structures is limited mainly by a density-of-states bottleneck and by interlayer tunneling across the collapsed bilayer region. Also through systematic measurements across large numbers of devices with wide folded wrinkles, we find a distinct anisotropy in their electrical resistivity, consistent with our transport simulations. These results highlight the coupling between morphology and electronic properties, which has important practical implications for large-scale high-speed graphene electronics.

preprint2011arXiv

Multilayer epitaxial graphene formed by pyrolysis of polycrystalline silicon-carbide grown on c-plane sapphire substrates

We use ultra-high vacuum chemical vapor deposition to grow polycrystalline silicon carbide (SiC) on c-plane sapphire wafers which are then annealed between 1250 and 1450°C in vacuum to create epitaxial multilayer graphene (MLG). Despite the surface roughness and small domain size of the polycrystalline SiC, a conformal MLG film is formed. By planarizing the SiC prior to graphene growth, a reduction of the Raman defect band is observed in the final MLG. The graphene formed on polished SiC films also demonstrates significantly more ordered layer-by-layer growth and increased carrier mobility for the same carrier density as the non-polished samples.

preprint2010arXiv

Wafer-scale Epitaxial Graphene Growth on the Si-face of Hexagonal SiC (0001) for High Frequency Transistors

Up to two layers of epitaxial graphene have been grown on the Si-face of two-inch SiC wafers exhibiting room-temperature Hall mobilities up to 1800 cm^2/Vs, measured from ungated, large, 160 micron x 200 micron Hall bars, and up to 4000 cm^2/Vs, from top-gated, small, 1 micron x 1.5 micron Hall bars. The growth process involved a combination of a cleaning step of the SiC in a Si-containing gas, followed by an annealing step in Argon for epitaxial graphene formation. The structure and morphology of this graphene has been characterized using AFM, HRTEM, and Raman spectroscopy. Furthermore, top-gated radio frequency field effect transistors (RF-FETs) with a peak cutoff frequency fT of 100 GHz for a gate length of 240 nm were fabricated using epitaxial graphene grown on the Si face of SiC that exhibited Hall mobilities up to 1450 cm^2/Vs from ungated Hall bars and 1575 cm^2/Vs from top-gated ones. This is by far the highest cut-off frequency measured from any kind of graphene.

preprint2003arXiv

ac Josephson effect in the resonant tunneling through mesoscopic superconducting junctions

We investigate ac Josephson effect in the resonant tunneling through mesoscopic superconducting junctions. In the presence of microwave irradiation, we show that the trajectory of multiple Andreev reflections can be closed by emitting or absorbing photons. Consequently, photon-assisted Andreev states are formed and play the role of carrying supercurrent. On the Shapiro steps, dc component appears when the resonant level is near a series of positions with spacing of half of the microwave frequency. Analytical result is derived in the limit of infinite superconducting gap, based on which new features of ac Josephson effect are revealed.

preprint2002arXiv

Microwave-induced pi-junction transition in a superconductor / quantum-dot / superconductor structure

Using the nonequilibrium Green function, we show that microwave irradiation can reverse the supercurrent flowing through a superconductor / quantum-dot / superconductor structure. In contrast with the conventional sideband effect in normal-metal / quantum-dot / normal-metal junctions, the photon-assisted structures appear near $E_{0}=\frac{n}{2}\hbar ω(n=\pm 1,\pm 2...)$, where $E_{0}$ is the resonant energy level of the quantum dot and $ω$ is the frequency of microwave field. Each photon-assisted structure is composed of a negative and a positive peak, with an abrupt jump from the negative peak to the positive peak around $E_{0}=\frac{n}{2}\hbar ω$. The microwave-induced $π$-junction transition is interpreted in the picture of photon-assisted Andreev bound states, which are formed due to multiple photon-assisted Andreev reflection between the two superconductors. Moreover, the main resonance located at $E_{0}=0$ can also be reversed with proper microwave strength and frequency.

Yu Zhu

What is connected

Connect this record

See the researcher in context

Building this map preview

37 published item(s)

AI of Brain and Cognitive Sciences: From the Perspective of First Principles

AIM 2022 Challenge on Super-Resolution of Compressed Image and Video: Dataset, Methods and Results

Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression Recognition

Exploring and Evaluating Image Restoration Potential in Dynamic Scenes

Free Energy Node Embedding via Generalized Skip-gram with Negative Sampling

Hypergraph Cuts with Edge-Dependent Vertex Weights

Signal Processing on Higher-Order Networks: Livin' on the Edge ... and Beyond

Signal processing on simplicial complexes

The generality of uncooperative and cooperative effects in elementary hydrogen-bonded systems

Co-clustering Vertices and Hyperedges via Spectral Hypergraph Partitioning

Joint Active and Passive Beamforming for Intelligent Reflecting Surface Aided Multiuser MIMO Communications

Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance

A short-range metastable defect in the double layer ice

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

Attention-based network for low-light image enhancement

Channel Estimation for Intelligent Reflecting Surface-Assisted Millimeter Wave MIMO Systems

GINet: Graph Interaction Network for Scene Parsing

Network Inference from Consensus Dynamics with Unknown Parameters

Unidirectional Oriented Water Wire in Short Nanotube

Beamforming Design for Large-Scale Antenna Arrays Using Deep Learning

Strong interfacial exchange field in the graphene/EuS heterostructure

Fidelity-Naturalness Evaluation of Single Image Super Resolution

Model-Free Sure Screening via Maximum Correlation

Spin-Polarized Tunneling Study on Spin-Momentum Locking in the Topological Insulators

Direct tunneling through high-$κ$ amorphous HfO$_2$: effects of chemical modification

Statistical calibration of qRT-PCR, microarray and RNA-Seq gene expression data with measurement error models

A generic tight-binding model for monolayer, bilayer and bulk MoS2

Band offset of GaAs/AlxGa1-xAs heterojunctions from atomistic first principles

Electric control of spin in monolayer WSe2 field effect transistors

Electronic structures of III-V zinc-blende semiconductors from atomistic first principles

Nonequilibrium Green's function theory for predicting device-to-device variability

Quantum Transport Theory with the Nonequilibrium Coherent Potentials

Structure and electronic transport in graphene wrinkles

Multilayer epitaxial graphene formed by pyrolysis of polycrystalline silicon-carbide grown on c-plane sapphire substrates

Wafer-scale Epitaxial Graphene Growth on the Si-face of Hexagonal SiC (0001) for High Frequency Transistors

ac Josephson effect in the resonant tunneling through mesoscopic superconducting junctions

Microwave-induced pi-junction transition in a superconductor / quantum-dot / superconductor structure