Source author record

Yin Wang

Yin Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci cond-mat.mes-hall eess.IV Machine Learning Networking and Internet Architecture physics.comp-ph Distributed, Parallel, and Cluster Computing math.NA Multimedia nucl-th physics.ao-ph physics.app-ph physics.optics Programming Languages quant-ph Quantitative Methods

Catalog footprint

What is connected

22works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

OptFormer: Optical Flow-Guided Attention and Phase Space Reconstruction for SST Forecasting

Sea Surface Temperature (SST) prediction plays a vital role in climate modeling and disaster forecasting. However, it remains challenging due to its nonlinear spatiotemporal dynamics and extended prediction horizons. To address this, we propose OptFormer, a novel encoder-decoder model that integrates phase-space reconstruction with a motion-aware attention mechanism guided by optical flow. Unlike conventional attention, our approach leverages inter-frame motion cues to highlight relative changes in the spatial field, allowing the model to focus on dynamic regions and capture long-range temporal dependencies more effectively. Experiments on NOAA SST datasets across multiple spatial scales demonstrate that OptFormer achieves superior performance under a 1:1 training-to-prediction setting, significantly outperforming existing baselines in accuracy and robustness.

preprint2022arXiv

Spherical Transformer: Adapting Spherical Signal to CNNs

Convolutional neural networks (CNNs) have been widely used in various vision tasks, e.g. image classification, semantic segmentation, etc. Unfortunately, standard 2D CNNs are not well suited for spherical signals such as panorama images or spherical projections, as the sphere is an unstructured grid. In this paper, we present Spherical Transformer which can transform spherical signals into vectors that can be directly processed by standard CNNs such that many well-designed CNNs architectures can be reused across tasks and datasets by pretraining. To this end, the proposed method first uses local structured sampling methods such as HEALPix to construct a transformer grid by using the information of spherical points and its adjacent points, and then transforms the spherical signals to the vectors through the grid. By building the Spherical Transformer module, we can use multiple CNN architectures directly. We evaluate our approach on the tasks of spherical MNIST recognition, 3D object classification and omnidirectional image semantic segmentation. For 3D object classification, we further propose a rendering-based projection method to improve the performance and a rotational-equivariant model to improve the anti-rotation ability. Experimental results on three tasks show that our approach achieves superior performance over state-of-the-art methods.

preprint2022arXiv

Training a universal instance segmentation network for live cell images of various cell types and imaging modalities

We share our recent findings in an attempt to train a universal segmentation network for various cell types and imaging modalities. Our method was built on the generalized U-Net architecture, which allows the evaluation of each component individually. We modified the traditional binary training targets to include three classes for direct instance segmentation. Detailed experiments were performed regarding training schemes, training settings, network backbones, and individual modules on the segmentation performance. Our proposed training scheme draws minibatches in turn from each dataset, and the gradients are accumulated before an optimization step. We found that the key to training a universal network is all-time supervision on all datasets, and it is necessary to sample each dataset in an unbiased way. Our experiments also suggest that there might exist common features to define cell boundaries across cell types and imaging modalities, which could allow application of trained models to totally unseen datasets. A few training tricks can further boost the segmentation performance, including uneven class weights in the cross-entropy loss function, well-designed learning rate scheduler, larger image crops for contextual information, and additional loss terms for unbalanced classes. We also found that segmentation performance can benefit from group normalization layer and Atrous Spatial Pyramid Pooling module, thanks to their more reliable statistics estimation and improved semantic understanding, respectively. We participated in the 6th Cell Tracking Challenge (CTC) held at IEEE International Symposium on Biomedical Imaging (ISBI) 2021 using one of the developed variants. Our method was evaluated as the best runner up during the initial submission for the primary track, and also secured the 3rd place in an additional round of competition in preparation for the summary publication.

preprint2021arXiv

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization

Breast cancer is the most common malignancy in women, being responsible for more than half a million deaths every year. As such, early and accurate diagnosis is of paramount importance. Human expertise is required to diagnose and correctly classify breast cancer and define appropriate therapy, which depends on the evaluation of the expression of different biomarkers such as the transmembrane protein receptor HER2. This evaluation requires several steps, including special techniques such as immunohistochemistry or in situ hybridization to assess HER2 status. With the goal of reducing the number of steps and human bias in diagnosis, the HEROHE Challenge was organized, as a parallel event of the 16th European Congress on Digital Pathology, aiming to automate the assessment of the HER2 status based only on hematoxylin and eosin stained tissue sample of invasive breast cancer. Methods to assess HER2 status were presented by 21 teams worldwide and the results achieved by some of the proposed methods open potential perspectives to advance the state-of-the-art.

preprint2020arXiv

Key Frame Proposal Network for Efficient Pose Estimation in Videos

Human pose estimation in video relies on local information by either estimating each frame independently or tracking poses across frames. In this paper, we propose a novel method combining local approaches with global context. We introduce a light weighted, unsupervised, key frame proposal network (K-FPN) to select informative frames and a learned dictionary to recover the entire pose sequence from these frames. The K-FPN speeds up the pose estimation and provides robustness to bad frames with occlusion, motion blur, and illumination changes, while the learned dictionary provides global dynamic context. Experiments on Penn Action and sub-JHMDB datasets show that the proposed method achieves state-of-the-art accuracy, with substantial speed-up.

preprint2020arXiv

Largely enhanced photogalvanic effects in the phosphorene photodetector by strain-increased device asymmetry

Photogalvanic effect (PGE) occurring in noncentrosymmetric materials enables the generation of the open-circuit voltage that is much larger than the bandgap, making it rather attractive in solar cells. However, the magnitude of the PGE photocurrent is usually small, which severely hampers its practical application. Here we propose a mechanism to largely enhance the PGE photocurrent by mechanical strain based on the quantum transport simulations for the two-dimensional nickel-phosphorene-nickel photodetector. Broadband PGE photocurrent governed by the Cs noncentrosymmetry is generated at zero bias under the illumination of linearly polarized light. The photocurrent depends linearly on the device asymmetry, while nonlinearly on the optical absorption. By applying the appropriate mechanical tension stress on the phosphorene, the photocurrent can be substantially enhanced by up to 3 orders of magnitude, which is primarily ascribed to the largely increased device asymmetry. The change in the optical absorption in some cases can also play a critical role in tuning the photocurrent due to the nonlinear dependence. Moreover, the photocurrent can even be further enhanced by the mechanical bending, mainly owing to the considerably enhanced device asymmetry. Our results reveal the dependence of the PGE photocurrent on the device asymmetry and absorption in transport process through a device, and also explore the potentials of the PGE in the self-powered low-dimensional flexible optoelectronics.

preprint2020arXiv

The 1st Agriculture-Vision Challenge: Methods and Results

The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agriculture-Vision Challenge Dataset was employed, which comprises of 21,061 aerial and multi-spectral farmland images. This paper provides a summary of notable methods and results in the challenge. Our submission server and leaderboard will continue to open for researchers that are interested in this challenge dataset and task; the link can be found here.

preprint2016arXiv

Impurity-limited quantum transport variability in magnetic tunnel junctions

We report an extensive first-principles investigation of impurity-induced device-to-device variability of spin-polarized quantum tunneling through Fe/MgO/Fe magnetic tunnel junctions (MTJ). In particular, we calculated the tunnel magnetoresistance ratio (TMR) and the average values and variances of the currents and spin transfer torque (STT) of an interfacially doped Fe/MgO/Fe MTJ. Further, we predicted that N-doped MgO can improve the performance of a doped Fe/MgO/Fe MTJ. Our first-principles calculations of the fluctuations of the on/off currents and STT provide vital information for future predictions of the long-term reliability of spintronic devices, which is imperative for high-volume production.

preprint2016arXiv

Large influence of capping layers on tunnel magnetoresistance in magnetic tunnel junctions

It has been reported in experiments that capping layers which enhance the perpendicular magnetic anisotropy (PMA) of magnetic tunnel junctions (MTJs) induce great impact on the tunnel magnetoresistance (TMR). To explore the essential influence caused by capping layers, we carry out ab initio calculations on TMR in the X(001)|CoFe(001)|MgO(001)|CoFe(001)|X(001) MTJ, where X represents the capping layer material which can be tungsten, tantalum or hafnium. We report TMR in different MTJs and demonstrate that tungsten is an ideal candidate for a giant TMR ratio. The transmission spectrum in Brillouin zone is presented. It can be seen that in the parallel condition of MTJ, sharp transmission peaks appear in the minority-spin channel. This phenomenon is attributed to the resonant tunnel transmission effect and we explained it by the layer-resolved density of states (DOS). In order to explore transport properties in MTJs, the density of scattering states (DOSS) was studied from the point of band symmetry. It has been found that CoFe|tungsten interface blocks scattering states transmission in the anti-parallel condition. This work reports TMR and transport properties in MTJs with different capping layers, and proves that tungsten is a proper capping layer material, which would benefit the design and optimization of MTJs.

preprint2016arXiv

Nonequilibrium spin injection in monolayer black phosphorus

Monolayer black phosphorus (MBP) is an interesting emerging electronic material with a direct band gap and relatively high carrier mobility. In this work we report a theoretical investigation of nonequilibrium spin injection and spin-polarized quantum transport in MBP from ferromagnetic Ni contacts, in two-dimensional magnetic tunneling structures. We investigate physical properties such as the spin injection efficiency, the tunnel magnetoresistance ratio, spin-polarized currents, charge currents and transmission coefficients as a function of external bias voltage, for two different device contact structures where MBP is contacted by Ni(111) and by Ni(100). While both structures are predicted to give respectable spin-polarized quantum transport, the Ni(100)/MBP/Ni(100) trilayer has the superior properties where the spin injection and magnetoresistance ratio maintains almost a constant value against the bias voltage. The nonequilibrium quantum transport phenomenon is understood by analyzing the transmission spectrum at nonequilibrium.

preprint2016arXiv

Spin-polarized quantum transport properties through flexible phosphorene

We report a first-principles study on the tunnel magnetoresistance (TMR) and spin-injection efficiency (SIE) through phosphorene with nickel electrodes under the mechanical tension and bending on the phosphorene region. Both the TMR and SIE are largely improved under these mechanical deformations. For the uniaxial tension ($\varepsilon_y$) varying from 0 to 15\% applied along the armchair transport ({\it y}-)direction of the phosphorene, the TMR ratio is enhanced with a maximum of 107\% at the $\varepsilon_y=10\%$, while the SIE increases monotonously from 8\% up to 43\% with the increasing of the strain. Under the out-of-plane bending, the TMR overall increases from 7\% to 50\% within the bending ratio of 0-3.9\%, and meanwhile the SIE is largely improved to around 70\%, as compared to that (30\%) of the flat phosphorene. Such behaviors of the TMR and SIE are mainly affected by the transmission of spin-up electrons in the parallel configuration, which is highly depended on the applied mechanical tension and bending. Our results indicate that the phosphorene based tunnel junctions have promising applications in flexible electronics.

preprint2015arXiv

Automatic Objects Removal for Scene Completion

With the explosive growth of web-based cameras and mobile devices, billions of photographs are uploaded to the internet. We can trivially collect a huge number of photo streams for various goals, such as 3D scene reconstruction and other big data applications. However, this is not an easy task due to the fact the retrieved photos are neither aligned nor calibrated. Furthermore, with the occlusion of unexpected foreground objects like people, vehicles, it is even more challenging to find feature correspondences and reconstruct realistic scenes. In this paper, we propose a structure based image completion algorithm for object removal that produces visually plausible content with consistent structure and scene texture. We use an edge matching technique to infer the potential structure of the unknown region. Driven by the estimated structure, texture synthesis is performed automatically along the estimated curves. We evaluate the proposed method on different types of images: from highly structured indoor environment to the natural scenes. Our experimental results demonstrate satisfactory performance that can be potentially used for subsequent big data processing: 3D scene reconstruction and location recognition.

preprint2015arXiv

Shape phase transition in the odd Sm nuclei: effective order parameter and odd-even effect

Some binding-energy-related quantities serving as effective order parameters have been used to analyze the shape phase transition in the odd Sm nuclei. It is found that the signals of phase transition in the odd Sm nuclei are greatly enhanced in contrast to the even Sm nuclei. A further analysis shows that the transitional behaviors related to pairing in the Sm nuclei can be well described by the mean field plus pairing interaction model, with a monotonic decrease in the pairing strength $G$.

preprint2014arXiv

Compression of Video Tracking and Bandwidth Balancing Routing in Wireless Multimedia Sensor Networks

There has been a tremendous growth in multimedia applications over wireless networks. Wireless Multimedia Sensor Networks(WMSNs) have become the premier choice in many research communities and industry. Many state-of-art applications, such as surveillance, traffic monitoring, and remote heath care are essentially video tracking and transmission in WMSNs. The transmission speed is constrained by big size of video data and fixed bandwidth allocation in constant routing path. In this paper, we present a CamShift based algorithm to compress the tracking of videos. Then we propose a bandwidth balancing strategy in which each sensor node is able to dynamically select the node for next hop with the highest potential bandwidth capacity to resume communication. Key to the strategy is that each node merely maintains two parameters that contains its historical bandwidth varying trend and then predicts its near future bandwidth capacity. Then forwarding node selects the next hop with the highest potential bandwidth capacity. Simulations demonstrate that our approach significantly increases the data received by sink node and decreases the delay on video transmission in Wireless Multimedia Sensor Network environment.

preprint2014arXiv

Direct tunneling through high-$κ$ amorphous HfO$_2$: effects of chemical modification

We report first principles modeling of quantum tunneling through amorphous HfO$_2$ dielectric layer of metal-oxide-semiconductor (MOS) nanostructures in the form of n-Si/HfO$_2$/Al. In particular we predict that chemically modifying the amorphous HfO$_2$ barrier by doping N and Al atoms in the middle region - far from the two interfaces of the MOS structure, can reduce the gate-to-channel tunnel leakage by more than one order of magnitude. Several other types of modification are found to enhance tunneling or induce substantial band bending in the Si, both are not desired from leakage point of view. By analyzing transmission coefficients and projected density of states, the microscopic physics of electron traversing the tunnel barrier with or without impurity atoms in the high-$κ$ dielectric is revealed.

preprint2013arXiv

Band offset of GaAs/AlxGa1-xAs heterojunctions from atomistic first principles

Using an atomistic first principles approach, we investigate the band offset of the GaAs/AlxGa1-xAs heterojunctions for the entire range of the Al doping concentration 0<x<=1. We apply the coherent potential approach to handle the configuration average of Al doping and a recently proposed semi-local exchange potential to accurately determine the band gaps of the materials. The calculated band structures of the GaAs, AlAs crystals and band gaps of the GaAs/AlxGa1-xAs alloys, are in very good agreement with the experimental results. We predict that valence band offset of the GaAs/AlxGa1-xAs heterojunction scales with the Al concentration x in a linear fashion as VBO(x)~0.587 x, and the conduction band offset scales with x in a nonlinear fashion. Quantitative comparisons to the corresponding experimental data are made.

preprint2013arXiv

Electronic structures of III-V zinc-blende semiconductors from atomistic first principles

For analyzing quantum transport in semiconductor devices, accurate electronic structures are critical for quantitative predictions. Here we report theoretical analysis of electronic structures of all III-V zinc-blende semiconductor compounds. Our calculations are from density functional theory with the semi-local exchange proposed recently [F. Tran and P. Blaha, Phys. Rev. Lett. 102, 226401 (2009)], within the linear muffin tin orbital scheme. The calculated band gaps and effective masses are compared to experimental data and good quantitative agreement is obtained. Using the theoretical scheme presented here, quantum transport in nanostructures of III-V compounds can be confidently predicted.

preprint2013arXiv

Structure and Dielectric Properties of Amorphous High-kappa Oxides: HfO2, ZrO2 and their alloys

High-$κ$ metal oxides are a class of materials playing an increasingly important role in modern device physics and technology. Here we report theoretical investigations of the properties of structural and lattice dielectric constants of bulk amorphous metal oxides by a combined approach of classical molecular dynamics (MD) - for structure evolution, and quantum mechanical first principles density function theory (DFT) - for electronic structure analysis. Using classical MD based on the Born-Mayer-Buckingham potential function within a melt and quench scheme, amorphous structures of high-$κ$ metal oxides Hf$_{1-x}$Zr$_x$O$_2$ with different values of the concentration $x$, are generated. The coordination numbers and the radial distribution functions of the structures are in good agreement with the corresponding experimental data. We then calculate the lattice dielectric constants of the materials from quantum mechanical first principles, and the values averaged over an ensemble of samples agree well with the available experimental data, and are very close to the dielectric constants of their cubic form.

preprint2012arXiv

A stable algorithm for non-homogeneous waveguide equation based on DtN maps

A new stable computational method for non-homogeneous waveguide equation with a piecewise uniform structure along the main propagation direction is constructed, based on the modified Dirichlet-to-Neumann (DtN) map of each uniform segment. For segments with the same structure, only a DtN map needs to be calculated on such a segment, and then the solution of the equation can be derived recursively. Numerical examples demonstrate that it is a stable and efficient algorithm for the waveguide equations. This method can greatly reduces the requirement of internal memory and the amount of computation compared with the traditional algorithms.

preprint2012arXiv

Register Allocation By Model Transformer Semantics

Register allocation has long been formulated as a graph coloring problem, coloring the conflict graph with physical registers. Such a formulation does not fully capture the goal of the allocation, which is to minimize the traffic between registers and memory. Linear scan has been proposed as an alternative to graph coloring, but in essence, it can be viewed as a greedy algorithm for graph coloring: coloring the vertices not in the order of their degrees, but in the order of their occurence in the program. Thus it suffers from almost the same constraints as graph coloring. In this article, I propose a new method of register allocation based on the ideas of model transformer semantics (MTS) and static cache replacement (SCR). Model transformer semantics captures the semantics of registers and the stack. Static cache replacement relaxes the assumptions made by graph coloring and linear scan, aiming directly at reducing register-memory traffic. The method explores a much larger solution space than that of graph coloring and linear scan, thus providing more opportunities of optimization. It seamlessly performs live range splitting, an optimization found in extensions to graph coloring and linear scan. Also, it simplifies the compiler, and its semantics-based approach provides possibilities of simplifying the formal verification of compilers.

preprint2012arXiv

Triggercast: Enabling Wireless Collisions Constructive

It is generally considered that concurrent transmissions should be avoided in order to reduce collisions in wireless sensor networks. Constructive interference (CI) envisions concurrent transmissions to positively interfere at the receiver. CI potentially allows orders of magnitude reductions in energy consumptions and improvements on link quality. In this paper, we theoretically introduce a sufficient condition to construct CI with IEEE 802.15.4 radio for the first time. Moreover, we propose Triggercast, a distributed middleware, and show it is feasible to generate CI in TMote Sky sensor nodes. To synchronize transmissions of multiple senders at the chip level, Triggercast effectively compensates propagation and radio processing delays, and has $95^{th}$ percentile synchronization errors of at most 250ns. Triggercast also intelligently decides which co-senders to participate in simultaneous transmissions, and aligns their transmission time to maximize the overall link PRR, under the condition of maximal system robustness. Extensive experiments in real testbeds reveal that Triggercast significantly improves PRR from 5% to 70% with 7 concurrent senders. We also demonstrate that Triggercast provides on average $1.3\times$ PRR performance gains, when integrated with existing data forwarding protocols.

preprint2010arXiv

A progressive diagonalization scheme for the Rabi Hamiltonian

A diagonalization scheme for the Rabi Hamiltonian, which describes a qubit interacting with a single-mode radiation field via a dipole interaction, is proposed. It is shown that the Rabi Hamiltonian can be solved almost exactly using a progressive scheme that involves a finite set of one variable polynomial equations. The scheme is especially efficient for lower part of the spectrum. Some low-lying energy levels of the model with several sets of parameters are calculated and compared to those provided by the recently proposed generalized rotating-wave approximation and full matrix diagonalization.

Yin Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

OptFormer: Optical Flow-Guided Attention and Phase Space Reconstruction for SST Forecasting

Spherical Transformer: Adapting Spherical Signal to CNNs

Training a universal instance segmentation network for live cell images of various cell types and imaging modalities

HEROHE Challenge: assessing HER2 status in breast cancer without immunohistochemistry or in situ hybridization

Key Frame Proposal Network for Efficient Pose Estimation in Videos

Largely enhanced photogalvanic effects in the phosphorene photodetector by strain-increased device asymmetry

The 1st Agriculture-Vision Challenge: Methods and Results

Impurity-limited quantum transport variability in magnetic tunnel junctions

Large influence of capping layers on tunnel magnetoresistance in magnetic tunnel junctions

Nonequilibrium spin injection in monolayer black phosphorus

Spin-polarized quantum transport properties through flexible phosphorene

Automatic Objects Removal for Scene Completion

Shape phase transition in the odd Sm nuclei: effective order parameter and odd-even effect

Compression of Video Tracking and Bandwidth Balancing Routing in Wireless Multimedia Sensor Networks

Direct tunneling through high-$κ$ amorphous HfO$_2$: effects of chemical modification

Band offset of GaAs/AlxGa1-xAs heterojunctions from atomistic first principles

Electronic structures of III-V zinc-blende semiconductors from atomistic first principles

Structure and Dielectric Properties of Amorphous High-kappa Oxides: HfO2, ZrO2 and their alloys

A stable algorithm for non-homogeneous waveguide equation based on DtN maps

Register Allocation By Model Transformer Semantics

Triggercast: Enabling Wireless Collisions Constructive

A progressive diagonalization scheme for the Rabi Hamiltonian