Researcher profile

Lin Gu

Lin Gu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
34works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

34 published item(s)

preprint2026arXiv

FluxFlow: Conservative Flow-Matching for Astronomical Image Super-Resolution

Ground-to-space astronomical super-resolution requires recovering space-quality images from ground-based observations that are simultaneously limited by pixel sampling resolution and atmospheric seeing, which imposes a stochastic, spatially varying PSF that cannot be resolved through upsampling alone. Existing methods rely on synthetic training pairs that fail to capture real atmospheric statistics and are prone to either over-smoothed reconstructions or hallucination sources with no physical counterpart in the observed sky. We propose FluxFlow, a conservative pixel-space flow-matching framework that incorporates observation uncertainty and source-region importance weights during training, and a training-free Wiener-regularized test-time correction to suppress hallucination sources while preserving recovered detail. We further construct the DESI--HST Dataset, the large-scale real-world benchmark comprising 19,500 real co-registered ground-to-space image pairs with real atmospheric PSF variation. Experiments demonstrate that FluxFlow consistently outperforms existing baseline methods in both photometric and scientific accuracy.

preprint2023arXiv

Aleth-NeRF: Low-light Condition View Synthesis with Concealing Fields

Common capture low-light scenes are challenging for most computer vision techniques, including Neural Radiance Fields (NeRF). Vanilla NeRF is viewer-centred simplifies the rendering process only as light emission from 3D locations in the viewing direction, thus failing to model the low-illumination induced darkness. Inspired by the emission theory of ancient Greeks that visual perception is accomplished by rays casting from eyes, we make slight modifications on vanilla NeRF to train on multiple views of low-light scenes, we can thus render out the well-lit scene in an unsupervised manner. We introduce a surrogate concept, Concealing Fields, that reduces the transport of light during the volume rendering stage. Specifically, our proposed method, Aleth-NeRF, directly learns from the dark image to understand volumetric object representation and concealing field under priors. By simply eliminating Concealing Fields, we can render a single or multi-view well-lit image(s) and gain superior performance over other 2D low-light enhancement methods. Additionally, we collect the first paired LOw-light and normal-light Multi-view (LOM) datasets for future research. This version is invalid, please refer to our new AAAI version: arXiv:2312.09093

preprint2023arXiv

Miniature Magnetic Nano islands in a Morphotropic Cobaltite Matrix

High-density magnetic memories are key components in spintronics, quantum computing, and energy-efficient electronics. Reduced dimensionality and magnetic domain stability at the nanoscale are essential for the miniaturization of magnetic storage units. Yet, inducing magnetic order, and selectively tuning spin-orbital coupling at specific locations have remained challenging. Here we demonstrate the construction of switchable magnetic nano-islands in a nonmagnetic matrix based on cobaltite homo-structures. The magnetic and electronic states are laterally modified by epitaxial strain, which is regionally controlled by freestanding membranes. Atomically sharp grain boundaries isolate the crosstalk between magnetically distinct regions. The minimal size of magnetic nano-islands reaches 35 nm in diameter, enabling an areal density of 400 Gbit per inch square. Besides providing an ideal platform for precisely controlled read and write schemes, this methodology can enable scalable and patterned memories on silicon and flexible substrates for various applications.

preprint2023arXiv

Pressure-Induced Superconductivity in Topological Heterostructure (PbSe)5(Bi2Se3)6

Recently, the natural heterostructure of (PbSe)5(Bi2Se3)6 has been theoretically predicted and experimentally confirmed as a topological insulator. In this work, we induce superconductivity in (PbSe)5(Bi2Se3)6 by implementing high pressure. As increasing pressure up to 10 GPa, superconductivity with Tc ~ 4.6 K suddenly appears, followed by an abrupt decrease. Remarkably, upon further compression above 30 GPa, a new superconducting state arises, where pressure raises the Tc to an unsaturated 6.0 K within the limit of our research. Combining XRD and Raman spectroscopies, we suggest that the emergence of two distinct superconducting states occurs concurrently with the pressure-induced structural transition in this topological heterostructure (PbSe)5(Bi2Se3)6.

preprint2022arXiv

A new artificial photosynthetic system coupling photovoltaic electrocatalysis with photothermal catalysis

In this work, we present a novel artificial photosynthetic paradigm with square meter (m2) level scalable production by integrating photovoltaic electrolytic water splitting device and solar heating CO2 hydrogenation device, successfully achieving the synergy of 1 sun driven 19.4% solar to chemical energy efficiency (STC) for CO production (2.7 times higher than state of the art of large-sized artificial photosynthetic systems) with a low cost (equivalent to 1/7 of reported artificial photosynthetic systems). Furthermore, the outdoor artificial photosynthetic demonstration with 1.268 m2 of scale exhibits the CO generation amount of 258.4 L per day, the STC of ~15.5% for CO production in winter, which could recover the cost within 833 suuny days of operation by selling CO.

preprint2022arXiv

A Privacy-Preserving Unsupervised Domain Adaptation Framework for Clinical Text Analysis

Unsupervised domain adaptation (UDA) generally aligns the unlabeled target domain data to the distribution of the source domain to mitigate the distribution shift problem. The standard UDA requires sharing the source data with the target, having potential data privacy leaking risks. To protect the source data's privacy, we first propose to share the source feature distribution instead of the source data. However, sharing only the source feature distribution may still suffer from the membership inference attack who can infer an individual's membership by the black-box access to the source model. To resolve this privacy issue, we further study the under-explored problem of privacy-preserving domain adaptation and propose a method with a novel differential privacy training strategy to protect the source data privacy. We model the source feature distribution by Gaussian Mixture Models (GMMs) under the differential privacy setting and send it to the target client for adaptation. The target client resamples differentially private source features from GMMs and adapts on target data with several state-of-art UDA backbones. With our proposed method, the source data provider could avoid leaking source data privacy during domain adaptation as well as reserve the utility. To evaluate our proposed method's utility and privacy loss, we apply our model on a medical report disease label classification task using two noisy challenging clinical text datasets. The results show that our proposed method can preserve source data's privacy with a minor performance influence on the text classification task.

preprint2022arXiv

Atomically engineered cobaltite layers for robust ferromagnetism

Emergent phenomena at heterointerfaces are directly associated with the bonding geometry of adjacent layers. Effective control of accessible parameters, such as the bond length and bonding angles, offers an elegant method to tailor competing energies of the electronic and magnetic ground states. In this study, we construct unit thick syntactic layers of cobaltites within a strongly tilted octahedral matrix via atomically precise synthesis. The octahedral tilt patterns of adjacent layers propagate into cobaltites, leading to a continuation of octahedral tilting while maintaining significant misfit tensile strain. These effects induce severe rumpling within an atomic plane of neighboring layers triggers the electronic reconstruction between the splitting orbitals. First-principles calculations reveal that the cobalt ions transits to a higher spin state level upon octahedral tilting, resulting in robust ferromagnetism in ultrathin cobaltites. This work demonstrates a design methodology for fine-tuning the lattice and spin degrees of freedom in correlated quantum heterostructures by exploiting epitaxial geometric engineering.

preprint2022arXiv

Braiding lateral morphotropic grain boundary in homogeneitic oxides

Interfaces formed by correlated oxides offer a critical avenue for discovering emergent phenomena and quantum states. However, the fabrication of oxide interfaces with variable crystallographic orientations and strain states integrated along a film plane is extremely challenge by conventional layer-by-layer stacking or self-assembling. Here, we report the creation of morphotropic grain boundaries (GBs) in laterally interconnected cobaltite homostructures. Single-crystalline substrates and suspended ultrathin freestanding membranes provide independent templates for coherent epitaxy and constraint on the growth orientation, resulting in seamless and atomically sharp GBs. Electronic states and magnetic behavior in hybrid structures are laterally modulated and isolated by GBs, enabling artificially engineered functionalities in the planar matrix. Our work offers a simple and scalable method for fabricating unprecedented innovative interfaces through controlled synthesis routes as well as provides a platform for exploring potential applications in neuromorphics, solid state batteries, and catalysis.

preprint2022arXiv

Caging-Pnictogen-Induced Superconductivity in Skutterudites IrX3 (X = As, P)

Here we report on a new kind of compound, XδIr4X12-δ (X = P, As), the first hole-doped skutterudites superconductor. We provide atomic resolution images of the caging As atoms using scanning transmission electron microscopy (STEM). By inserting As atoms into the caged structure under a high pressure, superconductivity emerges with a maximum transition temperature (Tc) of 4.4 K (4.8 K) in IrAs3 (IrP3). In contrast to all of the electron-doped skutterudites, the electronic states around the Fermi level in XδIr4X12-δ are dominated by the caged X atom, which can be described by a simple body-centered tight-binding model, implying a distinct paring mechanism. Our density functional theory (DFT) calculations reveal an intimate relationship between the pressure-dependent local-phonon mode and the enhancement of Tc. The discovery of XδIr4X12-δ provides an arena to investigate the uncharted territory of hole-doped skutterudites, and the method proposed here represents a new strategy of carrier doping in caged structures, without introducing extra elements.

preprint2022arXiv

Emergent magnetic states and tunable exchange bias at all 3d nitride heterointerfaces

Interfacial magnetism stimulates the discovery of giant magnetoresistance and spin-orbital coupling across the heterointerfaces, facilitating the intimate correlation between spin transport and complex magnetic structures. Over decades, functional heterointerfaces composed of nitrides are seldomly explored due to the difficulty in synthesizing high-quality and correct composition nitride films. Here we report the fabrication of single-crystalline ferromagnetic Fe3N thin films with precisely controlled thickness. As film thickness decreasing, the magnetization deteriorates dramatically, and electronic state transits from metallic to insulating. Strikingly, the high-temperature ferromagnetism maintains in a Fe3N layer with a thickness down to 2 u. c. (~ 8 Å). The magnetoresistance exhibits a strong in-plane anisotropy and meanwhile the anomalous Hall resistance reserves its sign when Fe3N layer thickness exceeds 5 u. c. Furthermore, we observe a sizable exchange bias at the interfaces between a ferromagnetic Fe3N and an antiferromagnetic CrN. The exchange bias field and saturation moment strongly depend on the controllable bending curvature using cylinder diameter engineering (CDE) technique, implying the tunable magnetic states under lattice deformation. This work provides a guideline for exploring functional nitride films and applying their interfacial phenomena for innovative perspectives towards the practical applications.

preprint2022arXiv

Epitaxial stabilization of an orthorhombic Mg-Ti-O superconductor

The family of titanium oxide superconductors exhibits many intriguing phenomena comparable to cuprates and iron pnictides/chalcogenides, and thus provides an ideal platform to contrastively study the unconventional pairing mechanism of high-temperature superconductors. Here, we successfully deposit superconducting Mg-Ti-O films on MgAl$_2$O$_4$ substrates with three principal orientations by ablating a MgTi$_2$O$_4$ target. Particularly, it is striking to observed that a single-crystalline film of an unintended structure has been grown on the (011)-oriented substrate, with the highest zero resistance transition temperature ($T_{\mathrm{c}0}$) of 5.0 K among them. The film has a highly reduced Mg/Ti ratio and an orthorhombic Ti$_9$O$_{10}$-like structure (denoted as Mg: Ti$_9$O$_{10}$), demonstrated by further characterizations of chemical composition and structure. Such a structure is unstable in bulk but favorable to be epitaxially stabilized on the (011)-surface of MgAl$_2$O$_4$ due to a relatively small strain at the formed interface. An isotropic upper critical field ($B_{\mathrm{c}2}$) up to 13.7 T that breaks the Pauli limit is observed in the Mg: Ti$_9$O$_{10}$ film, analogous to other superconducting titanium oxides. The similarity points to a common origin for the superconductivity in the family, which will provide valuable opinions for the mechanism of unconventional superconductivity in transition metal compounds.

preprint2022arXiv

Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection

Image restoration algorithms such as super resolution (SR) are indispensable pre-processing modules for object detection in low quality images. Most of these algorithms assume the degradation is fixed and known a priori. However, in practical, either the real degradation or optimal up-sampling ratio rate is unknown or differs from assumption, leading to a deteriorating performance for both the pre-processing module and the consequent high-level task such as object detection. Here, we propose a novel self-supervised framework to detect objects in degraded low resolution images. We utilizes the downsampling degradation as a kind of transformation for self-supervised signals to explore the equivariant representation against various resolutions and other degradation conditions. The Auto Encoding Resolution in Self-supervision (AERIS) framework could further take the advantage of advanced SR architectures with an arbitrary resolution restoring decoder to reconstruct the original correspondence from the degraded input image. Both the representation learning and object detection are optimized jointly in an end-to-end training fashion. The generic AERIS framework could be implemented on various mainstream object detection architectures with different backbones. The extensive experiments show that our methods has achieved superior performance compared with existing methods when facing variant degradation situations. Code would be released at https://github.com/cuiziteng/ECCV_AERIS.

preprint2022arXiv

Memory Efficient Temporal & Visual Graph Model for Unsupervised Video Domain Adaptation

Existing video domain adaption (DA) methods need to store all temporal combinations of video frames or pair the source and target videos, which are memory cost expensive and can't scale up to long videos. To address these limitations, we propose a memory-efficient graph-based video DA approach as follows. At first our method models each source or target video by a graph: nodes represent video frames and edges represent the temporal or visual similarity relationship between frames. We use a graph attention network to learn the weight of individual frames and simultaneously align the source and target video into a domain-invariant graph feature space. Instead of storing a large number of sub-videos, our method only constructs one graph with a graph attention mechanism for one video, reducing the memory cost substantially. The extensive experiments show that, compared with the state-of-art methods, we achieved superior performance while reducing the memory cost significantly.

preprint2022arXiv

Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection

Dark environment becomes a challenge for computer vision algorithms owing to insufficient photons and undesirable noise. To enhance object detection in a dark environment, we propose a novel multitask auto encoding transformation (MAET) model which is able to explore the intrinsic pattern behind illumination translation. In a self-supervision manner, the MAET learns the intrinsic visual structure by encoding and decoding the realistic illumination-degrading transformation considering the physical noise model and image signal processing (ISP). Based on this representation, we achieve the object detection task by decoding the bounding box coordinates and classes. To avoid the over-entanglement of two tasks, our MAET disentangles the object and degrading features by imposing an orthogonal tangent regularity. This forms a parametric manifold along which multitask predictions can be geometrically formulated by maximizing the orthogonality between the tangents along the outputs of respective tasks. Our framework can be implemented based on the mainstream object detection architecture and directly trained end-to-end using normal target detection datasets, such as VOC and COCO. We have achieved the state-of-the-art performance using synthetic and real-world datasets. Code is available at https://github.com/cuiziteng/MAET.

preprint2022arXiv

RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images

Image restoration algorithms such as super resolution (SR) are indispensable pre-processing modules for object detection in degraded images. However, most of these algorithms assume the degradation is fixed and known a priori. When the real degradation is unknown or differs from assumption, both the pre-processing module and the consequent high-level task such as object detection would fail. Here, we propose a novel framework, RestoreDet, to detect objects in degraded low resolution images. RestoreDet utilizes the downsampling degradation as a kind of transformation for self-supervised signals to explore the equivariant representation against various resolutions and other degradation conditions. Specifically, we learn this intrinsic visual structure by encoding and decoding the degradation transformation from a pair of original and randomly degraded images. The framework could further take the advantage of advanced SR architectures with an arbitrary resolution restoring decoder to reconstruct the original correspondence from the degraded input image. Both the representation learning and object detection are optimized jointly in an end-to-end training fashion. RestoreDet is a generic framework that could be implemented on any mainstream object detection architectures. The extensive experiment shows that our framework based on CenterNet has achieved superior performance compared with existing methods when facing variant degradation situations. Our code would be released soon.

preprint2022arXiv

Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective

Despite recent stereo matching networks achieving impressive performance given sufficient training data, they suffer from domain shifts and generalize poorly to unseen domains. We argue that maintaining feature consistency between matching pixels is a vital factor for promoting the generalization capability of stereo matching networks, which has not been adequately considered. Here we address this issue by proposing a simple pixel-wise contrastive learning across the viewpoints. The stereo contrastive feature loss function explicitly constrains the consistency between learned features of matching pixel pairs which are observations of the same 3D points. A stereo selective whitening loss is further introduced to better preserve the stereo feature consistency across domains, which decorrelates stereo features from stereo viewpoint-specific style information. Counter-intuitively, the generalization of feature consistency between two viewpoints in the same scene translates to the generalization of stereo matching performance to unseen domains. Our method is generic in nature as it can be easily embedded into existing stereo networks and does not require access to the samples in the target domain. When trained on synthetic data and generalized to four real-world testing sets, our method achieves superior performance over several state-of-the-art networks.

preprint2022arXiv

Surgical Skill Assessment via Video Semantic Aggregation

Automated video-based assessment of surgical skills is a promising task in assisting young surgical trainees, especially in poor-resource areas. Existing works often resort to a CNN-LSTM joint framework that models long-term relationships by LSTMs on spatially pooled short-term CNN features. However, this practice would inevitably neglect the difference among semantic concepts such as tools, tissues, and background in the spatial dimension, impeding the subsequent temporal relationship modeling. In this paper, we propose a novel skill assessment framework, Video Semantic Aggregation (ViSA), which discovers different semantic parts and aggregates them across spatiotemporal dimensions. The explicit discovery of semantic parts provides an explanatory visualization that helps understand the neural network's decisions. It also enables us to further incorporate auxiliary information such as the kinematic data to improve representation learning and performance. The experiments on two datasets show the competitiveness of ViSA compared to state-of-the-art methods. Source code is available at: bit.ly/MICCAI2022ViSA.

preprint2022arXiv

Two-Dimensional Electron Gas with High Mobility Forming at BaO/SrTiO3 Interface

Two-dimensional electron gas (2DEG) formed at the interface between two insulating oxides offers an opportunity for fundamental research and device applications. Binary alkaline earth metal oxides possess compatible lattice constants with both silicon and perovskite oxides, exhibiting an enormous potential to bridging those two materials classes for multifunctionalities. Here we report the formation of 2DEG at the interface between the rock-salt BaO and SrTiO3. The highest electron mobility reaches 69000 cm^2 V.S^-1 at 2 K, leading to the typical Shubniko de Haas (SdH) oscillations under the high magnetic fields. The presence of SdH oscillations at different field-angles reveals a quasi-two-dimensional character of the Fermi surface. The first-principles calculations suggest that the effective charge transfer from the BaO to Ti 3dxy orbital at the interfaces is responsible to the observed high carrier mobility. Our results demonstrate that the BaO/STO heterointerface is a platform for exploring the correlated quantum phases, opening a door to the low-power and mesoscopic electronic devices.

preprint2021arXiv

A magnetic Weyl semimetallic phase in thin films of Eu$_2$Ir$_2$O$_7$

The interplay between electronic interactions and strong spin-orbit coupling is expected to create a plethora of fascinating correlated topological states of quantum matter. Of particular interest are magnetic Weyl semimetals originally proposed in the pyrochlore iridates, which are only expected to reveal their topological nature in thin film form. To date, however, direct experimental demonstrations of these exotic phases remain elusive, due to the lack of usable single crystals and the insufficient quality of available films. Here, we report on the discovery of the long-sought magnetic Weyl semi-metallic phase in (111)-oriented Eu$_2$Ir$_2$O$_7$ high-quality epitaxial thin films. The topological magnetic state shows an intrinsic anomalous Hall effect with colossal coercivity but vanishing net magnetization, which emerges below the onset of a peculiar magnetic phase with all-in-all-out antiferromagnetic ordering. The observed anomalous Hall conductivity arises from the non-zero Berry curvature emanated by Weyl node pairs near the Fermi level that act as sources and sinks of Berry flux, activated by broken cubic crystal symmetry at the top and bottom terminations of the thin film.

preprint2021arXiv

Dynamics of anisotropic oxygen-ion migration in strained cobaltites

Orientation control of oxygen vacancy channel (OVC) is a highly desirable for tailoring oxygen diffusion as it serves fast transport channel in ion conductors, which is widespread exploited in solid-state fuel cells, catalysts, and ion-batteries. Direct observation of oxygen-ions hopping towards preferential vacant sites is a key to clarifying migration pathways. Here we report the anisotropic oxygen-ion migration mediated by strain in ultrathin cobaltites via in-situ thermal activation in an atomic-resolved transmission electron microscopy. Oxygen migration pathways are constructed on the basis of the atomic structure during the OVC switching, which is manifested as the vertical-to-horizontal OVC switching under tensile strain, but the horizontal-to-diagonal switching under compression. We evaluate the topotactic structural changes to OVC, determine the crucial role of tolerance factor for OVC stability and establish the strain-dependent phase diagram. Our work provides a practical guide for engineering OVC orientation that is applicable ionic-oxide electronics.

preprint2021arXiv

Explainable Diabetic Retinopathy Detection and Retinal Image Generation

Though deep learning has shown successful performance in classifying the label and severity stage of certain diseases, most of them give few explanations on how to make predictions. Inspired by Koch's Postulates, the foundation in evidence-based medicine (EBM) to identify the pathogen, we propose to exploit the interpretability of deep learning application in medical diagnosis. By determining and isolating the neuron activation patterns on which diabetic retinopathy (DR) detector relies to make decisions, we demonstrate the direct relation between the isolated neuron activation and lesions for a pathological explanation. To be specific, we first define novel pathological descriptors using activated neurons of the DR detector to encode both spatial and appearance information of lesions. Then, to visualize the symptom encoded in the descriptor, we propose Patho-GAN, a new network to synthesize medically plausible retinal images. By manipulating these descriptors, we could even arbitrarily control the position, quantity, and categories of generated lesions. We also show that our synthesized images carry the symptoms directly related to diabetic retinopathy diagnosis. Our generated images are both qualitatively and quantitatively superior to the ones by previous methods. Besides, compared to existing methods that take hours to generate an image, our second level speed endows the potential to be an effective solution for data augmentation.

preprint2021arXiv

Ferromagnetic Enhancement in LaMnO3 Films with Release and Flexure

A variety of novel phenomena and functionalities emerge from lowering the dimensionality of materials and enriching the degrees of freedom in modulation. In this work, it is found that the saturation magnetization of LaMnO3 (LMO) films is largely enhanced by 56% after releasing from a brand-new phase of tetragonal strontium aluminate buffer layer, and is significantly increased by 92% with bending films to a curvature of 1 mm-1 using a water-assisted direct-transferring method. Meanwhile, the Curie temperature of LMO films has been improved by 13 K. High-resolution spherical aberration-corrected scanning transmission electron microscopy and first-principles calculations unambiguously demonstrate that the enhanced ferromagnetism is attributed to the strengthened Mn-O-Mn super-exchange interactions from the augmented characteristics of the unconventional P21/n structure caused by the out-of-plane lattice shrinking after strain releasing and increased flexure degree of freestanding LMO films. This work paves a way to achieve large-scale and crack-and-wrinkle-free freestanding films of oxides with largely improved functionalities.

preprint2021arXiv

Goal-Oriented Gaze Estimation for Zero-Shot Learning

Zero-shot learning (ZSL) aims to recognize novel classes by transferring semantic knowledge from seen classes to unseen classes. Since semantic knowledge is built on attributes shared between different classes, which are highly local, strong prior for localization of object attribute is beneficial for visual-semantic embedding. Interestingly, when recognizing unseen images, human would also automatically gaze at regions with certain semantic clue. Therefore, we introduce a novel goal-oriented gaze estimation module (GEM) to improve the discriminative attribute localization based on the class-level attributes for ZSL. We aim to predict the actual human gaze location to get the visual attention regions for recognizing a novel object guided by attribute description. Specifically, the task-dependent attention is learned with the goal-oriented GEM, and the global image features are simultaneously optimized with the regression of local attribute features. Experiments on three ZSL benchmarks, i.e., CUB, SUN and AWA2, show the superiority or competitiveness of our proposed method against the state-of-the-art ZSL methods. The ablation analysis on real gaze data CUB-VWSW also validates the benefits and accuracy of our gaze estimation module. This work implies the promising benefits of collecting human gaze dataset and automatic gaze estimation algorithms on high-level computer vision tasks. The code is available at https://github.com/osierboy/GEM-ZSL.

preprint2021arXiv

Information Bottleneck Constrained Latent Bidirectional Embedding for Zero-Shot Learning

Zero-shot learning (ZSL) aims to recognize novel classes by transferring semantic knowledge from seen classes to unseen classes. Though many ZSL methods rely on a direct mapping between the visual and the semantic space, the calibration deviation and hubness problem limit the generalization capability to unseen classes. Recently emerged generative ZSL methods generate unseen image features to transform ZSL into a supervised classification problem. However, most generative models still suffer from the seen-unseen bias problem as only seen data is used for training. To address these issues, we propose a novel bidirectional embedding based generative model with a tight visual-semantic coupling constraint. We learn a unified latent space that calibrates the embedded parametric distributions of both visual and semantic spaces. Since the embedding from high-dimensional visual features comprise much non-semantic information, the alignment of visual and semantic in latent space would inevitably been deviated. Therefore, we introduce information bottleneck (IB) constraint to ZSL for the first time to preserve essential attribute information during the mapping. Specifically, we utilize the uncertainty estimation and the wake-sleep procedure to alleviate the feature noises and improve model abstraction capability. In addition, our method can be easily extended to transductive ZSL setting by generating labels for unseen images. We then introduce a robust loss to solve this label noise problem. Extensive experimental results show that our method outperforms the state-of-the-art methods in different ZSL settings on most benchmark datasets. The code will be available at https://github.com/osierboy/IBZSL.

preprint2021arXiv

Observation of Aharonov-Bohm effect in PbTe nanowire networks

We report phase coherent electron transport in PbTe nanowire networks with a loop geometry. Magneto-conductance shows Aharonov-Bohm (AB) oscillations with periods of $h/e$ and $h/2e$ in flux. The amplitude of $h/2e$ oscillations is enhanced near zero magnetic field, possibly due to interference between time-reversal paths. Temperature dependence of the AB amplitudes suggests a phase coherence length $\sim$ 8 - 12 $μ$m at 50 mK. This length scale is larger than the typical geometry of PbTe-based hybrid semiconductor-superconductor nanowire devices.

preprint2021arXiv

Room-temperature ferromagnetism at an oxide/nitride interface

Heterointerfaces have led to the discovery of novel electronic and magnetic states because of their strongly entangled electronic degrees of freedom. Single-phase chromium compounds always exhibit antiferromagnetism following the prediction of Goodenough-Kanamori rules. So far, exchange coupling between chromium ions via hetero-anions has not been explored and the associated quantum states is unknown. Here we report the successful epitaxial synthesis and characterizations of chromium oxide (Cr2O3)-chromium nitride (CrN) superlattices. Room-temperature ferromagnetic spin ordering is achieved at the interfaces between these two antiferromagnets, and the magnitude of the effect decays with increasing layer thickness. First-principles calculations indicate that robust ferromagnetic spin interaction between Cr3+ ions via anion-hybridizations across the interface yields the lowest total energy. This work opens the door to fundamental understanding of the unexpected and exceptional properties of oxide-nitride interfaces and provides access to hidden phases at low-dimensional quantum heterostructures.

preprint2021arXiv

Selective area epitaxy of PbTe-Pb hybrid nanowires on a lattice-matched substrate

Topological quantum computing is based on braiding of Majorana zero modes encoding topological qubits. A promising candidate platform for Majorana zero modes is semiconductor-superconductor hybrid nanowires. The realization of topological qubits and braiding operations requires scalable and disorder-free nanowire networks. Selective area growth of in-plane InAs and InSb nanowires, together with shadow-wall growth of superconductor structures, have demonstrated this scalability by achieving various network structures. However, the noticeable lattice mismatch at the nanowire-substrate interface, acting as a disorder source, imposes a serious obstacle along with this roadmap. Here, combining selective area and shadow-wall growth, we demonstrate the fabrication of PbTe-Pb hybrid nanowires - another potentially promising Majorana system - on a nearly perfectly lattice-matched substrate CdTe, all done in one molecular beam epitaxy chamber. Transmission electron microscopy shows the single-crystal nature of the PbTe nanowire and its atomically sharp and clean interfaces to the CdTe substrate and the Pb overlayer, without noticeable inter-diffusion or strain. The nearly ideal interface condition, together with the strong screening of charge impurities due to the large dielectric constant of PbTe, hold promise towards a clean nanowire system to study Majorana zero modes and topological quantum computing.

preprint2021arXiv

Structural Twinning-induced Insulating Phase in CrN (111) Films

Electronic states of a correlated material can be effectively modified by structural variations delivered from a single-crystal substrate. In this letter, we show that the CrN films grown on MgO (001) substrates have a (001) orientation, whereas the CrN films on α-Al2O3 (0001) substrates are oriented along (111) direction parallel to the surface normal. Transport properties of CrN films are remarkably different depending on crystallographic orientations. The critical thickness for the metal-insulator transition (MIT) in CrN 111 films is significantly larger than that of CrN 001 films. In contrast to CrN 001 films without apparent defects, scanning transmission electron microscopy results reveal that CrN 111 films exhibit strain-induced structural defects, e. g. the periodic horizontal twinning domains, resulting in an increased electron scattering facilitating an insulating state. Understanding the key parameters that determine the electronic properties of ultrathin conductive layers is highly desirable for future technological applications.

preprint2020arXiv

Beyond Triplet Loss: Person Re-identification with Fine-grained Difference-aware Pairwise Loss

Person Re-IDentification (ReID) aims at re-identifying persons from different viewpoints across multiple cameras. Capturing the fine-grained appearance differences is often the key to accurate person ReID, because many identities can be differentiated only when looking into these fine-grained differences. However, most state-of-the-art person ReID approaches, typically driven by a triplet loss, fail to effectively learn the fine-grained features as they are focused more on differentiating large appearance differences. To address this issue, we introduce a novel pairwise loss function that enables ReID models to learn the fine-grained features by adaptively enforcing an exponential penalization on the images of small differences and a bounded penalization on the images of large differences. The proposed loss is generic and can be used as a plugin to replace the triplet loss to significantly enhance different types of state-of-the-art approaches. Experimental results on four benchmark datasets show that the proposed loss substantially outperforms a number of popular loss functions by large margins; and it also enables significantly improved data efficiency.

preprint2020arXiv

Electric-field-controllable high-spin SrRuO3 driven by a solid ionic junction

Controlling magnetism and spin structures in strongly correlated systems by using electric field is of fundamental importance but challenging. Here, a high-spin ruthenate phase is achieved via a solid ionic chemical junction at SrRuO3/SrTiO3 interface with distinct formation energies and diffusion barriers of oxygen vacancies, analogue to electronic band alignment in semiconductor heterojunction. Oxygen vacancies trapped within this interfacial SrRuO3 reconstruct Ru-4d electronic structure and orbital occupancy, leading to an enhanced magnetic moment. Furthermore, an interfacial magnetic phase can be switched reversibly by electric-field-rectifying oxygen migration in a solid-state ionic gating device, providing a framework for atomic design of functionalities in strongly correlated oxides using a way of solid chemistry.

preprint2020arXiv

Robust ferromagnetism in highly strained SrCoO3 thin films

Epitaxial strain provides important pathways to control the magnetic and electronic states in transition metal oxides. However, the large strain is usually accompanied by a strong reduction of the oxygen vacancy formation energy, which hinders the direct manipulation of their intrinsic properties. Here using a post-deposition ozone annealing method, we obtained a series of oxygen stoichiometric SrCoO3 thin films with the tensile strain up to 3.0%. We observed a robust ferromagnetic ground state in all strained thin films, while interestingly the tensile strain triggers a distinct metal to insulator transition along with the increase of the tensile strain. The persistent ferromagnetic state across the electrical transition therefore suggests that the magnetic state is directly correlated with the localized electrons, rather than the itinerant ones, which then calls for further investigation of the intrinsic mechanism of this magnetic compound beyond the double-exchange mechanism.

preprint2019arXiv

Emergent superconductivity in single crystalline $\mathrm{MgTi}_2\mathrm{O}_4$ films via structural engineering

Spinel compounds have demonstrated rich functionalities but rarely shown superconductivity. Here, we report the emergence of superconductivity in the spinel $\mathrm{MgTi}_2\mathrm{O}_4$, known to be an insulator with a complicated order. The superconducting transition is achieved by engineering a superlattice of $\mathrm{MgTi}_2\mathrm{O}_4$ and $\mathrm{SrTiO}_3$. The onset transition temperature in the $\mathrm{MgTi}_2\mathrm{O}_4$ layer can be tuned from 0 to 5 K in such geometry, concurrently with a stretched $c$-axis (from 8.51 to 8.53 Å) compared to the bulk material. Such a positive correlation without saturation suggests ample room for the further enhancement. Intriguingly, the superlattice exhibits isotropic upper critical field $H_{\mathrm{c}2}$ that breaks the Pauli limit, distinct from the highly anisotropic feature of interface superconductivity. The origin of superconductivity in the $\mathrm{MgTi}_2\mathrm{O}_4$ layer is understood in combination with the electron energy loss spectra and the first-principles electronic structure calculations, which point to the birth of superconductivity in the $\mathrm{MgTi}_2\mathrm{O}_4$ layer by preventing the Ti-Ti dimerization. Our discovery not only provides a platform to explore the interplay between the superconductivity and other exotic states, but also opens a new window to realize superconductivity in the spinel compounds as well as other titanium oxides.

preprint2019arXiv

Non-Volatile Superconductivity in an Insulating Copper Oxide Induced via Ionic Liquid Gating

Manipulating the superconducting states of high-T_c cuprate superconductors in an efficient and reliable way is of great importance for their applications in next-generation electronics. Traditional methods are mostly based on a trial-and-error method that is difficult to implement and time consuming. Here, employing ionic liquid gating, a selective control of volatile and non-volatile superconductivity is achieved in pristine insulating Pr_2CuO_{4\pmδ} film, based on two distinct mechanisms: 1) with positive electric fields, the film can be reversibly switched between non-superconducting and superconducting states, attributed to the carrier doping effect. 2) The film becomes more resistive by applying negative bias voltage up to -4 V, but strikingly, a non-volatile superconductivity is achieved once the gate voltage is removed. Such a persistent superconducting state represents a novel phenomenon in copper oxides, resulting from the doping healing of oxygen vacancies in copper-oxygen planes as unraveled by high-resolution scanning transmission electron microscope and in-situ x-ray diffraction experiments. The effective manipulation and mastering of volatile/non-volatile superconductivity in the same parent cuprate opens the door to more functionalities for superconducting electronics, as well as supplies flexible samples for investigating the nature of quantum phase transitions in high-T_c superconductors.

preprint2018arXiv

Pathological Evidence Exploration in Deep Retinal Image Diagnosis

Though deep learning has shown successful performance in classifying the label and severity stage of certain disease, most of them give few evidence on how to make prediction. Here, we propose to exploit the interpretability of deep learning application in medical diagnosis. Inspired by Koch's Postulates, a well-known strategy in medical research to identify the property of pathogen, we define a pathological descriptor that can be extracted from the activated neurons of a diabetic retinopathy detector. To visualize the symptom and feature encoded in this descriptor, we propose a GAN based method to synthesize pathological retinal image given the descriptor and a binary vessel segmentation. Besides, with this descriptor, we can arbitrarily manipulate the position and quantity of lesions. As verified by a panel of 5 licensed ophthalmologists, our synthesized images carry the symptoms that are directly related to diabetic retinopathy diagnosis. The panel survey also shows that our generated images is both qualitatively and quantitatively superior to existing methods.