Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2022arXiv

Causality Inspired Representation Learning for Domain Generalization

Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach.

preprint2022arXiv

DeepRecon: Joint 2D Cardiac Segmentation and 3D Volume Reconstruction via A Structure-Specific Generative Method

Joint 2D cardiac segmentation and 3D volume reconstruction are fundamental to building statistical cardiac anatomy models and understanding functional mechanisms from motion patterns. However, due to the low through-plane resolution of cine MR and high inter-subject variance, accurately segmenting cardiac images and reconstructing the 3D volume are challenging. In this study, we propose an end-to-end latent-space-based framework, DeepRecon, that generates multiple clinically essential outcomes, including accurate image segmentation, synthetic high-resolution 3D image, and 3D reconstructed volume. Our method identifies the optimal latent representation of the cine image that contains accurate semantic information for cardiac structures. In particular, our model jointly generates synthetic images with accurate semantic information and segmentation of the cardiac structures using the optimal latent representation. We further explore downstream applications of 3D shape reconstruction and 4D motion pattern adaptation by the different latent-space manipulation strategies.The simultaneously generated high-resolution images present a high interpretable value to assess the cardiac shape and motion.Experimental results demonstrate the effectiveness of our approach on multiple fronts including 2D segmentation, 3D reconstruction, downstream 4D motion pattern adaption performance.

preprint2022arXiv

FAT: An In-Memory Accelerator with Fast Addition for Ternary Weight Neural Networks

Convolutional Neural Networks (CNNs) demonstrate excellent performance in various applications but have high computational complexity. Quantization is applied to reduce the latency and storage cost of CNNs. Among the quantization methods, Binary and Ternary Weight Networks (BWNs and TWNs) have a unique advantage over 8-bit and 4-bit quantization. They replace the multiplication operations in CNNs with additions, which are favoured on In-Memory-Computing (IMC) devices. IMC acceleration for BWNs has been widely studied. However, though TWNs have higher accuracy and better sparsity than BWNs, IMC acceleration for TWNs has limited research. TWNs on existing IMC devices are inefficient because the sparsity is not well utilized, and the addition operation is not efficient. In this paper, we propose FAT as a novel IMC accelerator for TWNs. First, we propose a Sparse Addition Control Unit, which utilizes the sparsity of TWNs to skip the null operations on zero weights. Second, we propose a fast addition scheme based on the memory Sense Amplifier to avoid the time overhead of both carry propagation and writing back the carry to memory cells. Third, we further propose a Combined-Stationary data mapping to reduce the data movement of activations and weights and increase the parallelism across memory columns. Simulation results show that for addition operations at the Sense Amplifier level, FAT achieves 2.00X speedup, 1.22X power efficiency, and 1.22X area efficiency compared with a State-Of-The-Art IMC accelerator ParaPIM. FAT achieves 10.02X speedup and 12.19X energy efficiency compared with ParaPIM on networks with 80% average sparsity.

preprint2022arXiv

Improving Robustness of Convolutional Neural Networks Using Element-Wise Activation Scaling

Recent works reveal that re-calibrating the intermediate activation of adversarial examples can improve the adversarial robustness of a CNN model. The state of the arts [Baiet al., 2021] and [Yanet al., 2021] explores this feature at the channel level, i.e. the activation of a channel is uniformly scaled by a factor. In this paper, we investigate the intermediate activation manipulation at a more fine-grained level. Instead of uniformly scaling the activation, we individually adjust each element within an activation and thus propose Element-Wise Activation Scaling, dubbed EWAS, to improve CNNs' adversarial robustness. Experimental results on ResNet-18 and WideResNet with CIFAR10 and SVHN show that EWAS significantly improves the robustness accuracy. Especially for ResNet18 on CIFAR10, EWAS increases the adversarial accuracy by 37.65% to 82.35% against C&W attack. EWAS is simple yet very effective in terms of improving robustness. The codes are anonymously available at https://anonymous.4open.science/r/EWAS-DD64.

preprint2022arXiv

Leading Two-loop corrections to the mass of Higgs boson in the High scale Dirac gaugino supersymmetry

Precision measurements of the Higgs mass have become a powerful constraint on models of physics beyond the standard model. We revisit supersymmetric models with Dirac gauginos and study the contributions to the Higgs mass. We calculate the leading two-loop corrections to the SM-like Higgs mass by constructing a series of EFTs and iteratively integrating out heavy particles. We then apply these calculations to a variety of scenarios, including a simple Dirac gluino, and split Dirac models of supersymmetry. We present the detailed formulae for threshold corrections and compare with previous results, where available. In general, the contributions are small, but the additional precision allows us to make more concrete statements about the relevant scales in Dirac SUSY models.

preprint2022arXiv

Spin-optical dynamics and quantum efficiency of single V1 center in silicon carbide

Color centers in silicon carbide are emerging candidates for distributed spin-based quantum applications due to the scalability of host materials and the demonstration of integration into nanophotonic resonators. Recently, silicon vacancy centers in silicon carbide have been identified as a promising system with excellent spin and optical properties. Here, we in-depth study the spin-optical dynamics of single silicon vacancy center at hexagonal lattice sites, namely V1, in 4H-polytype silicon carbide. By utilizing resonant and above-resonant sub-lifetime pulsed excitation, we determine spin-dependent excited-state lifetimes and intersystem-crossing rates. Our approach to inferring the intersystem-crossing rates is based on all-optical pulsed initialization and readout scheme, and is applicable to spin-active color centers with similar dynamics models. In addition, the optical transition dipole strength and the quantum efficiency of V1 defect are evaluated based on coherent optical Rabi measurement and local-field calibration employing electric-field simulation. The measured rates well explain the results of spin-state polarization dynamics, and we further discuss the altered photoemission dynamics in resonant enhancement structures such as radiative lifetime shortening and Purcell enhancement. By providing a thorough description of V1 center's spin-optical dynamics, our work provides deep understanding of the system which guides implementations of scalable quantum applications based on silicon vacancy centers in silicon carbide.

preprint2022arXiv

TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers

Combining information from multi-view images is crucial to improve the performance and robustness of automated methods for disease diagnosis. However, due to the non-alignment characteristics of multi-view images, building correlation and data fusion across views largely remain an open problem. In this study, we present TransFusion, a Transformer-based architecture to merge divergent multi-view imaging information using convolutional layers and powerful attention mechanisms. In particular, the Divergent Fusion Attention (DiFA) module is proposed for rich cross-view context modeling and semantic dependency mining, addressing the critical issue of capturing long-range correlations between unaligned data from different image views. We further propose the Multi-Scale Attention (MSA) to collect global correspondence of multi-scale feature representations. We evaluate TransFusion on the Multi-Disease, Multi-View \& Multi-Center Right Ventricular Segmentation in Cardiac MRI (M\&Ms-2) challenge cohort. TransFusion demonstrates leading performance against the state-of-the-art methods and opens up new perspectives for multi-view imaging integration towards robust medical image segmentation.

preprint2022arXiv

You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms

Benefiting from the search efficiency, differentiable neural architecture search (NAS) has evolved as the most dominant alternative to automatically design competitive deep neural networks (DNNs). We note that DNNs must be executed under strictly hard performance constraints in real-world scenarios, for example, the runtime latency on autonomous vehicles. However, to obtain the architecture that meets the given performance constraint, previous hardware-aware differentiable NAS methods have to repeat a plethora of search runs to manually tune the hyper-parameters by trial and error, and thus the total design cost increases proportionally. To resolve this, we introduce a lightweight hardware-aware differentiable NAS framework dubbed LightNAS, striving to find the required architecture that satisfies various performance constraints through a one-time search (i.e., \underline{\textit{you only search once}}). Extensive experiments are conducted to show the superiority of LightNAS over previous state-of-the-art methods.

preprint2021arXiv

Nanofabricated and integrated colour centres in silicon carbide with high-coherence spin-optical properties

Optically addressable spin defects in silicon carbide (SiC) are an emerging platform for quantum information processing. Lending themselves to modern semiconductor nanofabrication, they promise scalable high-efficiency spin-photon interfaces. We demonstrate here nanoscale fabrication of silicon vacancy centres (VSi) in 4H-SiC without deterioration of their intrinsic spin-optical properties. In particular, we show nearly transform limited photon emission and record spin coherence times for single defects generated via ion implantation and in triangular cross section waveguides. For the latter, we show further controlled operations on nearby nuclear spin qubits, which is crucial for fault-tolerant quantum information distribution based on cavity quantum electrodynamics.

preprint2021arXiv

Narrow inhomogeneous distribution of spin-active emitters in silicon carbide

Optically active solid-state spin registers have demonstrated their unique potential in quantum computing, communication and sensing. Realizing scalability and increasing application complexity requires entangling multiple individual systems, e.g. via photon interference in an optical network. However, most solid-state emitters show relatively broad spectral distributions, which hinders optical interference experiments. Here, we demonstrate that silicon vacancy centres in semiconductor silicon carbide (SiC) provide a remarkably small natural distribution of their optical absorption/emission lines despite an elevated defect concentration of $\approx 0.43\,\rm μm^{-3}$. In particular, without any external tuning mechanism, we show that only 13 defects have to be investigated until at least two optical lines overlap within the lifetime-limited linewidth. Moreover, we identify emitters with overlapping emission profiles within diffraction limited excitation spots, for which we introduce simplified schemes for generation of computationally-relevant Greenberger-Horne-Zeilinger (GHZ) and cluster states. Our results underline the potential of the CMOS-compatible SiC platform toward realizing networked quantum technology applications.

preprint2021arXiv

Transverse mode-encoded quantum gate on a silicon photonic chip

As an important degree of freedom (DoF) in integrated photonic circuits, the orthogonal transverse mode provides a promising and flexible way to increasing communication capability, for both classical and quantum information processing. To construct large-scale on-chip multimode multi-DoF quantum systems, a transverse mode-encoded controlled-NOT (CNOT) gate is necessary. Here, through design and integrate transverse mode-dependent directional coupler and attenuators on a silicon photonic chip, we demonstrate the first multimode implementation of a two-qubit quantum gate. With the aid of state preparation and analysis parts, we show the ability of the gate to entangle two separated transverse mode qubits with an average fidelity of $0.89\pm0.02$ and the achievement of 10 standard deviations of violations in the quantum nonlocality verification. In addition, a fidelity of $0.82\pm0.01$ was obtained from quantum process tomography used to completely characterize the CNOT gate. Our work paves the way for universal transverse mode-encoded quantum operations and large-scale multimode multi-DoF quantum systems.

preprint2020arXiv

A fast approximate skeleton with guarantees for any cloud of points in a Euclidean space

The tree reconstruction problem is to find an embedded straight-line tree that approximates a given cloud of unorganized points in $\mathbb{R}^m$ up to a certain error. A practical solution to this problem will accelerate a discovery of new colloidal products with desired physical properties such as viscosity. We define the Approximate Skeleton of any finite point cloud $C$ in a Euclidean space with theoretical guarantees. The Approximate Skeleton ASk$(C)$ always belongs to a given offset of $C$, i.e. the maximum distance from $C$ to ASk$(C)$ can be a given maximum error. The number of vertices in the Approximate Skeleton is close to the minimum number in an optimal tree by factor 2. The new Approximate Skeleton of any unorganized point cloud $C$ is computed in a near linear time in the number of points in $C$. Finally, the Approximate Skeleton outperforms past skeletonization algorithms on the size and accuracy of reconstruction for a large dataset of real micelles and random clouds.

preprint2020arXiv

Energy-Aware Offloading in Time-Sensitive Networks with Mobile Edge Computing

Mobile Edge Computing (MEC) enables rich services in close proximity to the end users to provide high quality of experience (QoE) and contributes to energy conservation compared with local computing, but results in increased communication latency. In this paper, we investigate how to jointly optimize task offloading and resource allocation to minimize the energy consumption in an orthogonal frequency division multiple access-based MEC networks, where the time-sensitive tasks can be processed at both local users and MEC server via partial offloading. Since the optimization variables of the problem are strongly coupled, we first decompose the orignal problem into three subproblems named as offloading selection (PO ), transmission power optimization (PT ), and subcarriers and computing resource allocation (PS ), and then propose an iterative algorithm to deal with them in a sequence. To be specific, we derive the closed-form solution for PO , employ the equivalent parametric convex programming to cope with the objective function which is in the form of sum of ratios in PT , and deal with PS by an alternating way in the dual domain due to its NP-hardness. Simulation results demonstrate that the proposed algorithm outperforms the existing schemes.

preprint2020arXiv

Spin Excitations and Spin Wave Gap in the Ferromagnetic Weyl Semimetal Co$_3$Sn$_2$S$_2$

We report a comprehensive neutron scattering study on the spin excitations in the magnetic Weyl semimetal Co$_3$Sn$_2$S$_2$ with quasi-two-dimensional structure. Both in-plane and out-of-plane dispersions of the spin waves are revealed in the ferromagnetic state, similarly dispersive but damped spin excitations persist into the paramagnetic state. The effective exchange interactions have been estimated by a semi-classical Heisenberg model to consistently reproduce the experimental $T_C$ and spin stiffness. However, a full spin wave gap below $E_g=2.3$ meV is observed at $T=4$ K, much larger than the estimated magnetic anisotropy energy ($\sim0.6$ meV), while its temperature dependence indicates a significant contribution from the Weyl fermions. These results suggest that Co$_3$Sn$_2$S$_2$ is a three-dimensional correlated system with large spin stiffness, and the low-energy spin dynamics could interplay with the topological electron states.

preprint2020arXiv

Spin-controlled generation of indistinguishable and distinguishable photons from silicon vacancy centres in silicon carbide

Quantum systems combining indistinguishable photon generation and spin-based quantum information processing are essential for remote quantum applications and networking. However, identification of suitable systems in scalable platforms remains a challenge. Here, we investigate the silicon vacancy centre in silicon carbide and demonstrate controlled emission of indistinguishable and distinguishable photons via coherent spin manipulation. Using strong off-resonant excitation and collecting photons from the ultra-stable zero-phonon line optical transitions, we show a two-photon interference contrast close to 90% in Hong-Ou-Mandel type experiments. Further, we exploit the system's intimate spin-photon relation to spin-control the colour and indistinguishability of consecutively emitted photons. Our results provide a deep insight into the system's spin-phonon-photon physics and underline the potential of the industrially compatible silicon carbide platform for measurement-based entanglement distribution and photonic cluster state generation. Additional coupling to quantum registers based on recently demonstrated coupled individual nuclear spins would further allow for high-level network-relevant quantum information processing, such as error correction and entanglement purification.