Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
41works
0followers
35topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

41 published item(s)

preprint2026arXiv

Mantis: Mamba-native Tuning is Efficient for 3D Point Cloud Foundation Models

Pre-trained 3D point cloud foundation models (PFMs) have demonstrated strong transferability across diverse downstream tasks. However, full fine-tuning these models is computationally expensive and storage-intensive. Parameter-efficient fine-tuning (PEFT) offers a promising alternative, but existing PEFT approaches are primarily designed for Transformer-based backbones and rely on token-level prompting or feature transformation. Mamba-based backbones introduce a granularity mismatch between token-level adaptation and state-level sequence dynamics. Consequently, straightforward transfer of existing PEFT approaches to frozen Mamba backbones leads to substantial accuracy degradation and unstable optimization. To address this issue, we propose Mantis, the first Mamba-native PEFT framework for 3D PFMs. Specifically, a State-Aware Adapter (SAA) is introduced to inject lightweight task-conditioned control signals into selective state-space updates, enabling state-level adaptation while keeping the pre-trained backbone frozen. Moreover, different valid point cloud serializations are regularized by Dual-Serialization Consistency Distillation (DSCD), thereby reducing serialization-induced instability. Extensive experiments across multiple benchmarks demonstrate that our Mantis achieves competitive performance with only about 5% trainable parameters. Our code is available at https://github.com/gzhhhhhhh/Mantis.

preprint2024arXiv

Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification

Identifying labels that did not appear during training, known as multi-label zero-shot learning, is a non-trivial task in computer vision. To this end, recent studies have attempted to explore the multi-modal knowledge of vision-language pre-training (VLP) models by knowledge distillation, allowing to recognize unseen labels in an open-vocabulary manner. However, experimental evidence shows that knowledge distillation is suboptimal and provides limited performance gain in unseen label prediction. In this paper, a novel query-based knowledge sharing paradigm is proposed to explore the multi-modal knowledge from the pretrained VLP model for open-vocabulary multi-label classification. Specifically, a set of learnable label-agnostic query tokens is trained to extract critical vision knowledge from the input image, and further shared across all labels, allowing them to select tokens of interest as visual clues for recognition. Besides, we propose an effective prompt pool for robust label embedding, and reformulate the standard ranking learning into a form of classification to allow the magnitude of feature vectors for matching, which both significantly benefit label recognition. Experimental results show that our framework significantly outperforms state-of-the-art methods on zero-shot task by 5.9% and 4.5% in mAP on the NUS-WIDE and Open Images, respectively.

preprint2024arXiv

SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment

Multimodal alignment between language and vision is the fundamental topic in current vision-language model research. Contrastive Captioners (CoCa), as a representative method, integrates Contrastive Language-Image Pretraining (CLIP) and Image Caption (IC) into a unified framework, resulting in impressive results. CLIP imposes a bidirectional constraints on global representation of entire images and sentences. Although IC conducts an unidirectional image-to-text generation on local representation, it lacks any constraint on local text-to-image reconstruction, which limits the ability to understand images at a fine-grained level when aligned with texts. To achieve multimodal alignment from both global and local perspectives, this paper proposes Symmetrizing Contrastive Captioners (SyCoCa), which introduces bidirectional interactions on images and texts across the global and local representation levels. Specifically, we expand a Text-Guided Masked Image Modeling (TG-MIM) head based on ITC and IC heads. The improved SyCoCa can further leverage textual cues to reconstruct contextual images and visual cues to predict textual contents. When implementing bidirectional local interactions, the local contents of images tend to be cluttered or unrelated to their textual descriptions. Thus, we employ an attentive masking strategy to select effective image patches for interaction. Extensive experiments on five vision-language tasks, including image-text retrieval, image-captioning, visual question answering, and zero-shot/finetuned image classification, validate the effectiveness of our proposed method.

preprint2022arXiv

"Adversarial Examples" for Proof-of-Learning

In S&P '21, Jia et al. proposed a new concept/mechanism named proof-of-learning (PoL), which allows a prover to demonstrate ownership of a machine learning model by proving integrity of the training procedure. It guarantees that an adversary cannot construct a valid proof with less cost (in both computation and storage) than that made by the prover in generating the proof. A PoL proof includes a set of intermediate models recorded during training, together with the corresponding data points used to obtain each recorded model. Jia et al. claimed that an adversary merely knowing the final model and training dataset cannot efficiently find a set of intermediate models with correct data points. In this paper, however, we show that PoL is vulnerable to ``adversarial examples''! Specifically, in a similar way as optimizing an adversarial example, we could make an arbitrarily-chosen data point ``generate'' a given model, hence efficiently generating intermediate models with correct data points. We demonstrate, both theoretically and empirically, that we are able to generate a valid proof with significantly less cost than generating a proof by the prover.

preprint2022arXiv

Auditing Privacy Defenses in Federated Learning via Generative Gradient Leakage

Federated Learning (FL) framework brings privacy benefits to distributed learning systems by allowing multiple clients to participate in a learning task under the coordination of a central server without exchanging their private data. However, recent studies have revealed that private information can still be leaked through shared gradient information. To further protect user's privacy, several defense mechanisms have been proposed to prevent privacy leakage via gradient information degradation methods, such as using additive noise or gradient compression before sharing it with the server. In this work, we validate that the private training data can still be leaked under certain defense settings with a new type of leakage, i.e., Generative Gradient Leakage (GGL). Unlike existing methods that only rely on gradient information to reconstruct data, our method leverages the latent space of generative adversarial networks (GAN) learned from public image datasets as a prior to compensate for the informational loss during gradient degradation. To address the nonlinearity caused by the gradient operator and the GAN model, we explore various gradient-free optimization methods (e.g., evolution strategies and Bayesian optimization) and empirically show their superiority in reconstructing high-quality images from gradients compared to gradient-based optimizers. We hope the proposed method can serve as a tool for empirically measuring the amount of privacy leakage to facilitate the design of more robust defense mechanisms.

preprint2022arXiv

Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation

Token-level adaptive training approaches can alleviate the token imbalance problem and thus improve neural machine translation, through re-weighting the losses of different target tokens based on specific statistical metrics (e.g., token frequency or mutual information). Given that standard translation models make predictions on the condition of previous target contexts, we argue that the above statistical metrics ignore target context information and may assign inappropriate weights to target tokens. While one possible solution is to directly take target contexts into these statistical metrics, the target-context-aware statistical computing is extremely expensive, and the corresponding storage overhead is unrealistic. To solve the above issues, we propose a target-context-aware metric, named conditional bilingual mutual information (CBMI), which makes it feasible to supplement target context information for statistical metrics. Particularly, our CBMI can be formalized as the log quotient of the translation model probability and language model probability by decomposing the conditional joint distribution. Thus CBMI can be efficiently calculated during model training without any pre-specific statistical calculations and large storage overhead. Furthermore, we propose an effective adaptive training approach based on both the token- and sentence-level CBMI. Experimental results on WMT14 English-German and WMT19 Chinese-English tasks show our approach can significantly outperform the Transformer baseline and other related methods.

preprint2022arXiv

Demonstration of Parametric Instability suppression through optical feedback

We demonstrate the suppression of parametric instability using through optical actuation in an electro-optical feedback loop, stabilising the high order optical mode content in an 80 metre long Fabry-Perot cavity. The loop suppression of the high order mode is achieved by injecting a high order mode with the same frequency and opposite phase. Frequency matching is achieved by measuring the beat note signal between the fundamental and high order mode in transmission of the cavity and applying that signal to an electro-optical modulator to create the required frequency sideband. Spatial mode matching of the sideband to the high order mode is accomplished through the inherent mode overlap between the input injected beam and the high order mode of the cavity. The paper presents the theoretical analysis and experimental demonstration of parametric instability suppression, for an instability which would normally ring up with a parametric gain of approximately 2.5.

preprint2022arXiv

DProQ: A Gated-Graph Transformer for Protein Complex Structure Assessment

Proteins interact to form complexes to carry out essential biological functions. Computational methods have been developed to predict the structures of protein complexes. However, an important challenge in protein complex structure prediction is to estimate the quality of predicted protein complex structures without any knowledge of the corresponding native structures. Such estimations can then be used to select high-quality predicted complex structures to facilitate biomedical research such as protein function analysis and drug discovery. We challenge this significant task with DProQ, which introduces a gated neighborhood-modulating Graph Transformer (GGT) designed to predict the quality of 3D protein complex structures. Notably, we incorporate node and edge gates within a novel Graph Transformer framework to control information flow during graph message passing. We train and evaluate DProQ on four newly-developed datasets that we make publicly available in this work. Our rigorous experiments demonstrate that DProQ achieves state-of-the-art performance in ranking protein complex structures.

preprint2022arXiv

EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex Structures

Protein complexes are macromolecules essential to the functioning and well-being of all living organisms. As the structure of a protein complex, in particular its region of interaction between multiple protein subunits (i.e., chains), has a notable influence on the biological function of the complex, computational methods that can quickly and effectively be used to refine and assess the quality of a protein complex's 3D structure can directly be used within a drug discovery pipeline to accelerate the development of new therapeutics and improve the efficacy of future vaccines. In this work, we introduce the Equivariant Graph Refiner (EGR), a novel E(3)-equivariant graph neural network (GNN) for multi-task structure refinement and assessment of protein complexes. Our experiments on new, diverse protein complex datasets, all of which we make publicly available in this work, demonstrate the state-of-the-art effectiveness of EGR for atomistic refinement and assessment of protein complexes and outline directions for future work in the field. In doing so, we establish a baseline for future studies in macromolecular refinement and structure analysis.

preprint2022arXiv

Exceptional complete intersection maps of local rings

This work concerns surjective maps $φ\colon R\to S$ of commutative noetherian local rings with kernel generated by a regular sequence that is part of a minimal generating set for the maximal ideal of $R$. The main result provides criteria for detecting such exceptional complete intersection maps in terms of the lattices of thick subcategories of the derived category of complexes of finite length homology. A key input is a characterization of such maps in terms of the truncated Atiyah class of $φ$.

preprint2022arXiv

Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification

Hierarchical multi-granularity classification (HMC) assigns hierarchical multi-granularity labels to each object and focuses on encoding the label hierarchy, e.g., ["Albatross", "Laysan Albatross"] from coarse-to-fine levels. However, the definition of what is fine-grained is subjective, and the image quality may affect the identification. Thus, samples could be observed at any level of the hierarchy, e.g., ["Albatross"] or ["Albatross", "Laysan Albatross"], and examples discerned at coarse categories are often neglected in the conventional setting of HMC. In this paper, we study the HMC problem in which objects are labeled at any level of the hierarchy. The essential designs of the proposed method are derived from two motivations: (1) learning with objects labeled at various levels should transfer hierarchical knowledge between levels; (2) lower-level classes should inherit attributes related to upper-level superclasses. The proposed combinatorial loss maximizes the marginal probability of the observed ground truth label by aggregating information from related labels defined in the tree hierarchy. If the observed label is at the leaf level, the combinatorial loss further imposes the multi-class cross-entropy loss to increase the weight of fine-grained classification loss. Considering the hierarchical feature interaction, we propose a hierarchical residual network (HRN), in which granularity-specific features from parent levels acting as residual connections are added to features of children levels. Experiments on three commonly used datasets demonstrate the effectiveness of our approach compared to the state-of-the-art HMC approaches and fine-grained visual classification (FGVC) methods exploiting the label hierarchy.

preprint2022arXiv

Monolithically integrated active passive waveguide array fabricated on thin film lithium niobate using a single continuous photolithography process

We demonstrate a robust low-loss optical interface by tiling passive (i.e., without doping of active ions) thin film lithium niobate (TFLN) and active (i.e., doped with rare earth ions) TFLN substrates for monolithic integration of passive/active lithium niobate photonics. The tiled substrates composed of both active and passive areas allow to pattern the mask of the integrated active passive photonic device at once using a single continuous photolithography process. The interface loss of tiled substrate is measured as low as 0.26 dB. Thanks to the stability provided by this approach, a four-channel waveguide amplifier is realized in a straightforward manner, which shows a net gain of ~5 dB at 1550-nm wavelength and that of ~8 dB at 1530-nm wavelength for each channel. The robust low-loss optical interface for passive/active photonic integration will facilitate large-scale high performance photonic devices which require on-chip light sources and amplifiers.

preprint2022arXiv

Monolithically integrated waveguide-coupled single-frequency microlaser on erbium-doped thin film lithium niobate

We overcome the difficulty in realizing a monolithic waveguide-coupled microring laser integrated on erbium-doped thin film lithium niobate (Er: TFLN) using photolithography assisted chemo-mechanical etching (PLACE) technique. We demonstrate an integrated single-frequency microring laser operating around 1531 nm wavelength. The PLACE technique, enabling integrated Er: TFLN photonics with low propagation loss, can thus be used to realize low cost mass production of monolithic on-chip microlasers with applications ranging from optical communication and photonic integrated circuit (PIC) to precision metrology and large-scale sensing.

preprint2022arXiv

Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection

In this paper, we present the Multi-Forgery Detection Challenge held concurrently with the IEEE Computer Society Workshop on Biometrics at CVPR 2022. Our Multi-Forgery Detection Challenge aims to detect automatic image manipulations including but not limited to image editing, image synthesis, image generation, image photoshop, etc. Our challenge has attracted 674 teams from all over the world, with about 2000 valid result submission counts. We invited the Top 10 teams to present their solutions to the challenge, from which three teams are awarded prizes in the grand finale. In this paper, we present the solutions from the Top 3 teams, in order to boost the research work in the field of image forgery detection.

preprint2022arXiv

Multi-parametric Analysis for Mixed Integer Linear Programming: An Application to Transmission Planning and Congestion Control

Enhancing existing transmission lines is a useful tool to combat transmission congestion and guarantee transmission security with increasing demand and boosting the renewable energy source. This study concerns the selection of lines whose capacity should be expanded and by how much from the perspective of independent system operator (ISO) to minimize the system cost with the consideration of transmission line constraints and electricity generation and demand balance conditions, and incorporating ramp-up and startup ramp rates, shutdown ramp rates, ramp-down rate limits and minimum up and minimum down times. For that purpose, we develop the ISO unit commitment and economic dispatch model and show it as a right-hand side uncertainty multiple parametric analysis for the mixed integer linear programming (MILP) problem. We first relax the binary variable to continuous variables and employ the Lagrange method and Karush-Kuhn-Tucker conditions to obtain optimal solutions (optimal decision variables and objective function) and critical regions associated with active and inactive constraints. Further, we extend the traditional branch and bound method for the large-scale MILP problem by determining the upper bound of the problem at each node, then comparing the difference between the upper and lower bounds and reaching the approximate optimal solution within the decision makers' tolerated error range. In additional, the objective function's first derivative on the parameters of each line is used to inform the selection of lines to ease congestion and maximize social welfare. Finally, the amount of capacity upgrade will be chosen by balancing the cost-reduction rate of the objective function on parameters and the cost of the line upgrade. Our findings are supported by numerical simulation and provide transmission line planners with decision-making guidance.

preprint2022arXiv

Negative Zero-Point-Energy Parameter in the Meyer-Miller Mapping Model for Nonadiabatic Dynamics

The celebrated Meyer-Miller mapping model has been a useful approach for generating practical trajectory-based nonadiabatic dynamics methods. It is generally assumed that the zero-point-energy (ZPE) parameter is positive. The constraint implied in the conventional Meyer-Miller mapping Hamiltonian for an F-electronic-state system actually requires that parameter γis larger than -1/F for the ZPE parameter for each electronic degree of freedom. Both negative and positive values are possible for such a parameter. We first establish a rigorous formulation to construct exact mapping models in the Cartesian phase space when the constraint is applied. When nuclear dynamics is approximated by the linearized semiclassical initial value representation, a negative ZPE parameter could lead to reasonably good performance in describing dynamic behaviors in typical spin-boson models for condensed-phase two-state systems, even at challenging zero temperature.

preprint2022arXiv

New Phase Space Formulations and Quantum Dynamics Approaches

We report recent progress on the phase space formulation of quantum mechanics with coordinate-momentum variables, focusing more on new theory of (weighted) constraint coordinate-momentum phase space for discrete-variable quantum systems. This leads to a general coordinate-momentum phase space formulation of composite quantum systems, where conventional representations on infinite phase space are employed for continuous variables. It is convenient to utilize (weighted) constraint coordinate-momentum phase space for representing the quantum state and describing nonclassical features. Various numerical tests demonstrate that new trajectory-based quantum dynamics approaches derived from the (weighted) constraint phase space representation are useful and practical for describing dynamical processes of composite quantum systems in gas phase as well as in condensed phase.

preprint2022arXiv

On the Cayley-persistence algebra

In this paper, we introduce a persistent (co)homology theory for Cayley digraph grading. We give the algebraic structures of Cayley-persistence object. Specifically, we consider the module structure of persistent (co)homology and show the decomposition of a finitely generated Cayley-persistence module. Moreover, we introduce the persistence-cup product on the Cayley-persistence module and study the twisted structure with respect to the persistence-cup product. As an application on manifolds, we show that the persistent (co)homology is closely related to the persistent map of fundamental classes.

preprint2022arXiv

On-chip integrated Yb3+-doped waveguide amplifiers on thin film lithium niobate

We report the fabrication and optical characterization of Yb3+-doped waveguide amplifiers (YDWA) on the thin film lithium niobate fabricated by photolithography assisted chemo-mechanical etching. The fabricated Yb3+-doped lithium niobate waveguides demonstrates low propagation loss of 0.13 dB/cm at 1030 nm and 0.1 dB/cm at 1060 nm. The internal net gain of 5 dB at 1030 nm and 8 dB at 1060 nm are measured on a 4.0 cm long waveguide pumped by 976nm laser diodes, indicating the gain per unit length of 1.25 dB/cm at 1030 nm and 2 dB/cm at 1060 nm, respectively. The integrated Yb3+-doped lithium niobate waveguide amplifiers will benefit the development of a powerful gain platform and are expected to contribute to the high-density integration of thin film lithium niobate based photonic chip.

preprint2022arXiv

Radiation hardness study on a CMOS pixel sensor for charged particle tracking

A CMOS pixel sensor, named Supix-1, is developed for a pixelated silicon tracker for the Circular Electron-Positron Collider (CEPC) project. The sensor, consisted of nine sectors varying in pixel sizes, diode sizes and geometries, is fabricated with a 180 nm CMOS Image Sensor (CIS) process to study the particle detection performance of enlarged pixels. In this work, the radiation-induced effects on the charge collection of the sensor under the fluence of 1 $\times$ 10^13 1 MeV neq/cm^2 are studied by the measurements with the radioactive source of Fe-55 and the Technology Computer Aided Design (TCAD) simulations, since the radiation hardness of 6.8 $\times$ 10^12 1 MeV neq/cm^2 per year for Non-Ionizing Energy Loss (NIEL) effects is required. In measurements, the sensor gain has been calibrated using the k-$α$ peak of Fe-55 before and after irradiation. The pixel-wise equivalent noise charge (ENC), charge collection efficiency (CCE) and signal-to-noise ratio (SNR) were evaluated. The radiation-induced effects on cluster properties are studied through a self-developed reconstruction algorithm. In TCAD simulations, charge collections in 5 $\times$ 5 pixel matrixes for two typical impinging cases of incident particles were simulated with and without irradiation. Both measurements and simulations indicate that enlarged pixels with area of 21 $μ$m $\times$ 84 $μ$m, though suffering greater loss on sensor performance than small pixels do, still have satisfactory noise and charge collection performance after irradiation for particle tracking in the upcoming collider detectors.

preprint2022arXiv

RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN

Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models. To date, most of the existing studies focus on backdoor attack against the uncompressed model; while the vulnerability of compressed DNNs, which are widely used in the practical applications, is little exploited yet. In this paper, we propose to study and develop Robust and Imperceptible Backdoor Attack against Compact DNN models (RIBAC). By performing systematic analysis and exploration on the important design knobs, we propose a framework that can learn the proper trigger patterns, model parameters and pruning masks in an efficient way. Thereby achieving high trigger stealthiness, high attack success rate and high model efficiency simultaneously. Extensive evaluations across different datasets, including the test against the state-of-the-art defense mechanisms, demonstrate the high robustness, stealthiness and model efficiency of RIBAC. Code is available at https://github.com/huyvnphan/ECCV2022-RIBAC

preprint2022arXiv

Spatiotemporal 2-D Channel Coding for Very Low Latency Reliable MIMO Transmission

To fully support vertical industries, 5G and its corresponding channel coding are expected to meet requirements of different applications. However, for applications of 5G and beyond 5G (B5G) such as URLLC, the transmission latency is required to be much shorter than that in eMBB. Therefore, the resulting channel code length reduces drastically. In this case, the traditional 1-D channel coding suffers a lot from the performance degradation and fails to deliver strong reliability with very low latency. To remove this bottleneck, new channel coding scheme beyond the existing 1-D one is in urgent need. By making full use of the spacial freedom of massive MIMO systems, this paper devotes itself in proposing a spatiotemporal 2-D channel coding for very low latency reliable transmission. For a very short time-domain code length $N^{\text{time}}=16$, $64 \times 128$ MIMO system employing the proposed spatiotemporal 2-D coding scheme successfully shows more than $3$\,dB performance gain at $\text{FER}=10^{-3}$, compared to the 1-D time-domain channel coding. It is noted that the proposed coding scheme is suitable for different channel codes and enjoys high flexibility to adapt to difference scenarios. By appropriately selecting the code rate, code length, and the number of codewords in the time and space domains, the proposed coding scheme can achieve a good trade-off between the transmission latency and reliability.

preprint2022arXiv

Unified Formulation of Phase Space Mapping Approaches for Nonadiabatic Quantum Dynamics

Nonadiabatic dynamical processes are one of the most important quantum mechanical phenomena in chemical, materials, biological, and environmental molecular systems, where the coupling between different electronic states is either inherent in the molecular structure or induced by the (intense) external field. The curse of dimensionality indicates the intractable exponential scaling of calculation effort with system size and restricts the implementation of numerically exact approaches for realistic large systems. The phase space formulation of quantum mechanics offers an important theoretical framework for constructing practical approximate trajectory-based methods for quantum dynamics. This Account reviews our recent progress in phase space mapping theory: a unified framework for constructing the mapping Hamiltonian on phase space for coupled F-state systems where the renowned Meyer-Miller Hamiltonian model is a special case, a general phase space formulation of quantum mechanics for nonadiabatic systems where the electronic degrees of freedom are mapped onto constraint space and the nuclear degrees of freedom are mapped onto infinite space, and an isomorphism between the mapping phase space approach for nonadiabatic systems and that for nonequilibrium electron transport processes.

preprint2022arXiv

VPNets: Volume-preserving neural networks for learning source-free dynamics

We propose volume-preserving networks (VPNets) for learning unknown source-free dynamical systems using trajectory data. We propose three modules and combine them to obtain two network architectures, coined R-VPNet and LA-VPNet. The distinct feature of the proposed models is that they are intrinsic volume-preserving. In addition, the corresponding approximation theorems are proved, which theoretically guarantee the expressivity of the proposed VPNets to learn source-free dynamics. The effectiveness, generalization ability and structure-preserving property of the VP-Nets are demonstrated by numerical experiments.

preprint2022arXiv

Watermark Vaccine: Adversarial Attacks to Prevent Watermark Removal

As a common security tool, visible watermarking has been widely applied to protect copyrights of digital images. However, recent works have shown that visible watermarks can be removed by DNNs without damaging their host images. Such watermark-removal techniques pose a great threat to the ownership of images. Inspired by the vulnerability of DNNs on adversarial perturbations, we propose a novel defence mechanism by adversarial machine learning for good. From the perspective of the adversary, blind watermark-removal networks can be posed as our target models; then we actually optimize an imperceptible adversarial perturbation on the host images to proactively attack against watermark-removal networks, dubbed Watermark Vaccine. Specifically, two types of vaccines are proposed. Disrupting Watermark Vaccine (DWV) induces to ruin the host image along with watermark after passing through watermark-removal networks. In contrast, Inerasable Watermark Vaccine (IWV) works in another fashion of trying to keep the watermark not removed and still noticeable. Extensive experiments demonstrate the effectiveness of our DWV/IWV in preventing watermark removal, especially on various watermark removal networks.

preprint2021arXiv

CelebA-Spoof Challenge 2020 on Face Anti-Spoofing: Methods and Results

As facial interaction systems are prevalently deployed, security and reliability of these systems become a critical issue, with substantial research efforts devoted. Among them, face anti-spoofing emerges as an important area, whose objective is to identify whether a presented face is live or spoof. Recently, a large-scale face anti-spoofing dataset, CelebA-Spoof which comprised of 625,537 pictures of 10,177 subjects has been released. It is the largest face anti-spoofing dataset in terms of the numbers of the data and the subjects. This paper reports methods and results in the CelebA-Spoof Challenge 2020 on Face AntiSpoofing which employs the CelebA-Spoof dataset. The model evaluation is conducted online on the hidden test set. A total of 134 participants registered for the competition, and 19 teams made valid submissions. We will analyze the top ranked solutions and present some discussion on future work directions.

preprint2021arXiv

Enabling Fast and Universal Audio Adversarial Attack Using Generative Model

Recently, the vulnerability of DNN-based audio systems to adversarial attacks has obtained the increasing attention. However, the existing audio adversarial attacks allow the adversary to possess the entire user's audio input as well as granting sufficient time budget to generate the adversarial perturbations. These idealized assumptions, however, makes the existing audio adversarial attacks mostly impossible to be launched in a timely fashion in practice (e.g., playing unnoticeable adversarial perturbations along with user's streaming input). To overcome these limitations, in this paper we propose fast audio adversarial perturbation generator (FAPG), which uses generative model to generate adversarial perturbations for the audio input in a single forward pass, thereby drastically improving the perturbation generation speed. Built on the top of FAPG, we further propose universal audio adversarial perturbation generator (UAPG), a scheme crafting universal adversarial perturbation that can be imposed on arbitrary benign audio input to cause misclassification. Extensive experiments show that our proposed FAPG can achieve up to 167X speedup over the state-of-the-art audio adversarial attack methods. Also our proposed UAPG can generate universal adversarial perturbation that achieves much better attack performance than the state-of-the-art solutions.

preprint2021arXiv

Proposal for measuring Newtonian constant of gravitation at an exceptional point in an optomechanical system

We develop a quantum mechanical method of measuring the Newtonian constant of gravitation, G. In this method, an optomechanical system consisting of two cavities and two membrane resonators is used. The added source mass would induce the shifts of the eigenfrequencies of the supermodes. Via detecting the shifts, we can perform our measurement of G. Furthermore, our system can features exceptional point (EP) which are branch point singularities of the spectrum and eigenfunctions. In the paper, we demonstrate that operating the system at EP can enhance our measurement of G. In addition, we derive the relationship between EP enlarged eigenfrequency shift and the Newtonian constant. This work provides a way to engineer EP-assisted optomechanical devices for applications in the field of precision measurement of G

preprint2020arXiv

Adversarial Training Based Multi-Source Unsupervised Domain Adaptation for Sentiment Analysis

Multi-source unsupervised domain adaptation (MS-UDA) for sentiment analysis (SA) aims to leverage useful information in multiple source domains to help do SA in an unlabeled target domain that has no supervised information. Existing algorithms of MS-UDA either only exploit the shared features, i.e., the domain-invariant information, or based on some weak assumption in NLP, e.g., smoothness assumption. To avoid these problems, we propose two transfer learning frameworks based on the multi-source domain adaptation methodology for SA by combining the source hypotheses to derive a good target hypothesis. The key feature of the first framework is a novel Weighting Scheme based Unsupervised Domain Adaptation framework (WS-UDA), which combine the source classifiers to acquire pseudo labels for target instances directly. While the second framework is a Two-Stage Training based Unsupervised Domain Adaptation framework (2ST-UDA), which further exploits these pseudo labels to train a target private extractor. Importantly, the weights assigned to each source classifier are based on the relations between target instances and source domains, which measured by a discriminator through the adversarial training. Furthermore, through the same discriminator, we also fulfill the separation of shared features and private features. Experimental results on two SA datasets demonstrate the promising performance of our frameworks, which outperforms unsupervised state-of-the-art competitors.

preprint2020arXiv

Comprehensive Control of Metamagnetic Transition of Antiferromagnetic Mott Insulator Sr2IrO4 by in-situ Anisotropic Strain

Metamagnetism in antiferromagnets exhibits distinct critical behaviors and dynamics when invoking spin reversal and rotation. Here we show a 0.05% anisotropic strain suffices to in-situ modulate the metamagnetic critical field of the Mott insulator Sr2IrO4 by over 50%, enabling electrical switching of the transition. Resonant x-ray scattering and model simulation reveal that the transition is completely tuned from the spin-flop to spin-flip type as the strain introduces C4-symmetry-breaking magnetic anisotropy. Simultaneous transport study indicates the metamagnetic responses are reflected in the large elasto- and magnetoconductance, highlighting the active charge degree of freedom in the spin-orbit-coupled Mott state and its potential for spin-electronics.

preprint2020arXiv

Evolution of clustering structure through the momentum distributions in $^{8-10}$Be isotopes

We investigate the evolution of clustering structure through the momentum distributions in the $^{8-10}$Be isotopes. The nucleon dynamics within the inter-cluster antisymmetrization are discussed via the momentum distribution of a Brink type $α$-$α$ wave function. For the state with a small $α$-$α$ distance, we observe a significant depression with a dip structure at zero-momentum and an enhanced tail at relatively higher momentum region. In addition, we find the "cluster structure" in the intrinsic frame of momentum space, which is complementary to its significant $α$-cluster dissolution in the coordinate space because of the strong antisymmetrization. For the physical $^{8-10}$Be isotopes, the Tohsaki-Horiuchi-Schuck-R{ö}pke (THSR) wave functions are adopted. The evolution from the dilute clustering state to the compact one is demonstrated by a successive depression at the zero-momentum of nucleon distribution for the two $α$-clusters within $^{8-10}$Be isotopes. For the compact $^{10}$Be nucleus, the momentum distribution of all nucleons shows significant depression at zero-momentum with a dip structure, which is found to be contributed by both the inter-cluster antisymmetrization and the $p$-orbit occupation of the valence neutrons. This study proposes a new window for the investigations of the $α$-clustering effects via the low-momentum components of nuclei, which is expected to be extended to the heavier nuclear clustering states.

preprint2020arXiv

Gate-tunable van der Waals heterostructure for reconfigurable neural network vision sensor

Early processing of visual information takes place in the human retina. Mimicking neurobiological structures and functionalities of the retina provide a promising pathway to achieving vision sensor with highly efficient image processing. Here, we demonstrate a prototype vision sensor that operates via the gate-tunable positive and negative photoresponses of the van der Waals (vdW) vertical heterostructures. The sensor emulates not only the neurobiological functionalities of bipolar cells and photoreceptors but also the unique synaptic connectivity between bipolar cells and photoreceptors. By tuning gate voltage for each pixel, we achieve reconfigurable vision sensor for simultaneously image sensing and processing. Furthermore, our prototype vision sensor itself can be trained to classify the input images, via updating the gate voltages applied individually to each pixel in the sensor. Our work indicates that vdW vertical heterostructures offer a promising platform for the development of neural network vision sensor.

preprint2020arXiv

Gravitational waves detection with exceptional points in micro cavities

Here we propose a new gravitational waves(GWs) detector in broad frequency band, which is operated at exceptional points(EPs) in micro cavities. The detected signal is an eigenfrequency split of the mechanical modes caused by the spatial strain. Due to the complex square root topology near the EP, the splitting is greatly enhanced for sufficiently small perturbations. Compared to current strategies, it can be achieved at the room temperature and has advantages in micro device scale, wide frequency band and higher sensitivity.

preprint2020arXiv

LogiQA: A Challenge Dataset for Machine Reading Comprehension with Logical Reasoning

Machine reading is a fundamental task for testing the capability of natural language understanding, which is closely related to human cognition in many aspects. With the rising of deep learning techniques, algorithmic models rival human performances on simple QA, and thus increasingly challenging machine reading datasets have been proposed. Though various challenges such as evidence integration and commonsense knowledge have been integrated, one of the fundamental capabilities in human reading, namely logical reasoning, is not fully investigated. We build a comprehensive dataset, named LogiQA, which is sourced from expert-written questions for testing human Logical reasoning. It consists of 8,678 QA instances, covering multiple types of deductive reasoning. Results show that state-of-the-art neural models perform by far worse than human ceiling. Our dataset can also serve as a benchmark for reinvestigating logical AI under the deep learning NLP setting. The dataset is freely available at https://github.com/lgw863/LogiQA-dataset

preprint2020arXiv

Real-time, Universal, and Robust Adversarial Attacks Against Speaker Recognition Systems

As the popularity of voice user interface (VUI) exploded in recent years, speaker recognition system has emerged as an important medium of identifying a speaker in many security-required applications and services. In this paper, we propose the first real-time, universal, and robust adversarial attack against the state-of-the-art deep neural network (DNN) based speaker recognition system. Through adding an audio-agnostic universal perturbation on arbitrary enrolled speaker's voice input, the DNN-based speaker recognition system would identify the speaker as any target (i.e., adversary-desired) speaker label. In addition, we improve the robustness of our attack by modeling the sound distortions caused by the physical over-the-air propagation through estimating room impulse response (RIR). Experiment using a public dataset of 109 English speakers demonstrates the effectiveness and robustness of our proposed attack with a high attack success rate of over 90%. The attack launching time also achieves a 100X speedup over contemporary non-universal attacks.

preprint2020arXiv

Review and Examination of Input Feature Preparation Methods and Machine Learning Models for Turbulence Modeling

Model extrapolation to unseen flow is one of the biggest challenges facing data-driven turbulence modeling, especially for models with high dimensional inputs that involve many flow features. In this study we review previous efforts on data-driven Reynolds-Averaged Naiver Stokes (RANS) turbulence modeling and model extrapolation, with main focus on the popular methods being used in the field of transfer learning. Several potential metrics to measure the dissimilarity between training flows and testing flows are examined. Different Machine Learning (ML) models are compared to understand how the capacity or complexity of the model affects its behavior in the face of dataset shift. Data preprocessing schemes which are robust to covariate shift, like normalization, transformation, and importance re-weighted likelihood, are studied to understand whether it is possible to find projections of the data that attenuate the differences in the training and test distributions while preserving predictability. Three metrics are proposed to assess the dissimilarity between training/testing dataset. To attenuate the dissimilarity, a distribution matching framework is used to align the statistics of the distributions. These modifications also allow the regression tasks to have better accuracy in forecasting under-represented extreme values of the target variable. These findings are useful for future ML based turbulence models to evaluate their model predictability and provide guidance to systematically generate diversified high-fidelity simulation database.

preprint2020arXiv

Strain-modulated Slater-Mott crossover of pseudospin-half square-lattice in (SrIrO3)1/ (SrTiO3)1 superlattices

We report on the epitaxial strain-driven electronic and antiferromagnetic modulations of a pseudospin-half square lattice realized in superlattices of (SrIrO3)1/(SrTiO3)1. With increasing compressive strain, we find the low-temperature insulating behavior to be strongly suppressed with a corresponding systematic reduction of both the Neel temperature and the staggered moment. However, despite such a suppression, the system remains weakly insulating above the Neel transition. The emergence of metallicity is observed under large compressive strain but only at temperatures far above the Néel transition. These behaviors are characteristics of the Slater-Mott crossover regime, providing a unique experimental model system of the spin-half Hubbard Hamiltonian with a tunable intermediate coupling strength.

preprint2019arXiv

Drastic suppression of superconducting $T_{c}$ by anisotropic strain near a nematic quantum critical point

High temperature superconductivity emerges in the vicinity of competing strongly correlated phases. In the iron-based superconductor $Ba(Fe_{1-x}Co_{x})_{2}As_{2}$, the superconducting state shares the composition-temperature phase diagram with an electronic nematic phase and an antiferromagnetic phase that break the crystalline rotational symmetry. Symmetry considerations suggest that anisotropic strain can enhance these competing phases and thus suppress the superconductivity. Here we study the effect of anisotropic strain on the superconducting transition in single crystals of $Ba(Fe_{1-x}Co_{x})_{2}As_{2}$ through electrical transport, magnetic susceptibility, and x-ray diffraction measurements. We find that in the underdoped and near-optimally doped regions of the phase diagram, the superconducting critical temperature is rapidly suppressed by both compressive and tensile stress, and in the underdoped case this suppression is enough to induce a strain-tuned superconductor to metal quantum phase transition.

preprint2019arXiv

Epitaxial growth and antiferromagnetism of Sn-substituted perovskite iridate SrIr$_{0.8}$Sn$_{0.2}$O$_3$

5d iridates have shown vast emergent phenomena due to a strong interplay among its lattice, charge and spin degrees of freedom, because of which the potential in spintronic application of the thin-film form is highly leveraged. Here we have epitaxially stabilized perovskite SrIr$_{0.8}$Sn$_{0.2}$O$_3$ on [001] SrTiO$_3$ substrates through pulsed laser deposition and systematically characterized the structural, electronic and magnetic properties. Physical properties measurements unravel an insulating ground state with a weak ferromagnetism in the compressively strained epitaxial film. The octahedral rotation pattern is identified by synchrotron x-ray diffraction, resolving a mix of $a^+b^-c^-$ and $a^-b^+c^-$ domains. X-ray magnetic resonant scattering directly demonstrates a G-type antiferromagnetic structure of the magnetic order and the spin canting nature of the weak ferromagnetism.

preprint2017arXiv

Universal Scaling in Intrinsic Resistivity of Two-Dimensional Metal Borophene

Two-dimensional boron sheets (borophenes) have been successfully synthesized in experiments and are expected to exhibit intriguing transport properties such as the emergence of superconductivity and Dirac Fermions. However, quantitative understanding of intrinsic electrical transport of borophene has not been achieved. Here, we report a comprehensive first-principles study on electron-phonon driven intrinsic electrical resistivity (\r{ho}) of emerging borophene structures. We find that the resistivity is highly dependent on the atomic structures and electron density of borophene. Low-temperature resistivity of borophene \r{ho} exhibits a universal scaling behavior, which increases rapidly with temperature T (\r{ho}~T^4), while \r{ho} increases linearly for a large temperature window T > 100 K. It is observed that this universal behavior of intrinsic resistivity is well described by Bloch-Grünesisen model. Different from graphene and conventional three-dimensional metals, the intrinsic resistivity of borophenes can be easily tuned by adjusting carrier densities while the Bloch-Grünesisen temperature is nearly fixed at ~100 K. Our work suggests monolayer boron can serve as an intriguing platform for realizing high-tunable two-dimensional electronic devices.