Source author record

Xiaoqin Wang

Xiaoqin Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision eess.IV Machine Learning Methodology Quantitative Methods Artificial Intelligence Computation Cryptography and Security Applications Computation and Language

Catalog footprint

What is connected

14works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Binary Representation via Jointly Personalized Sparse Hashing

Unsupervised hashing has attracted much attention for binary representation learning due to the requirement of economical storage and efficiency of binary codes. It aims to encode high-dimensional features in the Hamming space with similarity preservation between instances. However, most existing methods learn hash functions in manifold-based approaches. Those methods capture the local geometric structures (i.e., pairwise relationships) of data, and lack satisfactory performance in dealing with real-world scenarios that produce similar features (e.g. color and shape) with different semantic information. To address this challenge, in this work, we propose an effective unsupervised method, namely Jointly Personalized Sparse Hashing (JPSH), for binary representation learning. To be specific, firstly, we propose a novel personalized hashing module, i.e., Personalized Sparse Hashing (PSH). Different personalized subspaces are constructed to reflect category-specific attributes for different clusters, adaptively mapping instances within the same cluster to the same Hamming space. In addition, we deploy sparse constraints for different personalized subspaces to select important features. We also collect the strengths of the other clusters to build the PSH module with avoiding over-fitting. Then, to simultaneously preserve semantic and pairwise similarities in our JPSH, we incorporate the PSH and manifold-based hash learning into the seamless formulation. As such, JPSH not only distinguishes the instances from different clusters, but also preserves local neighborhood structures within the cluster. Finally, an alternating optimization algorithm is adopted to iteratively capture analytical solutions of the JPSH model. Extensive experiments on four benchmark datasets verify that the JPSH outperforms several hashing algorithms on the similarity search task.

preprint2022arXiv

CE-based white-box adversarial attacks will not work using super-fitting

Deep neural networks are widely used in various fields because of their powerful performance. However, recent studies have shown that deep learning models are vulnerable to adversarial attacks, i.e., adding a slight perturbation to the input will make the model obtain wrong results. This is especially dangerous for some systems with high-security requirements, so this paper proposes a new defense method by using the model super-fitting state to improve the model's adversarial robustness (i.e., the accuracy under adversarial attacks). This paper mathematically proves the effectiveness of super-fitting and enables the model to reach this state quickly by minimizing unrelated category scores (MUCS). Theoretically, super-fitting can resist any existing (even future) CE-based white-box adversarial attacks. In addition, this paper uses a variety of powerful attack algorithms to evaluate the adversarial robustness of super-fitting, and the proposed method is compared with nearly 50 defense models from recent conferences. The experimental results show that the super-fitting method in this paper can make the trained model obtain the highest adversarial robustness.

preprint2022arXiv

Rethinking Classifier and Adversarial Attack

Various defense models have been proposed to resist adversarial attack algorithms, but existing adversarial robustness evaluation methods always overestimate the adversarial robustness of these models (i.e., not approaching the lower bound of robustness). To solve this problem, this paper uses the proposed decouple space method to divide the classifier into two parts: non-linear and linear. Then, this paper defines the representation vector of the original example (and its space, i.e., the representation space) and uses the iterative optimization of Absolute Classification Boundaries Initialization (ACBI) to obtain a better attack starting point. Particularly, this paper applies ACBI to nearly 50 widely-used defense models (including 8 architectures). Experimental results show that ACBI achieves lower robust accuracy in all cases.

preprint2021arXiv

Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging

A key challenge in training neural networks for a given medical imaging task is often the difficulty of obtaining a sufficient number of manually labeled examples. In contrast, textual imaging reports, which are often readily available in medical records, contain rich but unstructured interpretations written by experts as part of standard clinical practice. We propose using these textual reports as a form of weak supervision to improve the image interpretation performance of a neural network without requiring additional manually labeled examples. We use an image-text matching task to train a feature extractor and then fine-tune it in a transfer learning setting for a supervised task using a small labeled dataset. The end result is a neural network that automatically interprets imagery without requiring textual reports during inference. This approach can be applied to any task for which text-image pairs are readily available. We evaluate our method on three classification tasks and find consistent performance improvements, reducing the need for labeled data by 67%-98%.

preprint2020arXiv

2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Automated methods for breast cancer detection have focused on 2D mammography and have largely ignored 3D digital breast tomosynthesis (DBT), which is frequently used in clinical practice. The two key challenges in developing automated methods for DBT classification are handling the variable number of slices and retaining slice-to-slice changes. We propose a novel deep 2D convolutional neural network (CNN) architecture for DBT classification that simultaneously overcomes both challenges. Our approach operates on the full volume, regardless of the number of slices, and allows the use of pre-trained 2D CNNs for feature extraction, which is important given the limited amount of annotated training data. In an extensive evaluation on a real-world clinical dataset, our approach achieves 0.854 auROC, which is 28.80% higher than approaches based on 3D CNNs. We also find that these improvements are stable across a range of model configurations.

preprint2020arXiv

Hypothesis Testing of Blip Effects in Sequential Causal Inference

In this article, we study the hypothesis testing of the blip / net effects of treatments in a treatment sequence. We illustrate that the likelihood ratio test and the score test may suffer from the curse of dimensionality, the null paradox and the high-dimensional constraint on standard parameters under the null hypothesis. On the other hand, we construct the Wald test via a small number of point effects of treatments in single-point causal inference. We show that the Wald test can avoid these problems under the same assumptions as the Wald test for testing the point effect of treatment. The simulation study illustrates that the Wald test achieves the nominal level of type I error and a low level of type II error. A real medical example illustrates how to conduct the Wald test in practice.

preprint2020arXiv

Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Recent works have shown that deep neural networks can achieve super-human performance in a wide range of image classification tasks in the medical imaging domain. However, these works have primarily focused on classification accuracy, ignoring the important role of uncertainty quantification. Empirically, neural networks are often miscalibrated and overconfident in their predictions. This miscalibration could be problematic in any automatic decision-making system, but we focus on the medical field in which neural network miscalibration has the potential to lead to significant treatment errors. We propose a novel calibration approach that maintains the overall classification accuracy while significantly improving model calibration. The proposed approach is based on expected calibration error, which is a common metric for quantifying miscalibration. Our approach can be easily integrated into any classification task as an auxiliary loss term, thus not requiring an explicit training round for calibration. We show that our approach reduces calibration error significantly across various architectures and datasets.

preprint2020arXiv

Joint 2D-3D Breast Cancer Classification

Breast cancer is the malignant tumor that causes the highest number of cancer deaths in females. Digital mammograms (DM or 2D mammogram) and digital breast tomosynthesis (DBT or 3D mammogram) are the two types of mammography imagery that are used in clinical practice for breast cancer detection and diagnosis. Radiologists usually read both imaging modalities in combination; however, existing computer-aided diagnosis tools are designed using only one imaging modality. Inspired by clinical practice, we propose an innovative convolutional neural network (CNN) architecture for breast cancer classification, which uses both 2D and 3D mammograms, simultaneously. Our experiment shows that the proposed method significantly improves the performance of breast cancer classification. By assembling three CNN classifiers, the proposed model achieves 0.97 AUC, which is 34.72% higher than the methods using only one imaging modality.

preprint2020arXiv

Unsupervised Domain Adaptation for Mammogram Image Classification: A Promising Tool for Model Generalization

Generalization is one of the key challenges in the clinical validation and application of deep learning models to medical images. Studies have shown that such models trained on publicly available datasets often do not work well on real-world clinical data due to the differences in patient population and image device configurations. Also, manually annotating clinical images is expensive. In this work, we propose an unsupervised domain adaptation (UDA) method using Cycle-GAN to improve the generalization ability of the model without using any additional manual annotations.

preprint2015arXiv

Estimating confidence regions of common measures of (baseline, treatment effect) on dichotomous outcome of a population

In this article we estimate confidence regions of the common measures of (baseline, treatment effect) in observational studies, where the measure of baseline is baseline risk or baseline odds while the measure of treatment effect is odds ratio, risk difference, risk ratio or attributable fraction, and where confounding is controlled in estimation of both baseline and treatment effect. To avoid high complexity of the normal approximation method and the parametric or non-parametric bootstrap method, we obtain confidence regions for measures of (baseline, treatment effect) by generating approximate distributions of the ML estimates of these measures based on one logistic model.

preprint2015arXiv

Measuring and estimating interaction between exposures on dichotomous outcome of a population

In observational studies for the interaction between exposures on dichotomous outcome of a population, one usually uses one parameter of a regression model to describe the interaction, leading to one measure of the interaction. In this article, we use the conditional risk of outcome given exposures and covariates to describe the interaction and obtain five different measures for the interaction in observational studies, i.e. difference between the marginal risk differences, ratio of the marginal risk ratios, ratio of the marginal odds ratios, ratio of the conditional risk ratios, and ratio of the conditional odds ratios. By using only one regression model for the conditional risk of outcome given exposures and covariates, we obtain the maximum-likelihood estimates of all these measures. By generating approximate distributions of the maximum-likelihood estimates of these measures, we obtain interval estimates of these measures. The method is presented by studying the interaction between a therapy and the environment on eradication of Helicobacter pylori among Vietnamese children.

preprint2015arXiv

Parametric Sequential Causal Inference in Point Parametrization

Suppose that a sequence of treatments are assigned to influence an outcome of interest that occurs after the last treatment. Between treatments there exist time-dependent covariates that may be posttreatment variables of the earlier treatments and confounders of the subsequent treatments. In this article, we develop a parametric approach to inference of the causal effect of the treatment sequence on the outcome called the sequential causal effect. We construct a point parametrization for the conditional distribution of an outcome given all treatments and time-dependent covariates, in which the point parameters of interest are the point effects of treatments considered as single-point treatments. We (1) identify net effects of treatments by point effects of treatments, (2) express patterns of net effects of treatments by constraints on point effects of treatments, and (3) show that all sequential causal effects are determined by net effects of treatments. Accordingly we (1) estimate net effects of treatments through point effects of treatments by maximum likelihood, (2) improve the estimation by constraints on point effects of treatments and assignment conditions of treatments, and (3) use the estimates of net effects of treatments to obtain those of sequential causal effects. As a result, we obtain unbiased consistent maximum-likelihood estimates of sequential causal effects even for long treatment sequences. For illustration of our method, we study the causal effects of various sequences of recreational drugs on the CD4 count among HIV patients.

preprint2015arXiv

Point and interval estimation of exposure effects and interaction between the exposures based on logistic model for observational studies

In observational studies with dichotomous outcome of a population, researchers need to present the effects of exposures and interaction between the exposures jointly in order to learn the relationship between the exposure effects and the interaction. In this article we study point and interval estimation of exposure effects and the interaction based on logistic model, where the exposure effects are measured by risk differences while the interaction is measured by difference between risk differences. Using approximate normal distribution of the maximum-likelihood (ML) estimate of the model parameters, we obtain approximate non-normal distribution of the ML estimate of the exposure effects and the interaction. Using the obtained distribution, we obtain point estimate and confidence region of (exposure effect, interaction) as well as point estimate and confidence interval of the interaction when the ML estimate of an exposure effect falls into specified range. Our maximum-likelihood-based approach provides a simple but reliable method of interval estimation of exposure effects and the interaction.

preprint2014arXiv

Estimating Net Effects of Treatments in Treatment Sequence without the Assumption of Strongly Ignorable Treatment Assignment

In sequential causal inference, one estimates the causal net effect of treatment in treatment sequence on an outcome after last treatment in the presence of time-dependent covariates between treatments, improves the estimation by the untestable assumption of strongly ignorable treatment assignment, and obtains consistent but non-genuine likelihood-based estimate. In this article, we introduce the net effect of treatment as parameter for the conditional distribution of outcome given all treatments and time-dependent covariates and show that it is equal to the causal net effect of treatment under the assumption of strongly ignorable treatment assignment. As a result, we can estimate the net effect of treatment and evaluate its causal interpretation in two separate steps. The first step is fucus of this article while the second step can be accomplished by usual sensitivity analyses. We construct point parametrization for the conditional outcome distribution in which the parameters of interest are the point effects of single-point treatments. With point parametrization and without the untestable assumption, we estimate the net effect of treatment by maximum likelihood, improve the estimation by testable pattern of the net effect of treatment, and obtain unbiased consistent maximum-likelihood estimate for the net effect of treatment with finite-dimensional pattern.

Xiaoqin Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Binary Representation via Jointly Personalized Sparse Hashing

CE-based white-box adversarial attacks will not work using super-fitting

Rethinking Classifier and Adversarial Attack

Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging

2D Convolutional Neural Networks for 3D Digital Breast Tomosynthesis Classification

Hypothesis Testing of Blip Effects in Sequential Causal Inference

Improved Trainable Calibration Method for Neural Networks on Medical Imaging Classification

Joint 2D-3D Breast Cancer Classification

Unsupervised Domain Adaptation for Mammogram Image Classification: A Promising Tool for Model Generalization

Estimating confidence regions of common measures of (baseline, treatment effect) on dichotomous outcome of a population

Measuring and estimating interaction between exposures on dichotomous outcome of a population

Parametric Sequential Causal Inference in Point Parametrization

Point and interval estimation of exposure effects and interaction between the exposures based on logistic model for observational studies

Estimating Net Effects of Treatments in Treatment Sequence without the Assumption of Strongly Ignorable Treatment Assignment