Source author record

Weiwei Pan

Weiwei Pan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning cond-mat.mes-hall cond-mat.mtrl-sci

Catalog footprint

What is connected

10works

3topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Failure Modes of Variational Autoencoders and Their Effects on Downstream Tasks

Variational Auto-encoders (VAEs) are deep generative latent variable models that are widely used for a number of downstream tasks. While it has been demonstrated that VAE training can suffer from a number of pathologies, existing literature lacks characterizations of exactly when these pathologies occur and how they impact downstream task performance. In this paper, we concretely characterize conditions under which VAE training exhibits pathologies and connect these failure modes to undesirable effects on specific downstream tasks, such as learning compressed and disentangled representations, adversarial robustness, and semi-supervised learning.

preprint2022arXiv

Policy Optimization with Sparse Global Contrastive Explanations

We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that global contrastive explanation short. We demonstrate our framework with a discrete MDP and a continuous 2D navigation domain.

preprint2022arXiv

Success of Uncertainty-Aware Deep Models Depends on Data Manifold Geometry

For responsible decision making in safety-critical settings, machine learning models must effectively detect and process edge-case data. Although existing works show that predictive uncertainty is useful for these tasks, it is not evident from literature which uncertainty-aware models are best suited for a given dataset. Thus, we compare six uncertainty-aware deep learning models on a set of edge-case tasks: robustness to adversarial attacks as well as out-of-distribution and adversarial detection. We find that the geometry of the data sub-manifold is an important factor in determining the success of various models. Our finding suggests an interesting direction in the study of uncertainty-aware deep learning models.

preprint2022arXiv

Wide Mean-Field Bayesian Neural Networks Ignore the Data

Bayesian neural networks (BNNs) combine the expressive power of deep learning with the advantages of Bayesian formalism. In recent years, the analysis of wide, deep BNNs has provided theoretical insight into their priors and posteriors. However, we have no analogous insight into their posteriors under approximate inference. In this work, we show that mean-field variational inference entirely fails to model the data when the network width is large and the activation function is odd. Specifically, for fully-connected BNNs with odd activation functions and a homoscedastic Gaussian likelihood, we show that the optimal mean-field variational posterior predictive (i.e., function space) distribution converges to the prior predictive distribution as the width tends to infinity. We generalize aspects of this result to other likelihoods. Our theoretical results are suggestive of underfitting behavior previously observered in BNNs. While our convergence bounds are non-asymptotic and constants in our analysis can be computed, they are currently too loose to be applicable in standard training regimes. Finally, we show that the optimal approximate posterior need not tend to the prior if the activation function is not odd, showing that our statements cannot be generalized arbitrarily.

preprint2020arXiv

BaCOUn: Bayesian Classifers with Out-of-Distribution Uncertainty

Traditional training of deep classifiers yields overconfident models that are not reliable under dataset shift. We propose a Bayesian framework to obtain reliable uncertainty estimates for deep classifiers. Our approach consists of a plug-in "generator" used to augment the data with an additional class of points that lie on the boundary of the training data, followed by Bayesian inference on top of features that are trained to distinguish these "out-of-distribution" points.

preprint2020arXiv

Characterizing and Avoiding Problematic Global Optima of Variational Autoencoders

Variational Auto-encoders (VAEs) are deep generative latent variable models consisting of two components: a generative model that captures a data distribution p(x) by transforming a distribution p(z) over latent space, and an inference model that infers likely latent codes for each data point (Kingma and Welling, 2013). Recent work shows that traditional training methods tend to yield solutions that violate modeling desiderata: (1) the learned generative model captures the observed data distribution but does so while ignoring the latent codes, resulting in codes that do not represent the data (e.g. van den Oord et al. (2017); Kim et al. (2018)); (2) the aggregate of the learned latent codes does not match the prior p(z). This mismatch means that the learned generative model will be unable to generate realistic data with samples from p(z)(e.g. Makhzani et al. (2015); Tomczak and Welling (2017)). In this paper, we demonstrate that both issues stem from the fact that the global optima of the VAE training objective often correspond to undesirable solutions. Our analysis builds on two observations: (1) the generative model is unidentifiable - there exist many generative models that explain the data equally well, each with different (and potentially unwanted) properties and (2) bias in the VAE objective - the VAE objective may prefer generative models that explain the data poorly but have posteriors that are easy to approximate. We present a novel inference method, LiBI, mitigating the problems identified in our analysis. On synthetic datasets, we show that LiBI can learn generative models that capture the data distribution and inference models that better satisfy modeling assumptions when traditional methods struggle to do so.

preprint2020arXiv

Ensembles of Locally Independent Prediction Models

Ensembles depend on diversity for improved performance. Many ensemble training methods, therefore, attempt to optimize for diversity, which they almost always define in terms of differences in training set predictions. In this paper, however, we demonstrate the diversity of predictions on the training set does not necessarily imply diversity under mild covariate shift, which can harm generalization in practical settings. To address this issue, we introduce a new diversity metric and associated method of training ensembles of models that extrapolate differently on local patches of the data manifold. Across a variety of synthetic and real-world tasks, we find that our method improves generalization and diversity in qualitatively novel ways, especially under data limits and covariate shift.

preprint2016arXiv

An Empirical Comparison of Sampling Quality Metrics: A Case Study for Bayesian Nonnegative Matrix Factorization

In this work, we empirically explore the question: how can we assess the quality of samples from some target distribution? We assume that the samples are provided by some valid Monte Carlo procedure, so we are guaranteed that the collection of samples will asymptotically approximate the true distribution. Most current evaluation approaches focus on two questions: (1) Has the chain mixed, that is, is it sampling from the distribution? and (2) How independent are the samples (as MCMC procedures produce correlated samples)? Focusing on the case of Bayesian nonnegative matrix factorization, we empirically evaluate standard metrics of sampler quality as well as propose new metrics to capture aspects that these measures fail to expose. The aspect of sampling that is of particular interest to us is the ability (or inability) of sampling methods to move between multiple optima in NMF problems. As a proxy, we propose and study a number of metrics that might quantify the diversity of a set of NMF factorizations obtained by a sampler through quantifying the coverage of the posterior distribution. We compare the performance of a number of standard sampling methods for NMF in terms of these new metrics.

preprint2011arXiv

Effect of Zn substitution on morphology and magnetic properties of copper ferrite nanofibers

Spinel ferrite Cu1-xZnxFe2O4 nanofibers over a compositional range 0 < x < 1 were prepared by electrospinning combined with sol-gel method. The influence of Zn2+ ions substitution on morphology, structure, and magnetic properties of copper ferrite has been investigated. The results show that surface of CuFe2O4 nanofibers consists of small open porosity, while surface of doped nanofibers reveals smooth and densified nature. With increasing Zn substitution, saturation magnetization initially increases and then decreases with a maximum value of 58.4 emu/g at x = 0.4, coercivity and square ratio all decrease. The influence of substitution on magnetic properties is related with the cation distraction and exchange interactions between spinel lattices.

preprint2011arXiv

Microstructure and magnetic anisotropy of electrospun Cu$_{1-x}$Zn$_x$Fe$_2$O$_4$ nanofibers: A local probe study

Understanding the phenomena at the nanometer scale is of fundamental importance for future improvements of desired properties of nanomaterials. We report a detailed investigation of the microstructure and the resulting magnetic anisotropy by magnetic, transmission electron microscope (TEM) and Mössbauer measurements of the electrospun Cu$_{1-x}$Zn$_x$Fe$_2$O$_4$ nanofibers. Our results show that the electrospun Cu$_{1-x}$Zn$_x$Fe$_2$O$_4$ nanofibers exhibit nearly isotropic magnetic anisotropy. TEM measurements indicate that the nanofibers are composed of loosely connected and randomly aligned nanograins. As revealed by the Henkel plot, these nanofibers and the nanograins within the nanofibers are dipolar coupled, which reduces the effective shape anisotropy leading to a nearly random configuration of the magnetic moments inside the nanofibers, hence, the observed nearly isotropic magnetic anisotropy can be easily understood.

Weiwei Pan

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

Failure Modes of Variational Autoencoders and Their Effects on Downstream Tasks

Policy Optimization with Sparse Global Contrastive Explanations

Success of Uncertainty-Aware Deep Models Depends on Data Manifold Geometry

Wide Mean-Field Bayesian Neural Networks Ignore the Data

BaCOUn: Bayesian Classifers with Out-of-Distribution Uncertainty

Characterizing and Avoiding Problematic Global Optima of Variational Autoencoders

Ensembles of Locally Independent Prediction Models

An Empirical Comparison of Sampling Quality Metrics: A Case Study for Bayesian Nonnegative Matrix Factorization

Effect of Zn substitution on morphology and magnetic properties of copper ferrite nanofibers

Microstructure and magnetic anisotropy of electrospun Cu$_{1-x}$Zn$_x$Fe$_2$O$_4$ nanofibers: A local probe study