Source author record

Bei Wang

Bei Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

22works

27topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

pMSz: A Distributed Parallel Algorithm for Correcting Extrema and Morse Smale Segmentations in Lossy Compression

Lossy compression, widely used by scientists to reduce data from simulations, experiments, and observations, can distort features of interest even under bounded error. Such distortions may compromise downstream analyses and lead to incorrect scientific conclusions in applications such as combustion and cosmology. This paper presents a distributed and parallel algorithm for correcting topological features, specifically, piecewise linear Morse Smale segmentations (PLMSS), which decompose the domain into monotone regions labeled by their corresponding local minima and maxima. While a single GPU algorithm (MSz) exists for PLMSS correction after compression, no methodology has been developed that scales beyond a single GPU for extreme scale data. We identify the key bottleneck in scaling PLMSS correction as the parallel computation of integral paths, a communication-intensive computation that is notoriously difficult to scale. Instead of explicitly computing and correcting integral paths, our algorithm simplifies MSz by preserving steepest ascending and descending directions across all locations, thereby minimizing interprocess communication while introducing negligible additional storage overhead. With this simplified algorithm and relaxed synchronization, our method achieves over 90% parallel efficiency on 128 GPUs on the Perlmutter supercomputer for real world datasets.

preprint2026arXiv

Spatiotemporal Detection and Uncertainty Visualization of Atmospheric Blocking Events

Atmospheric blocking events are quasi-stationary high-pressure systems that disrupt the typical paths of polar and subtropical air currents, often producing prolonged extreme weather events such as summer heat waves or winter cold spells. Despite their critical role in shaping mid-latitude weather, accurately modeling and analyzing blocking events in long meteorological records remains a significant challenge. To address this challenge, we present an uncertainty visualization framework for detecting and characterizing atmospheric blocking events. First, we introduce a geometry-based detection and tracking method, evaluated on both pre-industrial climate model simulations (UKESM) and reanalysis data (ERA5), which represent historical Earth observations assimilated from satellite and station measurements onto regular numerical grids using weather models. Second, we propose a suite of uncertainty-aware summaries: contour boxplots that capture representative boundaries and their variability, frequency heatmaps that encode occurrences, and 3D temporal stacks that situate these patterns in time. Third, we demonstrate our framework in a case study of the 2003 European heatwave, mapping the spatiotemporal occurrences of blocking events using these summaries. Collectively, these uncertainty visualizations reveal where blocking events are most likely to occur and how their spatial footprints evolve over time. We envision our framework as a valuable tool for climate scientists and meteorologists: by analyzing how blocking frequency, duration, and intensity vary across regions and climate scenarios, it supports both the study of historical blocking events and the assessment of scenario-dependent climate risks associated with changes in extreme weather linked to blocking.

preprint2022arXiv

Intrinsic Interleaving Distance for Merge Trees

Merge trees are a type of graph-based topological summary that tracks the evolution of connected components in the sublevel sets of scalar functions. They enjoy widespread applications in data analysis and scientific visualization. In this paper, we consider the problem of comparing two merge trees via the notion of interleaving distance in the metric space setting. We investigate various theoretical properties of such a metric. In particular, we show that the interleaving distance is intrinsic on the space of labeled merge trees and provide an algorithm to construct metric 1-centers for collections of labeled merge trees. We further prove that the intrinsic property of the interleaving distance also holds for the space of unlabeled merge trees. Our results are a first step toward performing statistics on graph-based topological summaries.

preprint2022arXiv

The SVD of Convolutional Weights: A CNN Interpretability Framework

Deep neural networks used for image classification often use convolutional filters to extract distinguishing features before passing them to a linear classifier. Most interpretability literature focuses on providing semantic meaning to convolutional filters to explain a model's reasoning process and confirm its use of relevant information from the input domain. Fully connected layers can be studied by decomposing their weight matrices using a singular value decomposition, in effect studying the correlations between the rows in each matrix to discover the dynamics of the map. In this work we define a singular value decomposition for the weight tensor of a convolutional layer, which provides an analogous understanding of the correlations between filters, exposing the dynamics of the convolutional map. We validate our definition using recent results in random matrix theory. By applying the decomposition across the linear layers of an image classification network we suggest a framework against which interpretability methods might be applied using hypergraphs to model class separation. Rather than looking to the activations to explain the network, we use the singular vectors with the greatest corresponding singular values for each linear layer to identify those features most important to the network. We illustrate our approach with examples and introduce the DeepDataProfiler library, the analysis tool used for this study.

preprint2022arXiv

Untangling Force-Directed Layouts Using Persistent Homology

Force-directed layouts belong to a popular class of methods used to position nodes in a node-link diagram. However, they typically lack direct consideration of global structures, which can result in visual clutter and the overlap of unrelated structures. In this paper, we use the principles of persistent homology to untangle force-directed layouts thus mitigating these issues. First, we devise a new method to use 0-dimensional persistent homology to efficiently generate an initial graph layout. The approach results in faster convergence and better quality graph layouts. Second, we provide a new definition and an efficient algorithm for 1-dimensional persistent homology features (i.e., tunnels/cycles) on graphs. We provide users the ability to interact with the 1-dimensional features by highlighting them and adding cycle-emphasizing forces to the layout. Finally, we evaluate our approach with 32 synthetic and real-world graphs by computing various metrics, e.g., co-ranking, edge crossing, etc., to demonstrate the efficacy of our proposed method.

preprint2021arXiv

Parallelizing the Unpacking and Clustering of Detector Data for Reconstruction of Charged Particle Tracks on Multi-core CPUs and Many-core GPUs

We present results from parallelizing the unpacking and clustering steps of the raw data from the silicon strip modules for reconstruction of charged particle tracks. Throughput is further improved by concurrently processing multiple events using nested OpenMP parallelism on CPU or CUDA streams on GPU. The new implementation along with earlier work in developing a parallelized and vectorized implementation of the combinatoric Kalman filter algorithm has enabled efficient global reconstruction of the entire event on modern computer architectures. We demonstrate the performance of the new implementation on Intel Xeon and NVIDIA GPU architectures.

preprint2020arXiv

FIS-Nets: Full-image Supervised Networks for Monocular Depth Estimation

This paper addresses the importance of full-image supervision for monocular depth estimation. We propose a semi-supervised architecture, which combines both unsupervised framework of using image consistency and supervised framework of dense depth completion. The latter provides full-image depth as supervision for the former. Ego-motion from navigation system is also embedded into the unsupervised framework as output supervision of an inner temporal transform network, making monocular depth estimation better. In the evaluation, we show that our proposed model outperforms other approaches on depth estimation.

preprint2020arXiv

Moduli Spaces of Morse Functions for Persistence

We consider different notions of equivalence for Morse functions on the sphere in the context of persistent homology, and introduce new invariants to study these equivalence classes. These new invariants are as simple, but more discerning than existing topological invariants, such as persistence barcodes and Reeb graphs. We give a method to relate any two Morse--Smale vector fields on the sphere by a sequence of fundamental moves by considering graph-equivalent Morse functions. We also explore the combinatorially rich world of height-equivalent Morse functions, considered as height functions of embedded spheres in $\mathbf R^3$. Their level-set invariant, a poset generated by nested disks and annuli from levels sets, gives insight into the moduli space of Morse functions sharing the same persistence barcode.

preprint2020arXiv

Probabilistic Convergence and Stability of Random Mapper Graphs

We study the probabilistic convergence between the mapper graph and the Reeb graph of a topological space $\mathbb{X}$ equipped with a continuous function $f: \mathbb{X} \rightarrow \mathbb{R}$. We first give a categorification of the mapper graph and the Reeb graph by interpreting them in terms of cosheaves and stratified covers of the real line $\mathbb{R}$. We then introduce a variant of the classic mapper graph of Singh et al.~(2007), referred to as the enhanced mapper graph, and demonstrate that such a construction approximates the Reeb graph of $(\mathbb{X}, f)$ when it is applied to points randomly sampled from a probability density function concentrated on $(\mathbb{X}, f)$. Our techniques are based on the interleaving distance of constructible cosheaves and topological estimation via kernel density estimates. Following Munch and Wang (2018), we first show that the mapper graph of $(\mathbb{X}, f)$, a constructible $\mathbb{R}$-space (with a fixed open cover), approximates the Reeb graph of the same space. We then construct an isomorphism between the mapper of $(\mathbb{X},f)$ to the mapper of a super-level set of a probability density function concentrated on $(\mathbb{X}, f)$. Finally, building on the approach of Bobrowski et al.~(2017), we show that, with high probability, we can recover the mapper of the super-level set given a sufficiently large sample. Our work is the first to consider the mapper construction using the theory of cosheaves in a probabilistic setting. It is part of an ongoing effort to combine sheaf theory, probability, and statistics, to support topological data analysis with random data.

preprint2020arXiv

Reconstruction of Charged Particle Tracks in Realistic Detector Geometry Using a Vectorized and Parallelized Kalman Filter Algorithm

One of the most computationally challenging problems expected for the High-Luminosity Large Hadron Collider (HL-LHC) is finding and fitting particle tracks during event reconstruction. Algorithms used at the LHC today rely on Kalman filtering, which builds physical trajectories incrementally while incorporating material effects and error estimation. Recognizing the need for faster computational throughput, we have adapted Kalman-filter-based methods for highly parallel, many-core SIMD and SIMT architectures that are now prevalent in high-performance hardware. Previously we observed significant parallel speedups, with physics performance comparable to CMS standard tracking, on Intel Xeon, Intel Xeon Phi, and (to a limited extent) NVIDIA GPUs. While early tests were based on artificial events occurring inside an idealized barrel detector, we showed subsequently that our mkFit software builds tracks successfully from complex simulated events (including detector pileup) occurring inside a geometrically accurate representation of the CMS-2017 tracker. Here, we report on advances in both the computational and physics performance of mkFit, as well as progress toward integration with CMS production software. Recently we have improved the overall efficiency of the algorithm by preserving short track candidates at a relatively early stage rather than attempting to extend them over many layers. Moreover, mkFit formerly produced an excess of duplicate tracks; these are now explicitly removed in an additional processing step. We demonstrate that with these enhancements, mkFit becomes a suitable choice for the first iteration of CMS tracking, and eventually for later iterations as well. We plan to test this capability in the CMS High Level Trigger during Run 3 of the LHC, with an ultimate goal of using it in both the CMS HLT and offline reconstruction for the HL-LHC CMS tracker.

preprint2020arXiv

Sheaf-Theoretic Stratification Learning from Geometric and Topological Perspectives

In this paper, we investigate a sheaf-theoretic interpretation of stratification learning from geometric and topological perspectives. Our main result is the construction of stratification learning algorithms framed in terms of a sheaf on a partially ordered set with the Alexandroff topology. We prove that the resulting decomposition is the unique minimal stratification for which the strata are homogeneous and the given sheaf is constructible. In particular, when we choose to work with the local homology sheaf, our algorithm gives an alternative to the local homology transfer algorithm given in Bendich et al. (2012), and the cohomology stratification algorithm given in Nanda (2017). Additionally, we give examples of stratifications based on the geometric techniques of Breiding et al. (2018), illustrating how the sheaf-theoretic approach can be used to study stratifications from both topological and geometric perspectives. This approach also points toward future applications of sheaf theory in the study of topological data analysis by illustrating the utility of the language of sheaf theory in generalizing existing algorithms.

preprint2019arXiv

Persistent Homology Guided Force-Directed Graph Layouts

Graphs are commonly used to encode relationships among entities, yet their abstractness makes them difficult to analyze. Node-link diagrams are popular for drawing graphs, and force-directed layouts provide a flexible method for node arrangements that use local relationships in an attempt to reveal the global shape of the graph. However, clutter and overlap of unrelated structures can lead to confusing graph visualizations. This paper leverages the persistent homology features of an undirected graph as derived information for interactive manipulation of force-directed layouts. We first discuss how to efficiently extract 0-dimensional persistent homology features from both weighted and unweighted undirected graphs. We then introduce the interactive persistence barcode used to manipulate the force-directed graph layout. In particular, the user adds and removes contracting and repulsing forces generated by the persistent homology features, eventually selecting the set of persistent homology features that most improve the layout. Finally, we demonstrate the utility of our approach across a variety of synthetic and real datasets.

preprint2015arXiv

Geometric Inference on Kernel Density Estimates

We show that geometric inference of a point cloud can be calculated by examining its kernel density estimate with a Gaussian kernel. This allows one to consider kernel density estimates, which are robust to spatial noise, subsampling, and approximate computation in comparison to raw point sets. This is achieved by examining the sublevel sets of the kernel distance, which isomorphically map to superlevel sets of the kernel density estimate. We prove new properties about the kernel distance, demonstrating stability results and allowing it to inherit reconstruction results from recent advances in distance-based topological reconstruction. Moreover, we provide an algorithm to estimate its topology using weighted Vietoris-Rips complexes.

preprint2015arXiv

Modern Gyrokinetic Particle-In-Cell Simulation of Fusion Plasmas on Top Supercomputers

The Gyrokinetic Toroidal Code at Princeton (GTC-P) is a highly scalable and portable particle-in-cell (PIC) code. It solves the 5D Vlasov-Poisson equation featuring efficient utilization of modern parallel computer architectures at the petascale and beyond. Motivated by the goal of developing a modern code capable of dealing with the physics challenge of increasing problem size with sufficient resolution, new thread-level optimizations have been introduced as well as a key additional domain decomposition. GTC-P's multiple levels of parallelism, including inter-node 2D domain decomposition and particle decomposition, as well as intra-node shared memory partition and vectorization have enabled pushing the scalability of the PIC method to extreme computational scales. In this paper, we describe the methods developed to build a highly parallelized PIC code across a broad range of supercomputer designs. This particularly includes implementations on heterogeneous systems using NVIDIA GPU accelerators and Intel Xeon Phi (MIC) co-processors and performance comparisons with state-of-the-art homogeneous HPC systems such as Blue Gene/Q. New discovery science capabilities in the magnetic fusion energy application domain are enabled, including investigations of Ion-Temperature-Gradient (ITG) driven turbulence simulations with unprecedented spatial resolution and long temporal duration. Performance studies with realistic fusion experimental parameters are carried out on multiple supercomputing systems spanning a wide range of cache capacities, cache-sharing configurations, memory bandwidth, interconnects and network topologies. These performance comparisons using a realistic discovery-science-capable domain application code provide valuable insights on optimization techniques across one of the broadest sets of current high-end computing platforms worldwide.

preprint2014arXiv

Turbulence Mitigation Scheme for Optical Communications using Orbital Angular Momentum Multiplexing Based on Channel Coding and Wavefront Correction

The free-space optical (FSO) communication links with orbital angular momentum (OAM) multiplexing have been demonstrated that they can largely enhance the systems' capacity without a corresponding increase in spectral bandwidth, but the performance of the system is unavoidably disturbed by atmospheric turbulence (AT). Different from the existed AT disturbance, the OAM-multiplexed systems will cause both the burst and random errors for a single OAM state carrier and the `crosstalk' interference between the different OAM states carriers. In this paper, we propose a turbulence mitigation method to improve AT tolerance of OAM-multiplexed FSO communication links. In the proposed scheme, we use channel codes to correct the burst and random errors caused by AT for a single OAM state carrier; And we use wavefront correction method to correct the `crosstalk' interference between the different OAM states carriers. The improvements of AT tolerance are discussed by comparing the performance of OAM-multiplexed FSO communication links with or without channel coding or Shark-Hartmann wavefront correction method. The numerical simulation results show that the OAM-multiplexed FSO communication links have enhanced their AT tolerance. The usage of channel codes and wavefront correction methods together has improved greatly the performance of OAM-multiplexed FSO communication links over atmospheric turbulence.

preprint2013arXiv

Approximating Local Homology from Samples

Recently, multi-scale notions of local homology (a variant of persistent homology) have been used to study the local structure of spaces around a given point from a point cloud sample. Current reconstruction guarantees rely on constructing embedded complexes which become difficult in high dimensions. We show that the persistence diagrams used for estimating local homology, can be approximated using families of Vietoris-Rips complexes, whose simple constructions are robust in any dimension. To the best of our knowledge, our results, for the first time, make applications based on local homology, such as stratification learning, feasible in high dimensions.

preprint2013arXiv

Local, Smooth, and Consistent Jacobi Set Simplification

The relation between two Morse functions defined on a common domain can be studied in terms of their Jacobi set. The Jacobi set contains points in the domain where the gradients of the functions are aligned. Both the Jacobi set itself as well as the segmentation of the domain it induces have shown to be useful in various applications. Unfortunately, in practice functions often contain noise and discretization artifacts causing their Jacobi set to become unmanageably large and complex. While there exist techniques to simplify Jacobi sets, these are unsuitable for most applications as they lack fine-grained control over the process and heavily restrict the type of simplifications possible. In this paper, we introduce a new framework that generalizes critical point cancellations in scalar functions to Jacobi sets in two dimensions. We focus on simplifications that can be realized by smooth approximations of the corresponding functions and show how this implies simultaneously simplifying contiguous subsets of the Jacobi set. These extended cancellations form the atomic operations in our framework, and we introduce an algorithm to successively cancel subsets of the Jacobi set with minimal modifications according to some user-defined metric. We prove that the algorithm is correct and terminates only once no more local, smooth and consistent simplifications are possible. We disprove a previous claim on the minimal Jacobi set for manifolds with arbitrary genus and show that for simply connected domains, our algorithm reduces a given Jacobi set to its simplest configuration.

preprint2012arXiv

An adaptive, high-order phase-space remapping for the two-dimensional Vlasov-Poisson equations

The numerical solution of high dimensional Vlasov equation is usually performed by particle-in-cell (PIC) methods. However, due to the well-known numerical noise, it is challenging to use PIC methods to get a precise description of the distribution function in phase space. To control the numerical error, we introduce an adaptive phase-space remapping which regularizes the particle distribution by periodically reconstructing the distribution function on a hierarchy of phase-space grids with high-order interpolations. The positivity of the distribution function can be preserved by using a local redistribution technique. The method has been successfully applied to a set of classical plasma problems in one dimension. In this paper, we present the algorithm for the two dimensional Vlasov-Poisson equations. An efficient Poisson solver with infinite domain boundary conditions is used. The parallel scalability of the algorithm on massively parallel computers will be discussed.

preprint2012arXiv

Extraordinary room-temperature photoluminescence in WS2 monolayers

Individual monolayers of metal dichalcogenides are atomically thin two-dimensional crystals with attractive physical properties different from their bulk layered counterpart. Here we describe the direct synthesis of WS2 monolayers with triangular morphologies and strong room-temperature photoluminescence (PL). Bulk WS2 does not present PL due to its indirect band gap nature. The edges of these monolayers exhibit PL signals with extraordinary intensity, around 25 times stronger than the platelets center. The structure and composition of the platelet edges appear to be critical for the PL enhancement effect. Electron diffraction revealed that platelets present zigzag edges, while first-principles calculations indicate that sulfur-rich zigzag WS2 edges possess metallic edge states, which might tailor the optical response reported here. These novel 2D nanoscale light sources could find diverse applications including the fabrication of flexible/transparent/low-energy optoelectronic devices.

preprint2012arXiv

Performance of Polar Codes on wireless communications Channel

We discuss the performance of polar codes, the capacity-achieving channel codes, on wireless communication channel in this paper. By generalizing the definition of Bhattacharyya Parameter in discrete memoryless channel, we present the special expression of the parameter for Gaussian and Rayleigh fading the two continuous channels, including the recursive formulas and the initial values. We analyze the applications of polar codes with the defined parameter over Rayleigh fading channel by transmitting image and speech. By comparing with low density parity-check codes(LDPC) at the same cases, our simulation results show that polar codes have better performance than that of LDPC codes. Polar codes will be good candidate for wireless communication channel.

preprint2010arXiv

Photoluminescence from nanocrystalline graphite monofluoride

We synthesize and study the structural and optical properties of nanocrystalline graphene monofluoride and graphite monofluoride, which are carbon-based wide bandgap materials. Using laser excitations 2.41 - 5.08 eV, we identify six emission modes of graphite monofluoride, spanning the visible spectrum from red to violet. The energy and linewidth of the modes point to defect-induced midgap states as the source of the photoemission. We discuss possible candidates. Our findings open the window to electro-optical applications of graphene fluoride.

preprint2010arXiv

Towards Stratification Learning through Homology Inference

A topological approach to stratification learning is developed for point cloud data drawn from a stratified space. Given such data, our objective is to infer which points belong to the same strata. First we define a multi-scale notion of a stratified space, giving a stratification for each radius level. We then use methods derived from kernel and cokernel persistent homology to cluster the data points into different strata, and we prove a result which guarantees the correctness of our clustering, given certain topological conditions; some geometric intuition for these topological conditions is also provided. Our correctness result is then given a probabilistic flavor: we give bounds on the minimum number of sample points required to infer, with probability, which points belong to the same strata. Finally, we give an explicit algorithm for the clustering, prove its correctness, and apply it to some simulated data.

Bei Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

pMSz: A Distributed Parallel Algorithm for Correcting Extrema and Morse Smale Segmentations in Lossy Compression

Spatiotemporal Detection and Uncertainty Visualization of Atmospheric Blocking Events

Intrinsic Interleaving Distance for Merge Trees

The SVD of Convolutional Weights: A CNN Interpretability Framework

Untangling Force-Directed Layouts Using Persistent Homology

Parallelizing the Unpacking and Clustering of Detector Data for Reconstruction of Charged Particle Tracks on Multi-core CPUs and Many-core GPUs

FIS-Nets: Full-image Supervised Networks for Monocular Depth Estimation

Moduli Spaces of Morse Functions for Persistence

Probabilistic Convergence and Stability of Random Mapper Graphs

Reconstruction of Charged Particle Tracks in Realistic Detector Geometry Using a Vectorized and Parallelized Kalman Filter Algorithm

Sheaf-Theoretic Stratification Learning from Geometric and Topological Perspectives

Persistent Homology Guided Force-Directed Graph Layouts

Geometric Inference on Kernel Density Estimates

Modern Gyrokinetic Particle-In-Cell Simulation of Fusion Plasmas on Top Supercomputers

Turbulence Mitigation Scheme for Optical Communications using Orbital Angular Momentum Multiplexing Based on Channel Coding and Wavefront Correction

Approximating Local Homology from Samples

Local, Smooth, and Consistent Jacobi Set Simplification

An adaptive, high-order phase-space remapping for the two-dimensional Vlasov-Poisson equations

Extraordinary room-temperature photoluminescence in WS2 monolayers

Performance of Polar Codes on wireless communications Channel

Photoluminescence from nanocrystalline graphite monofluoride

Towards Stratification Learning through Homology Inference