Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

A Mesh-Adaptive Hypergraph Neural Network for Unsteady Flow Around Oscillating and Rotating Structures

Graph neural networks, recently introduced into the field of fluid flow surrogate modeling, have been successfully applied to model the temporal evolution of various fluid flow systems. Existing applications, however, are mostly restricted to cases where the domain is time-invariant. The present work extends the application of graph neural network-based modeling to fluid flow around structures rotating with respect to a certain axis. Specifically, we propose to apply a graph neural network-based surrogate model with part of the mesh/graph co-rotating with the structure and part of the mesh/graph static. A single layer of interface cells are constructed at the interface between the two parts and are allowed to distort and adapt, which helps in circumventing the difficulty of interpolating information encoded by the neural network at every graph neural network layer. Dedicated reconstruction and re-projection schemes are designed to counter the error caused by the distortion and connectivity change of the interface cells. The effectiveness of our proposed framework is examined on two test cases: (i) fluid flow around a 2D oscillating airfoil, and (ii) fluid flow past a 3D rotating cube. Our results show that the model achieves stable rollout predictions over hundreds or even a thousand time steps. We further demonstrate that one could enforce accurate, error-bounded prediction results by incorporating the measurements from sparse pressure sensors. In addition to the accurate flow field predictions, the lift and drag force predictions closely match with the computational fluid dynamics calculations, highlighting the potential of the framework for modeling fluid flow around rotating structures, and paving the path towards a graph neural network-based surrogate model for more complex scenarios like flow around marine propellers.

preprint2026arXiv

DeepHalo: A Neural Choice Model with Controllable Context Effects

Modeling human decision-making is central to applications such as recommendation, preference learning, and human-AI alignment. While many classic models assume context-independent choice behavior, a large body of behavioral research shows that preferences are often influenced by the composition of the choice set itself -- a phenomenon known as the context effect or Halo effect. These effects can manifest as pairwise (first-order) or even higher-order interactions among the available alternatives. Recent models that attempt to capture such effects either focus on the featureless setting or, in the feature-based setting, rely on restrictive interaction structures or entangle interactions across all orders, which limits interpretability. In this work, we propose DeepHalo, a neural modeling framework that incorporates features while enabling explicit control over interaction order and principled interpretation of context effects. Our model enables systematic identification of interaction effects by order and serves as a universal approximator of context-dependent choice functions when specialized to a featureless setting. Experiments on synthetic and real-world datasets demonstrate strong predictive performance while providing greater transparency into the drivers of choice.

preprint2022arXiv

Absolute Zero-Shot Learning

Considering the increasing concerns about data copyright and privacy issues, we present a novel Absolute Zero-Shot Learning (AZSL) paradigm, i.e., training a classifier with zero real data. The key innovation is to involve a teacher model as the data safeguard to guide the AZSL model training without data leaking. The AZSL model consists of a generator and student network, which can achieve date-free knowledge transfer while maintaining the performance of the teacher network. We investigate `black-box' and `white-box' scenarios in AZSL task as different levels of model security. Besides, we also provide discussion of teacher model in both inductive and transductive settings. Despite embarrassingly simple implementations and data-missing disadvantages, our AZSL framework can retain state-of-the-art ZSL and GZSL performance under the `white-box' scenario. Extensive qualitative and quantitative analysis also demonstrates promising results when deploying the model under `black-box' scenario.

preprint2022arXiv

Distributionally Robust Stochastic Optimization with Wasserstein Distance

Distributionally robust stochastic optimization (DRSO) is an approach to optimization under uncertainty in which, instead of assuming that there is a known true underlying probability distribution, one hedges against a chosen set of distributions. In this paper we first point out that the set of distributions should be chosen to be appropriate for the application at hand, and that some of the choices that have been popular until recently are, for many applications, not good choices. We next consider sets of distributions that are within a chosen Wasserstein distance from a nominal distribution. Such a choice of sets has two advantages: (1) The resulting distributions hedged against are more reasonable than those resulting from other popular choices of sets. (2) The problem of determining the worst-case expectation over the resulting set of distributions has desirable tractability properties. We derive a strong duality reformulation of the corresponding DRSO problem and construct approximate worst-case distributions explicitly via the first-order optimality conditions of the dual problem. Our contributions are four-fold. (i) We identify necessary and sufficient conditions for the existence of a worst-case distribution, which are naturally related to the growth rate of the objective function. (ii) We show that the worst-case distributions resulting from an appropriate Wasserstein distance have a concise structure and a clear interpretation. (iii) Using this structure, we show that data-driven DRSO problems can be approximated to any accuracy by robust optimization problems, and thereby many DRSO problems become tractable by using tools from robust optimization. (iv) Our strong duality result holds in a very general setting. As examples, we show that it can be applied to infinite-dimensional process control and intensity estimation for point processes.

preprint2022arXiv

Distributionally Robust Weighted $k$-Nearest Neighbors

Learning a robust classifier from a few samples remains a key challenge in machine learning. A major thrust of research has been focused on developing $k$-nearest neighbor ($k$-NN) based algorithms combined with metric learning that captures similarities between samples. When the samples are limited, robustness is especially crucial to ensure the generalization capability of the classifier. In this paper, we study a minimax distributionally robust formulation of weighted $k$-nearest neighbors, which aims to find the optimal weighted $k$-NN classifiers that hedge against feature uncertainties. We develop an algorithm, \texttt{Dr.k-NN}, that efficiently solves this functional optimization problem and features in assigning minimax optimal weights to training samples when performing classification. These weights are class-dependent, and are determined by the similarities of sample features under the least favorable scenarios. When the size of the uncertainty set is properly tuned, the robust classifier has a smaller Lipschitz norm than the vanilla $k$-NN, and thus improves the generalization capability. We also couple our framework with neural-network-based feature embedding. We demonstrate the competitive performance of our algorithm compared to the state-of-the-art in the few-training-sample setting with various real-data experiments.

preprint2022arXiv

Finite-Sample Guarantees for Wasserstein Distributionally Robust Optimization: Breaking the Curse of Dimensionality

Wasserstein distributionally robust optimization (DRO) aims to find robust and generalizable solutions by hedging against data perturbations in Wasserstein distance. Despite its recent empirical success in operations research and machine learning, existing performance guarantees for generic loss functions are either overly conservative due to the curse of dimensionality, or plausible only in large sample asymptotics. In this paper, we develop a non-asymptotic framework for analyzing the out-of-sample performance for Wasserstein robust learning and the generalization bound for its related Lipschitz and gradient regularization problems. To the best of our knowledge, this gives the first finite-sample guarantee for generic Wasserstein DRO problems without suffering from the curse of dimensionality. Our results highlight that Wasserstein DRO, with a properly chosen radius, balances between the empirical mean of the loss and the variation of the loss, measured by the Lipschitz norm or the gradient norm of the loss. Our analysis is based on two novel methodological developments that are of independent interest: 1) a new concentration inequality controlling the decay rate of large deviation probabilities by the variation of the loss and, 2) a localized Rademacher complexity theory based on the variation of the loss.

preprint2022arXiv

Low complexity of optimizing measures over an expanding circle map

In this paper, we prove that for real analytic expanding circle maps, all optimizing measures of a real analytic potential function have zero entropy, unless the potential is cohomologous to constant. We use the group structure of the symbolic space to solve a transversality problem involved. We also discuss applications to optimizing measures for generic smooth potentials and to Lyapunov optimizing measures.

preprint2022arXiv

Phase-SLAM: Phase Based Simultaneous Localization and Mapping for Mobile Structured Light Illumination Systems

Structured Light Illumination (SLI) systems have been used for reliable indoor dense 3D scanning via phase triangulation. However, mobile SLI systems for 360 degree 3D reconstruction demand 3D point cloud registration, involving high computational complexity. In this paper, we propose a phase based Simultaneous Localization and Mapping (Phase-SLAM) framework for fast and accurate SLI sensor pose estimation and 3D object reconstruction. The novelty of this work is threefold: (1) developing a reprojection model from 3D points to 2D phase data towards phase registration with low computational complexity; (2) developing a local optimizer to achieve SLI sensor pose estimation (odometry) using the derived Jacobian matrix for the 6 DoF variables; (3) developing a compressive phase comparison method to achieve high-efficiency loop closure detection. The whole Phase-SLAM pipeline is then exploited using existing global pose graph optimization techniques. We build datasets from both the unreal simulation platform and a robotic arm based SLI system in real-world to verify the proposed approach. The experiment results demonstrate that the proposed Phase-SLAM outperforms other state-of-the-art methods in terms of the efficiency and accuracy of pose estimation and 3D reconstruction. The open-source code is available at https://github.com/ZHENGXi-git/Phase-SLAM.

preprint2022arXiv

Picosecond timing of charged particles using the TORCH detector

TORCH is a large-area, high-precision time-of-flight (ToF) detector designed to provide charged-particle identification in the 2-20 GeV$/c$ momentum range. Prompt Cherenkov photons emitted by charged hadrons as they traverse a 10mm quartz radiator are propagated to the periphery of the detector, where they are focused onto an array of micro-channel plate photomultiplier tubes (MCP-PMTs). The position and arrival times of the photons are used to infer the particles' time of entry in the radiator, to identify hadrons based on their ToF. The MCP-PMTs were developed with an industrial partner to satisfy the stringent requirements of the TORCH detector. The requirements include a finely segmented anode, excellent time resolution, and a long lifetime. Over an approximately 10m flight distance, the difference in ToF between a kaon and a pion with 10GeV$/c$ momentum is 35ps, leading to a 10-15ps per track timing resolution requirement. On average 30 photons per hadron are detected, which translates to a single-photon time resolution of 70ps. The TORCH research and development program aims to demonstrate the validity of the detector concept through laboratory and beam tests, results from which are presented. A timing resolution of 70-100ps was reached in beam tests, approaching the TORCH design goal. Laboratory timing tests consist of operating the MCP-PMTs coupled to the TORCH readout electronics. A time resolution of about 50ps was measured, meeting the TORCH target timing resolution.

preprint2022arXiv

Regularity of calibrated sub-actions for circle expanding maps and Sturmian optimization

In this short and elementary note, we study some ergodic optimization problems for circle expanding maps. We first make an observation that if a function is not far from being convex, then its calibrated sub-actions are closer to convex functions in certain effective way. As an application of this simple observation, for circle doubling map, we generalize a result of Bousch saying that translations of the cosine function are uniquely optimized by Sturmian measures. Our argument follows the mainline of Bousch's original proof, while the technical part is simplified by the observation mentioned above, and no numerical calculation is needed.

preprint2022arXiv

Reinforcement Learned Distributed Multi-Robot Navigation with Reciprocal Velocity Obstacle Shaped Rewards

The challenges to solving the collision avoidance problem lie in adaptively choosing optimal robot velocities in complex scenarios full of interactive obstacles. In this paper, we propose a distributed approach for multi-robot navigation which combines the concept of reciprocal velocity obstacle (RVO) and the scheme of deep reinforcement learning (DRL) to solve the reciprocal collision avoidance problem under limited information. The novelty of this work is threefold: (1) using a set of sequential VO and RVO vectors to represent the interactive environmental states of static and dynamic obstacles, respectively; (2) developing a bidirectional recurrent module based neural network, which maps the states of a varying number of surrounding obstacles to the actions directly; (3) developing a RVO area and expected collision time based reward function to encourage reciprocal collision avoidance behaviors and trade off between collision risk and travel time. The proposed policy is trained through simulated scenarios and updated by the actor-critic based DRL algorithm. We validate the policy in complex environments with various numbers of differential drive robots and obstacles. The experiment results demonstrate that our approach outperforms the state-of-art methods and other learning based approaches in terms of the success rate, travel time, and average speed. Source code of this approach is available at https://github.com/hanruihua/rl_rvo_nav.

preprint2022arXiv

SSGCNet: A Sparse Spectra Graph Convolutional Network for Epileptic EEG Signal Classification

In this article, we propose a sparse spectra graph convolutional network (SSGCNet) for solving Epileptic EEG signal classification problems. The aim is to achieve a lightweight deep learning model without losing model classification accuracy. We propose a weighted neighborhood field graph (WNFG) to represent EEG signals, which reduces the redundant edges between graph nodes. WNFG has lower time complexity and memory usage than the conventional solutions. Using the graph representation, the sequential graph convolutional network is based on a combination of sparse weight pruning technique and the alternating direction method of multipliers (ADMM). Our approach can reduce computation complexity without effect on classification accuracy. We also present convergence results for the proposed approach. The performance of the approach is illustrated in public and clinical-real datasets. Compared with the existing literature, our WNFG of EEG signals achieves up to 10 times of redundant edge reduction, and our approach achieves up to 97 times of model pruning without loss of classification accuracy.

preprint2022arXiv

Two-sample Test with Kernel Projected Wasserstein Distance

We develop a kernel projected Wasserstein distance for the two-sample test, an essential building block in statistics and machine learning: given two sets of samples, to determine whether they are from the same distribution. This method operates by finding the nonlinear mapping in the data space which maximizes the distance between projected distributions. In contrast to existing works about projected Wasserstein distance, the proposed method circumvents the curse of dimensionality more efficiently. We present practical algorithms for computing this distance function together with the non-asymptotic uncertainty quantification of empirical estimates. Numerical examples validate our theoretical results and demonstrate good performance of the proposed method.

preprint2021arXiv

Generalize Ultrasound Image Segmentation via Instant and Plug & Play Style Transfer

Deep segmentation models that generalize to images with unknown appearance are important for real-world medical image analysis. Retraining models leads to high latency and complex pipelines, which are impractical in clinical settings. The situation becomes more severe for ultrasound image analysis because of their large appearance shifts. In this paper, we propose a novel method for robust segmentation under unknown appearance shifts. Our contribution is three-fold. First, we advance a one-stage plug-and-play solution by embedding hierarchical style transfer units into a segmentation architecture. Our solution can remove appearance shifts and perform segmentation simultaneously. Second, we adopt Dynamic Instance Normalization to conduct precise and dynamic style transfer in a learnable manner, rather than previously fixed style normalization. Third, our solution is fast and lightweight for routine clinical adoption. Given 400*400 image input, our solution only needs an additional 0.2ms and 1.92M FLOPs to handle appearance shifts compared to the baseline pipeline. Extensive experiments are conducted on a large dataset from three vendors demonstrate our proposed method enhances the robustness of deep segmentation models.

preprint2020arXiv

On fair entropy of the tent family

The notions of fair measure and fair entropy were introduced by Misiurewicz and Rodrigues recently, and discussed in detail for piecewise monotone interval maps. In particular, they showed that the fair entropy $h(a)$ of the tent map $f_a$, as a function of the parameter $a=\exp(h_{top}(f_a))$, is continuous and strictly increasing on $[\sqrt{2},2]$. In this short note, we extend the last result and characterize regularity of the function $h$ precisely. We prove that $h$ is $\frac{1}{2}$-Hölder continuous on $[\sqrt{2},2]$ and identify its best Hölder exponent on each subinterval of $[\sqrt{2},2]$. On the other hand, parallel to a recent result on topological entropy of the quadratic family due to Dobbs and Mihalache, we give a formula of pointwise Hölder exponents of $h$ at parameters chosen in an explicitly constructed set of full measure. This formula particularly implies that the derivative of $h$ vanishes almost everywhere.

preprint2020arXiv

Remove Appearance Shift for Ultrasound Image Segmentation via Fast and Universal Style Transfer

Deep Neural Networks (DNNs) suffer from the performance degradation when image appearance shift occurs, especially in ultrasound (US) image segmentation. In this paper, we propose a novel and intuitive framework to remove the appearance shift, and hence improve the generalization ability of DNNs. Our work has three highlights. First, we follow the spirit of universal style transfer to remove appearance shifts, which was not explored before for US images. Without sacrificing image structure details, it enables the arbitrary style-content transfer. Second, accelerated with Adaptive Instance Normalization block, our framework achieved real-time speed required in the clinical US scanning. Third, an efficient and effective style image selection strategy is proposed to ensure the target-style US image and testing content US image properly match each other. Experiments on two large US datasets demonstrate that our methods are superior to state-of-the-art methods on making DNNs robust against various appearance shifts.

preprint2020arXiv

Thermal behaviors of light scalar resonances at low temperatures

We study the thermal properties of the lowest multiplet of the QCD light-flavor scalar resonances, including the $f_0(500)/σ$, $K_{0}^{*}(700)/κ$, $f_0(980)$ and $a_0(980)$, in the framework of unitarized $U(3)$ chiral perturbation theory. After the successful fits to the meson-meson scattering inputs, such as the phase shifts and inelasticities, we obtain the unknown parameters and further calculate the resonance poles and their residues at zero temperature. By including the finite-temperature effects in the unitarized meson-meson scattering amplitudes, the thermal behaviors of the scalar resonance poles in the complex energy plane are studied. The masses of $σ$ and $κ$ are found to considerably decrease when increasing the temperatures, while their widths turn out to be still large when the temperatures reach around $200$ MeV. In contrast, both the masses and widths of the $f_0(980)$ and $a_0(980)$ are only slightly changed.

preprint2020arXiv

UO2/BeO interfacial thermal resistance and its effect on fuel thermal conductivity

UO2/BeO interfacial thermal resistance (ITR) is calculated by diffuse mismatch model (DMM) and the effects of ITR on UO2-BeO thermal conductivity are investigated. ITR predicted by DMM is on the order of 10-9 m2K/W. Using this ITR, UO2-BeO thermal conductivities are calculated by theoretical models and compared with experimental data. The results indicate that DMM prediction is applicable to the interface between UO2 and dispersed BeO, while not applicable to the interface between UO2 and continuous BeO. If the thermal conductivity of UO2 containing continuous BeO was to be in agreement with experimental data, its ITR should be on the order of 10-6 - 10-5 m2K/W. Therefore, the vibrational mismatch between UO2 and BeO considered by DMM is the major mechanism for attenuating the heat flux through UO2/dispersed-BeO interface, but not for UO2/continuous-BeO interface. Furthermore, it is found that the presence of ITR leads to the dependence of the thermal conductivity of UO2 containing dispersed BeO on BeO size. With the decrease in BeO size, UO2-BeO thermal conductivity decreases. When BeO size is smaller than a critical value, UO2-BeO thermal conductivity becomes even smaller than UO2 thermal conductivity. For UO2 containing continuous BeO, the thermal conductivity decreases with the decrease in the size of UO2 granule surrounded by BeO, but not necessarily smaller than UO2 thermal conductivity. Under a critical temperature, UO2-BeO thermal conductivity is always larger than UO2 thermal conductivity. Above the critical temperature, UO2-BeO thermal conductivity is larger than UO2 thermal conductivity only when UO2 granule size is large enough. The conditions for achieving the targeted enhancement of UO2 thermal conductivity by doping with BeO are derived. These conditions can be used to design and optimize the distribution, content, size of BeO, and the size of UO2 granule.

preprint2010arXiv

ISIS2: Pixel Sensor with Local Charge Storage for ILC Vertex Detector

ISIS (In-situ Storage Imaging Sensor) is a novel CMOS sensor with multiple charge storage capability developed for the ILC vertex detector by the Linear Collider Flavour Identification (LCFI) collaboration. This paper reports test results for ISIS2, the second generation of ISIS sensors implemented in a 0.18 micron CMOS process. The local charge storage and charge transfer were unambiguously demonstrated.

preprint2009arXiv

The LCFIVertex package: vertexing, flavour tagging and vertex charge reconstruction with an ILC vertex detector

The precision measurements envisaged at the International Linear Collider (ILC) depend on excellent instrumentation and reconstruction software. The correct identification of heavy flavour jets, placing unprecedented requirements on the quality of the vertex detector, will be central for the ILC programme. This paper describes the LCFIVertex software, which provides tools for vertex finding and for identification of the flavour and charge of the leading hadron in heavy flavour jets. These tools are essential for the ongoing optimisation of the vertex detector design for linear colliders such as the ILC. The paper describes the algorithms implemented in the LCFIVertex package, as well as the scope of the code and its performance for a typical vertex detector design.