Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
17works
0followers
16topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

17 published item(s)

preprint2026arXiv

The Potential of Erroneous Outbound Traffic Analysis to Unveil Silent Internal Anomalies

Passive measurement has traditionally focused on inbound traffic to detect malicious activity, based on the assumption that threats originate externally. In this paper, we offer a complementary perspective by examining outbound traffic, and argue that a narrow subset -- what we term erroneous outbound traffic -- is a lighter and revealing yet overlooked data source for identifying a broad range of security threats and network problems. This traffic consists of packets sent by internal hosts that either receive no response, trigger ICMP errors, or are ICMP error messages themselves generated in response to unsolicited requests. To demonstrate its potential, we collect and analyse erroneous traffic from a large network, uncovering a variety of previously unnoticed issues, including misconfigurations, obsolete deployments and compromised hosts.

preprint2025arXiv

Competing Antiferromagnetic Phases in Multiferroic Wurtzite Transition-Metal Chalcogenides

Antiferromagnetic (AFM) spintronics offers a pathway toward electrically controllable spin-based devices beyond ferromagnets. Here, we identify wurtzite MnX (X = S, Se, Te) as a family of multiferroic materials hosting competing AFM phases, including altermagnetic, where nonrelativistic spin splitting can be controlled by ferroelectric polarization. Using density-functional theory and atomistic spin-model calculations, we show that all pristine MnX compounds stabilize a stripe type collinear AFM ground state, contrary to earlier predictions of an altermagnetic ground state, with the magnetic order governed by frustrated Heisenberg and biquadratic exchange interactions. We further demonstrate that Cr doping drives a transition to an A-type AFM phase that breaks Kramers spin degeneracy and realizes a g-wave altermagnetic state with large nonrelativistic spin splitting near the Fermi level. Importantly, this spin splitting can be deterministically reversed by polarization switching, enabling electric-field control of altermagnetic electronic structure without reorienting the Neel vector or relying on spin-orbit coupling. The close energetic proximity of the stripe AFM to a noncollinear all-in-all-out configuration indicates that wurtzite MnX lies near a topological magnetic phase with finite scalar spin chirality, which may be stabilized by modest perturbations such as temperature, strain or chemical tuning. The distinct magnetic phases exhibit symmetry selective linear and non-linear Hall responses, providing direct transport signatures of altermagnetism and polarization control. Together, these results establish doped wurtzite MnX as a promising platform for altermagnet-ferroelectric multiferroics and electrically AFM spintronics.

preprint2022arXiv

A Learned Index for Exact Similarity Search in Metric Spaces

Indexing is an effective way to support efficient query processing in large databases. Recently the concept of learned index, which replaces or complements traditional index structures with machine learning models, has been actively explored to reduce storage and search costs. However, accurate and efficient similarity query processing in high-dimensional metric spaces remains to be an open challenge. In this paper, we propose a novel indexing approach called LIMS that uses data clustering, pivot-based data transformation techniques and learned indexes to support efficient similarity query processing in metric spaces. In LIMS, the underlying data is partitioned into clusters such that each cluster follows a relatively uniform data distribution. Data redistribution is achieved by utilizing a small number of pivots for each cluster. Similar data are mapped into compact regions and the mapped values are totally ordinal. Machine learning models are developed to approximate the position of each data record on disk. Efficient algorithms are designed for processing range queries and nearest neighbor queries based on LIMS, and for index maintenance with dynamic updates. Extensive experiments on real-world and synthetic datasets demonstrate the superiority of LIMS compared with traditional indexes and state-of-the-art learned indexes.

preprint2022arXiv

Automation of Radiation Treatment Planning for Rectal Cancer

To develop an automated workflow for rectal cancer three-dimensional conformal radiotherapy treatment planning that combines deep-learning(DL) aperture predictions and forward-planning algorithms. We designed an algorithm to automate the clinical workflow for planning with field-in-field. DL models were trained, validated, and tested on 555 patients to automatically generate aperture shapes for primary and boost fields. Network inputs were digitally reconstructed radiography, gross tumor volume(GTV), and nodal GTV. A physician scored each aperture for 20 patients on a 5-point scale(>3 acceptable). A planning algorithm was then developed to create a homogeneous dose using a combination of wedges and subfields. The algorithm iteratively identifies a hotspot volume, creates a subfield, and optimizes beam weight all without user intervention. The algorithm was tested on 20 patients using clinical apertures with different settings, and the resulting plans(4 plans/patient) were scored by a physician. The end-to-end workflow was tested and scored by a physician on 39 patients using DL-generated apertures and planning algorithms. The predicted apertures had Dice scores of 0.95, 0.94, and 0.90 for posterior-anterior, laterals, and boost fields, respectively. 100%, 95%, and 87.5% of the posterior-anterior, laterals, and boost apertures were scored as clinically acceptable, respectively. Wedged and non-wedged plans were clinically acceptable for 85% and 50% of patients, respectively. The final plans hotspot dose percentage was reduced from 121%($\pm$ 14%) to 109%($\pm$ 5%) of prescription dose. The integrated end-to-end workflow of automatically generated apertures and optimized field-in-field planning gave clinically acceptable plans for 38/39(97%) of patients. We have successfully automated the clinical workflow for generating radiotherapy plans for rectal cancer for our institution.

preprint2022arXiv

BigDL 2.0: Seamless Scaling of AI Pipelines from Laptops to Distributed Cluster

Most AI projects start with a Python notebook running on a single laptop; however, one usually needs to go through a mountain of pains to scale it to handle larger dataset (for both experimentation and production deployment). These usually entail many manual and error-prone steps for the data scientists to fully take advantage of the available hardware resources (e.g., SIMD instructions, multi-processing, quantization, memory allocation optimization, data partitioning, distributed computing, etc.). To address this challenge, we have open sourced BigDL 2.0 at https://github.com/intel-analytics/BigDL/ under Apache 2.0 license (combining the original BigDL and Analytics Zoo projects); using BigDL 2.0, users can simply build conventional Python notebooks on their laptops (with possible AutoML support), which can then be transparently accelerated on a single node (with up-to 9.6x speedup in our experiments), and seamlessly scaled out to a large cluster (across several hundreds servers in real-world use cases). BigDL 2.0 has already been adopted by many real-world users (such as Mastercard, Burger King, Inspur, etc.) in production.

preprint2022arXiv

Ferroelectric control of magnetic skyrmions in two-dimensional van der Waals heterostructures

Magnetic skyrmions are chiral nanoscale spin textures which are usually induced by Dzyaloshinskii-Moriya interaction (DMI). Recently, magnetic skyrmions have been observed in two-dimensional (2D) van der Waals (vdW) ferromagnetic materials, such as Fe$_{3}$GeTe$_{2}$. The electric control of skyrmions is important for their potential application in low-power memory technologies. Here, we predict that DMI and magnetic skyrmions in a Fe$_{3}$GeTe$_{2}$ monolayer can be controlled by ferroelectric polarization of an adjacent 2D vdW ferroelectric In$_{2}$Se$_{3}$. Based on density functional theory and atomistic spin-dynamics modeling, we find that the interfacial symmetry breaking produces a sizable DMI in a Fe$_{3}$GeTe$_{2}$/In$_{2}$Se$_{3}$ vdW heterostructure. We show that the magnitude of DMI can be controlled by ferroe-lectric polarization reversal, leading to creation and annihilation of skyrmions. Furthermore, we find that the sign of DMI in a In$_{2}$Se$_{3}$/Fe$_{3}$GeTe$_{2}$/In$_{2}$Se$_{3}$ heterostructure changes with ferroelectric switching reversing the skyrmion chirality. The predicted electrically controlled skyrmion formation may be interesting for spintronic applications.

preprint2022arXiv

Gaia: Graph Neural Network with Temporal Shift aware Attention for Gross Merchandise Value Forecast in E-commerce

E-commerce has gone a long way in empowering merchants through the internet. In order to store the goods efficiently and arrange the marketing resource properly, it is important for them to make the accurate gross merchandise value (GMV) prediction. However, it's nontrivial to make accurate prediction with the deficiency of digitized data. In this article, we present a solution to better forecast GMV inside Alipay app. Thanks to graph neural networks (GNN) which has great ability to correlate different entities to enrich information, we propose Gaia, a graph neural network (GNN) model with temporal shift aware attention. Gaia leverages the relevant e-seller' sales information and learn neighbor correlation based on temporal dependencies. By testing on Alipay's real dataset and comparing with other baselines, Gaia has shown the best performance. And Gaia is deployed in the simulated online environment, which also achieves great improvement compared with baselines.

preprint2022arXiv

Granular dynamics in auger sampling

From geotechnical applications to space exploration, auger drilling is often used as a standard tool for soil sample collection, instrument installation, and others. Focusing on granular flow associated with the rotary drilling process, we investigate the performance of auger drilling in terms of sampling efficiency, defined as the mass ratio of the soil sample collected in the coring tube to its total volume at a given penetration depth, by means of experiments, numerical simulations, as well as theoretical analysis. The ratio of rotation to penetration speed is found to play a crucial role in the sampling process. A continuum model for the coupled granular flow in both coring and discharging channels is proposed to elucidate the physical mechanism behind the sampling process. Supported by a comparison to experimental results, the continuum model provides a practical way to predict the performance of auger drilling. Further analysis reveals that the drilling process approaches a steady state with constant granular flow speeds in both channels. In the steady state, sampling efficiency decreases linearly with the growth of the rotation to penetration speed ratio, which can be well captured by the analytical solution of the model. The analytical solution also suggests that the sampling efficiency is independent of gravity in the steady state, which has profound implications for extraterrestrial sample collection in future space missions.

preprint2022arXiv

Meta-Reinforcement Learning in Broad and Non-Parametric Environments

Recent state-of-the-art artificial agents lack the ability to adapt rapidly to new tasks, as they are trained exclusively for specific objectives and require massive amounts of interaction to learn new skills. Meta-reinforcement learning (meta-RL) addresses this challenge by leveraging knowledge learned from training tasks to perform well in previously unseen tasks. However, current meta-RL approaches limit themselves to narrow parametric task distributions, ignoring qualitative differences between tasks that occur in the real world. In this paper, we introduce TIGR, a Task-Inference-based meta-RL algorithm using Gaussian mixture models (GMM) and gated Recurrent units, designed for tasks in non-parametric environments. We employ a generative model involving a GMM to capture the multi-modality of the tasks. We decouple the policy training from the task-inference learning and efficiently train the inference mechanism on the basis of an unsupervised reconstruction objective. We provide a benchmark with qualitatively distinct tasks based on the half-cheetah environment and demonstrate the superior performance of TIGR compared to state-of-the-art meta-RL approaches in terms of sample efficiency (3-10 times faster), asymptotic performance, and applicability in non-parametric environments with zero-shot adaptation.

preprint2022arXiv

Role of cohesion in the formation of kink wave fronts in vibrofluidized granular materials

The formation of kink wave fronts (KWFs) in a quasi-two-dimensional granular system is investigated numerically with a focus on the role of cohesive interactions between individual particles. The cohesive particle-particle interaction is achieved through tuning the velocity-dependent coefficient of restitution, based on an analytical model introduced recently. A comparison with experimental results indicates that the threshold for the emergence of traveling KWFs matches the regime in which the center of mass height of the granular layers fluctuates with a period that triples the vibration period. Further comparisons between dry and wet granular dynamics reveal that KWFs are more pronounced in wet granular layers because of enhanced collective motion induced by cohesion.

preprint2022arXiv

Study on the Kinetics of Rayleigh Particle Jets Converging by Laser Beams

This paper discusses laser-induced flow stabilizing of Rayleigh particle jets. Laser technology, has important applications in micro/nano-scale static monomer particle operations, such as optical tweezers, or is used for the passive measurement of macroscopic physical features of particle groups. However, it is relatively rare for the laser beam to directly interfere with the behavior of particle populations dynamically, so as to achieve the purpose of instant group manipulations. Based on the theoretical analysis of particle dynamics and hydrodynamic stability theory, the effects of light induced convergence on rarified jets (as a point source emitting particle off a nozzle) and denser jets consists of Rayleigh sized particles have been considered. For rarified particle jet's analysis, compared with the classical vacuum evaporation deposition theory, we found that the laser positively guided the movement of particles, leading their pathes into more concentrated targets. Such convergence effect also happens in the case of denser Rayleigh particle jets. Particle dynamics simulations and hydrodynamic stability analysis mutually authenticated that the optical field forces suppress the instability both of long-wave and short-wave on particle jet interfaces, and have broad-spectrum stabilization characteristics. Therefor, diffusive particles in vacuum evaporation can also have very good targeted aggregations by laser.

preprint2021arXiv

Structure Assisted NMF Methods for Separation of Degenerate Mixture Data with Application to NMR Spectroscopy

In this paper, we develop structure assisted nonnegative matrix factorization (NMF) methods for blind source separation of degenerate data. The motivation originates from nuclear magnetic resonance (NMR) spectroscopy, where a multiple mixture NMR spectra are recorded to identify chemical compounds with similar structures. Consider the linear mixing model (LMM), we aim to identify the chemical compounds involved when the mixing process is known to be nearly singular. We first consider a class of data with dominant interval(s) (DI) where each of source signals has dominant peaks over others. Besides, a nearly singular mixing process produces degenerate mixtures. The DI condition implies clustering structures in the data points. Hence, the estimation of the mixing matrix could be achieved by data clustering. Due to the presence of the noise and the degeneracy of the data, a small deviation in the estimation may introduce errors in the output. To resolve this problem and improve robustness of the separation, methods are developed in two aspects. One is to find better estimation of the mixing matrix by allowing a constrained perturbation to the clustering output, and it can be achieved by a quadratic programming. The other is to seek sparse source signals by exploiting the DI condition, and it solves an $\ell_1$ optimization. If no source information is available, we propose to adopt the nonnegative matrix factorization approach by incorporating the matrix structure (parallel columns of the mixing matrix) into the cost function and develop multiplicative iteration rules for the numerical solutions. We present experimental results of NMR data to show the performance and reliability of the method in the applications arising in NMR spectroscopy.

preprint2020arXiv

3D Object Detection and Tracking Based on Streaming Data

Recent approaches for 3D object detection have made tremendous progresses due to the development of deep learning. However, previous researches are mostly based on individual frames, leading to limited exploitation of information between frames. In this paper, we attempt to leverage the temporal information in streaming data and explore 3D streaming based object detection as well as tracking. Toward this goal, we set up a dual-way network for 3D object detection based on keyframes, and then propagate predictions to non-key frames through a motion based interpolation algorithm guided by temporal information. Our framework is not only shown to have significant improvements on object detection compared with frame-by-frame paradigm, but also proven to produce competitive results on KITTI Object Tracking Benchmark, with 76.68% in MOTA and 81.65% in MOTP respectively.

preprint2020arXiv

Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation

Reinforcement learning algorithms such as hindsight experience replay (HER) and hindsight goal generation (HGG) have been able to solve challenging robotic manipulation tasks in multi-goal settings with sparse rewards. HER achieves its training success through hindsight replays of past experience with heuristic goals, but under-performs in challenging tasks in which goals are difficult to explore. HGG enhances HER by selecting intermediate goals that are easy to achieve in the short term and promising to lead to target goals in the long term. This guided exploration makes HGG applicable to tasks in which target goals are far away from the object's initial position. However, HGG is not applicable to manipulation tasks with obstacles because the euclidean metric used for HGG is not an accurate distance metric in such environments. In this paper, we propose graph-based hindsight goal generation (G-HGG), an extension of HGG selecting hindsight goals based on shortest distances in an obstacle-avoiding graph, which is a discrete representation of the environment. We evaluated G-HGG on four challenging manipulation tasks with obstacles, where significant enhancements in both sample efficiency and overall success rate are shown over HGG and HER. Videos can be viewed at https://sites.google.com/view/demos-g-hgg/.

preprint2020arXiv

Indirect and Direct Training of Spiking Neural Networks for End-to-End Control of a Lane-Keeping Vehicle

Building spiking neural networks (SNNs) based on biological synaptic plasticities holds a promising potential for accomplishing fast and energy-efficient computing, which is beneficial to mobile robotic applications. However, the implementations of SNNs in robotic fields are limited due to the lack of practical training methods. In this paper, we therefore introduce both indirect and direct end-to-end training methods of SNNs for a lane-keeping vehicle. First, we adopt a policy learned using the \textcolor{black}{Deep Q-Learning} (DQN) algorithm and then subsequently transfer it to an SNN using supervised learning. Second, we adopt the reward-modulated spike-timing-dependent plasticity (R-STDP) for training SNNs directly, since it combines the advantages of both reinforcement learning and the well-known spike-timing-dependent plasticity (STDP). We examine the proposed approaches in three scenarios in which a robot is controlled to keep within lane markings by using an event-based neuromorphic vision sensor. We further demonstrate the advantages of the R-STDP approach in terms of the lateral localization accuracy and training time steps by comparing them with other three algorithms presented in this paper.

preprint2020arXiv

Large-scale Uncertainty Estimation and Its Application in Revenue Forecast of SMEs

The economic and banking importance of the small and medium enterprise (SME) sector is well recognized in contemporary society. Business credit loans are very important for the operation of SMEs, and the revenue is a key indicator of credit limit management. Therefore, it is very beneficial to construct a reliable revenue forecasting model. If the uncertainty of an enterprise's revenue forecasting can be estimated, a more proper credit limit can be granted. Natural gradient boosting approach, which estimates the uncertainty of prediction by a multi-parameter boosting algorithm based on the natural gradient. However, its original implementation is not easy to scale into big data scenarios, and computationally expensive compared to state-of-the-art tree-based models (such as XGBoost). In this paper, we propose a Scalable Natural Gradient Boosting Machines that is simple to implement, readily parallelizable, interpretable and yields high-quality predictive uncertainty estimates. According to the characteristics of revenue distribution, we derive an uncertainty quantification function. We demonstrate that our method can distinguish between samples that are accurate and inaccurate on revenue forecasting of SMEs. What's more, interpretability can be naturally obtained from the model, satisfying the financial needs.

preprint2020arXiv

Set Voronoi Tessellation for Particulate Systems in Two Dimensions

Given a countable set of points in a continuous space, Voronoi tessellation is an intuitive way of partitioning the space according to the distance to the individual points. As a powerful approach to obtain structural information, it has a long history and widespread applications in diverse disciplines, from astronomy to urban planning. For particulate systems in real life, such as a pile of sand or a crowd of pedestrians, the realization of Voronoi tessellation needs to be modified to accommodate the fact that the particles cannot be simply treated as points. Here, we elucidate the use of Set Voronoi tessellation (i.e., considering for a non-spherical particle a set of points on its surface) to extract meaningful local information in a quasi-two-dimensional system of granular rods. In addition, we illustrate how it can be applied to arbitrarily shaped particles such as an assembly of honey bees or pedestrians for obtaining structural information. Details on the implementation of this algorithm with the strategy of balancing computational cost and accuracy are discussed. Furthermore, we provide our python code as open source in order to facilitate Set Voronoi calculations in two dimensions for arbitrarily shaped objects.